..
computation_communication_overlap.py
Benchmarks: Add Benchmark - Add computation and communication overlap micro benchmark ( #39 )
2021-04-14 18:07:06 +08:00
cpu_hpl_performance.py
Adding HPL benchmark ( #482 )
2023-03-21 16:44:08 +00:00
cpu_memory_bw_latency_performance.py
Benchmarks: Add Benchmark - Add mlc benchmark to superbench ( #216 )
2021-12-13 13:47:42 +08:00
cpu_stream_performance.py
Adding Stream Benchmark ( #473 )
2023-02-13 15:34:37 -05:00
cublas_function.py
Benchmarks: Add benchmark - add micro benchmark for cublas test ( #80 )
2021-05-31 10:31:53 +08:00
cuda_memory_bw_performance.py
Benchmarks: Add Benchmark - Add memory bandwidth benchmark for cuda. ( #114 )
2021-07-13 17:30:19 +08:00
cudnn_function.py
Benchmarks: Add benchmark - add micro benchmark for cudnn test ( #89 )
2021-06-01 22:24:35 +08:00
disk_performance.py
Benchmarks: Add Benchmark - Add disk performance benchmark ( #132 )
2021-07-23 14:49:05 +08:00
dist_inference.py
Benchmarks - Add distributed inference benchmark ( #493 )
2023-03-24 17:15:17 +08:00
fambench.py
Benchmarks: Add Benchmark - Add FAMBench based on docker benchmark ( #338 )
2022-04-11 15:31:07 +08:00
gemm_flops_cuda_performance.py
Benchmarks: Add Benchmark - Add FLOPs performance benchmark for cuda. ( #87 )
2021-06-02 09:15:58 +08:00
gpcnet_performance.py
Benchmarks: Add Benchmark - Add gpcnet microbenchmark ( #229 )
2021-10-22 08:40:01 +00:00
gpu_burn_test.py
Benchmarks: Add Feature - Add GPU-Burn as microbenchmark ( #324 )
2022-03-16 16:20:11 +08:00
gpu_copy_bw_performance.py
Benchmarks: Revise Code - Make data checking in gpu_copy optional ( #301 )
2022-02-08 10:59:27 +08:00
ib_loopback_performance.py
Benchmarks: Add Benchmark - Add IB Loopback performance benchmark. ( #112 )
2021-07-24 03:40:24 +08:00
ib_validation_performance.py
Benchmarks: Add Benchmark - Add ib traffic validation distributed benchmark ( #215 )
2021-11-10 01:18:41 +08:00
kernel_launch_overhead.py
Benchmarks: Fix Typo - Fix typo in description of kernel launch overhead example ( #244 )
2021-11-25 06:28:41 +00:00
matmul.py
unify arguments format by using whitespace. ( #50 )
2021-04-12 22:02:12 +08:00
nccl_bw_performance.py
Benchmarks: Add Benchmark - Revise and add rccl microbenchmark for rocm ( #143 )
2021-07-30 15:45:32 +08:00
ort_inference_performance.py
Benchmarks: Add Benchmark - Add ONNXRuntime inference benchmark based on ORT python API ( #245 )
2021-12-10 13:53:11 +00:00
pytorch_bert_large.py
Benchmarks: Code Revision - Fix some issue for BERT benchmark. ( #58 )
2021-04-16 13:17:42 +08:00
pytorch_cnn.py
expose interface of pin memory and modify cnn configuration ( #75 )
2021-05-19 10:52:45 +08:00
pytorch_gpt2_large.py
Benchmarks: Add Benchmark - Add GPT2 model benchmark. ( #57 )
2021-04-16 11:39:57 +08:00
pytorch_lstm.py
Benchmarks: Add Benchmark - Add LSTM model benchmarks. ( #60 )
2021-04-20 10:53:44 +08:00
rocm_gemm_flops_performance.py
Benchmarks: Add Benchmark - Add gemm flops microbenchmark for amd ( #152 )
2021-08-30 13:40:46 +08:00
rocm_memory_bw_performance.py
Benchmarks: Add Benchmark - Add memory bus bandwidth performance microbenchmark for amd ( #153 )
2021-08-27 21:17:39 +08:00
rocm_onnxruntime_model_benchmark.py
Benchmarks: Add Benchmark - Add onnx model benchmarks based on docker image. ( #227 )
2021-10-27 18:41:40 +08:00
sharding_matmul.py
Benchmarks: Add Benchmark - Add computation and communication overlap micro benchmark ( #39 )
2021-04-14 18:07:06 +08:00
tcp_connectivity.py
Benchmarks: Add Benchmark - Add tcp connectivity validation microbenchmark ( #217 )
2021-10-12 23:42:12 +00:00
tensorrt_inference_performance.py
Benchmarks - Add TensorRT inference benchmark ( #236 )
2021-11-12 15:27:16 +08:00