superbenchmark/examples/benchmarks
Ziyue Yang 4fa60be7cd
Benchmarks: Micro benchmark - Add one-to-all, all-to-one, all-to-all support to gpu_copy_bw_performance (#588)
**Description**
Add one-to-all, all-to-one, all-to-all support to
gpu_copy_bw_performance, and fix performance bug in gpu_copy
2023-12-08 23:22:38 +08:00
..
computation_communication_overlap.py Benchmarks: Add Benchmark - Add computation and communication overlap micro benchmark (#39) 2021-04-14 18:07:06 +08:00
cpu_hpl_performance.py Adding HPL benchmark (#482) 2023-03-21 16:44:08 +00:00
cpu_memory_bw_latency_performance.py Benchmarks: Add Benchmark - Add mlc benchmark to superbench (#216) 2021-12-13 13:47:42 +08:00
cpu_stream_performance.py Adding Stream Benchmark (#473) 2023-02-13 15:34:37 -05:00
cublas_function.py Benchmarks: Add benchmark - add micro benchmark for cublas test (#80) 2021-05-31 10:31:53 +08:00
cuda_memory_bw_performance.py Benchmarks: Add Benchmark - Add memory bandwidth benchmark for cuda. (#114) 2021-07-13 17:30:19 +08:00
cudnn_function.py Benchmarks: Add benchmark - add micro benchmark for cudnn test (#89) 2021-06-01 22:24:35 +08:00
disk_performance.py Benchmarks: Add Benchmark - Add disk performance benchmark (#132) 2021-07-23 14:49:05 +08:00
dist_inference.py Benchmarks - Add distributed inference benchmark (#493) 2023-03-24 17:15:17 +08:00
fambench.py Benchmarks: Add Benchmark - Add FAMBench based on docker benchmark (#338) 2022-04-11 15:31:07 +08:00
gemm_flops_cuda_performance.py Benchmarks: Add Benchmark - Add FLOPs performance benchmark for cuda. (#87) 2021-06-02 09:15:58 +08:00
gpcnet_performance.py Benchmarks: Add Benchmark - Add gpcnet microbenchmark (#229) 2021-10-22 08:40:01 +00:00
gpu_burn_test.py Benchmarks: Add Feature - Add GPU-Burn as microbenchmark (#324) 2022-03-16 16:20:11 +08:00
gpu_copy_bw_performance.py Benchmarks: Micro benchmark - Add one-to-all, all-to-one, all-to-all support to gpu_copy_bw_performance (#588) 2023-12-08 23:22:38 +08:00
ib_loopback_performance.py Benchmarks: Add Benchmark - Add IB Loopback performance benchmark. (#112) 2021-07-24 03:40:24 +08:00
ib_validation_performance.py Benchmarks: Add Benchmark - Add ib traffic validation distributed benchmark (#215) 2021-11-10 01:18:41 +08:00
kernel_launch_overhead.py Benchmarks: Fix Typo - Fix typo in description of kernel launch overhead example (#244) 2021-11-25 06:28:41 +00:00
matmul.py unify arguments format by using whitespace. (#50) 2021-04-12 22:02:12 +08:00
nccl_bw_performance.py Benchmarks: Add Benchmark - Revise and add rccl microbenchmark for rocm (#143) 2021-07-30 15:45:32 +08:00
ort_inference_performance.py Benchmarks: Add Benchmark - Add ONNXRuntime inference benchmark based on ORT python API (#245) 2021-12-10 13:53:11 +00:00
pytorch_bert_large.py Benchmarks: Code Revision - Fix some issue for BERT benchmark. (#58) 2021-04-16 13:17:42 +08:00
pytorch_cnn.py expose interface of pin memory and modify cnn configuration (#75) 2021-05-19 10:52:45 +08:00
pytorch_gpt2_large.py Benchmarks: Add Benchmark - Add GPT2 model benchmark. (#57) 2021-04-16 11:39:57 +08:00
pytorch_lstm.py Benchmarks: Add Benchmark - Add LSTM model benchmarks. (#60) 2021-04-20 10:53:44 +08:00
rocm_gemm_flops_performance.py Benchmarks: Add Benchmark - Add gemm flops microbenchmark for amd (#152) 2021-08-30 13:40:46 +08:00
rocm_memory_bw_performance.py Benchmarks: Add Benchmark - Add memory bus bandwidth performance microbenchmark for amd (#153) 2021-08-27 21:17:39 +08:00
rocm_onnxruntime_model_benchmark.py Benchmarks: Add Benchmark - Add onnx model benchmarks based on docker image. (#227) 2021-10-27 18:41:40 +08:00
sharding_matmul.py Benchmarks: Add Benchmark - Add computation and communication overlap micro benchmark (#39) 2021-04-14 18:07:06 +08:00
tcp_connectivity.py Benchmarks: Add Benchmark - Add tcp connectivity validation microbenchmark (#217) 2021-10-12 23:42:12 +00:00
tensorrt_inference_performance.py Benchmarks - Add TensorRT inference benchmark (#236) 2021-11-12 15:27:16 +08:00