superbenchmark

История

Ziyue Yang 8daef211dd Benchmarks - Add distributed inference benchmark (#493 ) Description This PR adds a micro-benchmark of distributed model inference workloads. Major Revision - Add a new micro-benchmark dist-inference. - Add corresponding example and unit tests. - Update configuration files to include this new micro-benchmark. - Update micro-benchmark README. --------- Co-authored-by: Peng Cheng <chengpeng5555@outlook.com>		2023-03-24 17:15:17 +08:00
..
computation_communication_overlap.py	Benchmarks: Add Benchmark - Add computation and communication overlap micro benchmark (#39 )	2021-04-14 18:07:06 +08:00
cpu_hpl_performance.py	Adding HPL benchmark (#482 )	2023-03-21 16:44:08 +00:00
cpu_memory_bw_latency_performance.py	Benchmarks: Add Benchmark - Add mlc benchmark to superbench (#216 )	2021-12-13 13:47:42 +08:00
cpu_stream_performance.py	Adding Stream Benchmark (#473 )	2023-02-13 15:34:37 -05:00
cublas_function.py	Benchmarks: Add benchmark - add micro benchmark for cublas test (#80 )	2021-05-31 10:31:53 +08:00
cuda_memory_bw_performance.py	Benchmarks: Add Benchmark - Add memory bandwidth benchmark for cuda. (#114 )	2021-07-13 17:30:19 +08:00
cudnn_function.py	Benchmarks: Add benchmark - add micro benchmark for cudnn test (#89 )	2021-06-01 22:24:35 +08:00
disk_performance.py	Benchmarks: Add Benchmark - Add disk performance benchmark (#132 )	2021-07-23 14:49:05 +08:00
dist_inference.py	Benchmarks - Add distributed inference benchmark (#493 )	2023-03-24 17:15:17 +08:00
fambench.py	Benchmarks: Add Benchmark - Add FAMBench based on docker benchmark (#338 )	2022-04-11 15:31:07 +08:00
gemm_flops_cuda_performance.py	Benchmarks: Add Benchmark - Add FLOPs performance benchmark for cuda. (#87 )	2021-06-02 09:15:58 +08:00
gpcnet_performance.py	Benchmarks: Add Benchmark - Add gpcnet microbenchmark (#229 )	2021-10-22 08:40:01 +00:00
gpu_burn_test.py	Benchmarks: Add Feature - Add GPU-Burn as microbenchmark (#324 )	2022-03-16 16:20:11 +08:00
gpu_copy_bw_performance.py	Benchmarks: Revise Code - Make data checking in gpu_copy optional (#301 )	2022-02-08 10:59:27 +08:00
ib_loopback_performance.py	Benchmarks: Add Benchmark - Add IB Loopback performance benchmark. (#112 )	2021-07-24 03:40:24 +08:00
ib_validation_performance.py	Benchmarks: Add Benchmark - Add ib traffic validation distributed benchmark (#215 )	2021-11-10 01:18:41 +08:00
kernel_launch_overhead.py	Benchmarks: Fix Typo - Fix typo in description of kernel launch overhead example (#244 )	2021-11-25 06:28:41 +00:00
matmul.py	unify arguments format by using whitespace. (#50 )	2021-04-12 22:02:12 +08:00
nccl_bw_performance.py	Benchmarks: Add Benchmark - Revise and add rccl microbenchmark for rocm (#143 )	2021-07-30 15:45:32 +08:00
ort_inference_performance.py	Benchmarks: Add Benchmark - Add ONNXRuntime inference benchmark based on ORT python API (#245 )	2021-12-10 13:53:11 +00:00
pytorch_bert_large.py	Benchmarks: Code Revision - Fix some issue for BERT benchmark. (#58 )	2021-04-16 13:17:42 +08:00
pytorch_cnn.py	expose interface of pin memory and modify cnn configuration (#75 )	2021-05-19 10:52:45 +08:00
pytorch_gpt2_large.py	Benchmarks: Add Benchmark - Add GPT2 model benchmark. (#57 )	2021-04-16 11:39:57 +08:00
pytorch_lstm.py	Benchmarks: Add Benchmark - Add LSTM model benchmarks. (#60 )	2021-04-20 10:53:44 +08:00
rocm_gemm_flops_performance.py	Benchmarks: Add Benchmark - Add gemm flops microbenchmark for amd (#152 )	2021-08-30 13:40:46 +08:00
rocm_memory_bw_performance.py	Benchmarks: Add Benchmark - Add memory bus bandwidth performance microbenchmark for amd (#153 )	2021-08-27 21:17:39 +08:00
rocm_onnxruntime_model_benchmark.py	Benchmarks: Add Benchmark - Add onnx model benchmarks based on docker image. (#227 )	2021-10-27 18:41:40 +08:00
sharding_matmul.py	Benchmarks: Add Benchmark - Add computation and communication overlap micro benchmark (#39 )	2021-04-14 18:07:06 +08:00
tcp_connectivity.py	Benchmarks: Add Benchmark - Add tcp connectivity validation microbenchmark (#217 )	2021-10-12 23:42:12 +00:00
tensorrt_inference_performance.py	Benchmarks - Add TensorRT inference benchmark (#236 )	2021-11-12 15:27:16 +08:00