.. |
computation_communication_overlap.py
|
Benchmarks: Add Benchmark - Add computation and communication overlap micro benchmark (#39)
|
2021-04-14 18:07:06 +08:00 |
cpu_hpl_performance.py
|
Adding HPL benchmark (#482)
|
2023-03-21 16:44:08 +00:00 |
cpu_memory_bw_latency_performance.py
|
Benchmarks: Add Benchmark - Add mlc benchmark to superbench (#216)
|
2021-12-13 13:47:42 +08:00 |
cpu_stream_performance.py
|
Adding Stream Benchmark (#473)
|
2023-02-13 15:34:37 -05:00 |
cublas_function.py
|
Benchmarks: Add benchmark - add micro benchmark for cublas test (#80)
|
2021-05-31 10:31:53 +08:00 |
cuda_memory_bw_performance.py
|
Benchmarks: Add Benchmark - Add memory bandwidth benchmark for cuda. (#114)
|
2021-07-13 17:30:19 +08:00 |
cudnn_function.py
|
Benchmarks: Add benchmark - add micro benchmark for cudnn test (#89)
|
2021-06-01 22:24:35 +08:00 |
disk_performance.py
|
Benchmarks: Add Benchmark - Add disk performance benchmark (#132)
|
2021-07-23 14:49:05 +08:00 |
dist_inference.py
|
Benchmarks - Add distributed inference benchmark (#493)
|
2023-03-24 17:15:17 +08:00 |
fambench.py
|
Benchmarks: Add Benchmark - Add FAMBench based on docker benchmark (#338)
|
2022-04-11 15:31:07 +08:00 |
gemm_flops_cuda_performance.py
|
Benchmarks: Add Benchmark - Add FLOPs performance benchmark for cuda. (#87)
|
2021-06-02 09:15:58 +08:00 |
gpcnet_performance.py
|
Benchmarks: Add Benchmark - Add gpcnet microbenchmark (#229)
|
2021-10-22 08:40:01 +00:00 |
gpu_burn_test.py
|
Benchmarks: Add Feature - Add GPU-Burn as microbenchmark (#324)
|
2022-03-16 16:20:11 +08:00 |
gpu_copy_bw_performance.py
|
Benchmarks: Micro benchmark - Add one-to-all, all-to-one, all-to-all support to gpu_copy_bw_performance (#588)
|
2023-12-08 23:22:38 +08:00 |
ib_loopback_performance.py
|
Benchmarks: Add Benchmark - Add IB Loopback performance benchmark. (#112)
|
2021-07-24 03:40:24 +08:00 |
ib_validation_performance.py
|
Benchmarks: Add Benchmark - Add ib traffic validation distributed benchmark (#215)
|
2021-11-10 01:18:41 +08:00 |
kernel_launch_overhead.py
|
Benchmarks: Fix Typo - Fix typo in description of kernel launch overhead example (#244)
|
2021-11-25 06:28:41 +00:00 |
matmul.py
|
unify arguments format by using whitespace. (#50)
|
2021-04-12 22:02:12 +08:00 |
nccl_bw_performance.py
|
Benchmarks: Add Benchmark - Revise and add rccl microbenchmark for rocm (#143)
|
2021-07-30 15:45:32 +08:00 |
ort_inference_performance.py
|
Benchmarks: Add Benchmark - Add ONNXRuntime inference benchmark based on ORT python API (#245)
|
2021-12-10 13:53:11 +00:00 |
pytorch_bert_large.py
|
Benchmarks: Code Revision - Fix some issue for BERT benchmark. (#58)
|
2021-04-16 13:17:42 +08:00 |
pytorch_cnn.py
|
expose interface of pin memory and modify cnn configuration (#75)
|
2021-05-19 10:52:45 +08:00 |
pytorch_gpt2_large.py
|
Benchmarks: Add Benchmark - Add GPT2 model benchmark. (#57)
|
2021-04-16 11:39:57 +08:00 |
pytorch_lstm.py
|
Benchmarks: Add Benchmark - Add LSTM model benchmarks. (#60)
|
2021-04-20 10:53:44 +08:00 |
rocm_gemm_flops_performance.py
|
Benchmarks: Add Benchmark - Add gemm flops microbenchmark for amd (#152)
|
2021-08-30 13:40:46 +08:00 |
rocm_memory_bw_performance.py
|
Benchmarks: Add Benchmark - Add memory bus bandwidth performance microbenchmark for amd (#153)
|
2021-08-27 21:17:39 +08:00 |
rocm_onnxruntime_model_benchmark.py
|
Benchmarks: Add Benchmark - Add onnx model benchmarks based on docker image. (#227)
|
2021-10-27 18:41:40 +08:00 |
sharding_matmul.py
|
Benchmarks: Add Benchmark - Add computation and communication overlap micro benchmark (#39)
|
2021-04-14 18:07:06 +08:00 |
tcp_connectivity.py
|
Benchmarks: Add Benchmark - Add tcp connectivity validation microbenchmark (#217)
|
2021-10-12 23:42:12 +00:00 |
tensorrt_inference_performance.py
|
Benchmarks - Add TensorRT inference benchmark (#236)
|
2021-11-12 15:27:16 +08:00 |