superbenchmark

Граф коммитов

Автор	SHA1	Сообщение	Дата
Ziyue Yang	4fa60be7cd	Benchmarks: Micro benchmark - Add one-to-all, all-to-one, all-to-all support to gpu_copy_bw_performance (#588 ) Description Add one-to-all, all-to-one, all-to-all support to gpu_copy_bw_performance, and fix performance bug in gpu_copy	2023-12-08 23:22:38 +08:00
Ziyue Yang	8daef211dd	Benchmarks - Add distributed inference benchmark (#493 ) Description This PR adds a micro-benchmark of distributed model inference workloads. Major Revision - Add a new micro-benchmark dist-inference. - Add corresponding example and unit tests. - Update configuration files to include this new micro-benchmark. - Update micro-benchmark README. --------- Co-authored-by: Peng Cheng <chengpeng5555@outlook.com>	2023-03-24 17:15:17 +08:00
rafsalas19	655bd0aa59	Adding HPL benchmark (#482 ) Description - Adding HPL benchmark --------- Co-authored-by: Ubuntu <azureuser@sbtestvm.jzlku1oskncengjiado35wf1hd.ax.internal.cloudapp.net> Co-authored-by: Peng Cheng <chengpeng5555@outlook.com>	2023-03-21 16:44:08 +00:00
rafsalas19	32896ca477	Adding Stream Benchmark (#473 ) Description - Added stream benchmark - Added stream unit test - Added stream example - Modified docker files to build stream --------- Co-authored-by: Ubuntu <azureuser@sbtestvm.jzlku1oskncengjiado35wf1hd.ax.internal.cloudapp.net> Co-authored-by: Peng Cheng <chengpeng5555@outlook.com> Co-authored-by: Yifan Xiong <xiongyf@yandex.com>	2023-02-13 15:34:37 -05:00
guoshzhao	80dcc8aaec	Benchmarks: Add Benchmark - Add FAMBench based on docker benchmark (#338 ) Description Integrate FAMBench into superbench based on docker implementation: https://github.com/facebookresearch/FAMBench The script to run all benchmarks is: https://github.com/facebookresearch/FAMBench/blob/main/benchmarks/run_all.sh	2022-04-11 15:31:07 +08:00
rafsalas19	ff51a3cee9	Benchmarks: Add Feature - Add GPU-Burn as microbenchmark (#324 ) Description Modifications adding GPU-Burn to SuperBench. - added third party submodule - modified Makefile to make gpu-burn binary - added/modified microbenchmarks to add gpu-burn python scripts - modified default and azure_ndv4 configs to add gpu-burn	2022-03-16 16:20:11 +08:00
Ziyue Yang	682b2c120d	Benchmarks: Revise Code - Make data checking in gpu_copy optional (#301 ) This commit makes data checking in gpu_copy optional, because it will take too long time if message size is large.	2022-02-08 10:59:27 +08:00
Ziyue Yang	74421ffee0	Benchmarks: Add Feature - Add bidirectional test support in gpu_copy benchmark (#285 ) Description This commit adds bidirectional tests in gpu_copy benchmark for both device-host transfer and device-device transfer, and revises related tests.	2022-01-21 13:45:37 +08:00
Hossein Pourreza	b590409e0f	Benchmarks: Add Benchmark - Add mlc benchmark to superbench (#216 ) Description Add mlc memory bandwidth and latency micro benchmark to Superbench. Major Revision - Add mlc benchmark with test and example files	2021-12-13 13:47:42 +08:00
guoshzhao	4d85630abb	Benchmarks: Add Benchmark - Add ONNXRuntime inference benchmark based on ORT python API (#245 ) Description Add ONNXRuntime inference benchmark based on ORT python API. Major Revision - Add `ORTInferenceBenchmark` class to export pytorch model to onnx model and do inference - Add tests and example for `ort-inference` benchmark - Update the introduction docs.	2021-12-10 13:53:11 +00:00
Kaiyu Xie	6d85b03a6e	Benchmarks: Fix Typo - Fix typo in description of kernel launch overhead example (#244 ) Description Fix typo in description of kernel_launch_overhead.py	2021-11-25 06:28:41 +00:00
Yifan Xiong	8a00c8a03b	Benchmarks - Add TensorRT inference benchmark (#236 ) __Description__ Add TensorRT inference benchmark for torchvision models. __Major Revision__ - Measure TensorRT inference performance.	2021-11-12 15:27:16 +08:00
Yuting Jiang	54919424c3	Benchmarks: Add Benchmark - Add ib traffic validation distributed benchmark (#215 ) Description Add ib traffic validation distributed benchmark. Major Revision - Add ib traffic validation distributed benchmark, example and test	2021-11-10 01:18:41 +08:00
Ziyue Yang	008e0fe1d8	Benchmarks: Add Feature - Add CPU-initiated copy and dtod support to gpu-sm-copy benchmark (#230 ) Description This commit does the following: 1) Adds CPU-initiated copy benchmark; 2) Adds dtod benchmark; 3) Support scanning NUMA nodes and GPUs inside the benchmark program; 4) Change the name of gpu-sm-copy to gpu-copy.	2021-10-30 11:19:09 +08:00
guoshzhao	e98a68124e	Benchmarks: Add Benchmark - Add onnx model benchmarks based on docker image. (#227 ) Add RocmOnnxModelBenchmark class to run benchmarks packaged in superbench/benchmark:rocm4.3.1-onnxruntime1.9.0	2021-10-27 18:41:40 +08:00
Yuting Jiang	6003f2c2a2	Benchmarks: Add Benchmark - Add gpcnet microbenchmark (#229 ) Description Add gpcnet microbenchmark Major Revision - add 2 microbenmark for gpcnet, gpc-network-test, gpc-network-load-test - add related test and example file	2021-10-22 08:40:01 +00:00
Yuting Jiang	49cc8f9a8c	Benchmarks: Add Benchmark - Add tcp connectivity validation microbenchmark (#217 ) Description Add tcp connectivity validation microbenchmark which is to validate TCP connectivity between current node and several nodes in the hostfile. Major Revision - Add tcp connectivity validation microbenchmark and related test, example	2021-10-12 23:42:12 +00:00
Ziyue Yang	b97197f08e	Benchmarks: Add Benchmark - Add GPU SM copy benchmark (#169 ) Description This commit adds gpu_sm_copy benchmark and related tests.	2021-08-30 18:54:26 +08:00
Yuting Jiang	f3d53c3d5f	Benchmarks: Add Benchmark - Add gemm flops microbenchmark for amd (#152 ) Description Add gemm flops microbenchmark for amd. Major Revision - Add gemm flops microbenchmark for amd. - Add related example and test file.	2021-08-30 13:40:46 +08:00
Yuting Jiang	666e3a9471	Benchmarks: Add Benchmark - Add memory bus bandwidth performance microbenchmark for amd (#153 ) Description Add memory bus bandwidth performance microbenchmark for amd. Major Revision - Add memory bus bandwidth performance microbenchmark for amd. - Add related example and test file.	2021-08-27 21:17:39 +08:00
Yuting Jiang	157b4e2dd1	Benchmarks: Add Benchmark - Revise and add rccl microbenchmark for rocm (#143 ) Description Add rccl bandwidth microbenchmark for rocm. Major Revision - Register rccl-bw benchmark.	2021-07-30 15:45:32 +08:00
Yuting Jiang	e083a598cf	Benchmarks: Add Benchmark - Add NCCL performance benchmark (#113 ) Description Add NCCL performance microbenchmark. Major Revision - Add microbenchmark, example, test, config for NCCL	2021-07-26 10:54:47 +08:00
Yuting Jiang	b0c5addcac	Benchmarks: Add Benchmark - Add IB Loopback performance benchmark. (#112 ) Description Add RDMA Loopback performance microbenchmark. Major Revision - Add microbenchmark, example, test, config for RDMA Loopback	2021-07-24 03:40:24 +08:00
Ziyue Yang	db297fb4ed	Benchmarks: Add Benchmark - Add disk performance benchmark (#132 ) Description Add disk performance microbenchmark. Major Revision - Add microbenchmark, example, test, config for disk performance. Minor Revision - Fix bugs in executor unit test related to default enabled tests.	2021-07-23 14:49:05 +08:00
Yuting Jiang	f9550bd693	Benchmarks: Add Benchmark - Add memory bandwidth benchmark for cuda. (#114 ) Add microbenchmark, example, test, config for cuda memory performance and Add cuda-samples(tag with cuda version) as git submodule and update related makefile	2021-07-13 17:30:19 +08:00
guoshzhao	6c6f526937	Benchmarks: Add Benchmark - Add FLOPs performance benchmark for cuda. (#87 ) * add cuda flops performance benchmark.	2021-06-02 09:15:58 +08:00
Yuting Jiang	83235433b2	Benchmarks: Add benchmark - add micro benchmark for cudnn test (#89 ) * add python related cudnn microbenchmark	2021-06-01 22:24:35 +08:00
Yuting Jiang	18398fbaa2	Benchmarks: Add benchmark - add micro benchmark for cublas test (#80 ) * add benchmark for cublas test * format * revise error handling and test * add interface to read json file, revise json file path and include .json in packaging * add random_seed in arguments * revise preprocess of cublas benchmark * fix lint error and note error in source code * update according comments * revise input arguments from json file to custom str and convert json file to built-in dict list * restore package config * fit lint issue * update platform and comments * rename files to match source code dir and fix comments error Co-authored-by: root <root@sb-validation-000001.51z1chmys5fuzfqyo4niepozre.bx.internal.cloudapp.net>	2021-05-31 10:31:53 +08:00
guoshzhao	e977bbc17f	Benchmarks: Add Benchmark - Add kernel launch overhead benchmark. (#74 ) * add kernel launch overhead benchmark.	2021-05-19 17:06:55 +08:00
Yuting Jiang	b7d0ee329f	expose interface of pin memory and modify cnn configuration (#75 )	2021-05-19 10:52:45 +08:00
guoshzhao	65292ae55b	Benchmarks: Code Revision - Revise the settings of CNN example models. (#65 ) * revise example settings of cnn models.	2021-04-26 21:23:11 +08:00
guoshzhao	2a7ab691f1	Benchmarks: Add Benchmark - Add LSTM model benchmarks. (#60 ) * Benchmarks: Add Benchmark - Add LSTM model benchmarks.	2021-04-20 10:53:44 +08:00
guoshzhao	902ea211d1	Benchmarks: Add Benchmark - Add CNN model benchmarks. (#59 ) * Benchmarks: Add Benchmark - Add CNN model benchmarks.	2021-04-20 10:43:02 +08:00
guoshzhao	ce3ed24ab7	Benchmarks: Code Revision - Fix some issue for BERT benchmark. (#58 ) Benchmarks: Code Revision - Fix some issue for BERT benchmark. (#58)	2021-04-16 13:17:42 +08:00
guoshzhao	af567cf650	Benchmarks: Add Benchmark - Add GPT2 model benchmark. (#57 ) * Benchmarks: Add Benchmark - Add GPT2 model benchmark.	2021-04-16 11:39:57 +08:00
Yuting Jiang	435b2d5eeb	Benchmarks: Add Benchmark - Add computation and communication overlap micro benchmark (#39 ) * Benchmarks: Add Benchmark - add computation and communication overlap micro benchmark * Benchmarks: Add benchmark - fix some format issues and typo * Benchmarks: Add Benchmark - update according comments and add test * revise tests * skip multi gpu test due to no multi gpu Co-authored-by: v-yujiang <v-yujiang@microsoft.com>	2021-04-14 18:07:06 +08:00
guoshzhao	4664019ac3	unify arguments format by using whitespace. (#50 ) Co-authored-by: Guoshuai Zhao <guzhao@microsoft.com> Co-authored-by: Yifan Xiong <yifan.xiong@microsoft.com>	2021-04-12 22:02:12 +08:00
guoshzhao	4561c83555	specify the framework for matmul example. (#37 ) Co-authored-by: Guoshuai Zhao <guzhao@microsoft.com>	2021-04-09 17:38:03 +08:00
guoshzhao	f0f65a719b	Benchmarks: Add Benchmark - Add op-sharding microbenchmark, including matmul and sharding_matmul. (#36 ) * add microbenchmark - sharding matmul. * address comments. Co-authored-by: Guoshuai Zhao <guzhao@microsoft.com>	2021-04-09 15:02:12 +08:00
guoshzhao	923ce2773f	Benchmarks: Code Revision - Revise BenchmarkRegistry interfaces for integration with executor. (#33 ) * revise BenchmarkRegistry interfaces. * address comments Co-authored-by: Guoshuai Zhao <guzhao@microsoft.com>	2021-04-08 23:17:03 +08:00
guoshzhao	28de899375	Benchmarks: Add Example - Add pytorch-bert-large as the model benchmark example. (#30 ) * add bert-large as the model benchmark example * add more arguments. * address comments. * delete duplicated file. Co-authored-by: Guoshuai Zhao <guzhao@microsoft.com>	2021-03-29 12:39:38 +08:00

41 Коммитов