Граф коммитов

40 Коммитов

Автор SHA1 Сообщение Дата
Ziyue Yang 8daef211dd
Benchmarks - Add distributed inference benchmark (#493)
**Description**
This PR adds a micro-benchmark of distributed model inference workloads.

**Major Revision**
- Add a new micro-benchmark dist-inference.
- Add corresponding example and unit tests.
- Update configuration files to include this new micro-benchmark.
- Update micro-benchmark README.

---------

Co-authored-by: Peng Cheng <chengpeng5555@outlook.com>
2023-03-24 17:15:17 +08:00
rafsalas19 655bd0aa59
Adding HPL benchmark (#482)
**Description**

- Adding HPL benchmark

---------

Co-authored-by: Ubuntu <azureuser@sbtestvm.jzlku1oskncengjiado35wf1hd.ax.internal.cloudapp.net>
Co-authored-by: Peng Cheng <chengpeng5555@outlook.com>
2023-03-21 16:44:08 +00:00
rafsalas19 32896ca477
Adding Stream Benchmark (#473)
**Description**

- Added stream benchmark
- Added stream unit test
- Added stream example
- Modified docker files to build stream

---------

Co-authored-by: Ubuntu <azureuser@sbtestvm.jzlku1oskncengjiado35wf1hd.ax.internal.cloudapp.net>
Co-authored-by: Peng Cheng <chengpeng5555@outlook.com>
Co-authored-by: Yifan Xiong <xiongyf@yandex.com>
2023-02-13 15:34:37 -05:00
guoshzhao 80dcc8aaec
Benchmarks: Add Benchmark - Add FAMBench based on docker benchmark (#338)
**Description**
Integrate FAMBench into superbench based on docker implementation:
https://github.com/facebookresearch/FAMBench

The script to run all benchmarks is:
https://github.com/facebookresearch/FAMBench/blob/main/benchmarks/run_all.sh
2022-04-11 15:31:07 +08:00
rafsalas19 ff51a3cee9
Benchmarks: Add Feature - Add GPU-Burn as microbenchmark (#324)
**Description**
Modifications adding GPU-Burn to SuperBench.
- added third party submodule
- modified Makefile to make gpu-burn binary
- added/modified microbenchmarks to add gpu-burn python scripts
- modified default and azure_ndv4 configs to add gpu-burn
2022-03-16 16:20:11 +08:00
Ziyue Yang 682b2c120d
Benchmarks: Revise Code - Make data checking in gpu_copy optional (#301)
This commit makes data checking in gpu_copy optional, because it will take too long time if message size is large.
2022-02-08 10:59:27 +08:00
Ziyue Yang 74421ffee0
Benchmarks: Add Feature - Add bidirectional test support in gpu_copy benchmark (#285)
**Description**
This commit adds bidirectional tests in gpu_copy benchmark for both device-host transfer and device-device transfer, and revises related tests.
2022-01-21 13:45:37 +08:00
Hossein Pourreza b590409e0f
Benchmarks: Add Benchmark - Add mlc benchmark to superbench (#216)
**Description**
Add mlc memory bandwidth and latency micro benchmark to Superbench.

**Major Revision**
- Add mlc benchmark with test and example files
2021-12-13 13:47:42 +08:00
guoshzhao 4d85630abb
Benchmarks: Add Benchmark - Add ONNXRuntime inference benchmark based on ORT python API (#245)
**Description**
Add ONNXRuntime inference benchmark based on ORT python API.

**Major Revision**
- Add `ORTInferenceBenchmark` class to export pytorch model to onnx model and do inference
- Add tests and example for `ort-inference` benchmark
- Update the introduction docs.
2021-12-10 13:53:11 +00:00
Kaiyu Xie 6d85b03a6e
Benchmarks: Fix Typo - Fix typo in description of kernel launch overhead example (#244)
**Description**
Fix typo in description of kernel_launch_overhead.py
2021-11-25 06:28:41 +00:00
Yifan Xiong 8a00c8a03b
Benchmarks - Add TensorRT inference benchmark (#236)
__Description__

Add TensorRT inference benchmark for torchvision models.

__Major Revision__
- Measure TensorRT inference performance.
2021-11-12 15:27:16 +08:00
Yuting Jiang 54919424c3
Benchmarks: Add Benchmark - Add ib traffic validation distributed benchmark (#215)
**Description**
Add ib traffic validation distributed benchmark.

**Major Revision**
- Add ib traffic validation distributed benchmark, example and test
2021-11-10 01:18:41 +08:00
Ziyue Yang 008e0fe1d8
Benchmarks: Add Feature - Add CPU-initiated copy and dtod support to gpu-sm-copy benchmark (#230)
**Description**
This commit does the following:
1) Adds CPU-initiated copy benchmark;
2) Adds dtod benchmark;
3) Support scanning NUMA nodes and GPUs inside the benchmark program;
4) Change the name of gpu-sm-copy to gpu-copy.
2021-10-30 11:19:09 +08:00
guoshzhao e98a68124e
Benchmarks: Add Benchmark - Add onnx model benchmarks based on docker image. (#227)
Add RocmOnnxModelBenchmark class to run benchmarks packaged in superbench/benchmark:rocm4.3.1-onnxruntime1.9.0
2021-10-27 18:41:40 +08:00
Yuting Jiang 6003f2c2a2
Benchmarks: Add Benchmark - Add gpcnet microbenchmark (#229)
**Description**
Add gpcnet microbenchmark

**Major Revision**
- add 2 microbenmark for gpcnet, gpc-network-test, gpc-network-load-test
- add related test and example file
2021-10-22 08:40:01 +00:00
Yuting Jiang 49cc8f9a8c
Benchmarks: Add Benchmark - Add tcp connectivity validation microbenchmark (#217)
**Description**
Add tcp connectivity validation microbenchmark which is to validate TCP connectivity between current node and several nodes in the hostfile.

**Major Revision**
- Add tcp connectivity validation microbenchmark and related test, example
2021-10-12 23:42:12 +00:00
Ziyue Yang b97197f08e
Benchmarks: Add Benchmark - Add GPU SM copy benchmark (#169)
**Description**
This commit adds gpu_sm_copy benchmark and related tests.
2021-08-30 18:54:26 +08:00
Yuting Jiang f3d53c3d5f
Benchmarks: Add Benchmark - Add gemm flops microbenchmark for amd (#152)
**Description**
Add gemm flops microbenchmark for amd.

**Major Revision**
- Add gemm flops microbenchmark for amd.
- Add related example and test file.
2021-08-30 13:40:46 +08:00
Yuting Jiang 666e3a9471
Benchmarks: Add Benchmark - Add memory bus bandwidth performance microbenchmark for amd (#153)
**Description**
Add memory bus bandwidth performance microbenchmark for amd.

**Major Revision**
- Add memory bus bandwidth performance microbenchmark for amd.
- Add related example and test file.
2021-08-27 21:17:39 +08:00
Yuting Jiang 157b4e2dd1
Benchmarks: Add Benchmark - Revise and add rccl microbenchmark for rocm (#143)
**Description**
Add rccl bandwidth microbenchmark for rocm.

**Major Revision**
- Register rccl-bw benchmark.
2021-07-30 15:45:32 +08:00
Yuting Jiang e083a598cf
Benchmarks: Add Benchmark - Add NCCL performance benchmark (#113)
**Description**
Add NCCL performance microbenchmark.

**Major Revision**
- Add microbenchmark, example, test, config for NCCL
2021-07-26 10:54:47 +08:00
Yuting Jiang b0c5addcac
Benchmarks: Add Benchmark - Add IB Loopback performance benchmark. (#112)
**Description**
Add RDMA Loopback performance microbenchmark.

**Major Revision**
- Add microbenchmark, example, test, config for RDMA Loopback
2021-07-24 03:40:24 +08:00
Ziyue Yang db297fb4ed
Benchmarks: Add Benchmark - Add disk performance benchmark (#132)
**Description**
Add disk performance microbenchmark.

**Major Revision**
- Add microbenchmark, example, test, config for disk performance.

**Minor Revision**
- Fix bugs in executor unit test related to default enabled tests.
2021-07-23 14:49:05 +08:00
Yuting Jiang f9550bd693
Benchmarks: Add Benchmark - Add memory bandwidth benchmark for cuda. (#114)
Add microbenchmark, example, test, config for cuda memory performance and Add cuda-samples(tag with cuda version) as git submodule and update related makefile
2021-07-13 17:30:19 +08:00
guoshzhao 6c6f526937
Benchmarks: Add Benchmark - Add FLOPs performance benchmark for cuda. (#87)
* add cuda flops performance benchmark.
2021-06-02 09:15:58 +08:00
Yuting Jiang 83235433b2
Benchmarks: Add benchmark - add micro benchmark for cudnn test (#89)
* add python related cudnn microbenchmark
2021-06-01 22:24:35 +08:00
Yuting Jiang 18398fbaa2
Benchmarks: Add benchmark - add micro benchmark for cublas test (#80)
* add benchmark for cublas test

* format

* revise error handling and test

* add interface to read json file, revise json file path and include .json in packaging

* add random_seed in arguments

* revise preprocess of cublas benchmark

* fix lint error and note error in source code

* update according comments

* revise input arguments from json file to custom str and convert json file to built-in dict list

* restore package config

* fit lint issue

* update platform and comments

* rename files to match source code dir and fix comments error

Co-authored-by: root <root@sb-validation-000001.51z1chmys5fuzfqyo4niepozre.bx.internal.cloudapp.net>
2021-05-31 10:31:53 +08:00
guoshzhao e977bbc17f
Benchmarks: Add Benchmark - Add kernel launch overhead benchmark. (#74)
* add kernel launch overhead benchmark.
2021-05-19 17:06:55 +08:00
Yuting Jiang b7d0ee329f
expose interface of pin memory and modify cnn configuration (#75) 2021-05-19 10:52:45 +08:00
guoshzhao 65292ae55b
Benchmarks: Code Revision - Revise the settings of CNN example models. (#65)
* revise example settings of cnn models.
2021-04-26 21:23:11 +08:00
guoshzhao 2a7ab691f1
Benchmarks: Add Benchmark - Add LSTM model benchmarks. (#60)
* Benchmarks: Add Benchmark - Add LSTM model benchmarks.
2021-04-20 10:53:44 +08:00
guoshzhao 902ea211d1
Benchmarks: Add Benchmark - Add CNN model benchmarks. (#59)
* Benchmarks: Add Benchmark - Add CNN model benchmarks.
2021-04-20 10:43:02 +08:00
guoshzhao ce3ed24ab7
Benchmarks: Code Revision - Fix some issue for BERT benchmark. (#58)
Benchmarks: Code Revision - Fix some issue for BERT benchmark. (#58)
2021-04-16 13:17:42 +08:00
guoshzhao af567cf650
Benchmarks: Add Benchmark - Add GPT2 model benchmark. (#57)
* Benchmarks: Add Benchmark - Add GPT2 model benchmark.
2021-04-16 11:39:57 +08:00
Yuting Jiang 435b2d5eeb
Benchmarks: Add Benchmark - Add computation and communication overlap micro benchmark (#39)
* Benchmarks: Add Benchmark - add computation and communication overlap micro benchmark

* Benchmarks: Add benchmark - fix some format issues and typo

* Benchmarks: Add Benchmark - update according comments and add test

* revise tests

* skip multi gpu test due to no multi gpu

Co-authored-by: v-yujiang <v-yujiang@microsoft.com>
2021-04-14 18:07:06 +08:00
guoshzhao 4664019ac3
unify arguments format by using whitespace. (#50)
Co-authored-by: Guoshuai Zhao <guzhao@microsoft.com>
Co-authored-by: Yifan Xiong <yifan.xiong@microsoft.com>
2021-04-12 22:02:12 +08:00
guoshzhao 4561c83555
specify the framework for matmul example. (#37)
Co-authored-by: Guoshuai Zhao <guzhao@microsoft.com>
2021-04-09 17:38:03 +08:00
guoshzhao f0f65a719b
Benchmarks: Add Benchmark - Add op-sharding microbenchmark, including matmul and sharding_matmul. (#36)
* add microbenchmark - sharding matmul.
* address comments.

Co-authored-by: Guoshuai Zhao <guzhao@microsoft.com>
2021-04-09 15:02:12 +08:00
guoshzhao 923ce2773f
Benchmarks: Code Revision - Revise BenchmarkRegistry interfaces for integration with executor. (#33)
* revise BenchmarkRegistry interfaces.
* address comments

Co-authored-by: Guoshuai Zhao <guzhao@microsoft.com>
2021-04-08 23:17:03 +08:00
guoshzhao 28de899375
Benchmarks: Add Example - Add pytorch-bert-large as the model benchmark example. (#30)
* add bert-large as the model benchmark example

* add more arguments.

* address comments.

* delete duplicated file.

Co-authored-by: Guoshuai Zhao <guzhao@microsoft.com>
2021-03-29 12:39:38 +08:00