pdr
479491279e
Dockerfile - Add support for arm64 build ( #660 )
...
Add support for arm64 build:
- Updated dockerfile for arm64 build
- extend cpu stream compilation for neoverse
- handle onnxruntime-gpu installation
- third party builds filtering based on arch
- disable cuda decode perf build for non x86
2024-11-06 23:16:12 +00:00
pdr
59d36f7ff0
Bug Fix - Fix numa error on grace cpu in gpu-copy ( #658 )
...
The current GPU Copy BW Performance fails on Nvidia Grace systems. This
is due to the memory only numa node and thus the numa_run_on_node fails
for such nodes and halts completely.
This fix checks for the presence of assigned CPU cores for the numa
node, on checking if it has no cpu cores assigned, it skips that
specific node during the args creation and continues.
2024-11-05 23:10:51 +00:00
Yifan Xiong
61770b8908
CI/CD - Update Image Build Pipeline ( #659 )
...
**Description**
Update image build.
**Major Revision**
* Remove ROCm 6.0 image due to outdated packages
* Remove build tag for ROCm
* Preserve build cache for 30 days
2024-11-02 04:50:50 +00:00
Yuting Jiang
949f9cb406
Release - SuperBench v0.11.0 ( #654 )
...
**Description**
Cherry pick bug fixes from v0.11.0 to main
**Major Revision**
* #645
* #648
* #646
* #647
* #651
* #652
* #650
---------
Co-authored-by: hongtaozhang <hongtaozhang@microsoft.com>
Co-authored-by: Yifan Xiong <yifan.xiong@microsoft.com>
2024-10-10 09:59:47 +08:00
dependabot[bot]
9f3231e99f
Bump serve-static and express in /website ( #643 )
...
Bumps [serve-static](https://github.com/expressjs/serve-static ) and [express](https://github.com/expressjs/express ). These dependencies needed to be updated together.
Updates `serve-static` from 1.15.0 to 1.16.2
- [Release notes](https://github.com/expressjs/serve-static/releases )
- [Changelog](https://github.com/expressjs/serve-static/blob/v1.16.2/HISTORY.md )
- [Commits](expressjs/serve-static@v1.15.0...v1.16.2)
Updates `express` from 4.19.2 to 4.21.0
- [Release notes](https://github.com/expressjs/express/releases )
- [Changelog](https://github.com/expressjs/express/blob/4.21.0/History.md )
- [Commits](expressjs/express@4.19.2...4.21.0)
---
updated-dependencies:
- dependency-name: serve-static
dependency-type: indirect
- dependency-name: express
dependency-type: indirect
...
Signed-off-by: dependabot[bot] <support@github.com>
2024-09-20 07:46:20 +08:00
Yang Wang
96cc4d9397
Bug: Executor - Fix executor for Benchmark Execution Without Explicit Framework Field ( #636 )
...
**Description**
Fix executor for Benchmark Execution Without Explicit Framework Field
2024-08-20 16:52:20 -07:00
Yuting Jiang
7af75df392
Bug Fix: Data Diagnosis - Fix bug of failure test and warning of pandas in data diagnosis ( #638 )
...
**Description**
Fix bug of failure test and warning of pandas in data diagnosis.
**Major Revision**
- fix warning of pandas in replace and fillna due to type downcast
- fix bug of failure check function only check one matched metric rather
than all matched metrics
- fix bug when converting regex into str of metrics when there're more
than one match group
2024-08-16 09:04:24 +08:00
Yang Wang
46a5792915
Bug Fix - Update Docker Exec Command for Persistent HPCX Environment ( #635 )
...
Add 10-hpcx.sh to /etc/profile.d
Update the Docker exec command to ensure a persistent HPCX environment.
2024-08-13 16:35:01 +00:00
Yang Wang
9de841bc95
Use `types-setuptools` as `types-pkg_resources` is Yanked ( #637 )
...
* https://pypi.org/project/types-pkg-resources/
* Use types-setuptools instead
2024-08-08 22:30:37 +08:00
Yuting Jiang
2101e933cc
CI/CD - Fix MSCCL build error in CUDA12.4 docker build pipeline ( #633 )
...
**Description**
Fix MSCCL build error in CUDA12.4 docker build pipeline due to OOM
issue.
2024-07-28 23:43:06 +00:00
Yuting Jiang
e304cf1572
Benchmarks: Micro benchmarks - add support for NVIDIA L4/L40/L40s GPUs in gemm-flops ( #634 )
...
**Description**
Add support GPU ARCH 8.9 for NVIDIA L4/L40/L40s GPUs in gemm-flops.
2024-07-26 02:42:17 +00:00
dependabot[bot]
4e27142a59
Bump express from 4.18.2 to 4.19.2 in /website ( #618 )
...
Bumps [express](https://github.com/expressjs/express ) from 4.18.2 to 4.19.2.
- [Release notes](https://github.com/expressjs/express/releases )
- [Changelog](https://github.com/expressjs/express/blob/master/History.md )
- [Commits](expressjs/express@4.18.2...4.19.2)
---
updated-dependencies:
- dependency-name: express
dependency-type: indirect
...
Signed-off-by: dependabot[bot] <support@github.com>
2024-07-26 09:12:11 +08:00
dependabot[bot]
b4945fb29c
Bump ws from 6.2.2 to 6.2.3 in /website ( #629 )
...
Bumps [ws](https://github.com/websockets/ws ) from 6.2.2 to 6.2.3.
- [Release notes](https://github.com/websockets/ws/releases )
- [Commits](websockets/ws@6.2.2...6.2.3)
---
updated-dependencies:
- dependency-name: ws
dependency-type: indirect
...
Signed-off-by: dependabot[bot] <support@github.com>
2024-07-25 15:59:48 +08:00
omahs
a4c87da0ac
Docs - fix typos ( #628 )
...
Docs - fix typos
2024-07-25 03:49:19 +00:00
dependabot[bot]
4102302a96
Bump ip from 1.1.5 to 1.1.9 in /website ( #610 )
...
Bumps [ip](https://github.com/indutny/node-ip ) from 1.1.5 to 1.1.9.
- [Commits](indutny/node-ip@v1.1.5...v1.1.9)
---
updated-dependencies:
- dependency-name: ip
dependency-type: indirect
...
Signed-off-by: dependabot[bot] <support@github.com>
2024-07-25 02:15:52 +00:00
dependabot[bot]
6e556d76e8
Bump follow-redirects from 1.14.8 to 1.15.6 in /website ( #613 )
...
Bumps [follow-redirects](https://github.com/follow-redirects/follow-redirects ) from 1.14.8 to 1.15.6.
- [Release notes](https://github.com/follow-redirects/follow-redirects/releases )
- [Commits](follow-redirects/follow-redirects@v1.14.8...v1.15.6)
---
updated-dependencies:
- dependency-name: follow-redirects
dependency-type: indirect
...
Signed-off-by: dependabot[bot] <support@github.com>
2024-07-24 23:59:58 +00:00
Yifan Xiong
1362732c79
Docs - Add BibTeX in README and repo ( #632 )
...
Add BibTeX for citation in README and repo.
2024-07-23 18:31:21 -07:00
Yang Wang
9a3ce39d5a
Update omegaconf version to 2.3.0 ( #631 )
...
Update `omegaconf` version to
[2.3.0](https://pypi.org/project/omegaconf/2.3.0/ ) as omegaconf 2.0.6
has a non-standard dependency specifier PyYAML>=5.1.*. pip 24.1 will
enforce this behaviour change.
Discussion can be found at https://github.com/pypa/pip/issues/12063 .
2024-07-23 14:46:28 -07:00
Yuting Jiang
7435f10a22
Dockerfile - Add CUDA 12.4 dockerfile ( #619 )
...
**Description**
Add CUDA 12.4 dockerfile.
**Major Revision**
- upgrade nvidia docker into 23.04
**Minor Revision**
- upgrade hpcx into 2.18
2024-04-22 06:36:19 +00:00
Yuting Jiang
dc3846cbd4
Dockerfile - Upgrade mlc to v3.11 ( #620 )
...
**Description**
Upgrade mlc to v3.11.
2024-04-18 10:59:36 +08:00
Ziyue Yang
cc89ee591c
Benchmarks: Revise Code - Add hipblasLt tuning to dist-inference cpp implementation ( #616 )
...
**Description**
Adds hipblasLt tuning to dist-inference cpp implementation.
2024-04-02 09:56:33 +08:00
Yang Wang
eeaa9b1ac9
Bug Fix - Bug fix for cuda 12.2 dockerfile LD_LIBRARY_PATH issue ( #614 )
...
**Description**
Cuda 12.2 image will report undfined symbol error due to incomplete
LD_LIBRARY_PATH:
![image](https://github.com/microsoft/superbenchmark/assets/25875482/1a7c48c7-cb6b-4e3a-abbe-dde23007a96b )
### How to reproduce:
1. Deploy sb with cuda12.2 image
```
sb deploy -f local.ini -i superbench/superbench:v0.10.0-cuda12.2
```
2. Enter to the container
```
sudo docker exec -it sb-workspace bash
```
3. Execute `mpirun`:
```
root@sb-container:~# mpirun
mpirun: symbol lookup error: mpirun: undefined symbol: opal_libevent2022_event_base_loop
```
### Fix to fix
* Append hpcx_load into /etc/bash.bashrc for updaing env LD_LIBRARY_PATH in each time
---------
2024-03-21 15:05:55 +00:00
Yifan Xiong
2c88db907f
Release - SuperBench v0.10.0 ( #607 )
...
**Description**
Cherry-pick bug fixes from v0.10.0 to main.
**Major Revisions**
* Benchmarks: Microbenchmark - Support different hipblasLt data types in dist_inference #590
* Benchmarks: Microbenchmark - Support in-place for NCCL/RCCL benchmark #591
* Bug Fix - Fix NUMA Domains Swap Issue in NDv4 Topology File #592
* Benchmarks: Microbenchmark - Add data type option for NCCL and RCCL tests #595
* Benchmarks: Bug Fix - Make metrics of dist-inference-cpp aligned with PyTorch version #596
* CI/CD - Add ndv5 topo file #597
* Benchmarks: Microbenchmark - Improve AMD GPU P2P performance with fine-grained GPU memory #593
* Benchmarks: Build Pipeline - fix nccl and nccl test version to 2.18.3 to resolve hang issue in cuda12.2 docker #599
* Dockerfile - Bug fix for rocm docker build and deploy #598
* Benchmarks: Microbenchmark - Adapt to hipblasLt data type changes #603
* Benchmarks: Micro benchmarks - Update hipblaslt metric unit to tflops #604
* Monitor - Upgrade pyrsmi to amdsmi python library. #601
* Benchmarks: Micro benchmarks - add fp8 and initialization for hipblaslt benchmark #605
* Dockerfile - Add rocm6.0 dockerfile #602
* Bug Fix - Bug fix for latest megatron-lm benchmark #600
* Docs - Upgrade version and release note #606
Co-authored-by: Ziyue Yang <ziyyang@microsoft.com>
Co-authored-by: Yang Wang <yangwang1@microsoft.com>
Co-authored-by: Yuting Jiang <yutingjiang@microsoft.com>
Co-authored-by: guoshzhao <guzhao@microsoft.com>
2024-01-08 05:40:52 +00:00
Ziyue Yang
2c2096ed83
Benchmark: Revision - Fix -O2 option passing in gpu_copy ROCm build ( #589 )
...
**Description**
`add_compile_options` will not work for ROCm build, change it to setting
`CMAKE_CXX_FLAGS`.
2023-12-11 04:34:51 +00:00
Ziyue Yang
719a427fe7
Benchmarks: Microbenchmark - Add distributed inference benchmark cpp implementation ( #586 )
...
**Description**
Add distributed inference benchmark cpp implementation.
2023-12-11 06:53:51 +08:00
Yuting Jiang
1f5031bd74
Dockerfile - Upgrade to rocm5.7 dockerfile ( #587 )
...
**Description**
upgrade to rocm5.7 dockerfile.
---------
Co-authored-by: yukirora <yuting.jiang@microsoft.com>
2023-12-09 17:41:12 +00:00
Ziyue Yang
4fa60be7cd
Benchmarks: Micro benchmark - Add one-to-all, all-to-one, all-to-all support to gpu_copy_bw_performance ( #588 )
...
**Description**
Add one-to-all, all-to-one, all-to-all support to
gpu_copy_bw_performance, and fix performance bug in gpu_copy
2023-12-08 23:22:38 +08:00
Ziyue Yang
6ef3a0110f
Benchmarks: Add MSCCL Support for Nvidia GPU ( #584 )
...
**Description**
Add MSCCL support for Nvidia GPU
2023-12-07 19:57:28 +08:00
Yuting Jiang
dd5a6329ed
Benchmarks: Add benchmark: Megatron-LM/Megatron-Deepspeed GPT pretrain benchmark ( #582 )
...
**Description**
Megatron-LM/Megatron-Deepspeed GPT pretrain benchmark
2023-12-07 09:37:09 +08:00
Ziyue Yang
254ea7feba
Benchmarks: Micro benchmark - Add graph mode in NCCL/RCCL benchmarks for latency metrics ( #583 )
...
**Description**
Revise NCCL/RCCL benchmarks to graph mode add latency metrics.
2023-12-05 16:48:13 +08:00
Yuting Jiang
9ae8c67093
Benchmarks: micro benchmark - Support cpu-gpu and gpu-cpu in ib-validation ( #581 )
...
**Description**
Benchmarks: micro benchmark - Support cpu-gpu and gpu-cpu in
ib-validation
**Major Revision**
- Support cpu-gpu and gpu-cpu in ib-validation
**Minor Revision**
- support multi msg size, multi direction, multi ib commands in
ib-validation
2023-12-04 22:20:46 +08:00
guoshzhao
028819b388
Monitor - Add support for AMD GPU. ( #580 )
...
**Description**
Add AMD support in monitor.
**Major Revision**
- Add library pyrsmi to collect metrics.
- Currently can get device_utilization, device_power, device_used_memory
and device_total_memory.
2023-11-27 18:45:56 +08:00
Yifan Xiong
1ad1c21c38
Dockerfile - Upgrade Docker image to CUDA 12.2 ( #577 )
...
Upgrade Docker image to CUDA 12.2 for H100:
* upgrade base image to 23.10
* fix onnxruntime version in python3.10
* fix compilation errors
2023-11-22 13:48:18 +00:00
Yuting Jiang
2235e084ab
Benchmarks: Micro benchmark - add initialization options for rocm gemm flops ( #578 )
...
**Description**
add initialization options for rocm gemm flops.
2023-11-22 12:52:22 +00:00
Yuting Jiang
79089b6517
Benchmarks: Micro benchmark - Add hipBLASLt function benchmark ( #576 )
...
**Description**
hipblaslt function benchmark and rebase cublaslt function benchmark.
2023-11-22 19:48:10 +08:00
guoshzhao
9f4880cb8e
Analyzer - Generate baseline given results from multiple nodes. ( #575 )
...
**Description**
Generate baseline given results from multiple nodes.
**Major Revision**
- Add sub command `sb result generate-baseline`
- Add UT and docs
---------
Co-authored-by: 454314380 <454314380@qq.com>
Co-authored-by: Yuting Jiang <yutingjiang@microsoft.com>
2023-11-22 14:42:32 +08:00
Yuting Jiang
f53d941a22
Benchmarks: micro benchmarks - add int8 support for cublaslt function ( #574 )
...
**Description**
add int8 support for cublaslt function.
2023-11-20 11:21:20 +08:00
Yuting Jiang
c7800bb8e0
Bug Fix - remove cp ptx file command in gpu burn test ( #567 )
...
**Description**
remove cp ptx file in gpu burn test since the command is run inside
self.args.bin_dir dir.
d246bab430/superbench/benchmarks/micro_benchmarks/micro_base.py (L183)
2023-11-14 03:52:56 +00:00
dependabot[bot]
ce3737f98b
Bump @babel/traverse from 7.14.5 to 7.23.2 in /website ( #566 )
...
Bumps [@babel/traverse](https://github.com/babel/babel/tree/HEAD/packages/babel-traverse ) from 7.14.5 to 7.23.2.
- [Release notes](https://github.com/babel/babel/releases )
- [Changelog](https://github.com/babel/babel/blob/main/CHANGELOG.md )
- [Commits](https://github.com/babel/babel/commits/v7.23.2/packages/babel-traverse )
---
updated-dependencies:
- dependency-name: "@babel/traverse"
dependency-type: indirect
...
Signed-off-by: dependabot[bot] <support@github.com>
2023-11-07 10:36:42 +08:00
dependabot[bot]
07477c3bae
Bump postcss from 8.3.5 to 8.4.31 in /website ( #564 )
...
Bumps [postcss](https://github.com/postcss/postcss ) from 8.3.5 to 8.4.31.
- [Release notes](https://github.com/postcss/postcss/releases )
- [Changelog](https://github.com/postcss/postcss/blob/main/CHANGELOG.md )
- [Commits](postcss/postcss@8.3.5...8.4.31)
---
updated-dependencies:
- dependency-name: postcss
dependency-type: indirect
...
Signed-off-by: dependabot[bot] <support@github.com>
2023-11-05 11:35:49 +00:00
Yuting Jiang
d246bab430
Dockerfile - update mlc version into 3.10 for cuda and rocm dockerfiles ( #562 )
...
**Description**
Update mlc version into 3.10 for cuda and rocm dockerfiles to be
consistent with cuda12 dockerfile
Co-authored-by: yukirora <yuting.jiang@microsoft.com>
2023-10-23 11:21:17 +08:00
Yuting Jiang
27a10811af
Benchmarks: micro benchmark - source code for evaluating NVDEC decoding performance ( #560 )
...
**Description**
source code for evaluating NVDEC decoding performance.
---------
Co-authored-by: yukirora <yuting.jiang@microsoft.com>
2023-08-22 10:56:33 +00:00
Yuting Jiang
6c0205cece
Benchmarks: micro benchmarks - add source code for DirectXRenderPerf ( #549 )
...
**Description**
add source code for DirectXRenderPerf.
---------
Co-authored-by: yukirora <yuting.jiang@microsoft.com>
2023-08-18 05:17:04 +00:00
pnunna93
67f2aa7237
Benchmarks: model benchmarks - change torch.distributed.launch to torchrun ( #556 )
...
This PR has following changes
- torch.distributed.launch changed to torchrun. torch.distributed.launch
is deprecated in latest Pytorch and is recommended to move to torchrun -
https://pytorch.org/docs/stable/elastic/run.html
- Changes to AMD GPU detection logic. The AMD GPU detection logic throws
warning when containers have only renderD in /dev/dri, this change would
resolve those warnings
---------
Co-authored-by: Yuting Jiang <yutingjiang@microsoft.com>
2023-08-08 13:03:32 +08:00
Yuting Jiang
e1df877bfe
Release - SuperBench v0.9.0 ( #558 )
...
**Description**
Cherry-pick bug fixes from v0.9.0 to main.
**Major Revision**
- CI/CD: pipeline - clean more disk space to fix rocm building image
pipeline(#555 )
- Benchmarks: bug fix - use absolute path for input file in
DirectXEncodingLatency(#554 )
- CI/CD - add push win docker image on release branch in pipeline (#552 )
- Docs - Upgrade version and release note(#557 )
2023-07-27 10:42:31 +08:00
dependabot[bot]
466b477e9d
Bump semver from 5.7.1 to 5.7.2 in /website ( #550 )
...
Bumps [semver](https://github.com/npm/node-semver ) from 5.7.1 to 5.7.2.
- [Release notes](https://github.com/npm/node-semver/releases )
- [Changelog](https://github.com/npm/node-semver/blob/v5.7.2/CHANGELOG.md )
- [Commits](npm/node-semver@v5.7.1...v5.7.2)
---
updated-dependencies:
- dependency-name: semver
dependency-type: indirect
...
Signed-off-by: dependabot[bot] <support@github.com>
2023-07-24 17:07:35 +08:00
Yuting Jiang
e8ac0b1e28
Benchmarks: micro benchmarks - add python code for DirectXGPUEncodingLatency ( #548 )
...
**Description**
add python code for DirectXGPUEncodingLatency.
2023-07-06 15:31:28 +08:00
Yuting Jiang
c8c079c2af
Benchmarks: micro benchmarks - add python code for DirectXGPUCopy ( #546 )
...
**Description**
add python code for DirectXGPUCopy.
2023-07-06 00:15:32 +08:00
Yuting Jiang
af4cfd5bbf
Benchmarks: micro benchmarks - add python code for DirecXGPUMemBw ( #547 )
...
**Description**
add python code for DirecXGPUMemBw.
2023-07-05 22:07:13 +08:00
Yuting Jiang
f1d608aef7
Benchmarks: micro benchmarks - add python code for DirectXGPUCoreFlops ( #542 )
...
**Description**
add python code for DirectX core flops and init DirectX test pipeline.
**Major Revision**
- add python code for DirectX core flops
- init DirectX test pipeline
**Minor Revision**
- add test for DirectX core flops
2023-07-05 16:56:21 +08:00