superbenchmark

История

Yifan Xiong 2c88db907f Release - SuperBench v0.10.0 (#607 ) Description Cherry-pick bug fixes from v0.10.0 to main. Major Revisions * Benchmarks: Microbenchmark - Support different hipblasLt data types in dist_inference #590 * Benchmarks: Microbenchmark - Support in-place for NCCL/RCCL benchmark #591 * Bug Fix - Fix NUMA Domains Swap Issue in NDv4 Topology File #592 * Benchmarks: Microbenchmark - Add data type option for NCCL and RCCL tests #595 * Benchmarks: Bug Fix - Make metrics of dist-inference-cpp aligned with PyTorch version #596 * CI/CD - Add ndv5 topo file #597 * Benchmarks: Microbenchmark - Improve AMD GPU P2P performance with fine-grained GPU memory #593 * Benchmarks: Build Pipeline - fix nccl and nccl test version to 2.18.3 to resolve hang issue in cuda12.2 docker #599 * Dockerfile - Bug fix for rocm docker build and deploy #598 * Benchmarks: Microbenchmark - Adapt to hipblasLt data type changes #603 * Benchmarks: Micro benchmarks - Update hipblaslt metric unit to tflops #604 * Monitor - Upgrade pyrsmi to amdsmi python library. #601 * Benchmarks: Micro benchmarks - add fp8 and initialization for hipblaslt benchmark #605 * Dockerfile - Add rocm6.0 dockerfile #602 * Bug Fix - Bug fix for latest megatron-lm benchmark #600 * Docs - Upgrade version and release note #606 Co-authored-by: Ziyue Yang <ziyyang@microsoft.com> Co-authored-by: Yang Wang <yangwang1@microsoft.com> Co-authored-by: Yuting Jiang <yutingjiang@microsoft.com> Co-authored-by: guoshzhao <guzhao@microsoft.com>		2024-01-08 05:40:52 +00:00
..
monitor	Monitor: Integration - Integrate monitor into Superbench (#259 )	2021-12-10 09:33:13 +00:00
cuda_memory_d2d_bw.log	Release - SuperBench v0.4.0 (#278 )	2021-12-30 16:24:00 +08:00
cuda_memory_d2h_bw.log	Release - SuperBench v0.4.0 (#278 )	2021-12-30 16:24:00 +08:00
cuda_memory_h2d_bw.log	Release - SuperBench v0.4.0 (#278 )	2021-12-30 16:24:00 +08:00
diagnosis_summary.html	Release - SuperBench v0.6.0 (#409 )	2022-09-06 18:06:05 +08:00
diagnosis_summary.json	Release - SuperBench v0.6.0 (#409 )	2022-09-06 18:06:05 +08:00
diagnosis_summary.jsonl	Release - SuperBench v0.6.0 (#409 )	2022-09-06 18:06:05 +08:00
diagnosis_summary.md	Release - SuperBench v0.6.0 (#409 )	2022-09-06 18:06:05 +08:00
diagnosis_summary.xlsx	Analyzer - Add failure check feature in data diagnosis (#378 )	2022-08-01 12:35:35 +08:00
diagnosis_summary_json.json	Release - SuperBench v0.6.0 (#409 )	2022-09-06 18:06:05 +08:00
disk_performance.log	Release - SuperBench v0.4.0 (#278 )	2021-12-30 16:24:00 +08:00
dist_inference.log	Release - SuperBench v0.10.0 (#607 )	2024-01-08 05:40:52 +00:00
gpcnet_network_load.log	Release - SuperBench v0.4.0 (#278 )	2021-12-30 16:24:00 +08:00
gpcnet_network_load_error.log	Release - SuperBench v0.4.0 (#278 )	2021-12-30 16:24:00 +08:00
gpcnet_network_test.log	Release - SuperBench v0.4.0 (#278 )	2021-12-30 16:24:00 +08:00
gpcnet_network_test_error.log	Release - SuperBench v0.4.0 (#278 )	2021-12-30 16:24:00 +08:00
gpu_burn.log	Benchmarks: Add Feature - Add GPU-Burn as microbenchmark (#324 )	2022-03-16 16:20:11 +08:00
gpu_copy_bw_performance.log	Benchmarks: Micro benchmark - Add one-to-all, all-to-one, all-to-all support to gpu_copy_bw_performance (#588 )	2023-12-08 23:22:38 +08:00
hpl_results.log	Adding HPL benchmark (#482 )	2023-03-21 16:44:08 +00:00
ib_loopback_8M_size.log	Release - SuperBench v0.4.0 (#278 )	2021-12-30 16:24:00 +08:00
ib_loopback_all_sizes.log	Release - SuperBench v0.4.0 (#278 )	2021-12-30 16:24:00 +08:00
ib_traffic_topo_aware_expected_config	Support topo-aware IB performance validation (#373 )	2022-07-26 16:56:19 -07:00
ib_traffic_topo_aware_hostfile	Support topo-aware IB performance validation (#373 )	2022-07-26 16:56:19 -07:00
ib_traffic_topo_aware_ibnetdiscover.txt	Support topo-aware IB performance validation (#373 )	2022-07-26 16:56:19 -07:00
ib_traffic_topo_aware_ibstat.txt	Release - SuperBench v0.6.0 (#409 )	2022-09-06 18:06:05 +08:00
megatron_deepspeed.log	Benchmarks: Add benchmark: Megatron-LM/Megatron-Deepspeed GPT pretrain benchmark (#582 )	2023-12-07 09:37:09 +08:00
mpi_pattern.txt	Runner - Generate host groups file in mpi mode (#458 )	2023-01-04 19:49:14 +08:00
nccl_allgather.log	Release - SuperBench v0.4.0 (#278 )	2021-12-30 16:24:00 +08:00
nccl_allreduce.log	Release - SuperBench v0.4.0 (#278 )	2021-12-30 16:24:00 +08:00
nccl_alltoall.log	Release - SuperBench v0.4.0 (#278 )	2021-12-30 16:24:00 +08:00
nccl_broadcast.log	Release - SuperBench v0.4.0 (#278 )	2021-12-30 16:24:00 +08:00
nccl_reduce.log	Release - SuperBench v0.4.0 (#278 )	2021-12-30 16:24:00 +08:00
nccl_reducescatter.log	Release - SuperBench v0.4.0 (#278 )	2021-12-30 16:24:00 +08:00
results_summary.html	Analyzer: Add feature - Add result summary in excel,md,html format (#320 )	2022-03-24 15:32:01 +08:00
results_summary.md	Analyzer: Add feature - Add result summary in excel,md,html format (#320 )	2022-03-24 15:32:01 +08:00
results_summary.xlsx	Analyzer: Add feature - Add result summary in excel,md,html format (#320 )	2022-03-24 15:32:01 +08:00
rocm_memory_d2h_bw.log	Release - SuperBench v0.4.0 (#278 )	2021-12-30 16:24:00 +08:00
rocm_memory_h2d_bw.log	Release - SuperBench v0.4.0 (#278 )	2021-12-30 16:24:00 +08:00
streamResult.log	Adding Stream Benchmark (#473 )	2023-02-13 15:34:37 -05:00
tensorrt_inference.1.log	Benchmarks - Update result parsing in tensorrt inference (#541 )	2023-06-30 11:22:46 +08:00
tensorrt_inference.2.log	Benchmarks - Update result parsing in tensorrt inference (#541 )	2023-06-30 11:22:46 +08:00
test.yaml	Runner: Support `topo-aware` and `k-batch` pattern in 'mpi' mode (#437 )	2023-01-03 10:28:35 +00:00