Yifan Xiong
9b8df883ae
Gracefully exit when timeout ( #383 )
...
* Gracefully exit when timeout, add corresponding log and return code.
* Set minimum timeout to 1 minute and enlarge Ansible timeout.
2022-08-04 13:05:34 +08:00
Yuting Jiang
54919424c3
Benchmarks: Add Benchmark - Add ib traffic validation distributed benchmark ( #215 )
...
**Description**
Add ib traffic validation distributed benchmark.
**Major Revision**
- Add ib traffic validation distributed benchmark, example and test
2021-11-10 01:18:41 +08:00
guoshzhao
37d5dfd5ed
Benchmarks: Code Revision - revise the DockerBenchmark base class ( #179 )
...
**Description**
Revise the DockerBenchmark base to support image pull, image rm etc.
**Major Revision**
- image pull in _preprocess()
- image clean in _postprocess()
- execute customized commands in _benchmark()
- add unit tests
2021-09-01 22:15:42 +08:00
Yuting Jiang
b0c5addcac
Benchmarks: Add Benchmark - Add IB Loopback performance benchmark. ( #112 )
...
**Description**
Add RDMA Loopback performance microbenchmark.
**Major Revision**
- Add microbenchmark, example, test, config for RDMA Loopback
2021-07-24 03:40:24 +08:00
guoshzhao
03b41be145
Benchmarks: Fix Bug - Fix OOM issue when run pytorch models sequentially. ( #93 )
...
* Clean up the cache.
2021-06-07 10:19:05 +08:00
guoshzhao
2d9be807a9
Benchmarks: Fix Bug - Fix return code overwrite issue ( #94 )
...
* fix return code reset issue
2021-06-04 18:02:12 +08:00
guoshzhao
6c6f526937
Benchmarks: Add Benchmark - Add FLOPs performance benchmark for cuda. ( #87 )
...
* add cuda flops performance benchmark.
2021-06-02 09:15:58 +08:00
guoshzhao
729e04ab94
Benchmarks: Code Revision - Revise MicroBenchmark class to be more flexible. ( #66 )
...
* Revise MicroBenchmark class to be more flexible.
* use command index not the command as the parameter.
* changes according to discussion.
2021-05-13 18:58:47 +08:00
guoshzhao
0172968f25
add _post_process() interface. ( #40 )
...
Co-authored-by: Guoshuai Zhao <guzhao@microsoft.com>
2021-04-12 09:58:53 +08:00
guoshzhao
9388f8f56a
add more checks for model base ( #12 )
...
Co-authored-by: Guoshuai Zhao <guzhao@microsoft.com>
2021-03-04 13:20:13 +08:00
guoshzhao
4c87a3e419
Benchmarks: Initialization - Add base class, registry, and result ( #1 )
...
* benchmarks init.
Co-authored-by: Guoshuai Zhao <guzhao@microsoft.com>
2021-02-24 12:43:24 +08:00