Граф коммитов

11 Коммитов

Автор SHA1 Сообщение Дата
Yifan Xiong 9b8df883ae
Gracefully exit when timeout (#383)
* Gracefully exit when timeout, add corresponding log and return code.
* Set minimum timeout to 1 minute and enlarge Ansible timeout.
2022-08-04 13:05:34 +08:00
Yuting Jiang 54919424c3
Benchmarks: Add Benchmark - Add ib traffic validation distributed benchmark (#215)
**Description**
Add ib traffic validation distributed benchmark.

**Major Revision**
- Add ib traffic validation distributed benchmark, example and test
2021-11-10 01:18:41 +08:00
guoshzhao 37d5dfd5ed
Benchmarks: Code Revision - revise the DockerBenchmark base class (#179)
**Description**
Revise the DockerBenchmark base to support image pull, image rm etc.

**Major Revision**
- image pull in _preprocess()
- image clean in _postprocess()
- execute customized commands in _benchmark()
- add unit tests
2021-09-01 22:15:42 +08:00
Yuting Jiang b0c5addcac
Benchmarks: Add Benchmark - Add IB Loopback performance benchmark. (#112)
**Description**
Add RDMA Loopback performance microbenchmark.

**Major Revision**
- Add microbenchmark, example, test, config for RDMA Loopback
2021-07-24 03:40:24 +08:00
guoshzhao 03b41be145
Benchmarks: Fix Bug - Fix OOM issue when run pytorch models sequentially. (#93)
* Clean up the cache.
2021-06-07 10:19:05 +08:00
guoshzhao 2d9be807a9
Benchmarks: Fix Bug - Fix return code overwrite issue (#94)
* fix return code reset issue
2021-06-04 18:02:12 +08:00
guoshzhao 6c6f526937
Benchmarks: Add Benchmark - Add FLOPs performance benchmark for cuda. (#87)
* add cuda flops performance benchmark.
2021-06-02 09:15:58 +08:00
guoshzhao 729e04ab94
Benchmarks: Code Revision - Revise MicroBenchmark class to be more flexible. (#66)
* Revise MicroBenchmark class to be more flexible.
* use command index not the command as the parameter.
* changes according to discussion.
2021-05-13 18:58:47 +08:00
guoshzhao 0172968f25
add _post_process() interface. (#40)
Co-authored-by: Guoshuai Zhao <guzhao@microsoft.com>
2021-04-12 09:58:53 +08:00
guoshzhao 9388f8f56a
add more checks for model base (#12)
Co-authored-by: Guoshuai Zhao <guzhao@microsoft.com>
2021-03-04 13:20:13 +08:00
guoshzhao 4c87a3e419
Benchmarks: Initialization - Add base class, registry, and result (#1)
* benchmarks init.

Co-authored-by: Guoshuai Zhao <guzhao@microsoft.com>
2021-02-24 12:43:24 +08:00