зеркало из https://github.com/microsoft/DeepSpeed.git
Add support for p100 in transformer kernels (#470)
add compute cap of 6.0, support p100
This commit is contained in:
Родитель
1afca8f722
Коммит
7ddfda8526
4
setup.py
4
setup.py
|
@ -217,6 +217,8 @@ if BUILD_MASK & DS_BUILD_TRANSFORMER:
|
|||
'-gencode',
|
||||
'arch=compute_61,code=compute_61',
|
||||
'-gencode',
|
||||
'arch=compute_60,code=compute_60',
|
||||
'-gencode',
|
||||
'arch=compute_70,code=compute_70',
|
||||
'-std=c++14',
|
||||
'-U__CUDA_NO_HALF_OPERATORS__',
|
||||
|
@ -248,6 +250,8 @@ if BUILD_MASK & DS_BUILD_TRANSFORMER:
|
|||
'-gencode',
|
||||
'arch=compute_61,code=compute_61',
|
||||
'-gencode',
|
||||
'arch=compute_60,code=compute_60',
|
||||
'-gencode',
|
||||
'arch=compute_70,code=compute_70',
|
||||
'-std=c++14',
|
||||
'-U__CUDA_NO_HALF_OPERATORS__',
|
||||
|
|
Загрузка…
Ссылка в новой задаче