* Update README.rst

* Update cuda_score_updater.cu

Co-authored-by: James Lamb <jaylamb20@gmail.com>
This commit is contained in:
Nikita Titov 2022-08-03 19:28:22 +03:00 коммит произвёл GitHub
Родитель e2dfcd69e6
Коммит c7102e56b2
Не найден ключ, соответствующий данной подписи
Идентификатор ключа GPG: 4AEE18F83AFDEB23
2 изменённых файлов: 2 добавлений и 2 удалений

Просмотреть файл

@ -125,7 +125,7 @@ All requirements from `Build from Sources section <#build-from-sources>`__ apply
Recently, a new CUDA version with better efficiency is implemented as an experimental feature. To build the new CUDA version, replace ``--cuda`` with ``--cuda-exp`` in the above commands. Please note that new version requires **CUDA** 10.0 or later libraries. Note that this new version uses twice the memory, since it stores data row-wise as well as column-wise in memory to improve performance (see this `issue <https://github.com/microsoft/LightGBM/issues/5318>`__ for discussion).
To use the regular or experimental cuda versions within Python, pass ``{"device":"cuda"}`` or ``{"device":"cuda_exp"}`` respectively as parameters.
To use the regular or experimental CUDA versions within Python, pass ``{"device": "cuda"}`` or ``{"device": "cuda_exp"}`` respectively as parameters.
Build HDFS Version
~~~~~~~~~~~~~~~~~~

Просмотреть файл

@ -22,7 +22,7 @@ __global__ void AddScoreConstantKernel(
void CUDAScoreUpdater::LaunchAddScoreConstantKernel(const double val, const size_t offset) {
const int num_blocks = (num_data_ + num_threads_per_block_) / num_threads_per_block_;
Log::Warning("adding init score = %f", val);
Log::Debug("Adding init score = %lf", val);
AddScoreConstantKernel<<<num_blocks, num_threads_per_block_>>>(val, offset, num_data_, cuda_score_);
}