This commit is contained in:
Fei Gao 2017-12-18 20:34:33 +08:00 коммит произвёл GitHub
Родитель 3278bbc5f2
Коммит de9cca422c
Не найден ключ, соответствующий данной подписи
Идентификатор ключа GPG: 4AEE18F83AFDEB23
1 изменённых файлов: 4 добавлений и 4 удалений

Просмотреть файл

@ -1,4 +1,4 @@
#LightLDA
# LightLDA
LightLDA is a distributed system for large scale topic modeling. It implements a distributed sampler that enables very large data sizes and models. LightLDA improves sampling throughput and convergence speed via a fast O(1) metropolis-Hastings algorithm, and allows small cluster to tackle very large data and model sizes through model scheduling and data parallelism architecture. LightLDA is implemented with C++ for performance consideration.
@ -6,7 +6,7 @@ We have sucessfully trained big topic models (with trillions of parameters) on b
For documents, please view our website [http://www.dmtk.io](http://www.dmtk.io).
##Why LightLDA
## Why LightLDA
The highlight features of LightLDA are
@ -14,13 +14,13 @@ The highlight features of LightLDA are
* **Fast**: The sampler can sample millions of tokens per second per multi-core node.
* **Lightweight**: Such big tasks can be trained with as few as tens of machines.
##Quick Start
## Quick Start
Run ``` $ sh build.sh ``` to build lightlda.
Run ``` $ sh example/nytimes.sh ``` for a simple example.
##Reference
## Reference
Please cite LightLDA if it helps in your research: