Родитель
de1023bb36
Коммит
dab1a810fd
|
@ -8,7 +8,7 @@ Proceedings of the International Symposium on Information Theory (ISIT), 2021. [
|
|||
|
||||
Our hope is that this dataset will enable further research progress by allowing objective comparisons between the algorithms. The dataset is represented by two files:
|
||||
|
||||
- **Centers.txt** This files contains 10,000 random strings of length $110$ in the alphabet {A,C,G,T}.
|
||||
- **Centers.txt** This files contains 10,000 random strings of length 110 in the alphabet {A,C,G,T}.
|
||||
- **Clusters.txt** This file contains 269,709 noisy nanopore reads of DNA sequences corresponding to strings in the file **Centers.txt**. Reads are arranged into clusters separated by lines of multiple "=" signs. Clusters follow the same order as the strings in the file **Centers.txt**, i.e., the first cluster contains reads corresponding to the DNA sequence represented by first string in *Centers.txt*, the second cluster contains reads corresponding to the DNA sequence resented by the second string in **Centers.txt**, etc.
|
||||
|
||||
|
||||
|
|
Загрузка…
Ссылка в новой задаче