7851b51ee3
* add initial tiktoken support * add vector hash and equal for bpe ranks map * change lambda comparator * move phi-3-small files * final changes * move tiktoken files from data2 to data * add unit test * add tokenizer module * merge json and tiktoken impl * fix tiktoken encoding problem * address comments * remove dummy tokens --------- Co-authored-by: Sayan Shaw <sayanshaw@microsoft.com> Co-authored-by: Wenbing Li <10278425+wenbingl@users.noreply.github.com> |
||
---|---|---|
.. | ||
audio | ||
azure | ||
cuda | ||
cv2 | ||
math | ||
text | ||
tokenizer | ||
vision | ||
ocos_operators_placeholder.cc |