* Fix CodeGenTokenizer issues and the related code refactoring.
* refactor the trie-tree
* temp check-ins
* code complete
* correctness fixing
* Update _hf_cvt.py
* more test cases fixing
* more refinement
* linux crash fixing
* Update test_autotokenizer.py
* Nodes can be called concurrently and Compute needs to be stateless due to that.
Update the kernels to make Compute const.
* Fix test that uses ustring.h.
Would be better to not have duplicate declarations for GetTensorMutableDataString and FillTensorDataString in ustring.h and string_tensor.h.
* initial PR
* add the attributes for op
* cmake update
* add the missing symbol
* add a unit test case
* fix the unit test
* fix some corner case.
* format Python code with autopep8
* fix the break in release pipeline
* code cleanup and the warnings fixing.
* Update ci.yml for Azure Pipelines
* Update ci.yml for Azure Pipelines
* fix linux build
* one more fixing
* again?
* fixing for macOS