Wenbing Li
b045e66396
make onnx package to be optional. ( #653 )
...
* putting onnx package to be optional
* update the ci.yml
* add more message of missing ONNX package
2024-02-15 14:09:04 -08:00
Wenbing Li
394377b988
add a Mistral model unit test case ( #593 )
...
* add a Mistral model unit test case
* add the dependency
* Update test_autotokenizer.py
* Update test_autotokenizer.py
* Update test_autotokenizer.py
* Update test_autotokenizer.py
* Update test_autotokenizer.py
* Update test_autotokenizer.py
* Update test_autotokenizer.py
* Update test_autotokenizer.py 11
* Update ci.yml for Azure Pipelines
* Update ci.yml
* Update test_autotokenizer.py 12
* Update test_autotokenizer.py 13
2023-11-07 09:53:54 -08:00
Sayan Shaw
92d17b0a52
remove ftfy imports and add warning if not installed ( #578 )
...
Co-authored-by: Sayan Shaw <sayanshaw@microsoft.com>
2023-10-19 14:09:55 -07:00
Sayan Shaw
4d2930e35a
Fix newline and apostrophe handling for BPE ( #574 )
...
* Fix certain BPE issues
* minor changes
* change newline handling for unix/linux/windows builds
* small test case
* move apostrophe testing into test_cliptok.py
* fix CLIP inconsistency with ftfy install
* add ftfy to requirements-dev.txt
* remove HF CLIP bug testing
---------
Co-authored-by: Sayan Shaw <sayanshaw@microsoft.com>
2023-10-19 00:17:34 -07:00
Wenbing Li
978ada6d60
Add TrieTokenizer for RWKV-like LLM models ( #509 )
...
* Add TrieTokenizer for RWKV-like LLM models
* add more tests
* fix the windows build
* downloading file instead of check in the vocab file
* a small bug fixing
2023-08-08 16:47:38 -07:00
Wenbing Li
1c04e95bda
upgrade all dependency versions ( #466 )
...
* release the dependency package version
* upgrade the dependency versions
* more fixing for 3.11
* Update ci.yml
* upgrade torch version in MacOS
* fix torch version in MacOS
* upgrade Python support versions
* pybind11 switch URL mode
* more URL modes
2023-06-03 20:09:41 -07:00
Sayan Shaw
77cf3e6d2c
Remove onnx<1.14 from requirements.txt ( #447 )
...
* remove onnx<1.14 from requirements.txt
* downgrade protobuf
* move protobuf req to requirements-dev.txt
---------
Co-authored-by: Sayan Shaw <sayanshaw@microsoft.com>
Co-authored-by: Wenbing Li <10278425+wenbingl@users.noreply.github.com>
2023-05-21 23:34:09 -07:00
Wenbing Li
0f45fef2d9
Compatible with onnxruntime-gpu package ( #410 )
...
* be compatible without onnxruntime-gpu version
* some fixing
2023-04-26 17:17:23 -07:00
Edward Chen
e8611d036d
Include requirements.txt from requirements-dev.txt. ( #328 )
...
Include requirements.txt from requirements-dev.txt so that pip has context to avoid installing incompatible dependencies.
2022-12-13 10:21:32 -08:00
Wenbing Li
8df25bf5a5
Fix ci pipelines failure caused by the latest transformer package ( #324 )
...
* test ci pipeline
* transformer 4.25 broken on pytorch 1.9/1.10
2022-12-06 15:21:06 -08:00
Wenbing Li
72790873e5
suppress the protobuf warning. ( #299 )
2022-10-05 00:29:46 -07:00
shaahji
a7deb7b52c
Update CI build workflow matrix
...
Upgraded the onnxruntime headers from v1.6 to v1.9
Update workflow matrix so it's consistent across the platforms and using
newer versions of the dependencies. Current supported matrix -
+------------+------------------------+-----------------------+-----------------------+----------------------+
|Python | 3.7 | 3.8 | 3.9 | 3.10 |
+------------+------------------------+-----------------------+-----------------------+----------------------+
|Onnxruntime | 1.9.0 (Sept 22, 2021) | 1.10.0 (Dec 7, 2021) | 1.11.0 (Mar 26, 2022) | 1.12.1 (Aug 4, 2022) |
|Torch | 1.9.1 (Sept 22, 2021) | 1.10.0 (Oct 21, 2021) | 1.11.0 (Mar 10, 2022) | 1.12.1 (Aug 5, 2022) |
|TorchVision | 0.10.1 (Jun 15, 2021) | 0.11.1 (Oct 21, 2021) | 0.12.0 (Mar 10, 2022) | 0.13.1 (Aug 5, 2022) |
|TorchAudio | 0.9.0 (Jun 15, 2021) | 0.10.0 (Oct 21, 2021) | 0.11.0 (Mar 10, 2022) | 0.12.1 (Aug 5, 2022) |
+------------+------------------------+-----------------------+-----------------------+----------------------+
Release versions strictly follow the convention of onnxruntime being one
release ahead of all its dependencies.
2022-09-02 23:05:58 -07:00
Mojimi
dddd85397d
Add android pipeline ( #183 )
...
* build locally success
* update
* fix pipeline
* fix pipeline
* fix pipeline
* fix tool chain file
* validate quickly
* fix tool chain
* update
* finished
* fix bug
* fix bugs
* remove tree
* update android ndk
* fix bugs
* remove java install
* bring back build
* fix model
* resolve conflict
* remove uncessary file
* remove tensorflow_text version 2.6.0
Co-authored-by: Ze Tao <zetao@microsoft.com>
2021-11-09 10:25:03 -08:00
Xavier Dupré
a98c29f6d2
Implement custom operators for sentancepiece ( #41 )
...
* implements sentancepiecetokozenizer
* add ragged to sparse
* move one input to attribute
2021-01-27 23:55:50 +01:00
Wenbing Li
c7b2f864c6
Add Huggingface GPT2Tokenizer Support ( #35 )
...
* initialize a bbpe tokenizer
* add the json library.
* gpt2 tokenizer cpp implementation.
* Tom/add tutorial (#32 )
* Added getting started instructions for Windows
Signed-off-by: Tom Wildenhain <tomwi@microsoft.com>
* Created a tutorial for converting models with custom ops. WIP
* Removed long outputs
* Changed to keras syntax and added setup instructions
Co-authored-by: Wenbing Li <10278425+wenbingl@users.noreply.github.com>
* rename gpt2 test case file
* polish the symbol names in the sources
* polish it again.
* fix the build issue on macos
* another fixing
* another fixing 3
Co-authored-by: TomWildenhain-Microsoft <67606533+TomWildenhain-Microsoft@users.noreply.github.com>
2020-12-21 17:12:32 -08:00
Xavier Dupré
e36205ee83
Handles dummy python operators for double and strings ( #7 )
...
* refactor tests
* Update mshost.yaml
* Implements dummy operators with double and strings
* udpate CI
* Implements StringUpper C++ version
* Fix runtime issue preventing from registering multiple python ops
* add c++ operator StringJoin
* Support multi output for python and C++ operators
* remove torch in requirements-dev.txt
2020-10-30 11:20:18 +01:00