Wenbing Li
70aa18e14e
add a native unit test for regex_split op ( #166 )
...
* add a native unit test for regex_split op
* fix the case of shape [1, 0]
* Update mshost.yaml
* downgrade the test model version.
* upgrade torch version on Windows CI
* disable windows python 3.7 pipeline.
2021-10-06 15:58:46 -07:00
joburkho
4d7004bf6e
Correct memory indexing issue. ( #165 )
...
* Correct memory reservation.
* Fix the vmImage version for MacOS CI pipeline.
Co-authored-by: Wenbing Li <10278425+wenbingl@users.noreply.github.com>
2021-10-04 16:26:34 -07:00
Wenbing Li
4ebb8667da
Integrate the changes from the 0.4 release branch. ( #162 )
...
* generate the release packages on release branch
* no pyd file on Windows
* undo setup.py simplify
2021-09-29 13:18:36 -07:00
Mojimi
4290400ed3
Add doc for new operators ( #161 )
...
* add initial doc
* update doc
* finish all docs
Co-authored-by: Ze Tao <zetao@microsoft.com>
2021-09-29 07:59:09 +08:00
joburkho
b6e65f8780
Change farmhash to use uint128_t from the local namespace. ( #160 )
2021-09-27 19:49:03 -07:00
Wenbing Li
9eef22cb81
embedded farmhash source files to have some fixing ( #158 )
...
* embedded farmhash source files to have some fixing.
* update CMakefile
2021-09-27 16:04:46 -07:00
joburkho
cddb3de158
Add ORT_API_CALL to AddExternalCustomOp so that the calling convention of the declaration in ocos.h matches the calling convention of the definition in ortcustomops.cc for x86 builds. ( #159 )
2021-09-27 16:04:28 -07:00
Wenbing Li
6ad95f6df1
upgrade ort and ir version mapping ( #157 )
...
* some bugs fixing for the gpt2 tools
* upgrade ort and ir version mapping
2021-09-27 14:56:22 -07:00
Mojimi
2d6cf0b4ea
Reduce bert tokenize memory usage ( #156 )
...
* add BertTokenizerVocab
* improve format
Co-authored-by: Ze Tao <zetao@microsoft.com>
2021-09-27 11:19:57 -07:00
Mojimi
d8cdb8e042
reduce memory usage ( #154 )
...
Co-authored-by: Ze Tao <zetao@microsoft.com>
2021-09-27 13:45:47 +08:00
Wenbing Li
9f3abe20fd
Prepare for 0.4.0 release ( #151 )
...
* new CI configuration
* Set up CI with Azure Pipelines
[skip ci]
* install numpy in cibuildwheel
* add pyproject.toml
* upgrade vmImage
* update the build python versions
* remove the pytest
* move the wheel build files
* enable sdist setup.py as well.
* use git command line
* Update wheels.yml for Azure Pipelines
* disable the pypy package for macos;
* fix the external repo code tag
* fix the ctest problem
* fix the unicode 8217.
* fix the locale base test
2021-09-25 00:40:12 -07:00
Mojimi
98c32dfe4a
Add new operator StringMapping ( #148 )
...
* update
* update python wrapper
* fix comments
Co-authored-by: Ze Tao <zetao@microsoft.com>
Co-authored-by: Wenbing Li <10278425+wenbingl@users.noreply.github.com>
2021-09-14 09:13:08 +08:00
Mojimi
519755fcc0
fix bug ( #149 )
2021-09-13 10:29:08 -07:00
Mojimi
3542434876
Fix op selection ( #147 )
...
* fix segement_extraction
* update
Co-authored-by: Ze Tao <zetao@microsoft.com>
2021-09-11 01:23:35 +08:00
Mojimi
d4b2aff0c8
Improve regex ( #146 )
...
* add test
* bring back test case
* add ignore case for regex
Co-authored-by: Ze Tao <zetao@microsoft.com>
2021-09-09 13:24:49 +08:00
Zuwei Zhao
6d7a865913
Disable c++ exceptions in onnxruntime-extensions. ( #143 )
...
* Disable c++ exceptions in onnxruntime-extensions.
* Remove cxx flags for extensions.
* Remove redundant lines.
Co-authored-by: Zuwei Zhao <zuzhao@microsoft.com>
2021-09-09 08:21:40 +08:00
Mojimi
8649d98839
update ( #145 )
...
Co-authored-by: Ze Tao <zetao@microsoft.com>
Co-authored-by: Wenbing Li <10278425+wenbingl@users.noreply.github.com>
2021-09-08 14:26:26 +08:00
Mojimi
cce66310b2
Improve recent checkin operators ( #144 )
...
* update
* update
* update
* remove tokenizer space
* fix bugs
Co-authored-by: Ze Tao <zetao@microsoft.com>
2021-09-07 13:34:47 +08:00
Wenbing Li
2842d2208e
support the non-exception compiling for the text domain. ( #142 )
...
* support the non-exception compiling for the text domain.
* fix an path error.
2021-09-02 11:19:18 -07:00
Mojimi
97ec950751
Add SegementExtraction and BertTokenizerDecoder ( #140 )
...
* status
* update
* update
* fix bug
Co-authored-by: Ze Tao <zetao@microsoft.com>
Co-authored-by: Wenbing Li <10278425+wenbingl@users.noreply.github.com>
2021-08-27 13:54:02 -07:00
Wenbing Li
cb47ee4d44
Standardize the public header files. ( #139 )
...
* a couple of fixing
* add a library alias
2021-08-27 12:51:02 -07:00
Zuwei Zhao
8d47fb7925
Fix comma error. ( #138 )
...
Co-authored-by: Zuwei Zhao <zuzhao@microsoft.com>
2021-08-27 10:15:38 -07:00
Mojimi
aef5ef1ef1
Add BertTokenizer ( #135 )
...
* init
* update
* update
* update
* update
* update
* update
* Modify relative path of generated cmake file.
* update
* udapte
* fix the bug
* update
* fix bugs
Co-authored-by: Ze Tao <zetao@microsoft.com>
Co-authored-by: Wenbing Li <10278425+wenbingl@users.noreply.github.com>
Co-authored-by: Zuwei Zhao <zuzhao@microsoft.com>
2021-08-26 13:50:03 -07:00
Wenbing Li
6c3b496e3f
Support the domain name in the PyOp. ( #136 )
2021-08-26 11:04:45 -07:00
Mojimi
00448bc78c
Replace Re2 with std::re2 ( #129 )
...
* inital commit
* update
* bring selectedoplist back
* remove unnessary change
* update
* fix unittest
* remove test
* fix windows building
* udpate
* update
* undo the changes on test cases
* add the missing C++ flags
Co-authored-by: Ze Tao <zetao@microsoft.com>
Co-authored-by: Wenbing Li <10278425+wenbingl@users.noreply.github.com>
Co-authored-by: Wenbing Li <wenli@microsoft.com>
2021-08-25 16:09:35 -07:00
Zuwei Zhao
f74770feed
Customize string operators list for cmake build. ( #134 )
...
* Customize operators list for build.
* Modify cmake_helper.py to not depend on onnxruntime-extensions build/install.
* Add new line at last.
* Move cmake_helper.py to ci_build/tools/ folder.
* undo some unwanted changes.
* Update CMakeLists.txt
* Update gen_selectedops.py
Co-authored-by: Zuwei Zhao <zuzhao@microsoft.com>
Co-authored-by: Wenbing Li <wenli@microsoft.com>
Co-authored-by: Wenbing Li <10278425+wenbingl@users.noreply.github.com>
2021-08-24 12:31:05 -07:00
Wenbing Li
52922ecc54
Update wheels.yml
2021-08-20 16:42:37 -07:00
Wenbing Li
e5e766d6dc
Update wheels.yml
2021-08-20 16:38:07 -07:00
Wenbing Li
4777213e31
Create wheels.yml ( #133 )
2021-08-20 16:29:29 -07:00
Wenbing Li
ef4e07c5f4
Add a dockerfile and update for build script. ( #131 )
...
* Add a dockerfile and update for build script.
* update
2021-08-19 10:38:05 -07:00
Mojimi
1ae69c0f7a
Users/zetao/fix link order ( #125 )
...
* fix link order
* fix format
* update
* bring option back
* bring selectedoplist back
Co-authored-by: Ze Tao <zetao@microsoft.com>
Co-authored-by: Wenbing Li <10278425+wenbingl@users.noreply.github.com>
2021-08-17 13:21:12 +08:00
Wenbing Li
4eaa5ac4ae
support non-exception build ( #126 )
...
* support non-exception build
* fix the ir_version issue in the test
2021-08-16 11:08:03 -07:00
Wenbing Li
3fa03e1855
Update README.md
...
Update the build status link
2021-08-13 15:53:39 -07:00
Wenbing Li
0c57e02e1f
fix the ir_version issue due to the latest onnx release ( #128 )
2021-08-13 15:03:14 -07:00
Wenbing Li
983de7c0fe
support the operator list for build flags ( #122 )
...
* support the operator list for build flags
* revert the flag
* update the file name
* little refinement
2021-07-30 12:43:47 -07:00
Wenbing Li
a428be447c
optimize the gpt2-bs model after the generation ( #119 )
2021-07-08 15:36:11 -07:00
Wenbing Li
7fd96c8e97
fix the input shape in the gptbs model. ( #118 )
2021-07-02 18:00:09 -07:00
Wenbing Li
b708aeaaf4
add a disable_tokenizer option for the beam search process ( #116 )
...
* add an enable_tokenizer options
* a formal name
2021-06-29 15:05:14 -07:00
Mojimi
3e82549bcb
Add TextToSentenceTokenizer ( #113 )
...
* add depence
* init
* add test
* implement
* finished
* find file path througth pathlib
* rename the op
* update reminder
Co-authored-by: Ze Tao <zetao@microsoft.com>
Co-authored-by: Wenbing Li <10278425+wenbingl@users.noreply.github.com>
2021-06-24 14:29:16 +08:00
Wenbing Li
800e360ef3
add convert gpt-2 function in the bs script as well ( #114 )
2021-06-17 14:51:47 -07:00
Wenbing Li
fd1216b027
fix some build and env issues. ( #111 )
2021-06-10 09:57:49 -07:00
Wenbing Li
f4e1be286a
add sentencepiece pre-trained model test ( #110 )
2021-06-09 15:50:12 -07:00
Mojimi
cb81344392
install emsdk out of source ( #109 )
...
Co-authored-by: Ze Tao <zetao@microsoft.com>
2021-06-09 14:45:26 +08:00
Zuwei Zhao
803d9a3db9
Update patch file to integrate CustomOps into ONNXRuntime. ( #107 )
...
Co-authored-by: Zuwei Zhao <zuzhao@microsoft.com>
2021-06-08 18:26:14 -07:00
Wenbing Li
ba200b4a0e
import the opset version from the embeded ONNX model ( #105 )
...
* import the opset version from the embeded ONNX model
* real fix
2021-06-04 18:43:18 -07:00
Wenbing Li
88a3c0e42d
fix the onnxprocess for the empty input and name ( #104 )
...
* fixing the onnxprocess for the empty input and name
* fix the crash on onnxruntime 1.8
2021-06-03 21:23:13 -07:00
Wenbing Li
0851eacfeb
Update mshost.yaml for Azure Pipelines ( #102 )
2021-06-01 22:58:22 -07:00
Wenbing Li
48be0efea2
more refinement on the release doc and script ( #99 )
...
* more refinement on the release doc and script
* fix the build script.
2021-06-01 14:02:55 -07:00
Wenbing Li
e7b7eb5960
prepare the 0.3.0 release ( #98 )
...
* prepare the 0.3.0 release
* fix the install wheel
* Update mshost.yaml for Azure Pipelines
* update the yaml file
* update the yaml file
* update the yaml file
* put the wheels in one zipped file
* Revert "put the wheels in one zipped file"
This reverts commit bf4a086c94
.
2021-05-30 23:46:33 -07:00
Wenbing Li
567527b3b3
Create linux_release_x86_64.yml ( #97 )
...
* Create linux_release_x86_64.yml
* update release scripts
* Create linux_release_x86_64.yml
* temp1
* Update linux_release_x86_64.yml
kick off
* Update linux_release_x86_64.yml
* Update linux_release_x86_64.yml
* Update linux_release_x86_64.yml
* fix the cmake file.
* Update linux_release_x86_64.yml
* Update linux_release_x86_64.yml
* Update linux_release_x86_64.yml
* Fix the python3.x.lib missing issue.
2021-05-29 20:17:15 -07:00