Граф коммитов

44 Коммитов

Автор SHA1 Сообщение Дата
Wenbing Li be29e28dd7
support tokenizers build only in C API mode (#783)
* support tokenizer build only in C API mode

* fix the python build.

* fix the selectedops build

---------

Co-authored-by: Sayan Shaw <52221015+sayanshaw24@users.noreply.github.com>
2024-08-02 13:28:58 -07:00
Wenbing Li 620050fbe0
reimplement resize cpu kernel for image processing (#768)
* reimplement resize cpu kernel for image processing

* accuracy fixing and code refinement

* fix the build issues

* fix Linux build issue

* more fixings

* Fix the pipeline issue

* fix the ci script

* try to fix CUDA machine pool
2024-07-23 15:40:52 -07:00
Wenbing Li 8153bc1a3a
Feature extraction C API for whipser model (#755)
* Feature extraction C API for whipser model

* Update the docs

* Update the docs2

* refine the code

* fix some issues

* fix the Linux build

* fix more data consistency issue

* More code refinements
2024-07-11 11:20:36 -07:00
cao lei 95d65e4ec0
sync to flash attention kernel 2.5.9 and add document of how to write custom op (#757)
* sync to flash attention kernel 2.5.9

* support users to overload GetMayInplace and ReleaseMayInplace

* Undo the change for pybind11 dependency
2024-07-10 07:09:40 -07:00
Wenbing Li 3c22daa4cf
disable azure ops from ci/packaging pipelines (#746)
* disable azure ops from ci/packaging pipelines

* add the missed files

* keep the env variable flag

* add python to the path?
2024-06-13 21:11:55 -07:00
cao lei 3649db065d
Introduce ONNXRUNTIME_PKG_DIR parameter to pip install (#719)
* introduce ONNXRUNTIME_PKG_DIR parameter to pip install

* update document
2024-05-17 05:58:37 -07:00
Wenbing Li 5d558bd34f
Refine the cuda build support (#680) 2024-03-27 10:08:07 -07:00
cao lei 2234001184
refactor ORT-Extension for the coming GroupQueryAttention work (#674)
* refactor ORT-Extension for the coming GroupQueryAttention work

* fix typo and add #if ORT_API_VERSION >= 15 for GetOrtAllocator

* fix cuda build
2024-03-20 10:55:04 -07:00
Wenbing Li 972cb17a82
clean up requirements.txt and developer docs (#655) 2024-02-16 15:47:44 -08:00
Wenbing Li dce0d9c72c
Add the missing tasks and fix the issues in CUDA python unit test pipelines (#615)
* add an experimental CUDA python unit test pipeline

* typo

* in ci.yml?

* winpycuda

* move it in optional

* enable cuda pytest in linuxbuild

* build in docker

* add the cuda pytest for windows

* cuda flag fixing

* minor fixing

* typo

---------

Co-authored-by: Yi Zhang <zhanyi@microsoft.com>
2023-12-08 10:19:54 -08:00
Wenbing Li fb2a8c2841
Enhancing CUDA Support in Python Package Build and Testing (#608)
* initial commit

* Add the cuda support for python package

* formt the code

* refine it a little bit
2023-11-27 15:39:52 -08:00
Sayan Shaw 5fd6bcf4d6
Add test .nuspec for Windows and respective docs (#585)
* add test .nuspec for Windows and respective docs

* minor doc change

* keep only x64 file entry and use build.bat output dll

---------

Co-authored-by: Sayan Shaw <sayanshaw@microsoft.com>
Co-authored-by: Wenbing Li <10278425+wenbingl@users.noreply.github.com>
2023-10-30 23:34:51 -07:00
RandySheriffH 4cc985fa32
Update Azure Op doc for main (#554)
* Add details to Azure ops documentation (#552)

* doc ops

* typo

---------

Co-authored-by: Randy Shuai <rashuai@microsoft.com>

* rename the input and output as example

---------

Co-authored-by: Randy Shuai <rashuai@microsoft.com>
2023-08-31 12:57:47 -07:00
Sayan Shaw d39e8444b6
Add initial API docs generation and publishing (#540)
* initial api-docs implementation

* change pip install extensions

* add examples README as it is needed to build Sphinx gallery + some minor changes

* call sphinx directly from workflow

* fixed commit id issue for workflow

* fix path issue in publish-python-apidocs.yml

* add tools API and cleanup

* update copyright from current year

---------

Co-authored-by: Sayan Shaw <sayanshaw@microsoft.com>
2023-08-29 20:28:48 -07:00
RandySheriffH d853d31fc1
Document azure ops. (#530)
* doc ops (#529)

* Try and make CIs pass with Azure ops enabled by default.
Misc. other cleanups

* Fix some CI issues.
Cleanups some bits and pieces.

* Fix a couple of issues.

* Fix arg to build.bat

* Increase warning in triton client build to make binskim happy (hopefully).

* Try patching the warning level in the triton grpc branch as well. Shouldn't matter but...

* Run triton patch command for windows as well.

* Add patch.exe directly so windows builds work.

* override auth gen for AOAI

* fix build

* switch to windows-static

* update model for azure chat

* document triton invoker

* doc chat endpoint

* document triton invoker

* format

* format

* format

---------

Co-authored-by: Scott McKay <Scott.McKay@microsoft.com>
Co-authored-by: Randy Shuai <rashuai@microsoft.com>

* address comments

* move doc sect

* typo

* typo

---------

Co-authored-by: Scott McKay <Scott.McKay@microsoft.com>
Co-authored-by: Randy Shuai <rashuai@microsoft.com>
2023-08-17 14:12:02 -07:00
Wenbing Li 981cb049ff
Add a new API for building data processing graph from Huggingface transformers processor/tokenizer (#482)
* initial checkins

* test pass

* basic impl

* first unit test pass

* merge error

* refine a little bit

* add more unit test

* fix unit test

* Fix the unit test.

* add one more whisper audiodecoder test case

* update the docs

* More updates
2023-07-17 16:50:58 -07:00
Wenbing Li a6012b383e
support vc runtime static linkage (#472)
* support vc runtime static linkage

* typo

* add unit test

* Update ci.yml
2023-06-20 16:40:29 -07:00
Wenbing Li 1c04e95bda
upgrade all dependency versions (#466)
* release the dependency package version

* upgrade the dependency versions

* more fixing for 3.11

* Update ci.yml

* upgrade torch version in MacOS

* fix torch version in MacOS

* upgrade Python support versions

* pybind11 switch URL mode

* more URL modes
2023-06-03 20:09:41 -07:00
Wenbing Li 46efcb9051
PyOp attribute supports int and float data type (#425) 2023-05-05 19:35:59 -07:00
Wenbing Li b5dce955f0
Add an audio decoder custom op for whisper end-to-end processing (#385)
* evaluate the audio decoder library

* MP3 Decoder

* rename it to test_audio_codec

* add the audio decoder to whisper model

* whisper end-to-end draft

* fix the mp3 decoder

* Running with ONNX models

* Add more audio format supports

* refine the end-to-end script

* Update operators/audio/audio_decoder.hpp

Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com>

* Update operators/audio/audio_decoder.hpp

Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com>

* Update operators/audio/audio_decoder.hpp

Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com>

* some fixings of comments and more test cases.

* changes for review comments.

* Update audio_decoder.hpp

* Update audio_decoder.hpp

* code refinement

* Update operators/audio/audio_decoder.hpp

Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com>

---------

Co-authored-by: Sayan Shaw <52221015+sayanshaw24@users.noreply.github.com>
Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com>
2023-04-11 14:47:10 -07:00
Wenbing Li 9c93fade6f
add the onnxruntime 1.14 release into the CI pipeline (#387)
* add the onnxruntime 1.14 release into the CI pipeline

* torch 2.0 crashed on Linux
2023-03-27 16:52:08 -07:00
Edward Chen 63fe165ee8
Android package build updates (#344)
- Generate AAR with associated artifacts (javadoc.jar, sources.jar, pom).
- Fix JAR/AAR package descriptions.
- Remove `gradle -b build-android.gradle` invocation from CMake. This creates the AAR and it is not necessary to run it for each Android ABI build.
- Support building on Windows.
2023-01-13 14:06:00 -08:00
Edward Chen 280ec289cb
Use `object` instead of `np.object` and `str` instead of `np.str`. (#337) 2022-12-19 18:45:07 -05:00
Wenbing Li 13a438234e
Fix the include path issue on the ORT combined build. (#325)
* test ci pipeline

* trigger ci

* correct ORT include path

* remove unwanted script

* fixing supres code with codecs off
2022-12-16 10:48:45 -08:00
Wenbing Li 69e6ec7cf1
doc of how to add an operator 2022-12-09 10:52:40 -08:00
Wenbing Li 0647ce6d14
update the main doc and add a developer doc (#321)
* update the main doc and add a developer doc

* add it back

* fix some typo

* Update README.md

Co-authored-by: Nat Kershaw (MSFT) <nakersha@microsoft.com>

Co-authored-by: Nat Kershaw (MSFT) <nakersha@microsoft.com>
2022-12-02 16:27:58 -08:00
Wenbing Li 08659eae90
Initial Java API for the JAR package. (#292)
* more C++ code fixing and polish for release

* fixing for android build

* build flags for android release

* add missing exporting function

* imint

* first versoin

* more C++ code fixing and polish for release (#275)

* more C++ code fixing and polish for release

* fixing for android build

* build flags for android release

* add missing exporting function

* support build_id on Python package building (#281)

* support buildid in package building

* undo the change on build.sh

* build.sh issue on macos

* Add `$schema` to `cgmanifest.json` (#284)

Co-authored-by: Jamie Magee <jamie.magee@microsoft.com>

* test package with a simple java app

* demo app

* some fixing for windows platform

* refine the example app

* fix the missing symobls issue for Linux build

* fix the package package build issue

* typo

* a missing change

* fix PythonOp

* fix Android test issue

* one more Android change

* replace build flags in ci pipeline

* android AAR package build

* refine the code for android package

Co-authored-by: Jamie Magee <jamie.magee@gmail.com>
Co-authored-by: Jamie Magee <jamie.magee@microsoft.com>
2022-10-04 16:22:28 -07:00
Wenbing Li a7f1a728d0
the doc for android/java custom build. (#260)
* the doc for android/java custom build.

* Update custom_build.md

* Update custom_build.md

* Update custom_build.md
2022-07-08 10:26:00 -07:00
Nat Kershaw (MSFT) f12f42a007
Update README.md (#238) 2022-07-06 16:18:35 -07:00
Mojimi abdd5b1bd8
Add MaskedFill (#182)
* add StringRemove

* update

Co-authored-by: Ze Tao <zetao@microsoft.com>
2021-11-04 09:29:17 +08:00
Mojimi 4290400ed3
Add doc for new operators (#161)
* add initial doc

* update doc

* finish all docs

Co-authored-by: Ze Tao <zetao@microsoft.com>
2021-09-29 07:59:09 +08:00
Wenbing Li 06c902253f
re-package onnx process code (#93)
* re-package onnx process code

* update the docs and examples as well.
2021-05-14 09:23:50 -07:00
Wenbing Li 3806e29421
rename the package to onnxruntime_extensions (#92)
* rename the package to onnxruntime_extensions

* fix the cmake file
2021-05-12 12:02:57 -07:00
Zuwei Zhao 794890985e
Support custom operator: StringToVector. (#84)
Co-authored-by: Zuwei Zhao <zuzhao@microsoft.com>
Co-authored-by: Wenbing Li <10278425+wenbingl@users.noreply.github.com>
2021-04-22 13:29:43 +08:00
Wenbing Li aa846c3116
Support the pre/post processing Python and pytorch code snippets to be converted and be merged into the ONNX model. (#76)
* initial checkins

* restructure the implementation.

* refine the Python interface

* Finalize the interface.

* Add the custmop class for the customization.

* Test the eager_op with vector_to_string customop

* Refine the customop conversion interface.

* initial onnx builder

* Runable with incorrect result.

* reformat the onnx_ops calls

* a few of operators working on tracing

* handcraft all op conversion

* Add the unit testing for mytorch

* unit test passed.

* Add some documents...

* Move non-torch API into onnxruntime_customops.utils module.

* Fix the unit test issues.

* Fix some typos.
2021-03-12 10:39:21 -08:00
Xavier Dupré 4bc5c962b0
Add WordpieceTokenizer (#72)
* add Wordpiece tokenizer
* add RaggedTensorToDense
* update documentation
2021-03-11 19:19:49 +01:00
Mojimi 9653f52341
Add batch query and attention mask support for GPT2Tokenizer (#75)
* add batch_mode and padding for GPT2Tokenizer

* fix text

* fix test and add doc

* fix test

* fix comments

* delete header

Co-authored-by: Ze Tao <zetao@microsoft.com>
Co-authored-by: Wenbing Li <10278425+wenbingl@users.noreply.github.com>
2021-03-10 13:57:07 -08:00
Mojimi 2378ca116b
add StringConcat (#70)
Co-authored-by: Ze Tao <zetao@microsoft.com>
2021-02-25 10:44:30 +08:00
Mojimi da41b75467
Add op: VectorToString (#57)
* add vector_to_string

* fix merge conflict

* fix building failure

* remove debug code

* fix test

* move back unicode

* fix typo

* move base64 back

* move the right place

* support only int64_t

Co-authored-by: Ze Tao <zetao@microsoft.com>
Co-authored-by: Wenbing Li <10278425+wenbingl@users.noreply.github.com>
2021-02-09 09:41:00 -08:00
Xavier Dupré b3a300d7bf
Add attribute global_replace to StringRegexReplace (#55)
* Add attribute global_replace to StringRegexReplace

Signed-off-by: xavier dupré <xavier.dupre@gmail.com>

* fix potential wrong pointer

Signed-off-by: xavier dupré <xavier.dupre@gmail.com>

* update sep

Signed-off-by: xavier dupré <xavier.dupre@gmail.com>

Co-authored-by: xavier dupré <xavier.dupre@gmail.com>
Co-authored-by: Wenbing Li <10278425+wenbingl@users.noreply.github.com>

> It seems to be working now.

I enabled some less secured option in pipeline. let's see how it goes.
2021-02-05 09:08:02 -08:00
Xavier Dupré 4c201e7800
Change Fix type of nbtest_size into int64 (#54)
Signed-off-by: xavier dupré <xavier.dupre@gmail.com>

Co-authored-by: xavier dupré <xavier.dupre@gmail.com>
2021-01-29 14:21:50 -05:00
Xavier Dupré a32f9bc28c
Documentation for SentencepieceTokenizer (#52) 2021-01-28 19:07:48 +01:00
Mojimi a9a498501c
Improve GPT2 (#48)
* test attribute

* finish improvement

Co-authored-by: Ze Tao <zetao@microsoft.com>
Co-authored-by: Wenbing Li <10278425+wenbingl@users.noreply.github.com>
2021-01-27 15:26:37 -08:00
Mojimi 4a0f892949
Operator Schemas of text processing ops (#42)
* add a schema doc for new Op
Co-authored-by: Ze Tao <zetao@microsoft.com>
2021-01-21 15:07:32 +08:00