Граф коммитов

564 Коммитов

Автор SHA1 Сообщение Дата
Sayan Shaw 0d5d19f67b
fix prefast warning (#809)
Co-authored-by: Sayan Shaw <sayanshaw@microsoft.com>
2024-09-15 22:34:07 -07:00
Chester Liu 8d842d85e3
Rm zlib when linking ocos_operators (#807) 2024-09-13 07:07:10 +08:00
Sayan Shaw 8bc8e43da1
Add C++ regex support for Llama3, Standard Library, and Custom Cases (#804)
* add C++ standard library regex support for GPT2 case

* reorder regex handling

* try without STL

* missing case

* add llama3 regex support

* add custom regex impl

* change regex based on model

* modify tests, add docs, and code cleanup

* add regex test and const strings

---------

Co-authored-by: Sayan Shaw <sayanshaw@microsoft.com>
2024-09-10 23:17:49 -07:00
Scott McKay 9164f54e5d
Don't disable vision operators in a catalyst build. (#805)
* Don't disable vision operators in a catalyst build.

* Patch to exclude NSImage on Mac-catalyst as it's not supported.
2024-09-10 08:58:09 +10:00
Wenbing Li 90d8f33172 Revert "some data calc fixing"
This reverts commit dae9510dbb.
2024-09-05 09:30:19 -07:00
Wenbing Li dae9510dbb some data calc fixing
really split the images

test with sus
2024-09-05 09:26:05 -07:00
Wenbing Li 1b80794903
Remove OpenCV dependency from C_API mode (#800)
* Remove OpenCV dependency from C_API model

* fix build on Windows

* switch ci build flag

* try to fix the macOS build issue

* more fixing

* fix the macOS build issue

* list jpeg source

* verified on MacOS

* update the pp_api too

* avoid the codecs library conflicts

* Add the unit tests

* move the codec test

* add the missing dl lib for extensions test

* refine the code

* a smaller fixing for Windows Python
2024-09-04 16:50:05 -07:00
Kyle 7c3ce36af8
Add Files Signature Validation after Signed by ESRP (#801)
* vlidate sign after ERSP

* blank line

* format
2024-09-02 17:17:03 +08:00
Wenbing Li b8b2ebfb85
optimize spm tokenizer for long text (#799)
* optimize spm tokenizer for long text

* refine the split logic

* re-trigger CI pipeline.
2024-08-30 14:58:40 -07:00
Prathik Rao 6f532376c9
bump (#791)
Co-authored-by: Wenbing Li <10278425+wenbingl@users.noreply.github.com>
2024-08-27 18:58:18 -07:00
Wenbing Li 2d02a687be
Optimize the tokenizer for efficiency (#797)
* optimize the tokenizer for efficiency

* fix the unit test failures.

* fix the api test case failures

* removed the unused code.

* More test cases fixings

* One more fixing

* fix macOS build issues

* refine the test

* add more diagnosis info.

* fix unit test in CI Linux

* fix the pp_api test failure
2024-08-27 18:57:50 -07:00
Yi Zhang 2d044adbf9
sign with the correct key code (#796)
Fixes incorrect dll singnature
2024-08-26 16:48:29 +08:00
Wenbing Li 8f2c35fad0
Add more tests for pre-processing C APIs (#793)
* initial api for tokenizer

* More fixings and test data refinement

* add a simple wrapper for pre-processing APIs

* fix the test issues

* test if the tokenizer is spm based

* fix the failed test cases

* json pointer does not work
2024-08-21 16:48:39 -07:00
Zhipeng Han 85ffb94169
Update custom_ops.md (#795)
add domain for SentencePiece Op
2024-08-21 09:52:54 -07:00
Wenbing Li 711a2cfa69
add a convert_token_string_to_an_id API for the prompt ids (#794)
* add a convert token string to an id API for the prompt ids

* fix the build issues on Linux
2024-08-19 16:44:07 -07:00
vraspar 6ce22f8ac4
Update nuget extraction path for iOS xcframework (#792)
* Update nuget extraction path for iOS xcframework

* Update nuget extraction path for iOS xcframework
2024-08-16 10:34:40 +10:00
vraspar 8b5354fb67
Update macosx framework packaging to follow apple guidelines (#776)
* Update macosx framework packaging to follow apple guidelines

* Test path fix

* Update tools/ci_build/extract_nuget_files.ps1

---------
2024-08-13 10:37:22 +10:00
Wenbing Li be29e28dd7
support tokenizers build only in C API mode (#783)
* support tokenizer build only in C API mode

* fix the python build.

* fix the selectedops build

---------

Co-authored-by: Sayan Shaw <52221015+sayanshaw24@users.noreply.github.com>
2024-08-02 13:28:58 -07:00
Sayan Shaw 7851b51ee3
Add initial tiktoken and Phi3SmallTokenizer support (#729)
* add initial tiktoken support

* add vector hash and equal for bpe ranks map

* change lambda comparator

* move phi-3-small files

* final changes

* move tiktoken files from data2 to data

* add unit test

* add tokenizer module

* merge json and tiktoken impl

* fix tiktoken encoding problem

* address comments

* remove dummy tokens

---------

Co-authored-by: Sayan Shaw <sayanshaw@microsoft.com>
Co-authored-by: Wenbing Li <10278425+wenbingl@users.noreply.github.com>
2024-08-02 10:24:02 -07:00
Wenbing Li 46998e96fb
Update build-package-for-windows.yml (#784) 2024-08-01 14:45:26 -07:00
Wenbing Li 4bb63dd2aa
Upgrade ESRP signing task from v2 to v5 (#780)
* Upgrade ESRP signing task from v2 to v5

* Upgrade ESRP signing task from v2 to v5 in win

---------

Co-authored-by: Sayan Shaw <52221015+sayanshaw24@users.noreply.github.com>
2024-08-01 09:57:59 -07:00
Wenbing Li 8b002b86ab
Fix the case that bos_token is null (#781) 2024-07-31 17:50:20 -07:00
Wenbing Li b4ebfc9519
Fix spm converted FastTokenizer issue on non-ascii char (#778)
* Fix spm converted tokenizer issue on non-ascii char

* remove pkg_resource in python
2024-07-31 14:22:25 -07:00
Prathik Rao e113ed30f1
removed OpenAIAudioToText from config (#777) 2024-07-31 10:40:43 -07:00
Wenbing Li c9c11b4846
Fix the windows API missing issue and Linux shared library size issue for Java packaging. (#774)
* Fix the java packaging issues

* add the jar path example for Linux build with a default configuration
2024-07-29 16:03:58 -07:00
Wenbing Li c3145b8f52
add the decoder_prompt_id for whisper tokenizer (#775)
* add the decoder_prompt_id for whisper tokenizer

* temporarily disable android prebuilt

* disable the prebuilt for android

* disable the prebuilt for android 2

* Add a unit test

* correct test ids
2024-07-29 14:21:17 -07:00
Wenbing Li 620050fbe0
reimplement resize cpu kernel for image processing (#768)
* reimplement resize cpu kernel for image processing

* accuracy fixing and code refinement

* fix the build issues

* fix Linux build issue

* more fixings

* Fix the pipeline issue

* fix the ci script

* try to fix CUDA machine pool
2024-07-23 15:40:52 -07:00
Prathik Rao d79299e733
increase timeout (#773) 2024-07-22 17:48:27 -07:00
Prathik Rao 735041e59f
increase timeout (#772) 2024-07-22 14:31:44 -07:00
Wenbing Li bfcca2cb76
Fix the win32 Python packaging pipeline (#771) 2024-07-19 17:24:40 -07:00
Changming Sun e95ae84ba6
Switch aiinfra-Linux-CPU machine pool to onnxruntime-Ubuntu2204-AMD-CPU (#765) 2024-07-17 13:53:30 -07:00
Wenbing Li 38a3d85f8f
switch cmake cmp0169 flag to new (#762)
* switch cmake cmp0169 flag to new

* the missing spm code.

* more refinement on cmake build targets

* Update ci.yml

* Update ci.yml

* update the jpg files after using libjpeg instead of libjpeg-turbo

* exclude cutlass too

* upgrade the protobuf library to be consistent with ORT

* update the protoc generated files

* use the right patch name

* Update cutlass.cmake
2024-07-15 23:28:49 -07:00
Wenbing Li 8153bc1a3a
Feature extraction C API for whipser model (#755)
* Feature extraction C API for whipser model

* Update the docs

* Update the docs2

* refine the code

* fix some issues

* fix the Linux build

* fix more data consistency issue

* More code refinements
2024-07-11 11:20:36 -07:00
cao lei 95d65e4ec0
sync to flash attention kernel 2.5.9 and add document of how to write custom op (#757)
* sync to flash attention kernel 2.5.9

* support users to overload GetMayInplace and ReleaseMayInplace

* Undo the change for pybind11 dependency
2024-07-10 07:09:40 -07:00
Wenbing Li b436d09459
Fix the CI pipeline for the latest PyTorch release. (#759) 2024-07-08 16:21:48 -07:00
Wenbing Li f1abea14e8
Update CMakeLists.txt (#754) 2024-06-25 11:12:21 -07:00
Chester Liu 0f1f454867
Fix C4459 warning in custom_op_lite.h (#751)
Internal workitem: https://task.ms/aii/29719

Co-authored-by: Xavier Dupré <xadupre@users.noreply.github.com>
2024-06-25 10:28:27 +08:00
Wenbing Li 3b275b16bc
Upgrade pybind11 2.12 to support both numpy 1.x and 2.x (#750) 2024-06-20 15:18:17 -07:00
Wenbing Li cbed8fd575
Add a generic image processor and its C API (#745)
* Add a generic image processor

* add more tests

* Fix the test failures

* Update runner.hpp
2024-06-20 10:53:49 -07:00
Chester Liu 58b552388f
Fix several C5038 warnings (#748) 2024-06-20 08:20:30 +08:00
Xavier Dupré bef5f07e33
Add custom ops ReplaceZero (#739)
* Add custom ops ReplaceZero

* fix merge conflicts
2024-06-18 11:36:14 +02:00
Xavier Dupré 05df33b302
Add missing documentation for fused kernels (#744) 2024-06-18 10:54:15 +02:00
Wenbing Li 2f9595af74
Fix the pipeline breaks dues to the MSVC 19.40 and numpy 2.0 release (#747)
* dd "-allow-unsupported-compiler" flags to Windows CUDA flags

inspired by this PR: https://github.com/microsoft/onnxruntime/pull/21004

* switch to cmake command line

* handle the issues caused by the latest MSVC release

* correct the typo

* correct the parameter

* try one dash again

* use the installed cmake

* use cmake standalone installation firstly

* use the standalone cmake in win32 python too

* fix it more

* one more try

* fix the MacOS pipeline issue

* fix the pip command line
2024-06-17 16:58:11 -07:00
Wenbing Li 3c22daa4cf
disable azure ops from ci/packaging pipelines (#746)
* disable azure ops from ci/packaging pipelines

* add the missed files

* keep the env variable flag

* add python to the path?
2024-06-13 21:11:55 -07:00
Xavier Dupré 690bed71b6
Add operator MulSigmoid, MulMulSigmoid (#741)
* Add operator MulSigmoid

* add mul mul sigmoid

* add comments

* Apply suggestions from code review

---------

Co-authored-by: Wei-Sheng Chin <wechi@microsoft.com>
2024-06-12 10:29:42 +02:00
Xavier Dupré f5055466d5
Add custom kernel ScatterNDOfShape (#705)
* first draft

* clang

* Draft for ScatterNFOfShape

* fix build

* disable test when cuda is missing

* fix implementation

* update test

* add MaskedScatterNdOfShape

* fix merge conflicts
2024-06-11 09:59:46 +02:00
Xavier Dupré 79f3b048d4
Add custom op Transpose2DCast (#737)
* Add custom op Transpose2DCast

* fix compilation issues

* fix compilation issues
2024-06-06 17:44:21 +02:00
Xavier Dupré 1e8c1211a5
Add custom kernels AddSharedInput, MulSharedInput (#734)
* Add custom kernel AddSharedInput, MulSharedInput

* fix compilation

* compilation issue

* fix unit test
2024-06-05 10:42:22 +02:00
Sayan Shaw c9bba37f91
update java packaging pipeline to have .pom and .jar files in correct location (#735)
Co-authored-by: Sayan Shaw <sayanshaw@microsoft.com>
2024-06-03 14:43:31 -07:00
Wenbing Li ca433cbea7
Refactor the unit tests and cmake build script (#726)
* refine the build script

* complete the unit tests.

* remove the commented code
2024-05-30 14:16:14 -07:00