Граф коммитов

511 Коммитов

Автор SHA1 Сообщение Дата
Scott McKay a1285d8f36
Fix CG warnings. (#731)
- update protobuf version being used by sentencepiece and the java tests
- ignore unused language bindings from protobuf and triton
- specify the CG config file with ignored directories where required

Fix cgmanifest.json
- 'git' entries require a commit hash not version
- use 'other' for opencv third party code that is included directly in the opencv repo
  - the path isn't a valid repositoryUrl value to be provided as a 'git' entry
- update version numbers/commit hashes to match the latest code

Co-authored-by: Sayan Shaw <52221015+sayanshaw24@users.noreply.github.com>
2024-05-29 07:47:55 +10:00
Adam Pocock fcf28fe83a
Setting Java module name to com.microsoft.onnxruntime.extensions. (#730) 2024-05-28 19:58:39 +10:00
Wenbing Li a1e342acb9
Fix the Linux and MacOS wheel build for packaging issues (#727)
* Update wheels_macos.yml

* remove mulslinux support

* Update wheels_linux.yml

* Update wheels_linux.yml

* remove appcenter testing
2024-05-24 14:12:27 -07:00
Wenbing Li 474540d8a5
Fix the image processing output data discrepancy (#722)
* some data calc fixing

* Update image_transforms.hpp

* really split the images

* Update image_transforms.hpp
2024-05-20 12:44:48 -07:00
Wenbing Li c3c5f1cbb1
Remove C++ filesystem library dependency for the compatibility of old system (#721)
* Remove C++ filesystem library dependency for the compatibility of old OS.

* Update file_sys.h
2024-05-18 07:23:45 -07:00
Wenbing Li 97ee9eb56f
Refactor OrtxStatus to be header-only implmentation. (#720) 2024-05-17 15:40:11 -07:00
Tang, Cheng f0ef40d074
add move constructor and Release API for tensor (#717)
Co-authored-by: Cheng Tang <chenta@microsoft.com@onnxruntime-a10.bxgbzpva45kedp3rhbsbit4phb.jx.internal.cloudapp.net>
2024-05-17 11:50:20 -07:00
cao lei 3649db065d
Introduce ONNXRUNTIME_PKG_DIR parameter to pip install (#719)
* introduce ONNXRUNTIME_PKG_DIR parameter to pip install

* update document
2024-05-17 05:58:37 -07:00
Wenbing Li 4781a9d1d8
Add ci pipeline for pre-processing API testing (#718)
* Add ci pipeline for pre-processing API testing

* update cmake for testing

* add test cases back

* add other two pipelines

* fix macos pipeline
2024-05-16 15:39:52 -07:00
Wenbing Li 311dd35401
Add ImageProcessor for Multimodel model Pre-processing (#715)
* only keep the image decoder from opencv

* initial build

* refine the code

* Add clear functions

* Update CMakeLists.txt

* Update opencv.cmake

* change the output type to float

* get the result

* align image-process with original Python

* move the LoadRawImages into library

* fix the calculation error

* fix the pipeline build issue

* fix the build breaks in ci pipeline

* support json configuration file and refactor the code.
2024-05-15 14:35:14 -07:00
Scott McKay beb9fbbae2
Fix warnings (#713) 2024-05-13 10:14:37 +10:00
Baiju Meswani 660af0d79a
Return added_tokens_ by reference (#711) 2024-05-07 11:47:57 -07:00
Wenbing Li c58c930739
Ignore all streaming output of invalid utf-8 string (#704)
* Ignore all streaming output of invalid utf-8 string

* Update bpe_streaming.hpp

* add the phi-3 tokenizer test

* add a streaming test for phi-3 model

* fix the utf-8 validation

* fix the utf-8 validation 2

* fix the utf-8 validation 3

* fix the utf-8 validation 4
2024-05-06 16:46:55 -07:00
cao lei e645cdab8d
Introduce flash attention and cutlass library (#708)
* refactor cuda ops, remove contrib folder

* introduce flash attention and cutlass

* resolve comments

---------

Co-authored-by: Lei Cao <leca@microsoft.com@onnxruntime-a10.bxgbzpva45kedp3rhbsbit4phb.jx.internal.cloudapp.net>
2024-05-05 22:52:28 -07:00
cao lei dfdf52e759
refactor cuda ops, remove contrib folder (#707)
Co-authored-by: Lei Cao <leca@microsoft.com@onnxruntime-a10.bxgbzpva45kedp3rhbsbit4phb.jx.internal.cloudapp.net>
2024-05-03 12:18:59 -07:00
Wenbing Li 8645a846fb
A tutorial of build ort-extensions from source as a static library (#703)
* The tutorial of build from source as a static library

* update test flag control

* add the tutorial
2024-05-01 13:46:27 -07:00
Tang, Cheng 3b889fc42f
update custom op v2 struct to be able to invoke from eager mode (#700)
Co-authored-by: Cheng Tang <chenta@a100.crj0ad2y1kku1j4yxl4sj10o4e.gx.internal.cloudapp.net>
Co-authored-by: Wenbing Li <10278425+wenbingl@users.noreply.github.com>
2024-04-30 13:53:39 -07:00
Yi-Hong Lyu 0175f90216
Bump up version in main from 0.11.0 to 0.12.0 (#702)
Bump up version in main from 0.11.0 to 0.12.0 since the release branch has been cut.
2024-04-30 11:45:33 -07:00
Rachel Guo 3dfec76c10
Revert net7.0 update for now (#701)
* revert net 7.0 changes for now

* minor update

---------

Co-authored-by: rachguo <rachguo@rachguos-Mini.attlocal.net>
2024-04-29 21:40:56 -07:00
Wenbing Li a8bce4328b
Add the tokenizer C ABI (#693)
* initial checkins

* fix the selectedops build failures

* add the tokenization implementation

* update the windows DEF file for c abi in cmake file

* fix the build on linux

* fix some warnings and remove the unused code

* initial import of unit tests from tfmtok

* add streaming API support

* fix the merges loading issues

* complete export from tfmtok - needs input id fixing

* fix the unit test failures.

* fix all unit test failure

* refactor streaming code

* remove the unused code

---------

Co-authored-by: Sayan Shaw <sayanshaw@microsoft.com>
2024-04-29 16:45:49 -07:00
Tang, Cheng 1f31d33ed4
Eager mode: cuda kernel support (#694)
* add UT for neg_pos_cuda in eager mode and fix build break in Windows

* fix Linux build break

* adjust argument and path

* remove old cudaContext

* add ort cuda test back

* fix cuda tests

* undo debug code

* undo useless change

---------

Co-authored-by: jslhcl <jslhcl@gmail.com>
Co-authored-by: Cheng Tang <chenta@a100.crj0ad2y1kku1j4yxl4sj10o4e.gx.internal.cloudapp.net>
Co-authored-by: Sayan Shaw <52221015+sayanshaw24@users.noreply.github.com>
2024-04-24 12:49:00 -07:00
Rachel Guo 627e93ac08
fix version in renaming (#692)
Co-authored-by: rachguo <rachguo@rachguos-Mac-mini.local>
Co-authored-by: Wenbing Li <10278425+wenbingl@users.noreply.github.com>
2024-04-19 10:30:43 -07:00
Wenbing Li f9290e8bac
Add a status class for future tokenizer API implementation (#690)
* Add a status class for future API implementation

* Update bpe_kernels.cc

* fix the ios package pipeline

* update mistral test model name
2024-04-18 21:12:14 -07:00
Wenbing Li 646462790b
Refactor the header file directory and integrate the eager tensor implementation (#689)
* refactor the header file in include folder

* fix the basic-token eager unit test case

* a more flexible way to handle string tensor shape.

* fix the unit test path issue

* remove the multi-inherits to avoid issue during pointer casting

* add api cmake build support

* undo some temporary changes

* code refinement

* fix variadic arg

* only expose the context for ort version >= 17

* fix a shape bug

* fix the cuda build issue

* change ifdef condition of GetAllocator

* finalize the ort c abi wrapper file name

* fix the iOS build break

* align gtest version with triton

* Update ext_apple_framework.cmake for iOS header files

---------

Co-authored-by: Cheng Tang <chenta@a100.crj0ad2y1kku1j4yxl4sj10o4e.gx.internal.cloudapp.net>
2024-04-17 12:58:19 -07:00
Rachel Guo fe8cd9ee8d
Add extensions catalyst support (#684)
* save work

* add flags to disable build that's currently not supported on macabi

* update

* update

* fix arg type

* update ext_test.cmake

* fix

* try add flags

* exclude ext_test

* add missing downloading steps

* nameing minor fix

* update ios version etc.

* fix

* minor fix

* minor fix

* clean up

* nuget package changes for catalyst extensions (#686)

Co-authored-by: rachguo <rachguo@rachguos-Mini.attlocal.net>

* update to net7.0 and add help comments

---------

Co-authored-by: rachguo <rachguo@rachguos-Mac-mini.local>
Co-authored-by: rachguo <rachguo@rachguos-Mini.attlocal.net>
2024-04-17 10:43:35 -07:00
Wenbing Li a96ed42beb
Update ext_java.cmake (#688) 2024-04-10 16:22:12 -07:00
Wenbing Li 00a594f13d
Standardize the inputs for ONNX STFT op for Whisper model (#681)
* Standardize the inputs for ONNX STFT op for Whisper model

* undo the format change

* Update _torch_cvt.py
2024-03-29 11:13:30 -07:00
cao lei 5aefc7e54d
pass CMAKE_CUDA_ARCHITECTURE from command line (#683) 2024-03-28 15:08:36 -07:00
Wenbing Li 5d558bd34f
Refine the cuda build support (#680) 2024-03-27 10:08:07 -07:00
Kaz Nishimura 31f129c779
Improve handling of missing `vocab_file` attribute in HFTokenizerConverter (#677)
This commit updates `HFTokenizerConverter` to handle cases where the `hf_tokenizer` object might not have a `vocab_file` attribute.

Changes:

* Uses `getattr` to retrieve the `vocab_file` attribute for flexibility
* Stores the retrieved value in a separate variable `vocab_file` for clarity
* Checks if `vocab_file` is `None` before checking its existence

This ensures the converter works correctly even with tokenizers that don't define a `vocab_file` attribute.
2024-03-26 08:56:28 -07:00
Changming Sun 29a4b49a5a
Update OneBranch.Official.yml for Azure Pipelines (#675) 2024-03-22 11:53:11 -07:00
cao lei 2234001184
refactor ORT-Extension for the coming GroupQueryAttention work (#674)
* refactor ORT-Extension for the coming GroupQueryAttention work

* fix typo and add #if ORT_API_VERSION >= 15 for GetOrtAllocator

* fix cuda build
2024-03-20 10:55:04 -07:00
Sayan Shaw 232132992b
Combine JAR Packages for Windows, Linux, and MacOS (#669)
* unpack all JARs

* fix macos job

* remove -f for macos tree

* move build files from java and linux into macos directory for combined JAR

* test with download and publish pipeline artifacts instead

* use full download task name rather than shortcut

* add job dependencies

* combine JAR packages into one

* update version from version.txt

* change relative path for version.txt

* test

* typo

* Update java_packaging.yml for Azure Pipelines

* Update java_packaging.yml for Azure Pipelines

* test without output variable

* Update java_packaging.yml for Azure Pipelines

* test with type rather than cat command

* Update java_packaging.yml for Azure Pipelines

* Update java_packaging.yml for Azure Pipelines

* Update java_packaging.yml for Azure Pipelines

* Update java_packaging.yml for Azure Pipelines

* Update java_packaging.yml for Azure Pipelines

* set version in each job

* Update java_packaging.yml for Azure Pipelines

* Update java_packaging.yml for Azure Pipelines

* add back dependencies

* final

---------

Co-authored-by: Sayan Shaw <sayanshaw@microsoft.com>
2024-03-15 11:30:19 -07:00
Wenbing Li ebc652d4af
Add python build option for cuda-archs (#672)
* Add python build option for cuda-archs

* ignore the nvidia-smi error

* apply multi-archs more places

* fix the semicolon issue

* Update ci_optional.yml

* Update ci.yml
2024-03-14 12:29:28 -07:00
Wenbing Li 01e3a6364c
Add the cstdint header for GCC 13 (#671)
* Add the cstdint header for GCC 13

* update the common header file as well
2024-03-14 11:05:09 -07:00
Hariharan Seshadri 2869f302c5
More Nuget pipeline fixes (#673) 2024-03-13 15:42:26 -07:00
Hariharan Seshadri 960e5510b1
Fix nuget pipeline (#670) 2024-03-13 11:27:48 -07:00
Wenbing Li 6ac6fb6fbd
using the huggingface whisper config instead of fixed numbers (#667)
* using the huggingface whisper config instead of fixed numbers

* refactor a little bit
2024-03-06 14:29:49 -08:00
Wenbing Li 61369fb970
Unify the spm/bpe tokenizers (#666)
* Unify the spm/bpe tokenizers

* fix the build error

* fix the decoding issue

* add model name in exported onnx

* fixing the unit tests

* revert the unneccesary file format changes
2024-03-06 10:07:05 -08:00
Sayan Shaw 62bbcb59a2
Update ort version to 1.17.1 (#665)
* update ort version to 1.17.1

* revert python versions

* change python range to 3.8-3.12 and skip requirements install and python test for WindowsPython

* add conditions for WindowsPython for Python 3.12

* typo

---------

Co-authored-by: Sayan Shaw <sayanshaw@microsoft.com>
2024-03-04 15:02:41 -08:00
Sayan Shaw 2bf26008f0
Add DLL sign for Java Windows Package (#661)
* test unpack jar on windows

* typo

* add DLL sign and jar repack

* change templates path

* change DLL path

* change repack

* update artifact name to reflect DLL sign

---------

Co-authored-by: Sayan Shaw <sayanshaw@microsoft.com>
2024-02-22 18:26:53 -08:00
Nat Kershaw (MSFT) 5c53aaad62
Update Yolov8 tutorial with arguments (#658) 2024-02-21 20:11:45 -08:00
Scott McKay 0c93c20761
Disable c++ tests on iOS build targeting iOS < 13. (#659) 2024-02-22 11:47:33 +10:00
Wenbing Li 69a08ffb1d
Remove numpy dependency from its Python binary build (#657) 2024-02-21 09:54:17 -08:00
Wenbing Li 972cb17a82
clean up requirements.txt and developer docs (#655) 2024-02-16 15:47:44 -08:00
Wenbing Li bf88aa8479
support python 3.12 for packaging pipeline (#650)
* support python 3.12 for packaging pipeline

* turn unit test build off for python package

* update ctest flag for selected_ops build
2024-02-16 08:00:47 -08:00
Wenbing Li b045e66396
make onnx package to be optional. (#653)
* putting onnx package to be optional

* update the ci.yml

* add more message of missing ONNX package
2024-02-15 14:09:04 -08:00
Wenbing Li fc275e623f
fix the unit test flag issue on packaging pipelines (#652) 2024-02-14 16:46:04 -08:00
Wenbing Li 307e712f20
Enable unit test build flag by default (#648)
* the new tokenizer API

* set the api cmake option off

* fix the pipeline

* more pipelines fixing

* fix the ci pipeline of macOS Python

* added token supporting

* fix the simdjson issue on gcc

* undo tfmtok

* remove the simdjson
2024-02-13 10:56:21 -08:00
Wenbing Li 9bc7623f24
Update version.txt (#647) 2024-02-12 19:23:52 -08:00