* Add separate step to wait for simulator to boot.
* Add -parallel-testing-enabled NO xcodebuild option.
* Apply changes to ios_packaging.yml.
* quote other display names in same job for consistency
* fix linux wheel pipeline by update the linux build wheel configuration
* update the command line
* add manylinux_aarch64
* Update wheels_linux.yml for Azure Pipelines
* remove the AzureOp from aarch64
* fix utests of ort 1.16
* Initial checkins for returnable ORT ABIs
* fix for linux build
* more fixes on Python, test...
* remove the statusmsg
* native unit tests fixing
* Python unit tests fixing
* last unit test fixing
* Add details to Azure ops documentation (#552)
* doc ops
* typo
---------
Co-authored-by: Randy Shuai <rashuai@microsoft.com>
* rename the input and output as example
---------
Co-authored-by: Randy Shuai <rashuai@microsoft.com>
* initial api-docs implementation
* change pip install extensions
* add examples README as it is needed to build Sphinx gallery + some minor changes
* call sphinx directly from workflow
* fixed commit id issue for workflow
* fix path issue in publish-python-apidocs.yml
* add tools API and cleanup
* update copyright from current year
---------
Co-authored-by: Sayan Shaw <sayanshaw@microsoft.com>
- Split ci.yml into stages. This makes it easier to only run a subset of the jobs in the pipeline.
- Use existing iOS simulator instead of creating a new one.
---------
Co-authored-by: Wenbing Li <10278425+wenbingl@users.noreply.github.com>
- Update build script to copy over test files and run tests on an Android device.
- Set up a CI job to run tests on the emulator.
- Fix a test failure encountered when running on an actual Android device. Update string_lower to call ToLower on each UTF code point instead of each UTF-8 byte.
---------
Co-authored-by: Wenbing Li <10278425+wenbingl@users.noreply.github.com>
Disable in-memory cert store and loading certs from model.
- TBD if it will be needed - need to know how reliable using the Android system certs will be and whether any scenarios need to have custom cert management.
* Use in-memory certs for curl on Android
- could not get curl+openssl to be able to use the system ones
* Use static build for curl and openssl
- smaller binary size
* Cache the x509 certificate store so we don't need to re-create for every request.
* Read certs from node attribute for now.
---------
Co-authored-by: Wenbing Li <10278425+wenbingl@users.noreply.github.com>
* Add `timeout_seconds` attribute for per-node timeout. Defaults to (arbitrary) value of 15 seconds.
* Fix datatype - onnx only has int64_t attributes.
Update test model to validate timeout is read correctly.
* Update ci.yml for Azure Pipelines
* Fix the command lines
* is requirements-dev.txt
* switch to windows
* Update windows task
* Update ci.yml for Azure Pipelines
* add cmake in path on windows
* Update ci.yml for Azure Pipelines
* add explicit azure python build flag
---------
Co-authored-by: Sayan Shaw <52221015+sayanshaw24@users.noreply.github.com>
* doc ops (#529)
* Try and make CIs pass with Azure ops enabled by default.
Misc. other cleanups
* Fix some CI issues.
Cleanups some bits and pieces.
* Fix a couple of issues.
* Fix arg to build.bat
* Increase warning in triton client build to make binskim happy (hopefully).
* Try patching the warning level in the triton grpc branch as well. Shouldn't matter but...
* Run triton patch command for windows as well.
* Add patch.exe directly so windows builds work.
* override auth gen for AOAI
* fix build
* switch to windows-static
* update model for azure chat
* document triton invoker
* doc chat endpoint
* document triton invoker
* format
* format
* format
---------
Co-authored-by: Scott McKay <Scott.McKay@microsoft.com>
Co-authored-by: Randy Shuai <rashuai@microsoft.com>
* address comments
* move doc sect
* typo
* typo
---------
Co-authored-by: Scott McKay <Scott.McKay@microsoft.com>
Co-authored-by: Randy Shuai <rashuai@microsoft.com>
* Build fixes
- zlib needs to come from vcpkg if azures ops are being built and opencv isn't enabled
- set the IR version to 8 for some of the azure ops test models so they can be tested when ORT 1.14 is used
- pass through new ort version value so that a consistent version is used to a) pull the ORT package for the c++ unit tests and b) disable azure ops if ORT version is too old.
* Update to automatically chain package to avoid build errors during the install if cmake runs commands in parallel
* Define simplified ORT_FILE for older ORT versions
* - Ensure we log an error message before throwing on Android
- message in exception will be lost due to how the shared libraries are built (both onnxruntime and extensions use static libc++ so there are no shared exception types between them)
- support static or dynamic build of curl/openssl on android
- TBD which we want to use.
- add infra for anything deriving from BaseKernel to log messages using the ORT logger
- ensures messages from custom kernels end up in the same place as messages from ORT
* Fix GPT2 and Falcon tokenizer cvt for AutoTokenizer imp
* fix fast tokenizer issue
* small fix
* use slow tokenizer in test script
---------
Co-authored-by: Sayan Shaw <sayanshaw@microsoft.com>
* Fix LNK4098 warning from sentencepiece forcibly changing the build flags.
> LINK : warning LNK4098: defaultlib 'LIBCMT' conflicts with use of other libs; use /NODEFAULTLIB:library
* Use CMAKE_MSVC_RUNTIME_LIBRARY to determine whether /MT should be used.
* Add TrieTokenizer for RWKV-like LLM models
* add more tests
* fix the windows build
* downloading file instead of check in the vocab file
* a small bug fixing
* Refactor setup for Azure ops to try and make common things more re-usable, and for the actual ops to simply layer in the specific input/output constraints for that type of request.
Currently builds on Linux, Windows (x64 only) and Android
Android requires a manual pre-build of openssl and curl.
Linux requires a manual pre-install of openssl.
Windows currently only works for x64. Other targets need the triplet adjusted.
* Address PR comments
* Fix could of android build warnings.
* Update .gitignore to remove old path
* Fix build break from merge