The pre- and post- processing library for ONNX Runtime

Перейти к файлу

Scott McKay a1285d8f36 Fix CG warnings. (#731 ) - update protobuf version being used by sentencepiece and the java tests - ignore unused language bindings from protobuf and triton - specify the CG config file with ignored directories where required Fix cgmanifest.json - 'git' entries require a commit hash not version - use 'other' for opencv third party code that is included directly in the opencv repo - the path isn't a valid repositoryUrl value to be provided as a 'git' entry - update version numbers/commit hashes to match the latest code Co-authored-by: Sayan Shaw <52221015+sayanshaw24@users.noreply.github.com>		2024-05-29 07:47:55 +10:00
.config	…
.github	clean up requirements.txt and developer docs (#655 )	2024-02-16 15:47:44 -08:00
.pipelines	Fix CG warnings. (#731 )	2024-05-29 07:47:55 +10:00
.pyproject	Introduce ONNXRUNTIME_PKG_DIR parameter to pip install (#719 )	2024-05-17 05:58:37 -07:00
base	Remove C++ filesystem library dependency for the compatibility of old system (#721 )	2024-05-18 07:23:45 -07:00
cmake	Fix CG warnings. (#731 )	2024-05-29 07:47:55 +10:00
docs	Introduce ONNXRUNTIME_PKG_DIR parameter to pip install (#719 )	2024-05-17 05:58:37 -07:00
include	Remove C++ filesystem library dependency for the compatibility of old system (#721 )	2024-05-18 07:23:45 -07:00
java	Fix CG warnings. (#731 )	2024-05-29 07:47:55 +10:00
nuget	Revert net7.0 update for now (#701 )	2024-04-29 21:40:56 -07:00
onnxruntime_extensions	Standardize the inputs for ONNX STFT op for Whisper model (#681 )	2024-03-29 11:13:30 -07:00
operators	Remove C++ filesystem library dependency for the compatibility of old system (#721 )	2024-05-18 07:23:45 -07:00
prebuild	Enable using system certs on Android. (#543 )	2023-08-24 12:17:07 +10:00
pyop	Remove numpy dependency from its Python binary build (#657 )	2024-02-21 09:54:17 -08:00
shared	Fix the image processing output data discrepancy (#722 )	2024-05-20 12:44:48 -07:00
test	Fix the image processing output data discrepancy (#722 )	2024-05-20 12:44:48 -07:00
tools	Add extensions catalyst support (#684 )	2024-04-17 10:43:35 -07:00
tutorials	Ignore all streaming output of invalid utf-8 string (#704 )	2024-05-06 16:46:55 -07:00
.clang-format	…
.clang-tidy	Refactor the header file directory and integrate the eager tensor implementation (#689 )	2024-04-17 12:58:19 -07:00
.flake8	…
.gitignore	Enhancing CUDA Support in Python Package Build and Testing (#608 )	2023-11-27 15:39:52 -08:00
.sscignore	…
.swift-format	Update OrtExtensionsUsage to also use the ORT Objective-C API. (#483 )	2023-09-25 07:35:37 -07:00
CMakeLists.txt	Add ImageProcessor for Multimodel model Pre-processing (#715 )	2024-05-15 14:35:14 -07:00
CODEOWNERS	Update CODEOWNERS	2023-08-22 17:02:17 -07:00
CODE_OF_CONDUCT.md	…
LICENSE	…
MANIFEST.in	Refactor the header file directory and integrate the eager tensor implementation (#689 )	2024-04-17 12:58:19 -07:00
README.md	make onnx package to be optional. (#653 )	2024-02-15 14:09:04 -08:00
SECURITY.md	…
ThirdPartyNotices.txt	Enable C++ unit tests on iOS (#560 )	2023-09-18 18:52:30 -05:00
build.android	…
build.bat	…
build.ios_xcframework	…
build.sh	Eager mode: cuda kernel support (#694 )	2024-04-24 12:49:00 -07:00
build_lib.bat	…
build_lib.sh	Enable C++ unit tests on iOS (#560 )	2023-09-18 18:52:30 -05:00
cgmanifest.json	Fix CG warnings. (#731 )	2024-05-29 07:47:55 +10:00
pyproject.toml	Remove numpy dependency from its Python binary build (#657 )	2024-02-21 09:54:17 -08:00
requirements-dev.txt	make onnx package to be optional. (#653 )	2024-02-15 14:09:04 -08:00
setup.cfg	Minor changes to test CI PR trigger (#634 )	2024-01-16 10:31:04 -08:00
setup.py	make onnx package to be optional. (#653 )	2024-02-15 14:09:04 -08:00
version.txt	Bump up version in main from 0.11.0 to 0.12.0 (#702 )	2024-04-30 11:45:33 -07:00

README.md

ONNXRuntime-Extensions

What's ONNXRuntime-Extensions

Introduction: ONNXRuntime-Extensions is a library that extends the capability of the ONNX models and inference with ONNX Runtime, via ONNX Runtime Custom Operator ABIs. It includes a set of ONNX Runtime Custom Operator to support the common pre- and post-processing operators for vision, text, and nlp models. And it supports multiple languages and platforms, like Python on Windows/Linux/macOS, some mobile platforms like Android and iOS, and Web-Assembly etc. The basic workflow is to enhance a ONNX model firstly and then do the model inference with ONNX Runtime and ONNXRuntime-Extensions package.

Quickstart

Python installation

pip install onnxruntime-extensions

Nightly Build

on Windows

pip install --index-url https://aiinfra.pkgs.visualstudio.com/PublicPackages/_packaging/ORT-Nightly/pypi/simple/ onnxruntime-extensions

Please ensure that you have met the prerequisites of onnxruntime-extensions (e.g., onnx and onnxruntime) in your Python environment.

on Linux/macOS

Please make sure the compiler toolkit like gcc(later than g++ 8.0) or clang are installed before the following command

python -m pip install git+https://github.com/microsoft/onnxruntime-extensions.git

Usage

1. Generation of Pre-/Post-Processing ONNX Model

The onnxruntime-extensions Python package provides a convenient way to generate the ONNX processing graph. This can be achieved by converting the Huggingface transformer data processing classes into the desired format. For more detailed information, please refer to the API below:

help(onnxruntime_extensions.gen_processing_models)

NOTE:

The generation of model processing requires the ONNX package to be installed. The data processing models generated in this manner can be merged with other models using the onnx.compose if needed.

2. Using Extensions for ONNX Runtime inference

Python

There are individual packages for the following languages, please install it for the build.

import onnxruntime as _ort
from onnxruntime_extensions import get_library_path as _lib_path

so = _ort.SessionOptions()
so.register_custom_ops_library(_lib_path())

# Run the ONNXRuntime Session, as ONNXRuntime docs suggested.
# sess = _ort.InferenceSession(model, so)
# sess.run (...)

C++

  // The line loads the customop library into ONNXRuntime engine to load the ONNX model with the custom op
  Ort::ThrowOnError(Ort::GetApi().RegisterCustomOpsLibrary((OrtSessionOptions*)session_options, custom_op_library_filename, &handle));

  // The regular ONNXRuntime invoking to run the model.
  Ort::Session session(env, model_uri, session_options);
  RunSession(session, inputs, outputs);

Java

var env = OrtEnvironment.getEnvironment();
var sess_opt = new OrtSession.SessionOptions();

/* Register the custom ops from onnxruntime-extensions */
sess_opt.registerCustomOpLibrary(OrtxPackage.getLibraryPath());

C#

SessionOptions options = new SessionOptions()
options.RegisterOrtExtensions()
session = new InferenceSession(model, options)

Contributing

This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.microsoft.com.

When you submit a pull request, a CLA-bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., label, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact opencode@microsoft.com with any additional questions or comments.

License

MIT License