The pre- and post- processing library for ONNX Runtime

Перейти к файлу

Zuwei Zhao 05f7ded825 Add check for empty input in StringJoin operator and fix empty string input error in BlingFire sentence breaker. (#175 ) * Add test cases and fix empty string error in BlingFire sentence breaker. * Throw error if input text to join is empty array. * Fix scalar support and access violation. * Resolve comments. * Resolve comments. Co-authored-by: Zuwei Zhao <zuzhao@microsoft.com>		2021-10-27 20:21:16 +08:00
.az	Prepare for 0.4.0 release (#151 )	2021-09-25 00:40:12 -07:00
.github/workflows	Update wheels.yml	2021-08-20 16:42:37 -07:00
ci_build	add a native unit test for regex_split op (#166 )	2021-10-06 15:58:46 -07:00
cmake	Change farmhash to use uint128_t from the local namespace. (#160 )	2021-09-27 19:49:03 -07:00
docker	Add a dockerfile and update for build script. (#131 )	2021-08-19 10:38:05 -07:00
docs	Add doc for new operators (#161 )	2021-09-29 07:59:09 +08:00
includes	add a native unit test for regex_split op (#166 )	2021-10-06 15:58:46 -07:00
onnxruntime_extensions	Add test for StringECMARegexReplace (#176 )	2021-10-26 10:28:06 +08:00
operators	Add check for empty input in StringJoin operator and fix empty string input error in BlingFire sentence breaker. (#175 )	2021-10-27 20:21:16 +08:00
pyop	Support the domain name in the PyOp. (#136 )	2021-08-26 11:04:45 -07:00
shared	support the non-exception compiling for the text domain. (#142 )	2021-09-02 11:19:18 -07:00
test	Add check for empty input in StringJoin operator and fix empty string input error in BlingFire sentence breaker. (#175 )	2021-10-27 20:21:16 +08:00
tools	Fix ::tolower error when locale is not 'C' (#174 )	2021-10-20 20:59:29 -07:00
tutorials	upgrade ort and ir version mapping (#157 )	2021-09-27 14:56:22 -07:00
.clang-format	initial checkins	2020-10-12 10:52:52 -07:00
.clang-tidy	initial checkins	2020-10-12 10:52:52 -07:00
.flake8	initial checkins	2020-10-12 10:52:52 -07:00
.gitignore	Add native test for bert tokenizer (#173 )	2021-10-19 11:09:38 -07:00
CMakeLists.txt	embedded farmhash source files to have some fixing (#158 )	2021-09-27 16:04:46 -07:00
CODEOWNERS	Create CODEOWNERS	2021-04-21 16:46:21 -07:00
CODE_OF_CONDUCT.md	Initial CODE_OF_CONDUCT.md commit	2020-10-05 12:36:41 -07:00
LICENSE	Updating LICENSE to template content	2020-10-05 12:36:43 -07:00
MANIFEST.in	Prepare for 0.4.0 release (#151 )	2021-09-25 00:40:12 -07:00
README.md	Update README.md	2021-08-13 15:53:39 -07:00
SECURITY.md	Initial SECURITY.md commit	2020-10-05 12:36:44 -07:00
build.android	Prepare for 0.4.0 release (#151 )	2021-09-25 00:40:12 -07:00
build.bat	upgrade ort and ir version mapping (#157 )	2021-09-27 14:56:22 -07:00
build.sh	Add a dockerfile and update for build script. (#131 )	2021-08-19 10:38:05 -07:00
pyproject.toml	Prepare for 0.4.0 release (#151 )	2021-09-25 00:40:12 -07:00
requirements-dev.txt	Implement custom operators for sentancepiece (#41 )	2021-01-27 23:55:50 +01:00
requirements.txt	Customize string operators list for cmake build. (#134 )	2021-08-24 12:31:05 -07:00
setup.cfg	A more formal build process and the fixing of unix-like environment. (#39 )	2021-01-11 13:44:17 -08:00
setup.py	Integrate the changes from the 0.4 release branch. (#162 )	2021-09-29 13:18:36 -07:00

README.md

ONNXRuntime Extensions

Introduction

ONNXRuntime Extensions is a comprehensive package to extend the capability of the ONNX conversion and inference.

The CustomOp C++ library for ONNX Runtime on ONNXRuntime CustomOp API.
Support PyOp feature to implement the custom op with a Python function.
Build all-in-one ONNX model from the pre/post processing code, go to docs/pre_post_processing.md for details.
Support Python per operator debugging, checking hook_model_op in onnxruntime_extensions Python package.

Quick Start

The following code shows how to run ONNX model and ONNXRuntime customop more straightforwardly.

import numpy
from onnxruntime_extensions import PyOrtFunction, VectorToString
# <ProjectDir>/tutorials/data/gpt-2/gpt2_tok.onnx
encode = PyOrtFunction.from_model('gpt2_tok.onnx')
# https://github.com/onnx/models/blob/master/text/machine_comprehension/gpt-2/model/gpt2-lm-head-10.onnx
gpt2_core = PyOrtFunction.from_model('gpt2-lm-head-10.onnx')
decode = PyOrtFunction.from_customop(VectorToString, map={' a': [257]}, unk='<unknown>')

input_text = ['It is very cool to have']
output, *_ = gpt2_core(input_ids)
next_id = numpy.argmax(output[:, :, -1, :], axis=-1)
print(input_text[0] + decode(next_id).item())

This is a simplified version of GPT-2 inference for the demonstration only, The comprehensive solution on the GPT-2 model and its deviants are under development, and here is the link to the experimental.

Android/iOS

The previous processing python code can be translated into all-in-one model to be run in Android/iOS mobile platform, without any Python runtime and the 3rd-party dependencies requirement. Here is the tutorial

CustomOp Conversion

The mainstream ONNX converters support the custom op generation if there is the operation from the original framework cannot be interpreted as ONNX standard operators. Check the following two examples on how to do this.

Inference with CustomOp library

The CustomOp library was written with C++, so that it supports run the model in the native binaries. The following is the example of C++ version.

  // The line loads the customop library into ONNXRuntime engine to load the ONNX model with the custom op
  Ort::ThrowOnError(Ort::GetApi().RegisterCustomOpsLibrary((OrtSessionOptions*)session_options, custom_op_library_filename, &handle));

  // The regular ONNXRuntime invoking to run the model.
  Ort::Session session(env, model_uri, session_options);
  RunSession(session, inputs, outputs);

Of course, with Python language, the thing becomes much easier since PyOrtFunction will directly translate the ONNX model into a python function. But if the ONNXRuntime Custom Python API want to be used, the inference process will be

import onnxruntime as _ort
from onnxruntime_extensions import get_library_path as _lib_path

so = _ort.SessionOptions()
so.register_custom_ops_library(_lib_path())

# Run the ONNXRuntime Session.
# sess = _ort.InferenceSession(model, so)
# sess.run (...)

More CustomOp

Welcome to contribute the customop C++ implementation directly in this repository, which will widely benefit other users. Besides C++, if you want to quickly verify the ONNX model with some custom operators with Python language, PyOp will help with that

import numpy
from onnxruntime_extensions import PyOp, onnx_op

# Implement the CustomOp by decorating a function with onnx_op
@onnx_op(op_type="Inverse", inputs=[PyOp.dt_float])
def inverse(x):
    # the user custom op implementation here:
    return numpy.linalg.inv(x)

# Run the model with this custom op
# model_func = PyOrtFunction(model_path)
# outputs = model_func(inputs)
# ...

Build and Development

This project supports Python and can be built from source easily, or a simple cmake build without Python dependency.

Python package

Install Visual Studio with C++ development tools on Windows, or gcc for Linux or xcode for MacOS, and cmake on the unix-like platform. (hints: in Windows platform, if cmake bundled in Visual Studio was used, please specify the set VCVARS=%ProgramFiles(x86)%\Microsoft Visual Studio\2019<Edition>\VC\Auxiliary\Build\vcvars64.bat)
Prepare Python env and install the pip packages in the requirements.txt.
python setup.py install to build and install the package.
OR python setup.py develop to install the package in the development mode, which is more friendly for the developer since (re)installation is not needed with every build.

Test:

run pytest test in the project root directory.

If only DLL/shared library is needed without any Python dependencies, please run build.bat or bash ./build.sh to build the library. By default the DLL or the library will be generated in the directory out/<OS>/<FLAVOR>. There is a unit test to help verify the build.

The static library and link with ONNXRuntime

For sake of the binary size, the project can be built as a static library and link into ONNXRuntime. Here is the script to this, which is especially usefully on building the mobile release.

Contributing

This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.microsoft.com.

When you submit a pull request, a CLA-bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., label, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact opencode@microsoft.com with any additional questions or comments.

License

MIT License