Граф коммитов

13 Коммитов

Автор SHA1 Сообщение Дата
Scott McKay c81981b74c
Enable using system certs on Android. (#543)
Disable in-memory cert store and loading certs from model.
  - TBD if it will be needed - need to know how reliable using the Android system certs will be and whether any scenarios need to have custom cert management.
2023-08-24 12:17:07 +10:00
Scott McKay 613c5c0c9d
Make Azure ops work on Android (#532)
* Use in-memory certs for curl on Android
  - could not get curl+openssl to be able to use the system ones
* Use static build for curl and openssl
  - smaller binary size

* Cache the x509 certificate store so we don't need to re-create for every request.
* Read certs from node attribute for now.

---------

Co-authored-by: Wenbing Li <10278425+wenbingl@users.noreply.github.com>
2023-08-23 12:34:40 +10:00
Scott McKay eb5aef38fb
Make Azure op timeout an attribute (#539)
* Add `timeout_seconds` attribute for per-node timeout. Defaults to (arbitrary) value of 15 seconds.

* Fix datatype - onnx only has int64_t attributes.
Update test model to validate timeout is read correctly.
2023-08-23 08:21:37 +10:00
RandySheriffH d853d31fc1
Document azure ops. (#530)
* doc ops (#529)

* Try and make CIs pass with Azure ops enabled by default.
Misc. other cleanups

* Fix some CI issues.
Cleanups some bits and pieces.

* Fix a couple of issues.

* Fix arg to build.bat

* Increase warning in triton client build to make binskim happy (hopefully).

* Try patching the warning level in the triton grpc branch as well. Shouldn't matter but...

* Run triton patch command for windows as well.

* Add patch.exe directly so windows builds work.

* override auth gen for AOAI

* fix build

* switch to windows-static

* update model for azure chat

* document triton invoker

* doc chat endpoint

* document triton invoker

* format

* format

* format

---------

Co-authored-by: Scott McKay <Scott.McKay@microsoft.com>
Co-authored-by: Randy Shuai <rashuai@microsoft.com>

* address comments

* move doc sect

* typo

* typo

---------

Co-authored-by: Scott McKay <Scott.McKay@microsoft.com>
Co-authored-by: Randy Shuai <rashuai@microsoft.com>
2023-08-17 14:12:02 -07:00
Scott McKay 486c2b6d79
Android fixes/improvements (#522)
* - Ensure we log an error message before throwing on Android
  - message in exception will be lost due to how the shared libraries are built (both onnxruntime and extensions use static libc++ so there are no shared exception types between them)
- support static or dynamic build of curl/openssl on android
  - TBD which we want to use.
- add infra for anything deriving from BaseKernel to log messages using the ORT logger
  - ensures messages from custom kernels end up in the same place as messages from ORT
2023-08-16 15:17:13 +10:00
Scott McKay 4842e9d6ae
Make CIs pass with Azure ops enabled by default. (#518)
* Make CIs pass with Azure ops enabled by default.
2023-08-12 17:45:59 +10:00
Scott McKay f77a3b8ad2
Update domain in triton test models (#519)
* Update domain in triton test models

* Use 'model_name' everywhere. Test py and model/op were inconsistent.
2023-08-12 12:40:21 +10:00
Scott McKay 2bde82fce9
Refactor setup for Azure ops. Add Android support. (#507)
* Refactor setup for Azure ops to try and make common things more re-usable, and for the actual ops to simply layer in the specific input/output constraints for that type of request.

Currently builds on Linux, Windows (x64 only) and Android

Android requires a manual pre-build of openssl and curl.

Linux requires a manual pre-install of openssl.

Windows currently only works for x64. Other targets need the triplet adjusted.

* Address PR comments

* Fix could of android build warnings.

* Update .gitignore to remove old path

* Fix build break from merge
2023-08-08 19:54:30 +10:00
RandySheriffH 911c2b2340
Assemble file name for azure audio (#508)
* fix name

* document pipeline param

---------

Co-authored-by: Randy Shuai <rashuai@microsoft.com>
2023-08-03 16:21:29 -07:00
RandySheriffH 9e7f8e5b1d
Add UT for Azure Ops during packaging (#502)
* set before-test

* test cmd

* clean in yml

* restore toml

* add ut for triton endpoints

* reset working path

* rename suffix

* install ort

* pip install

* make env

* add extra env

* make executable

* set dir for linux

* add switch

* set env default

* skip tests

* simplify env

* clean env for official

---------

Co-authored-by: Randy Shuai <rashuai@microsoft.com>
Co-authored-by: Wenbing Li <10278425+wenbingl@users.noreply.github.com>
2023-08-02 17:01:09 -07:00
Scott McKay e448676a5e
Make kernel Compute method implementations const (#500)
* Nodes can be called concurrently and Compute needs to be stateless due to that.

Update the kernels to make Compute const.

* Fix test that uses ustring.h.

Would be better to not have duplicate declarations for GetTensorMutableDataString and FillTensorDataString in ustring.h and string_tensor.h.
2023-07-28 09:25:36 +10:00
RandySheriffH 06d5a8d781
Enable AzureOp packaging (#495)
* generalize azure op

* implement text endpoint

* add parameter to linux whl pipeline

* tune grammer

* 2

* 3

* 4

* 5

* 6

* 7

* 9

* 10

* 11

* 12

* 13

* 14

* 15

* 16

* cancel azure build in ci

* tune cpu tensor check

* try mem name

* address comments

* fix path

* clean env

---------

Co-authored-by: Randy Shuai <rashuai@microsoft.com>
2023-07-20 14:16:08 -07:00
RandySheriffH 27132ced71
Implement azure invokers (#487)
* Implement azure invokers (#486)

* draft azure ops

* migrate triton client

* AzureAudioInvoker works

* triton client builds

* triton invoker works

* limit version

* restore setup.py

* limit ort version

* upgrade version

* pip install cmake

* add ut

* promote ort header version to 1.15.1

* register as cpu op

* limit triton invoker to 1.14 and newer

* remove test

* install rapidjson

* install dep

* sudo install

* install version script

* print err msg

* fix pipeline

* disable from web assembly

* install cmake

* Fix pipelines (#479)

* 1

* 2

* 3

* 4

* 5

* 6

* 7

* 8

* 9

* 10

* 11

* 12

* 13

* 14

* 15

* 16

* 17

* 18

* 19

* 20

* 21

* 22

* 23

* 24

* 25

* 26

* 27

* 28

* 29

* 30

* 31

* 32

* 33

* 34

* 35

* 36

* 37

* 38

* 39

* 40

* 41

* 42

* 43

* 44

* 45

* 46

* 47

* 47

* 48

* 49

* 50

* 51

* 52

* 53

* 54

* 55

* 56

* 57

* 58

* 59

* 60

* 61

* 62

* 62

* 63

* 64

* 65

* 66

* 67

* 68

* 69

* 70

* 71

* 72

* 73

* 74

* 75

* 76

* 77

* 78

* 79:

* 80:

---------

Co-authored-by: Randy Shuai <rashuai@microsoft.com>

* fix pipelines (#481)

* 1

* 2

* 3

* 4

* 5

* 6

* 7

* 8

* 9

* 10

* 11

* 12

* 13

* 14

* 15

* 16

* 17

* 18

* 19

* 20

* 21

* 22

* 23

* 24

* 25

* 26

* 27

* 28

* 29

* 30

* 31

* 32

* 33

* 34

* 35

* 36

* 37

* 38

* 39

* 40

* 41

* 42

* 43

* 44

* 45

* 46

* 47

* 47

* 48

* 49

* 50

* 51

* 52

* 53

* 54

* 55

* 56

* 57

* 58

* 59

* 60

* 61

* 62

* 62

* 63

* 64

* 65

* 66

* 67

* 68

* 69

* 70

* 71

* 72

* 73

* 74

* 75

* 76

* 77

* 78

* 79:

* 80:

* 81

* 82

* 83

* 84

* 85

* 86

---------

Co-authored-by: Randy Shuai <rashuai@microsoft.com>

* test as cpu op

* add ut

* add ut

* move cond

* tune ut

* tune pipeline

* promote to ort 141

* reset header version

* restore cmake

---------

Co-authored-by: Randy Shuai <rashuai@microsoft.com>

* trim changes

* revert req txt

---------

Co-authored-by: Randy Shuai <rashuai@microsoft.com>
2023-07-10 10:07:33 -07:00