Граф коммитов

11 Коммитов

Автор SHA1 Сообщение Дата
Wenbing Li 8153bc1a3a
Feature extraction C API for whipser model (#755)
* Feature extraction C API for whipser model

* Update the docs

* Update the docs2

* refine the code

* fix some issues

* fix the Linux build

* fix more data consistency issue

* More code refinements
2024-07-11 11:20:36 -07:00
Wenbing Li c71e2ae090
Refactor String and Audio operators with status-return prototype. (#576)
* Refactor String and Audio operators with status-return prototype.

* complete the whole text domain

---------

Co-authored-by: Sayan Shaw <52221015+sayanshaw24@users.noreply.github.com>
2023-10-19 10:40:58 -07:00
Scott McKay e448676a5e
Make kernel Compute method implementations const (#500)
* Nodes can be called concurrently and Compute needs to be stateless due to that.

Update the kernels to make Compute const.

* Fix test that uses ustring.h.

Would be better to not have duplicate declarations for GetTensorMutableDataString and FillTensorDataString in ustring.h and string_tensor.h.
2023-07-28 09:25:36 +10:00
Wenbing Li 62d8598b6b
Update whisper model test cases and e2e example (#496)
* Update whisper model test cases and e2e example

* fix unit test on windows

* more refinement

* utest fix
2023-07-21 15:27:02 -07:00
Wenbing Li 981cb049ff
Add a new API for building data processing graph from Huggingface transformers processor/tokenizer (#482)
* initial checkins

* test pass

* basic impl

* first unit test pass

* merge error

* refine a little bit

* add more unit test

* fix unit test

* Fix the unit test.

* add one more whisper audiodecoder test case

* update the docs

* More updates
2023-07-17 16:50:58 -07:00
Wenbing Li bab1989644
refine audiodecoder with new api (#489)
* refine audiodecoder with new api

* update std::optional usage for macOS
2023-07-12 13:11:58 -07:00
Wenbing Li 1f0c76cefa
fix some prefast warnings (#467) 2023-06-07 16:26:45 -07:00
Wenbing Li 507358545d
improve lowpass filter with a higher order one. (#463)
* improve lowpass filter with a higer order one.

* Update test_sampling.cc

* remove the unneccerary throw in the code
2023-06-01 14:12:05 -07:00
Wenbing Li 2fa0b710ea
Adding down-sampling and stereo mixing features for AudioDecoder (#420)
* initial draft

* second

* third

* polishing

* fix the M_PI name in LINUX platform

* fix bessel function issue

* add a unit test case

* fix the unit test name
2023-05-04 13:30:10 -07:00
Wenbing Li 997fa892c2
more code fixing related whisper models (#403) 2023-04-21 09:26:44 -07:00
Wenbing Li b5dce955f0
Add an audio decoder custom op for whisper end-to-end processing (#385)
* evaluate the audio decoder library

* MP3 Decoder

* rename it to test_audio_codec

* add the audio decoder to whisper model

* whisper end-to-end draft

* fix the mp3 decoder

* Running with ONNX models

* Add more audio format supports

* refine the end-to-end script

* Update operators/audio/audio_decoder.hpp

Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com>

* Update operators/audio/audio_decoder.hpp

Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com>

* Update operators/audio/audio_decoder.hpp

Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com>

* some fixings of comments and more test cases.

* changes for review comments.

* Update audio_decoder.hpp

* Update audio_decoder.hpp

* code refinement

* Update operators/audio/audio_decoder.hpp

Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com>

---------

Co-authored-by: Sayan Shaw <52221015+sayanshaw24@users.noreply.github.com>
Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com>
2023-04-11 14:47:10 -07:00