onnxruntime-extensions

Граф коммитов

Автор	SHA1	Сообщение	Дата
Wenbing Li	8153bc1a3a	Feature extraction C API for whipser model (#755 ) * Feature extraction C API for whipser model * Update the docs * Update the docs2 * refine the code * fix some issues * fix the Linux build * fix more data consistency issue * More code refinements	2024-07-11 11:20:36 -07:00
Wenbing Li	c71e2ae090	Refactor String and Audio operators with status-return prototype. (#576 ) * Refactor String and Audio operators with status-return prototype. * complete the whole text domain --------- Co-authored-by: Sayan Shaw <52221015+sayanshaw24@users.noreply.github.com>	2023-10-19 10:40:58 -07:00
Scott McKay	e448676a5e	Make kernel Compute method implementations const (#500 ) * Nodes can be called concurrently and Compute needs to be stateless due to that. Update the kernels to make Compute const. * Fix test that uses ustring.h. Would be better to not have duplicate declarations for GetTensorMutableDataString and FillTensorDataString in ustring.h and string_tensor.h.	2023-07-28 09:25:36 +10:00
Wenbing Li	62d8598b6b	Update whisper model test cases and e2e example (#496 ) * Update whisper model test cases and e2e example * fix unit test on windows * more refinement * utest fix	2023-07-21 15:27:02 -07:00
Wenbing Li	981cb049ff	Add a new API for building data processing graph from Huggingface transformers processor/tokenizer (#482 ) * initial checkins * test pass * basic impl * first unit test pass * merge error * refine a little bit * add more unit test * fix unit test * Fix the unit test. * add one more whisper audiodecoder test case * update the docs * More updates	2023-07-17 16:50:58 -07:00
Wenbing Li	bab1989644	refine audiodecoder with new api (#489 ) * refine audiodecoder with new api * update std::optional usage for macOS	2023-07-12 13:11:58 -07:00
Wenbing Li	1f0c76cefa	fix some prefast warnings (#467 )	2023-06-07 16:26:45 -07:00
Wenbing Li	507358545d	improve lowpass filter with a higher order one. (#463 ) * improve lowpass filter with a higer order one. * Update test_sampling.cc * remove the unneccerary throw in the code	2023-06-01 14:12:05 -07:00
Wenbing Li	2fa0b710ea	Adding down-sampling and stereo mixing features for AudioDecoder (#420 ) * initial draft * second * third * polishing * fix the M_PI name in LINUX platform * fix bessel function issue * add a unit test case * fix the unit test name	2023-05-04 13:30:10 -07:00
Wenbing Li	997fa892c2	more code fixing related whisper models (#403 )	2023-04-21 09:26:44 -07:00
Wenbing Li	b5dce955f0	Add an audio decoder custom op for whisper end-to-end processing (#385 ) * evaluate the audio decoder library * MP3 Decoder * rename it to test_audio_codec * add the audio decoder to whisper model * whisper end-to-end draft * fix the mp3 decoder * Running with ONNX models * Add more audio format supports * refine the end-to-end script * Update operators/audio/audio_decoder.hpp Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com> * Update operators/audio/audio_decoder.hpp Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com> * Update operators/audio/audio_decoder.hpp Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com> * some fixings of comments and more test cases. * changes for review comments. * Update audio_decoder.hpp * Update audio_decoder.hpp * code refinement * Update operators/audio/audio_decoder.hpp Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com> --------- Co-authored-by: Sayan Shaw <52221015+sayanshaw24@users.noreply.github.com> Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com>	2023-04-11 14:47:10 -07:00

11 Коммитов