Граф коммитов

245 Коммитов

Автор SHA1 Сообщение Дата
Karla Saur 8dda60023d
fixing non-ASCII char in long_description (#392) 2020-12-15 10:02:43 -08:00
Matteo Interlandi 7d0dee52a5
fix problem with doubles in PCA (#391) 2020-12-15 09:59:48 -08:00
masahi 71c951d31c
Introduce BatchContainer for batch by batch prediction use case (#377)
* introducing BatchContainer

* BatchContainer basic functionality done

* pass test_input to _convert

* introduce convert_batch API

* use convert_batch in the benchmark

* store _batch_size attribute

* test working

* run black, add concat output option, fix benchmark

* fix getattr

* fix operator benchmark

* support transform and decision function

* make sure input is tuple not list

* fix torch backend prediction

* begin fixing tests

* squeeze and ravel on onnx regression output

* all tests in test_extra_conf.py working

* restore BATCH_SIZE and k neighbor test

* fix onnxml test

* run black on test_extra_conf.py

* fix test_sklearn_normalizer_converter.py

* fix test_lightgbm_converter.py

* fixing more onnxml tests

* fixed remaining onnxml tests

* use format, fix pylint

* fix typo

* add document

* add missing doc

* fix typo

* doc update, remove unused stuff
2020-12-14 14:09:37 -08:00
Matteo Interlandi 779c1cc683
Fix imputation for tvm (#386)
* fix imputation for tvm
add tests for tvm and torch.jit
2020-12-14 09:43:58 -08:00
Karla Saur ea37a5e5b2
trying with newer numpy (#390) 2020-12-14 09:33:52 -08:00
Karla Saur 8f1b6166b9
trying to get pipeline working (#387)
* trying to get pipeline working

* updating string

* rtol atol

* pinning to macos version, reverint to 1 cache. removing rtol for now..

* macos 10.14

* rtol back
2020-12-12 08:13:02 -08:00
Matteo Interlandi 8802aaf426
fix f64 problem (#382)
* fix f64 problem
2020-12-09 13:18:46 -08:00
Matteo Interlandi 857287356c
Fix to linear ONNX (#379)
* fix problem with wrong multiclass in onnx linear model
2020-12-07 22:29:26 -08:00
Matteo Interlandi 63ea67f6db
Fix problem with ONNX models (#378)
* Fix problem with inputs of shape == 0
Use proper data types for scalers
Remove ONNX_INPUTS since is not used

* Add missing test
fix problem with N_features when no test input is passed

* add explisti case for concat
2020-12-07 19:23:00 -08:00
masahi 6cab8ee804
Refactor TVM compilation and prediction code (#376)
* refactor tvm compile and predict code

* run black
2020-12-04 09:50:12 -08:00
masahi f631a4df05
[Benchmark] Add batch_benchmark option to do benchmark on a single batch (not on a whole data) (#369)
* add xgb gpu support

* clean up predict

* fix setting predict_fn

* fix corner cases

* run black

* restore xgb batching

* fix max depth option ignored

* avoid allocating prediction array for xgb when batch bench

* fix onnx ml backend for batch benchmark

* create session once in onnxml backend

Co-authored-by: Masahiro Masuda <masahi@129@gmail.com>
2020-12-03 14:23:03 -08:00
Karla Saur 9714f4c2b9
black for any python3 (ex:python3.7), lint FIRST (#375)
* black for any python3 (ex:python3.7), lint FIRST

* pulling up flake8 install
2020-12-03 09:20:06 -08:00
Karla Saur d30655eb25
adding notebook with tvm example and graph (#367)
* adding notebook with tvm example and graph

* adding installation link for tvm
2020-11-17 17:25:57 -08:00
Matteo Interlandi d17e278585
Add `to` to Pytorch container (#365)
* add to to pytoch container

* make to return self
2020-11-16 11:58:19 -08:00
Supun Nakandala dac08f4ff4
increasing the precision for Naive Bayes method (#360) 2020-11-09 10:36:47 -08:00
Karla Saur 50f1f5c598 Merge branch 'main' of github.com:microsoft/hummingbird into main 2020-11-04 14:19:56 -08:00
Karla Saur 469807fa8b moving refs from master to main 2020-11-04 14:19:51 -08:00
Karla Saur 937c87336b
trying to make pytest run on all except ubu w/python 3.7 (#359)
* trying to make pytest run on all except ubu w/python 3.7

* fix problem with concat types in pt < 1.6

* add coverage for 3.5

* remove unecessary type conversions

Co-authored-by: Matteo Interlandi <mainterl@microsoft.com>
2020-11-03 18:12:17 -08:00
Karla Saur 49f078ebd1 moving refs from master to main 2020-11-03 14:55:10 -08:00
Matteo Interlandi 409c09a937
Add TVM backend (#236)
* add containers for onnx models

* add tvm_installed, initial work on topology

* add containers
add tvm backend to supported
add few tests

* fix type error in TVM
tree_trav and perf_tree_trav now work

* Add TVM_MAX_FUSE_DEPTH option
Add BATCH_SIZE option
Tree trav generate indexes based on batch size (if available)
TVM takes the max fuse detph configuration if set
2020-11-03 13:21:02 -08:00
Karla Saur db65391556
making readme links have raw paths so they can be displayed on Pypi (#357) 2020-10-30 21:46:26 -07:00
Karla Saur 2171044f8d
Binarizer for ONNX-ML (#353)
* starting to implement onnx-ml binarizer

* adding missing init

* adding remainging tests, fixing up

* fixing copypaste

* removing print stmt
2020-10-30 15:15:10 -07:00
Karla Saur acd159d9ff
bump to v0.1.0 (#355) 2020-10-30 11:51:07 -07:00
Matteo Interlandi 6e5f439083
Fix few issues with the benchmars (#354)
* fix score scripts

* pandas datasets -> numpy
2020-10-30 08:35:40 -07:00
Karla Saur 8f72e07314
adding HB logo (#350)
* adding logo to readme

* bg color img

* resizing images readme

* adding logo to sphinx docs
2020-10-28 11:59:38 -07:00
Matteo Interlandi 37bc1f5df5
pin versions of install requires deps (#349)
* pin version of install requires deps

* bump pytorch version in pipeline
2020-10-28 10:19:33 -07:00
Supun Nakandala 0765c6907b
Issue 346 (#348)
* fixing issue with KBinsDiscretizer

* unifying the approach in sklearn and sparkml discretizer
2020-10-27 17:59:31 -07:00
Supun Nakandala e791641d4b
fixing issue with KBinsDiscretizer (#347) 2020-10-27 17:38:18 -07:00
Matteo Interlandi 6fe2443240
Fix for pytorch 1.7 (#345)
* fix error with threads on mac

* momentarily fix the build

* remove assertation on threading in container
2020-10-27 12:03:03 -07:00
Matteo Interlandi 0e969857f8
Add pipeline benchmark (#331)
* add benchmark code for trees

* add pipeline benchmark scripts
add readme for all benchmarks
2020-10-27 09:35:31 -07:00
Karla Saur 4394740bb6
Using Sphinx for docs/web (#343)
* initial attempt at sphinx

* moving from shell script to just single command in yml

* adding make command, deleting sh

* ignoreing website/sphinx/_build/

* fixed missing hb prefix. testing out pipeline that will fail so hard

* fixing readme

* excluding web files from coverage. fixingdeps

* inner makefile

* adding secret for gh-pages for deploy

* using makefile

* changing subfolder to push

* excluding web files from coverage. fixingdeps

* fixing broken link, fixing doc tree depth

* purging pdoc
2020-10-26 13:19:01 -07:00
Matteo Interlandi 0a34f53856
Add scripts for the operators bench (#330)
* add benchmark code for trees

* device can be added directly to convert

* add code for tvm

* refactoring of the tree benchmark files

* add operators scripts
few fixes in the tree bench
2020-10-23 15:40:46 -07:00
Matteo Interlandi 02f310a3c9
merge fix (#342) 2020-10-23 11:27:49 -07:00
Karla Saur 38699d6c26
trying out only running coverage on a single of the nodes (#341) 2020-10-23 09:50:16 -07:00
Matteo Interlandi cb05a38fc8
Add benchmark scripts for trees (#328)
* add benchmark code for trees

* device can be added directly to convert

* add code for tvm

* refactoring of the tree benchmark files
2020-10-22 15:06:13 -07:00
Matteo Interlandi b775cbaee4
add support for onnx models with multiple inputs (#339)
* no need for adding input schemas to onnx anymore

* remove ONNX_INITIAL_TYPES

* add support for onnx models with multiple inputs

* add test case when onnx input data type is not supported

* add missing test
fix a bug in cast for onnx
2020-10-21 11:42:45 -07:00
Matteo Interlandi 5fd2cf4467
Add batching to output containers (#323) 2020-10-21 07:51:44 -07:00
Matteo Interlandi 554f5382f9
Remove requirements on providing input schemas for onnx models (#334)
* no need for adding input schemas to onnx anymore

* remove ONNX_INITIAL_TYPES
2020-10-20 16:10:51 -07:00
Matteo Interlandi 137d84a10b
move n_features from supported to internal (#336) 2020-10-19 20:37:20 -07:00
Matteo Interlandi 2ed2d8e799
Add capability of setting number of threads to the container (#319)
* add capability of setting number of threads to the container

* remove tuples from setup

* add psutil

* fix import problem

* fix xgb import

* add missing skipIf for onnx tests

* add back spark deps

* addressing Karla's comments
2020-10-18 16:09:57 -07:00
Matteo Interlandi 0aad3cee0a
remove unwanted symbols in docs (#332) 2020-10-16 17:14:03 -07:00
Matteo Interlandi 8813aa3f7a
Update the link to the paper (#329) 2020-10-15 15:19:53 -07:00
Karla Saur acf58228fd
removing html from commit hooks (#324) 2020-10-14 16:49:59 -07:00
Supun Nakandala e56ab57a87
Adding support for SparkML (#310)
* wip

* fixing tests

* fixing formatting issues

* adding support to pass pyspark dataframe as test data

* adding support for pipelinemodel

* named input selection

* adding support for named inputs in spark-ml

* fixing issue

* fixing name error

* end-to-end spark-ml pipeline working

* more tests

* addressing review comments

* adding pyspark to ci/cd

* fixing bug

* sparkml requires torch >= 1.6.0 which has automatic type promotion

* input selection happening inside pytorch container

* fixing pytorch version

* reusing Concat from pipeline_implementations.py

* fixing Concat issue
2020-10-12 17:01:20 -07:00
Karla Saur 28e13a6391
adding downloads badge from https://pepy.tech/project/hummingbird-ml (#313) 2020-10-05 17:21:08 -07:00
Supun Nakandala 4a1ffa830b
Issue-301: adding more distance metric types for kneighbors (#307)
* adding more distance metric types for kneighbors

* typo
2020-09-25 17:36:36 -07:00
Supun Nakandala 02b1a368e5
Issue #303: Adding support Sklearn KNeighbors Regressor (#304)
* wip

* reducing the minkowski power from 5 to 3 in order to avoid precision issues

* rebasing on top of 217 pull request
2020-09-21 14:44:27 -07:00
Supun Nakandala 826b5f087e
Issue #217: Adding support for sklearn KNeighborsClassifier (#296)
* wip

* knn classifier brute force algorithm with minkowski metric type

* fixing wrong probabilities issue

* addressing review comments

* fixing flake issues
2020-09-21 13:57:31 -07:00
Matteo Interlandi 26411642f0
Add support for pandas dataframes both at conversion time and inference time (#300)
* add support for pandas inputs
fix a couple of bugs
add support for double and long onnx input types

* makes OHE working over multiple inputs
add more tests
2020-09-17 09:02:51 -07:00
Karla Saur b8332d12fc
Checking for strings in pipeline col index (#295)
* commenting out lines  of _get_column_index for 'string' as it's not supported

* fixing input

* fixing inputs (last was import)
2020-09-14 10:30:07 -07:00