dowhy

Граф коммитов

Автор	SHA1	Сообщение	Дата
Patrick Bloebaum	b7e921719a	Change default parameter of the ICC method This should decrease runtime without much loss of accuracy. Signed-off-by: Patrick Bloebaum <bloebp@amazon.com>	2023-03-15 12:04:29 -07:00
Patrick Bloebaum	58f314c81c	Change default parameter for estimating continous KL divergence This should reduce variance in the estimation, increase runtime and, generally, accuracy. Signed-off-by: Patrick Bloebaum <bloebp@amazon.com>	2023-03-15 12:04:16 -07:00
Patrick Bloebaum	b8fb120033	Revise early stopping convergence critera for Shapley value estimator - In the "early stopping" method, the runs were converging slowly when a Shapley value is exactly zero, because the change was always considered as 100%. Now, if the Shapley value between two consecutive runs are exactly zero, then it is considered as 'converging'. - Since the Shapley estimator supports the estimation of multiple vectors of Shapley values at the same time, this change also introduces some logic to keep track of the convergence for each vector individually. Signed-off-by: Patrick Bloebaum <bloebp@amazon.com>	2023-03-15 12:03:30 -07:00
Patrick Bloebaum	df08efd2db	Revise gcm arrow strength - Speed up by reducing tolerance threshold - Increase default value for num_samples_conditional to improve accuracy - Fix bug when num_samples_conditional is set to a higher number than the actual number of given samples - Clarify tolerance parameter in docstring Signed-off-by: Patrick Bloebaum <bloebp@amazon.com>	2023-03-15 12:03:14 -07:00
Patrick Bloebaum	cf8a0b8112	Revise gcm Shapley estimator - Shapley config parameter now clearly distinguishes between num_subset_sampling and num_permutations - Using a quasi-random sequence generator now to improve uniform sampling of permutation. This should improve convergence (i.e., more accurate estimations) - Fix bug in early stopping where one can get negative percentages and, thus, the estimator would falsely stop too early. - Early stopping criteria is now applied to the change of each Shapley value individually instead of the average change. This should ensure less variance in the results between two runs and more accurate results. - Change minimum change in percentage threshold to 0.05 by default for early stopping. This balances out the potential slower runtime from the change above. Signed-off-by: Patrick Bloebaum <bloebp@amazon.com>	2023-03-14 09:42:16 -07:00
Patrick Bloebaum	f05b9d6315	Fix bug in gcm is_probability_matrix function Before, numeric arrays with the wrong dtype would cause an error when summing them up and using np.isclose. Now, the dtype is explicitly changed to float64. Signed-off-by: Patrick Bloebaum <bloebp@amazon.com>	2023-03-14 09:40:59 -07:00
Patrick Bloebaum	7e370f9c64	Add and remove some unit tests in gcm module Signed-off-by: Patrick Bloebaum <bloebp@amazon.com>	2023-03-07 11:10:43 -08:00
Patrick Bloebaum	7ca6de528a	Revise arrow strength user guide Signed-off-by: Patrick Bloebaum <bloebp@amazon.com>	2023-03-07 08:14:39 -08:00
Patrick Bloebaum	d7b7cc65c5	Revise gcm user guide entry for counterfactuals Signed-off-by: Patrick Bloebaum <bloebp@amazon.com>	2023-03-07 08:14:22 -08:00
Patrick Bloebaum	5d449be765	Revise attributing distributional changes user guide entry Signed-off-by: Patrick Bloebaum <bloebp@amazon.com>	2023-03-07 08:14:00 -08:00
Patrick Bloebaum	12168ea7bd	Revise user guide entry for intrinsic causal influence Signed-off-by: Patrick Bloebaum <bloebp@amazon.com>	2023-03-07 08:13:44 -08:00
Patrick Bloebaum	a1dcccbc80	Change one-hot-encoding behaviour in gcm module Before, one dimension is dropped in the encoding. However, this requires to assume that there are no unknown categories, since these would be mapped to a zero vector as well, which then coincides with one of the categories. Now, there are as many dimensions as categories, which allow to map unknown categories to a zero vector. Signed-off-by: Patrick Bloebaum <bloebp@amazon.com>	2023-03-06 07:00:00 -08:00
Patrick Bloebaum	b1e5804904	Renaming gcm feature.py module to feature_relevance.py The new name 'feature_relevance' will help clarify the type of functionality to expect from the module. Signed-off-by: Patrick Bloebaum <bloebp@amazon.com>	2023-03-06 06:59:12 -08:00
Patrick Bloebaum	9bb8678959	Add a new paramter to fit_and_compute for confidence intervals The new parameter allows to indicate whether existing causal mechanisms should be overridden by inferred ones based on the data. This would also include uncertainties of the model selection when computing confidence intervals. Signed-off-by: Patrick Bloebaum <bloebp@amazon.com>	2023-03-06 06:58:23 -08:00
Patrick Bloebaum	8b12ba7f92	Fix an issue with kernel_based test in gcm when the data is almost constant Signed-off-by: Patrick Bloebaum <bloebp@amazon.com>	2023-03-06 06:57:56 -08:00
Patrick Bloebaum	7db3fbc20c	Add unit test for gcm config Signed-off-by: Patrick Bloebaum <bloebp@amazon.com>	2023-03-06 06:57:21 -08:00
allcontributors[bot]	9af8f8f570	docs: add MFreidank as a contributor for code (#869 ) * docs: update CONTRIBUTING.md [skip ci] * docs: update .all-contributorsrc [skip ci] --------- Co-authored-by: allcontributors[bot] <46447321+allcontributors[bot]@users.noreply.github.com>	2023-02-14 12:25:07 -08:00
Moritz Freidank	23214a9850	Enhancement: warn about unobserved graph variables in `causal_model.identify_effect`. (#846 ) Enhancement: warn about unobserved graph variables Make `causal_model.CausalModel` constructor emit a `UserWarning` in case there are graph variables (common causes, instruments, effect modifiers) that are not contained in the observed data (`self._data`). Furthermore log additional information to logger level `logging.WARNING` to help debug typos in variable names and point out data variables that are not used in a given graph. Signed-off-by: Moritz Freidank <freidankm@gmail.com> Co-authored-by: Moritz Freidank <freidankm@gmail.com>	2023-02-14 12:21:28 -08:00
Patrick Bloebaum	2a2b3f4f7d	Rename remaining gcm unit tests Signed-off-by: Patrick Bloebaum <bloebp@amazon.com>	2023-02-13 10:27:33 -08:00
dependabot[bot]	de5ddd54f6	Bump ipython from 8.9.0 to 8.10.0 (#863 ) Bumps [ipython](https://github.com/ipython/ipython) from 8.9.0 to 8.10.0. - [Release notes](https://github.com/ipython/ipython/releases) - [Commits](https://github.com/ipython/ipython/compare/8.9.0...8.10.0) --- updated-dependencies: - dependency-name: ipython dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-02-11 09:01:19 -08:00
Patrick Bloebaum	224b65e1a9	Sort names in gcm bar_plot function alphabetically Signed-off-by: Patrick Bloebaum <bloebp@amazon.com>	2023-02-10 06:17:38 -08:00
Andres Morales	b21d2ffe06	Fix issue #820 - Estimate Effect fails with Econml DML estimator (#828 ) * Fix issue #820 - Estimate Effect fails with Econml DML estimator Signed-off-by: Andres Morales <andresmor@microsoft.com> * Add float support to parse_state Signed-off-by: Andres Morales <andresmor@microsoft.com> --------- Signed-off-by: Andres Morales <andresmor@microsoft.com>	2023-02-10 09:52:46 +05:30
Patrick Bloebaum	3b5989078a	Rename gcm stats unit tests Signed-off-by: Patrick Bloebaum <bloebp@amazon.com>	2023-02-09 11:11:06 -08:00
Patrick Bloebaum	86e0b1c78d	Rename shapley unit tests Signed-off-by: Patrick Bloebaum <bloebp@amazon.com>	2023-02-09 11:10:47 -08:00
Patrick Bloebaum	7f4640f368	Make approx indep test with random seed less flaky Signed-off-by: Patrick Bloebaum <bloebp@amazon.com>	2023-02-09 11:09:08 -08:00
Patrick Bloebaum	2c20fab418	Rename regression based unit tests Signed-off-by: Patrick Bloebaum <bloebp@amazon.com>	2023-02-09 05:47:50 -08:00
Patrick Bloebaum	e584ada049	Rename some gcm unit tests Signed-off-by: Patrick Bloebaum <bloebp@amazon.com>	2023-02-09 05:47:34 -08:00
Patrick Bloebaum	3272a16018	Make unit test for auto assign linear model less flaky Signed-off-by: Patrick Bloebaum <bloebp@amazon.com>	2023-02-09 05:47:14 -08:00
Patrick Bloebaum	c2bfea0349	Change np.int to np.int32 in shapley.py np.int has been deprecated. Signed-off-by: Patrick Bloebaum <bloebp@amazon.com>	2023-02-08 06:50:53 -08:00
Patrick Bloebaum	6fd637638c	Set gcm polynomial regressor to 2 degree by default This should reduce potential overfitting issues. Signed-off-by: Patrick Bloebaum <bloebp@amazon.com>	2023-02-08 06:50:42 -08:00
allcontributors[bot]	3a89d5af05	docs: add Klesel as a contributor for code (#848 ) * docs: update CONTRIBUTING.md [skip ci] * docs: update .all-contributorsrc [skip ci] --------- Co-authored-by: allcontributors[bot] <46447321+allcontributors[bot]@users.noreply.github.com>	2023-02-06 18:33:01 -08:00
Michael Klesel	1dd9429b58	Fix default values for simulation runs and typo related to alpha (#842 ) * fix: significance_level https://github.com/py-why/dowhy/issues/809 Signed-off-by: Michael Klesel <michael@klesel.info> * fix: default number of simulations for CI https://github.com/py-why/dowhy/issues/841 Signed-off-by: Michael Klesel <michael@klesel.info> * fix: significance_level https://github.com/py-why/dowhy/issues/809 Signed-off-by: Michael Klesel <michael@klesel.info> * fix: default number of simulations for CI https://github.com/py-why/dowhy/issues/841 Signed-off-by: Michael Klesel <michael@klesel.info> * remove: self._linear_model (not used in class) Signed-off-by: Michael Klesel <michael@klesel.info> --------- Signed-off-by: Michael Klesel <michael@klesel.info>	2023-02-06 18:29:18 -08:00
Peter Goetz	377b913571	Make EconML optional Signed-off-by: Peter Goetz <pego@amazon.com>	2023-02-06 14:29:09 +01:00
Michael Marien	f340d30727	fixed sphinx docstring format for causal model (#843 ) Signed-off-by: Michael <marien.mich@gmail.com>	2023-02-03 10:54:53 +05:30
Emre Kıcıman	dc64d8608e	Fixed broken URL to MSR Webinar in README.rst (#845 ) Fixed broken URL to MSR Webinar. Signed-off-by: Emre Kıcıman <emrek@microsoft.com>	2023-02-02 08:22:28 -08:00
Padarn Wilson	7bce030b6a	Bug fix: Missing use of arguments in estimate effect (#840 ) * use arguments in estimate effect that were not being passed to the functional version Signed-off-by: Padarn Wilson <padarn.wilson@grabtaxi.com> * fix formatting --------- Signed-off-by: Padarn Wilson <padarn.wilson@grabtaxi.com>	2023-02-02 11:10:41 +05:30
Peter Goetz	52c08ad90a	Remove rpy2 dependency It's not needed anymore. Signed-off-by: Peter Goetz <pego@amazon.com>	2023-02-01 22:26:18 +01:00
Peter Goetz	b68bdb83b4	Add docs deps in advanced-on-demand workflow This is required to have all necessary dependencies and it also makes this workflow consistent with the nightly tests. Signed-off-by: Peter Goetz <pego@amazon.com>	2023-01-27 17:26:03 +01:00
Peter Goetz	077c353914	Point user guide link to user guide instead of intro This is mainly for consistency, so we can get better aggregate metrics in Google Analytics. For users it probably makes no difference. Signed-off-by: Peter Goetz <pego@amazon.com>	2023-01-27 10:49:33 +01:00
Peter Goetz	ca340303e9	Pin Numpy version to 1.23.1 This makes it compatible with Numba version 0.56.4. The way the Numpy version was specified before, install version 1.24.0 which is incompatible, see also https://github.com/numba/numba/pull/8691. Numba is a transient dependency coming in through econml and sparse. Signed-off-by: Peter Goetz <pego@amazon.com>	2023-01-26 17:19:34 +01:00
dependabot[bot]	96a055ae44	Bump torch from 1.12.1 to 1.13.1 (#826 ) Bumps [torch](https://github.com/pytorch/pytorch) from 1.12.1 to 1.13.1. - [Release notes](https://github.com/pytorch/pytorch/releases) - [Changelog](https://github.com/pytorch/pytorch/blob/master/RELEASE.md) - [Commits](https://github.com/pytorch/pytorch/compare/v1.12.1...v1.13.1) --- updated-dependencies: - dependency-name: torch dependency-type: direct:development ... Signed-off-by: dependabot[bot] <support@github.com> Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-01-24 12:32:24 -08:00
Andres Morales	c3e1c75696	Enhancement/remove data from estimators (#811 ) * Initial commit * Add data to CausalEstimate (workaround for methods that depend on data) Signed-off-by: Andres Morales <andresmor@microsoft.com> * remove treatment_name from estimator object Signed-off-by: Andres Morales <andresmor@microsoft.com> * remove outcome_name from causal_estimators * Remove _outcome, _treatment, _outcome_name from estimator object * Restore notebooks Signed-off-by: Andres Morales <andresmor@microsoft.com> * Add removed code Signed-off-by: Andres Morales <andresmor@microsoft.com> * Restore notebook Signed-off-by: Andres Morales <andresmor@microsoft.com> * Remove treatment and outcome as parameters Signed-off-by: Andres Morales <andresmor@microsoft.com> * apply formatting Signed-off-by: Andres Morales <andresmor@microsoft.com> * Remove treatment and outcome parameters Signed-off-by: Andres Morales <andresmor@microsoft.com> * Apply formatting Signed-off-by: Andres Morales <andresmor@microsoft.com> Signed-off-by: Andres Morales <andresmor@microsoft.com>	2023-01-20 16:36:38 +05:30
allcontributors[bot]	325cf4e245	docs: add moberst as a contributor for code (#825 ) * docs: update CONTRIBUTING.md [skip ci] * docs: update .all-contributorsrc [skip ci] Co-authored-by: allcontributors[bot] <46447321+allcontributors[bot]@users.noreply.github.com>	2023-01-18 19:16:54 +05:30
Michael Oberst	3ab0e2a3e0	[New refutation] Add OverRule for learning Boolean rules to describe support/overlap (#791 ) * feat: Add scaffolding for overrule, including basic test Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * feat: Update dependencies for overrule Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * feat: Add the full set of overrule code Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * style: Black styling on overrule code style: Black formatting on beam_search style: Black formatting for ruleset style: Black format utils style: Black formatting on load_process_data_BCS Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * fix: Change np.matmul to np.dot Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * fix: Update to appropriate matmul notation for latest CVXPY Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * feat: Remove unnecessary overrule code Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * feat: Minimum working example for OverRule Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * test: Minimum viable test for OverRule test: Fix test to work with new interface Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * feat: Print rules with option to recompute metrics Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * feat: Improve printing of results for refutation Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * feat: Pass in additional arguments to OverRule Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * docs: Add docstrings to ruleset.py docs: Docstrings for assess_overlap.py Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * feat: Update logger Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * docs: Add additional docstrings Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * fix: Path bug Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * feat: Add notebook with a toy example to demonstrate OverRule Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * feat: Add back using LP coeff by default Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * docs: Typing and docstrings docs: Consistent module docstrings docs: Add docstrings to overrule/utils.py docs: Add docstrings to overrule/BCS/beam_search.py docs: Add doctstrings and typing to load_process_data_BCS docs: Add docstrings and type hints to BCS/overlap_boolean_rule.py Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * docs: Fix and rename notebook for demonstrating overrule Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * feat: Use default_rng instead of setting a global seed in sample_Unif Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * fix: Use rng in place of numpy.random Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * fix: Replace list with numpy array to fix type error Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * docs: Fix type hint on ref_range Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * feat: Add option to only fit overlap or support rules Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * docs: Flesh out example notebook with parameters Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * feat: Add thresh_override as a argument for configuration Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * feat: Functional API for overrule has defaults Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * docs: Add API reference Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * ci: Fix support rule test Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * chore: Update poetry.lock for cxvpy dependency Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * feat: Remove `XGBClassifier` as default classifier To avoid dependency on `xgboost`, replace `XGBClassifier` as the default propensity score model with `RandomForestClassifier` from `sklearn` Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * fix: Fix logic so that when verbose=True, silent=False Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * feat: Remove seaborn dependency from overrule notebook Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * feat: Add option to pass random seed to support estimation Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * docs: Update notebook to use random seed on support estimation Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * fix: Typo Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * feat: Add PSID dataset (observational controls for Lalonde) Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * fix: Prevent fitting overlap rules if all samples in overlap region One of the assertions in OverlapBooleanRule will trip if all samples are in the overlap region. This commit adds a more informative error if the assertion gets tripped, and raises a more informative warning upstream if all samples are in the overlap region Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * feat: Add function that can be used with target_units `refute.filter_dataframe(df)` can be used to filter a dataframe to units that are in the overlap/support region. Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * docs: Clarify notebook intro Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * feat: Clarify how to read rules in refuter output Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * fix: Return a copy when filtering Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * docs: Update notebook with Lalonde example Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * docs: Add return to docstring Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * docs: Add citation for pricing problem Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * feat: Change progressbar error to warning Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> * chore: Update lockfile Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com> Signed-off-by: Michael Oberst <michael.k.oberst@gmail.com>	2023-01-18 19:16:04 +05:30
Michael Marien	c0de39078b	removed causal-ml from poetry setup instructions (#822 ) Signed-off-by: Michael <marien.mich@gmail.com> Signed-off-by: Michael <marien.mich@gmail.com>	2023-01-16 09:40:31 +05:30
Amit Sharma	c2938d5329	changed copyright to pywhy	2023-01-12 12:58:55 +01:00
allcontributors[bot]	56051a930a	docs: add Padarn as a contributor for code (#817 ) * docs: update CONTRIBUTING.md [skip ci] * docs: update .all-contributorsrc [skip ci] Co-authored-by: allcontributors[bot] <46447321+allcontributors[bot]@users.noreply.github.com>	2023-01-10 21:23:46 +05:30
Padarn Wilson	a4f422acf3	Bootstrap p value fix (#806 ) * Fix bootstrap p-value Signed-off-by: Padarn Wilson <padarn.wilson@grabtaxi.com> * fix formatting Signed-off-by: Padarn Wilson <padarn.wilson@grabtaxi.com> Signed-off-by: Padarn Wilson <padarn.wilson@grabtaxi.com>	2023-01-10 20:00:28 +05:30
Jivat Neet	1ac8354e44	Adding causal prediction functionality using CACM algorithm with demo notebook (#729 ) * add causal prediction using cacm + demo notebook Signed-off-by: jivatneet <jivatneet@gmail.com> * Adding causal prediction code using CACM with demo notebook Signed-off-by: jivatneet <jivatneet@gmail.com> * addressing comments: fixing documentation in base_algorithm and cacm Signed-off-by: jivatneet <jivatneet@gmail.com> * addressing comments: fixing documentation in base_dataset Signed-off-by: jivatneet <jivatneet@gmail.com> * rename Algorithm class Signed-off-by: jivatneet <jivatneet@gmail.com> * updating base_dataset for format check Signed-off-by: jivatneet <jivatneet@gmail.com> * adding exception for optimizer; resolving list bug Signed-off-by: jivatneet <jivatneet@gmail.com> * adding docs for base algo Signed-off-by: jivatneet <jivatneet@gmail.com> * added torch to dev dependency * added torchlightning to dev dependency Signed-off-by: Amit Sharma <amit_sharma@live.com> * updated poetry.lock Signed-off-by: Amit Sharma <amit_sharma@live.com> * updated gpu to devices Signed-off-by: Amit Sharma <amit_sharma@live.com> Signed-off-by: jivatneet <jivatneet@gmail.com> Signed-off-by: Amit Sharma <amit_sharma@live.com> Co-authored-by: Amit Sharma <amit_sharma@live.com>	2022-12-22 19:21:12 +05:30
allcontributors[bot]	b6a3ae500c	docs: add eeulig as a contributor for code (#799 ) * docs: update CONTRIBUTING.md [skip ci] * docs: update .all-contributorsrc [skip ci] Co-authored-by: allcontributors[bot] <46447321+allcontributors[bot]@users.noreply.github.com>	2022-12-21 11:35:08 +05:30

... 3 4 5 6 7 ...

1038 Коммитов Все ветки Поиск

1038 Коммитов

Все ветки