* Initial version with first working test
* Make sure representative values in generalizations for 1-hot encoded features are consistent.
* Updated notebooks for one-hot encoded data
* Review comments
Signed-off-by: abigailt <abigailt@il.ibm.com>
* Support 1-hot encoded features in anonymization (#72)
* Fix anonymization adult notebook + new notebook to demonstrate anonymization on 1-hot encoded data
* Minimizer: No default encoder, if none provided data is supplied to the model as is. Fix data type of representative values. Fix and add more tests.
Signed-off-by: abigailt <abigailt@il.ibm.com>
* Limit scikit-learn versions between 0.22.2 and 1.1.3, remove deprecated load_boston().
* Set pytest configuration option to show test progress in detail.
* Change np.int to int according to DeprecationWarning
Signed-off-by: Maya Anderson <mayaa@il.ibm.com>
* Squashed commit of wrappers:
Wrapper minimizer
* apply dataset wrapper on minimizer
* apply changes on minimization notebook
* add black_box_access and unlimited_queries params
Dataset wrapper anonymizer
Add features_names to ArrayDataset
and allow providing features names in QI and Cat features not just indexes
update notebooks
categorical features and QI passed by indexes
dataset include feature names and is_pandas param
add pytorch Dataset
Remove redundant code.
Use data wrappers in model wrapper APIs.
add generic dataset components
Create initial version of wrappers for models
* Fix handling of categorical features
* add german credit notebook to showcase new features (minimize only some features and categorical features)
* add notebook to show minimization data on a regression problem
* Fixes related to corner cases in calculating generalizations
* Fix print
* Fix corner cases in transform as well
* Improve prints + bug fixes in calculation of feature to remove
* Notebook demonstrating ai minimization