Commit graph

18 commits

Author SHA1 Message Date
abigailgold
13a0567183
Make data minimization more consistent and performant (#83)
* Update requirements

* Update incompatible scipy version

* Reduce runtime of dataset assessment tests

* ncp is now a class that contains 3 values: fit_score, transform_score and generalizations_score so that it doesn't matter in what order the different methods are called, all calculated ncp scores are stored.
Generalizations can now be applied either from tree cells or from global generalizations struct depending on the value of generalize_using_transform. Representative values can also be computed from global generalizations.
Removing a feature from the generalization can also be applied in either mode.

* Compute generalizations with test data when possible (for computing better representatives).

* Externalize common test code to methods.
2023-08-21 18:39:15 +03:00
andersonm-ibm
e9a225501f
Limit scikit-learn version because of API changes (#81)
* Limit scikit-learn versions between 0.22.2 and 1.1.3, remove deprecated load_boston().

* Set pytest configuration option to show test progress in detail.

* Change np.int to int according to DeprecationWarning

Signed-off-by: Maya Anderson <mayaa@il.ibm.com>
2023-05-14 08:52:06 +03:00
abigailgold
8a9ef80146
Increase version to 0.2.0 (#74)
* Remove tensorflow dependency if not using keras model
* Remove xgboost dependency if not using xgboost model
* Documentation updates

Signed-off-by: abigailt <abigailt@il.ibm.com>
2023-05-08 12:50:55 +03:00
andersonm-ibm
782edabd58
Add dataset privacy risk assessment example notebook. (#73)
* Add dataset assessment notebook and reference to module from project README

Signed-off-by: Maya Anderson <mayaa@il.ibm.com>
2023-05-04 12:21:42 +03:00
Maya Anderson
dbb958f791 Merge pull request #71 from IBM/dataset_assessment
Add AI privacy Dataset assessment module with two attack implementations.

Signed-off-by: Maya Anderson <mayaa@il.ibm.com>
2023-03-20 14:21:29 +02:00
Maya Anderson
89bdcfc00e Prepare project for CI: cleanup dependencies, fix test data location, cleanup assert.
Signed-off-by: Maya Anderson <mayaa@il.ibm.com>
2022-12-20 16:00:36 +02:00
abigailt
dc5cc793ee Merge with main 2022-08-01 18:11:34 +03:00
abigailt
77a6e08c8e Keras regression support 2022-07-24 18:45:50 +03:00
Ron Shmelkin
521a2ccda9
add art to requirements.txt 2022-07-24 15:34:04 +03:00
Ron Shmelkin
c77e34e373
update pytorch wrapper to use torch loaders
fix tests
and dataset style
2022-07-24 14:31:47 +03:00
abigailgold
c6eb553a9f
Blackbox predict method (#43)
* Support output probabilities
* Support black box classifier with predict method
* Update requirements (security alert #1)
2022-06-30 18:23:53 +03:00
abigailgold
fd6be8e778
Documentation updates (#29)
* Bump version to 0.1.0 (breaking changes to some APIs)

* Update documentation

* Update requirements

* gitignore
2022-05-02 11:46:18 +03:00
abigailgold
014aed9670
Update requirements (#28)
* Bump version to 0.1.0 (breaking changes to some APIs)
* Update documentation
* Update requirements
2022-05-01 15:49:18 +03:00
abigailgold
2b2dab6bef
Data and Model wrappers (#26)
* Squashed commit of wrappers:

    Wrapper minimizer

    * apply dataset wrapper on minimizer
    * apply changes on minimization notebook
    * add black_box_access and unlimited_queries params

    Dataset wrapper anonymizer

    Add features_names to ArrayDataset
    and allow providing features names in QI and Cat features not just indexes

    update notebooks

    categorical features and QI passed by indexes
    dataset include feature names and is_pandas param

    add pytorch Dataset

    Remove redundant code.
    Use data wrappers in model wrapper APIs.

    add generic dataset components 

    Create initial version of wrappers for models

* Fix handling of categorical features
2022-04-27 12:33:27 +03:00
abigailt
3d82db80c4 Remove redundant code.
Use data wrappers in model wrapper APIs.
More typing.
2022-03-06 21:15:07 +02:00
abigailt
d1dded59ac Fix requirements 2021-07-12 16:02:54 +03:00
abigailgold
f2e1364b43
Add data minimization functionality to the ai-privacy-toolkit (#3)
* Fix directory issue when running tests for first time

* Initial version of data minimization

* Update version and documentation

* Fix documentation
2021-07-12 15:56:42 +03:00
abigailt
5665c2e79d Initial commit 2021-04-28 14:00:19 +03:00