mirror of https://github.com/IBM/ai-privacy-toolkit.git synced 2026-07-02 16:01:00 +02:00

A toolkit for tools and techniques related to the privacy and compliance of AI models. https://aip360.res.ibm.com

ai ai-models anonymization artificial-intelligence gdpr machine-learning ml mlops privacy python trustworthy-ai

Find a file

abigailgold 2b2dab6bef Data and Model wrappers (#26 ) * Squashed commit of wrappers: Wrapper minimizer * apply dataset wrapper on minimizer * apply changes on minimization notebook * add black_box_access and unlimited_queries params Dataset wrapper anonymizer Add features_names to ArrayDataset and allow providing features names in QI and Cat features not just indexes update notebooks categorical features and QI passed by indexes dataset include feature names and is_pandas param add pytorch Dataset Remove redundant code. Use data wrappers in model wrapper APIs. add generic dataset components Create initial version of wrappers for models * Fix handling of categorical features		2022-04-27 12:33:27 +03:00
apt	Data and Model wrappers (#26 )	2022-04-27 12:33:27 +03:00
datasets	Add data minimization functionality to the ai-privacy-toolkit (#3 )	2021-07-12 15:56:42 +03:00
docs	Update version and docs	2022-02-23 19:22:54 +02:00
notebooks	Data and Model wrappers (#26 )	2022-04-27 12:33:27 +03:00
tests	Data and Model wrappers (#26 )	2022-04-27 12:33:27 +03:00
.gitattributes	Ignore Jupyter Notebooks in git language detection	2021-04-28 16:34:02 +03:00
.readthedocs.yaml	Try to fix documentation	2021-06-07 17:01:21 +03:00
LICENSE	Initial commit	2021-04-28 06:25:00 -04:00
pyproject.toml	Files for pypi dist	2021-08-02 11:48:05 +03:00
README.md	Add link to Slack	2021-11-02 14:19:22 +02:00
requirements.txt	Data and Model wrappers (#26 )	2022-04-27 12:33:27 +03:00
setup.cfg	Update version and docs	2022-02-23 19:22:54 +02:00

README.md

ai-privacy-toolkit

A toolkit for tools and techniques related to the privacy and compliance of AI models.

The anonymization module contains methods for anonymizing ML model training data, so that when a model is retrained on the anonymized data, the model itself will also be considered anonymous. This may help exempt the model from different obligations and restrictions set out in data protection regulations such as GDPR, CCPA, etc.

The minimization module contains methods to help adhere to the data minimization principle in GDPR for ML models. It enables to reduce the amount of personal data needed to perform predictions with a machine learning model, while still enabling the model to make accurate predictions. This is done by by removing or generalizing some of the input features.

Official ai-privacy-toolkit documentation: https://ai-privacy-toolkit.readthedocs.io/en/latest/

Installation: pip install ai-privacy-toolkit

For more information or help using or improving the toolkit, please contact Abigail Goldsteen at abigailt@il.ibm.com, or join our Slack channel: https://aip360.mybluemix.net/community.

Related toolkits:

ai-minimization-toolkit - has been migrated into this toolkit.

differential-privacy-library: A general-purpose library for experimenting with, investigating and developing applications in, differential privacy.

adversarial-robustness-toolbox: A Python library for Machine Learning Security. Includes an attack module called inference that contains privacy attacks on ML models (membership inference, attribute inference, model inversion and database reconstruction) as well as a privacy metrics module that contains membership leakage metrics for ML models.