Commit graph

21 commits

Author SHA1 Message Date
abigailt
b8d2535231 Fix handling of categorical features 2022-04-25 17:41:15 +03:00
abigailt
a37ff06df8 Squashed commit of the following:
commit d53818644e
Author: olasaadi <92303887+olasaadi@users.noreply.github.com>
Date:   Mon Mar 7 20:12:55 2022 +0200

    Build the dt on all features anon (#23)

    * add param to build the DT on all features and not just on QI
    * one-hot encoding only for categorical features

commit c47819a031
Author: abigailt <abigailt@il.ibm.com>
Date:   Wed Feb 23 19:40:11 2022 +0200

    Update docs

commit 7e2ce7fe96
Merge: 7fbd1e4 752871d
Author: abigailt <abigailt@il.ibm.com>
Date:   Wed Feb 23 19:26:44 2022 +0200

    Merge remote-tracking branch 'origin/main' into main

commit 7fbd1e4b90
Author: abigailt <abigailt@il.ibm.com>
Date:   Wed Feb 23 19:22:54 2022 +0200

    Update version and docs

commit 752871dd0c
Author: olasaadi <92303887+olasaadi@users.noreply.github.com>
Date:   Wed Feb 23 14:57:12 2022 +0200

    add minimization notebook (#22)

    * add german credit notebook to showcase new features (minimize only some features and categorical features)

    * add notebook to show minimization data on a regression problem
2022-04-25 17:39:30 +03:00
olasaadi
31e278fe60 fix bug 2022-03-28 00:07:54 +03:00
olasaadi
6172385d91 update docstring 2022-03-28 00:07:23 +03:00
olasaadi
3bc1341dad update docstring 2022-03-28 00:07:04 +03:00
olasaadi
b54f0a2382 fix tests 2022-03-24 19:35:26 +02:00
olasaadi
66c86dc595 fix notebook and add features_names to ArrayDataset
and allow providing features names in QI and Cat features not just indexes
2022-03-24 19:32:24 +02:00
olasaadi
06158c8508 update 2022-03-23 17:54:37 +02:00
olasaadi
312469212e fix docstring and fix assert in test 2022-03-22 13:59:28 +02:00
olasaadi
5b3476071f fix 2022-03-21 21:59:14 +02:00
olasaadi
8aa7bb8281 categorical features and QI passed by indexes
dataset include feature names and is_pandas param
2022-03-21 21:57:11 +02:00
olasaadi
3263f92bee anonymizer works with numpy and return numpy/pandas as original dataset 2022-03-19 18:06:10 +02:00
olasaadi
a2b560920f apply changes 2022-03-15 10:52:45 +02:00
olasaadi
b4eddabd37 apply changes after rebase with wrappers 2022-03-10 13:49:05 +02:00
olasaadi
6afb175d6f add dataset wrapper to docstring 2022-03-10 13:42:49 +02:00
olasaadi
eefad206cd fix bug and update test_model 2022-03-10 13:42:47 +02:00
olasaadi
1280b849f4 using dataset wrapper on anonymizer 2022-03-10 13:41:57 +02:00
olasaadi
cb9278ddb5
Support regression models (#19)
* support DecisionTreeRegressor

* support regression models

* Update membership_inference_dp_diabetes_reg.ipynb
2022-01-26 14:30:58 +02:00
olasaadi
a9a93c8a3a
Train just on qi (#15)
* QI updates
* update code to support training ML on QI features
* fix code so features that are not from QI should not be part of generalizations
and add description
* merging two branches, training on QI and on all data
* adding tests and asserts
2022-01-12 17:01:27 +02:00
abigailgold
bcc3d67ba4
Small fix + unified approach to numpy and pandas categorical data (#2) 2021-07-11 17:42:48 +03:00
abigailt
5665c2e79d Initial commit 2021-04-28 14:00:19 +03:00