Fix error with pandas dataframes (#92)

* Fix error with pandas dataframes in _columns_different_distributions + add appropriate test
* Update documentation of classes to reflect that all data should be encoded and scaled.

---------

Signed-off-by: abigailt <abigailt@il.ibm.com>
This commit is contained in:
abigailgold 2024-02-13 08:56:12 -05:00 committed by GitHub
parent cb70ca10e6
commit e00535d120
No known key found for this signature in database
GPG key ID: B5690EEEBB952194
6 changed files with 28 additions and 30 deletions

View file

@ -71,9 +71,11 @@ class DatasetAttackMembershipClassification(DatasetAttackMembership):
config: DatasetAttackConfigMembershipClassification = DatasetAttackConfigMembershipClassification(),
dataset_name: str = DEFAULT_DATASET_NAME, categorical_features: list = None):
"""
:param original_data_members: A container for the training original samples and labels
:param original_data_non_members: A container for the holdout original samples and labels
:param synthetic_data: A container for the synthetic samples and labels
:param original_data_members: A container for the training original samples and labels. Should be encoded and
scaled.
:param original_data_non_members: A container for the holdout original samples and labels. Should be encoded
and scaled.
:param synthetic_data: A container for the synthetic samples and labels. Should be encoded and scaled.
:param config: Configuration parameters to guide the attack, optional
:param dataset_name: A name to identify this dataset, optional
"""