Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fixed str to int conversion issue #484

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

padmajaborwankar
Copy link

@padmajaborwankar padmajaborwankar commented Sep 22, 2023

Fixed str to int conversion issue
Signed-off-by: Padmaja Borwankar [email protected]

Fixed str to int conversion issue

Signed-off-by: padmajaborwankar <[email protected]>
@@ -10,7 +10,7 @@ def custom_preprocessing(df):
"""

# Group age by decade
df['Age (decade)'] = df['age'].apply(lambda x: x//10*10)
df['Age (decade)'] = df['age'].astype(int).apply(lambda x: x//10*10)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we group integers or it is only feasible to group str?

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The 'age' had a String datatype due to which it was typecasted to integer and grouped as a decade.

@yehee
Copy link

yehee commented Sep 22, 2023

It'd be great if you can link to the issue #421 👍

@anupamamurthi anupamamurthi changed the title Signed-off-by: Padmaja Borwankar <[email protected]> Fixed str to int conversion issue Sep 27, 2023
@mnagired mnagired linked an issue Sep 27, 2023 that may be closed by this pull request
Copy link
Collaborator

@mnagired mnagired left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

although I was unable to recreate the bug itself, the changes don't break anything (tested on aif360==0.5.0 env) so it looks good to me

P.S. when doing print(type(df['age'].values[0])) on the line before the comment "Group age by decade", it looks like the type is <class 'numpy.int64'>, which explains why the issue wasn't able to be recreated.

Regardless, it's good to explicitly do astype(int)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Broken load_preproc_data_adult
4 participants