-
Notifications
You must be signed in to change notification settings - Fork 840
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fixed str to int conversion issue #484
base: main
Are you sure you want to change the base?
Conversation
Fixed str to int conversion issue Signed-off-by: padmajaborwankar <[email protected]>
@@ -10,7 +10,7 @@ def custom_preprocessing(df): | |||
""" | |||
|
|||
# Group age by decade | |||
df['Age (decade)'] = df['age'].apply(lambda x: x//10*10) | |||
df['Age (decade)'] = df['age'].astype(int).apply(lambda x: x//10*10) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we group integers or it is only feasible to group str?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The 'age' had a String datatype due to which it was typecasted to integer and grouped as a decade.
It'd be great if you can link to the issue #421 👍 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
although I was unable to recreate the bug itself, the changes don't break anything (tested on aif360==0.5.0
env) so it looks good to me
P.S. when doing print(type(df['age'].values[0]))
on the line before the comment "Group age by decade", it looks like the type is <class 'numpy.int64'>
, which explains why the issue wasn't able to be recreated.
Regardless, it's good to explicitly do astype(int)
Fixed str to int conversion issue
Signed-off-by: Padmaja Borwankar [email protected]