Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Got error when change some data in WAERS BUKRS KTOSL PRCTR BSCHL HKONT fields #3

Open
stevenshisg opened this issue May 22, 2019 · 0 comments

Comments

@stevenshisg
Copy link

stevenshisg commented May 22, 2019

I have played around the fraud_dataset_v2.csv dataset. It was successful when I used the original dataset.

However, I tried to change data in some of value in WAERS BUKRS KTOSL PRCTR BSCHL HKONT fields as a new dataset and load trained model. It will throw different number of input data column error message.

I think the issue is caused by less data categories in one of the data fields. When perform one-hot encoding, the newly generated column is less than the original input data used by trained model.

My question:
If we want to evaluate abnormality on a new dataset with the same data fields / columns, is it required to re-train the model?

If no, do you have any (pre-processing) recommendation to fit new input data into trained model? For example, play around with one-hot encoding or other pre-processing approach?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant