Skip to content

Assessing the impact of different pre-processing techniques for classifying the sentiment of movie reviews

Notifications You must be signed in to change notification settings

fusi3/natural_language_coursework

Repository files navigation

natural_language_coursework

This worked focused on attempting various kinds of preprocessing to see the impact that they may have on the classification of sentiment for movie reviews. The preprocessing was tested with Multilayer Perceptrons and Support Vector Machines. This included various n-gram levels through BoW and TF-IDF for stemming and lemming. The effect of Latent Semantic Analysis was also assessed, however it seemed that the best performance came through using stemming and uni gram tf-idf. Please read the other read me to run the notebooks.

About

Assessing the impact of different pre-processing techniques for classifying the sentiment of movie reviews

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published