Reuters-21578 Corpus is a collection of documents consisting of news articles which appeared on Reuters newswire in 1987. The corpus is available in NLTK package in Python. Topic Modelling has been conducted on this Reuters-21578 corpus of news documents using Latent Dirichlet Allocation (LDA). The obtained topics have been visualized using proportional topics and words distributions, and also, topic word clouds. The file lda_topic_modelling.py contains the code for the project.
-
Notifications
You must be signed in to change notification settings - Fork 1
Reuters-21578 Corpus is a collection of documents consisting of news articles which appeared on Reuters newswire in 1987. The corpus is available in NLTK package in Python. Topic Modelling has been conducted on this Reuters-21578 corpus of news documents using Latent Dirichlet Allocation (LDA). The obtained topics have been visualized using prop…
nileshsachdeva/Reuters-21578_LDA_Topic_Modelling
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Reuters-21578 Corpus is a collection of documents consisting of news articles which appeared on Reuters newswire in 1987. The corpus is available in NLTK package in Python. Topic Modelling has been conducted on this Reuters-21578 corpus of news documents using Latent Dirichlet Allocation (LDA). The obtained topics have been visualized using prop…
Topics
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published