Big_Data_Project_US-Airlines_Tweet_Processing_and_Analysis

Big data application of Machine Learning concepts for sentiment classification of US Airlines tweets. The focus is on the usage of pyspark libraries (ml-lib) on big data to solve a problem using Machine Learning algorithms and not about the choice of algorithm used in the ML model creation. It also involves data pre-processing using NLP techniques, cross-validation and parameter-grid builder.

Input and output files used have been attached in the repositories, the urls used are only for the sake of usage in the databricks cluster.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Input file		Input file
Output-metrics files		Output-metrics files
Big data airlines tweets analysis .ipynb		Big data airlines tweets analysis .ipynb
Project details.pdf		Project details.pdf
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Big_Data_Project_US-Airlines_Tweet_Processing_and_Analysis

About

Releases

Packages

Languages

yvgupta03/Big_Data_Project_US-Airlines_Tweet_Processing_and_Analysis

Folders and files

Latest commit

History

Repository files navigation

Big_Data_Project_US-Airlines_Tweet_Processing_and_Analysis

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages