Skip to content

This project aims to classify emails into spam and ham emails

Notifications You must be signed in to change notification settings

Darius2527/SpamClassifier-Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 

Repository files navigation

SpamClassifier-Project

This project aims to classify emails into spam and ham emails. We first perform text processing and then use TFIDF vectorizer and bag of words (CountVectorizer) to convert the text into vectors. Then we pass it to our NaiveBayesClassifier to calculate the accuracy.

A. Pre-processing

Removal of Special Characters

Removal of Numbers

Lowercase Conversion

Tokenization

Removal of Stop words

Stemming

B. Feature Extraction

Bag of words

Tf-Idf

C. Classification

Naive Bayes Algorithm (GaussianNB, MultinomialNB, BinomialNB)

About

This project aims to classify emails into spam and ham emails

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published