report-cleaning

Ferdowsi University of Mashhad Information Retrieval Indexing and Retrieval Models

Table of Contents

About The Project
- Built With
Getting Started
- Prerequisites
- Installation
Usage
Roadmap
Contributing
License
Contact
Acknowledgments

About The Project

preprocess text:

Normalization
Stemming
Lemmatization
Remove stop words
Remove punctuations

TD-IDF:

the frequency of words to determine how relevant those words are to a given document.

Libraries:

pandas
numpy
json
ast
math
scipy
threading
hazm
sklearn
google

Built With

Technologies and Tools Utilized in this Project

(back to top)

Contributing

Contributions are what make the open source community such an amazing place to learn, inspire, and create. Any contributions you make are greatly appreciated.

If you have a suggestion that would make this better, please fork the repo and create a pull request. You can also simply open an issue with the tag "enhancement". Don't forget to give the project a star! Thanks again!

Fork the Project
Create your Feature Branch (git checkout -b feature/AmazingFeature)
Commit your Changes (git commit -m 'Add some AmazingFeature')
Push to the Branch (git push origin feature/AmazingFeature)
Open a Pull Request

(back to top)

License

Distributed under the MIT License. See LICENSE for more information.

(back to top)

Contact

Javid Chaji - @JavidChaji - [email protected]

Project Link: https://github.com/JavidChaji/FUM-Information-Retrieval-Indexing-and-Retrieval-Models

(back to top)

Acknowledgments

(back to top)

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
Code/news_ir_processing		Code/news_ir_processing
Assignment_1401.pdf		Assignment_1401.pdf
Information_Retrival_Practice.ipynb		Information_Retrival_Practice.ipynb
LICENSE		LICENSE
News.json		News.json
Posting_Lists.csv		Posting_Lists.csv
README.md		README.md
TF-IDF_Model.py		TF-IDF_Model.py
main.txt		main.txt
pre.txt		pre.txt
summary.docx.pdf		summary.docx.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

report-cleaning

About The Project

Built With

Contributing

License

Contact

Acknowledgments

About

Releases

Packages

Contributors 2

Languages

License

JavidChaji/FUM-Information-Retrieval-Indexing-and-Retrieval-Models

Folders and files

Latest commit

History

Repository files navigation

report-cleaning

About The Project

Built With

Contributing

License

Contact

Acknowledgments

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages