sentiment-analysis-proj

NLP project This project is for the course 22933 introduction to NLP in the open university of israel This project was created by Elad Biran and Elad Shoham

##how to run.

To run the train please run the following:

./project.sh train <file> <number of rows to take>

./project.sh test <file> <number of rows to take>

##data

This project uses glove's database

please download it from https://nlp.stanford.edu/projects/glove/ (we used this version http://nlp.stanford.edu/data/glove.6B.zip) Please make sure that the glove file is in the same folder as the python script.

make sure the training or test files are in the same folder as the python script.

##returns

note that our test script will only output the scalar test loss (less relevant), and the accuracy

There will not be an output file with the predictions of each sentence, and there will be no prints of the recall, f1, precision and such.

In the code there are functions that show these parameters. If needed, one can use them to see the scores.

##Installation/Prerequisites

We use many imports that need installation before hand.

We used anaconda to install and run the imports.

here is the list.

pickle

numpy

conda install -c anaconda numpy

pandas

conda install -c anaconda pandas

tensorFlow

pip install tensorflow-gpu
or
conda install -c conda-forge tensorflow

keras

conda install -c conda-forge keras 
or
sudo pip install keras

sklearn

pip install -U scikit-learn
or
conda install scikit-learn

pyswarms

conda install -c auto pyswarm

please note that this is a heavy project for the computer. You might need a small corpus, a lot of running time, and a 64bit computer (for tensorFlow).

about the folders and files

in finals there are 4 folders.

old: our old model with a static alpha

new: our new model with dynamic alpha

baseline50000: the baseline trained on 50,000 sentences

baseline10000: the baseline trained on 10,000 sentences

sample_200_test.ft.txt - a file of 200 test samples to run tests on.

each folder has other files in it. they are the models and other data needed. This means you can run the test without training.

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
finals		finals
README.md		README.md
sample_200_test.ft.txt		sample_200_test.ft.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

sentiment-analysis-proj

about the folders and files

About

Releases

Packages

Contributors 2

Languages

x-lad/sentiment-analysis-proj

Folders and files

Latest commit

History

Repository files navigation

sentiment-analysis-proj

about the folders and files

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages