Skip to content

musabbozkurt/data-collection-framework

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

92 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Prerequisites

  • Python 3.8+ should be installed
  • Docker should be installed
  • Pycharm IDE, IntelliJ IDEA, etc. can be installed to open the project (Recommended)
  • Update the following Twitter API credentials with your configs (DON'T SHARE THEM) in ConfigFile.config file
    • consumer_key =
    • consumer_secret =
    • access_key =
    • access_secret =

How to Run the Application

  1. Run via Terminal (OPTION 1)
    1. Run docker-compose up --build command
    2. cd data_collection_framework/
    3. python3 -m pip install -r requirements.txt
    4. chmod +x Server.py
    5. ./Server.py
  2. Run via Pycharm IDE, IntelliJ IDEA, etc. (OPTION 2)
    1. Run docker-compose up --build command via Terminal
    2. Right-click on the Server.py and click Run

How to Test the Application

  1. http://localhost:3000/
  2. Sign up
  3. Sign in

docker-compose.yml contains the followings


Other variables in ConfigFile.config file can be reviewed

  • word_list_for_streaming = ['python', 'java', '#java', 'javascript'] modify the list according to the needs
  • file_path_for_timeline_outputs = Path for timeline i.e. user path/timeline/
  • file_path_for_followers_outputs = Path for followers i.e user path/followers/
  • file_path_for_list_of_username = Path for userList file can be text file contains Twitter usernames
  • file_path_for_mongo = filepath/of/text/files/to/store/mongodb/*.txt
  • file_path_for_tokenization = File path contains text files
  • file_path_for_cross_val = File path must have at least 2 folders contains text files