IMDb Scrapper
stores a given number of movies from the Internet Movie Database as JSON.
- python 2.7
Download and install required libs and data:
pip install imdbpie
Collect and store the first 10,000 movies from the IMDb:
python imdb-api-scrapper.py 10000 save
Collect and store the movies 15,000 to 25,000 from the IMDb:
python imdb-api-scrapper.py --start 15000 10000 save
Collect and store the movies 30,000 to 35,000 from the IMDb and append them to the existing data in the movies.json:
python imdb-api-scrapper.py --start 30000 5000 --overwrite no save
Collect and store the movies and episodes 1 to 2,000 from the IMDb:
python imdb-api-scrapper.py 2000 --episodes yes save
Collect and store the most voted feature movies from 1950 to the current year from the IMDb:
python imdb-page-scrapper.py save
Collect and store the first 80,000 most voted feature movies from 1950 to the current year from the IMDb:
python imdb-page-scrapper.py 80000 save