Linkedin-sildeshare-scraper

Linkedin slideshare web crawler downloading files on SlideShare.

Installation

$ pip install bs4

$ pip install selenium

output_path: the path you want to save files. Use ABSOLUTE PATH!
start_point: the page you start to scrape. I have set one for you, but you can change it!
username: Your Linkedin account. You'd better register another account for testing in case that Linkedin blocked your original account. :)(I used Selenium so it seems not gonna happen. But just in case.)
password: Your Linkedin password.
search_depth: Depth you want search into. I used DFS in search algorithms. The program will stop and certain depths. You can also stop the program manually.

$ python sharesilde_crawler.py

Linkedin will limit the number of downloads in 24 hours each account. So you can try more test accounts.

The scraper will automatically download files in your output directory.(Resumes)