Skip to content

midknight24/mangaCrawler

Repository files navigation

A Scrapy crawler that crawls manga from a pre-selected website.

Splash is used to render the javascript on the page such that Scrapy can get the image url, which is rendered on load in the browser

A few bash scripts to automate the process.

By default, it will crawl three times. Each time it checks against MySQL database and local file system to see if all pages of a chapter are downloaded. Next time it will retry downloading missing pages.

Made the crawlManga.sh into a crontab that runs every 1 hours to check for new chapters. Will receive wechat notification if found.

pyasn1 1.4 works, while pyasn 1.6 does not for unknown reason

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published