-
Notifications
You must be signed in to change notification settings - Fork 141
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Cloudflare-protected site responds with 503 Service Temporarily Unavailable #205
Comments
You sure that the site is up? Also, are you sure that you aren't banned? |
I can still go into regular chrome.. no problem at all. |
weird. maybe the site requires JS and if you don't have it, bans you? otherwise idk |
@TheTechRobo please don't speculate like this in the issues, try to reproduce the issue yourself if you're interested in it. Anyway, I see
in the resulting WARC when trying to crawl this forum. cloudflare is known to block bots sending the wrong TLS fingerprint. It is probably picking up on grab-site's 'incorrect' TLS fingerprint, which does not match the browser it claims to be (Firefox). We might be able to fix that in ludios_wpull. |
@ivan Gotcha. 👍 |
I installed grab-site on ubuntu 20.04 using nix.
The command I use is 'grab-site https://www.forexfactory.com/forums --concurrency=1' .
Example.com and other sites completed crawling, but the 'https://www.forexfactory.com/' site failed to crawl. I've also tried with sub-addresses.
Below is the log.
The text was updated successfully, but these errors were encountered: