-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Turkey Ministry Of Interior Terrorist Wanted List #108
Comments
I wonder if this is behind a crawl protecting CDN; if so we'd need to manually capture it down. |
I don't know exactly, but it looks like it is so |
I've managed to manually get the data with:
However when trying to replicate the call with python requests I'm getting a
From urllib3/urllib3#2653 I learned that apparently this is what happens with OpenSSL 3.0 when connecting to legacy websites that disable renegotiation without signalling it correctly. Is saving a local copy of the data in the repo such as in lt_illegal_websites advisable or best to use a workaround such as urllib3/urllib3#2653 (comment)? |
yeah I think it's fine to enable the unsafe negotiation strategy, on the basis that we have another sanctions list that is http. I'll notify them of the issue and ask that they look into upgrading. Could you also add something like
in the crawl() function? |
Data URL
https://en.terorarananlar.pol.tr/tarananlar
Publisher
The Ministry Of Interior
Publisher country/territory code
No response
Type of data
Crime/Wanted/Suspected (Persons suspected or convicted of crimes and listed by official law enforcement)
Coverage region
region:Global
Can you tell us more?
No response
This is a suggestion or request
The text was updated successfully, but these errors were encountered: