Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Wiki dump 20230720 has been removed from https://dumps.wikimedia.org #38

Open
junjiechen-chris opened this issue Jul 12, 2024 · 1 comment
Labels
bug Something isn't working

Comments

@junjiechen-chris
Copy link

junjiechen-chris commented Jul 12, 2024

Hi,
Thanks for releasing such fantastic resources. Unfortunately, I encounter the following error when executing the python download_data.py en_wiki --output_dir data/download/en_wiki --overwrite script.

FileNotFoundError: Couldn't find file at https://dumps.wikimedia.org/enwiki/20230720/dumpstatus.json

Upon visiting the wiki dump, i found the 20230720 dump has been removed for both Japanese and English.

image image
@junjiechen-chris junjiechen-chris added the bug Something isn't working label Jul 12, 2024
@junjiechen-chris junjiechen-chris changed the title Wiki dump 20230720 has been removed from https://dumps.wikimedia.org/jawiki/ Wiki dump 20230720 has been removed from https://dumps.wikimedia.org Jul 12, 2024
@KeshavSingh29
Copy link

KeshavSingh29 commented Jul 24, 2024

Facing the same issue!
I just update the date parameter in download_data.py to 20240720.
Seems like wikidump keeps updating their data.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants