-
Notifications
You must be signed in to change notification settings - Fork 0
-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Dirty Poetry #5
Comments
Love this! |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Poems made from the dirtiest OCR data available from the Library of Congress's Chronicling America dataset of historical Newspapers.
And by dirty, I mean OCR that looks like:
y tho?
Every data project starts with a long slog of cleaning up the data to be useful. Well, this project is conceived under the idea of
"What if we leaned into our bad data? What if we took it on it's own terms, and didn't judge it. Where might it take us?"
And the answer is: this absolute nonsense right here.
SHOW ME THE POEMS
They looks like this:
More poems, plus a short write up and the code at https://github.com/bibliotechy/dirty-poetry
The text was updated successfully, but these errors were encountered: