Skip to content
/ GDLC Public

Gran Diccionari de la llengua catalana (Kindle LookUp Edition): Python code to edit the commercial ebook into a lookup dictionary for the Catalan language for use with a Kindle Paperwhite 2018 and other compatible ebook readers.

License

Notifications You must be signed in to change notification settings

ptoche/GDLC

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

48 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

WARNING!

Project completed in July 2020. A dictionary was produced with Kindle Previewer (free Amazon software). It was successfully tested on a few words. Then real life took over. I hope to come back to this project to polish it, some day.

The code has not been optimized. Most of it was written over two days without prior thoughts, refactored for another two days, polished a little in my spare time since. Much of the code does little more than help debug a few key functions used to create look-up references.

Overview

Python code to edit an ebook dictionary into a Kindle lookup dictionary that may be associated with the Catalan language. The source files may then be used to rebuild a dictionary.

Reader: Kindle Paperwhite 2018 aka Paperwhite 4 (Lookup dictionaries are not supported by all Kindles).

Source: Gran Diccionari de la llengua catalana (Kindle Edition). Available for purchase at https://www.amazon.com/Gran-Diccionari-Llengua-Catalana-Catalan-ebook/dp/B00DZWFUG4/ for less than 10 U.S. dollars. A free sample may be requested from amazon. This ebook does not support lookup (it is not listed as a dictionary under the Catalan language). The source was purchased in April 2020 and delivered in mobi format. The mobi file was converted to the azw format with calibre (https://calibre-ebook.com/). The azw file was broken into several components using the KindleUnpack plugin (https://wiki.mobileread.com/wiki/KindleUnpack).

Software:

Installation

You must be able to run Python 3 code and the BeautifulSoup library. As the files to be processed are large and the content proprietary, online emulators may not be the appropriate tool. You must have a copy of the dictionary. A free sample may be found inside the GDLC directory.

Python: Make sure you have the appropriate parser libraries installed, e.g. the lxml parser. Examples: pip install lxml # general purpose package manager brew install html.parser # popular for MacOS conda install html5lib # Anaconda environment manager

Getting Help

File an issue and let's see if I can help. Let me know if you can contribute. Do you already have a Catalan Lookup dictionary? Let me know! You're looking for one? Let me know.

History

My original plan was to make a lookup dictionary for Aranes and Occitan. I started with Catalan because I happen to own an electronic copy of the dictionary.

Suggestions for improvement welcome!

@author: Patrick Toche.

About

Gran Diccionari de la llengua catalana (Kindle LookUp Edition): Python code to edit the commercial ebook into a lookup dictionary for the Catalan language for use with a Kindle Paperwhite 2018 and other compatible ebook readers.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages