You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The Data Schema JSON will throw an error if it includes characters not compatible with ascii.
This happens due to the use of Swedish characters like "Ö" and "Ä".
This is the line of code where it happens. I made a quick workaround by replacing it with:
descriptor = json.loads(value.encode('utf-8'))
Just wanted to get some feedback on how to approach this, if there should be somewhere an option to declare the encoding of the file? Or you want a PR with this work around?
Please preserve this line to notify @amercader (maintainer of this repository)
The text was updated successfully, but these errors were encountered:
JVickery-TBS
pushed a commit
to JVickery-TBS/ckanext-validation
that referenced
this issue
Jul 23, 2024
I am also facing a similar issue that ckanext-validation is stripping french accents from the header names. We previously implemented a fix in xloader here: loader.py, but it seems that ckanext-validation does not go through the load_csv() function of xloader where the unidecode option can be checked.
Can you please advise where in ckanext-validation such a check can be made to preserve accents in the headers? Thanks!
Overview
The Data Schema JSON will throw an error if it includes characters not compatible with ascii.
This happens due to the use of Swedish characters like "Ö" and "Ä".
This is the line of code where it happens. I made a quick workaround by replacing it with:
descriptor = json.loads(value.encode('utf-8'))
Just wanted to get some feedback on how to approach this, if there should be somewhere an option to declare the encoding of the file? Or you want a PR with this work around?
Please preserve this line to notify @amercader (maintainer of this repository)
The text was updated successfully, but these errors were encountered: