You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Feb 25, 2023. It is now read-only.
I'm not sure what format this is being outputted to, but when I view it in notepad++ it says it's UCS 2 LE BOM. When I convert it to UTF-8 nothing changes, and when I use SHIFT-JIS it gives Japanese characters but it's all nonsense.
I've attached an excerpt of my output, as well as another file showing the original entries from that excerpt for two headwords I was able to track down.
{
"heading": "かみかぜ【神風】(和英)",
"text": "かみかぜ【神風】\n(1) a divine wind;the timely rescue of Providence.(2) a Kamikaze;a suicide pilot (特攻隊員).\n‖神風運転手 a reckless driver.\n"
}
かみかぜ【神風】(和英)
かみかぜ【神風】
(1) a divine wind;the timely rescue of Providence.(2) a Kamikaze;a suicide pilot (特攻隊員).
‖神風運転手 a reckless driver.
I looked on Wikipedia and also here: https://stackoverflow.com/questions/1778619/encoding-conversion-from-jis-x-208-to-unicode
but I'm not sure what's the best thing to use. I'm not sure #1 what the right encoding is, and #2 what the character set for it is written as in C. If it's like SHIFT_JIS/SHIFT-JIS, I could try changing that, but I don't really want to build this thing (by the way, I'm on Windows).
Also, all my dictionaries give the same type of result. I tried 大辞林, 新明解, 大辞泉, 新辞林, 明鏡, etc.
Sign up for freeto subscribe to this conversation on GitHub.
Already have an account?
Sign in.
I'm not sure what format this is being outputted to, but when I view it in notepad++ it says it's UCS 2 LE BOM. When I convert it to UTF-8 nothing changes, and when I use SHIFT-JIS it gives Japanese characters but it's all nonsense.
I've attached an excerpt of my output, as well as another file showing the original entries from that excerpt for two headwords I was able to track down.
Excerpt:
Original Entries:
The text was updated successfully, but these errors were encountered: