xenial-black•3mo ago
Creating Japanese parsing exceptions
Heyo!
I remember way back when, before the Migaku brand, that the Anki add-on developed by Yoga which is the equivalent of today's Migaku Anki add-on, used to have a feature that allowed you override parsing rules for Japanese when generating furigana and pitch accent syntax.
There are a few words that the current parsers gets consistently wrong for me, e.g. なる always gets generated with heiban pitch (my guess is that it parses it as 鳴る), when it should have atamadaka. Counters, months, years, and geography is something that it consistently fails at as well. 一人 often becomes 一[いち;o] 人[にん;a] instead of 一人[ひとり;n2], and so on.
City and prefecture names also gets "split up", generating e.g. 京都[きょうと;a] 府[ふ;o] 京都[きょうと;a] 市[し;o] instead of 京都府[きょうとふ;n3] 京都市[きょうとし;n3]
I'd like to override these as I encounter them.
I haven't been able to find the equivalent functionality in the current Migaku Anki add-on. Any plans on reintroducing this feature?
3 Replies
deep-jade•3mo ago
Can't you just do a mass find and replace? :ThinkingFace: @iGrapefruit
xenial-blackOP•3mo ago
That does work, but it does introduce the same manual steps from me every time I'm done creating cards for the day. It's not just every now and then - the parser forces atamadaka into common particles like これ, それ, から, etc. when they're most commonly used with heiban.
I know it's not the biggest deal, but to me it would be nice to be able to remove these manual steps, and just create custom permanent parsing overrides 😄
deep-jade•3mo ago
I'm honestly not even sure that ever was a thing but regardless definitely wouldn't be against it's reintroduction but it'll be a long while