xenial-blackβ’2mo ago
Example sentences for Cantonese are too long to be usable for language learning
This is an extremely common problem: I want to make a flash card for a word, and there are several example sentences, but they are all at a highly advanced reading level and far too long and complicated for a simple flash card. I will post some examples.
These "example sentences" (more like example paragraphs) really don't come anywhere close to being useful for language learning, because in order to understand them, I have to learn 10 other more complex words than the simple word I was trying to learn in the first place. I understand that these come from a web resource that Migaku is not responsible for. However, I would ask that you identify which web resources are problematic and remove them, or use programmatic means to identify which example sentences are too complicated to be helpful.
The examples I have posted screenshots of are not cherry-picked. They are the last flash cards I made, in order.




3 Replies
quickest-silverβ’2mo ago
@Rococo They're just pulled from Tatoeba. Fixing would require manual curation which isn't worth it rn
Do you know of another open source setence bank for Cantonese?
xenial-blackOPβ’2mo ago
I mentioned words.hk before. I hear they don't have an API but there are many example sentences even within the downloadable dictionary - they're included in the definition entry currently, which I'm sure could be fixed with a bit of parsing. That's actually what I normally use for example sentences currently. I just copy over from the words.hk definition (already in Migaku)
@HulK (ping reply me)ο½ππ° (8k)
quickest-silverβ’2mo ago
ya that's true
I'll work on it