(Korean language) Frequency of syllables and words in Korean
|There have been surveys on the frequency of the use of Korean words since the 1950, the latest based on 90s' novels published this year by National Academy of Korean Language, says a column in Hankyoreh. (See the bottom of the article for a list of articles on language, mostly concerned with "purification"; or bringing forth the native elements of Korean.) |
The most frequent word is -ida, "to be", which takes 3.34% of the all vocabulary frequency. After that prononun na (I), kôt (thing etc.), su (as in hal su itta, 'is able to do'), verbs itta (to be, to have), hada (to do), ôpta (to not be, to not have), toeda (to become), anida (to not be), katta (to be like). These are mostly words with a very wide meaning and usage. The highest position of a Chinese character (hanja) based word is yôja (女子) at 33; among the 100 most frequent, eight are hanja words.
The most frequent syllables are i, ta, nûn, ûl, ka, ko, kû, e, chi, ô (이 다 는 을 가 고 에 지 어) in that order. Of the sounds of language (or however that should be expressed), the most frequent are vowels a (ㅏ), i (ㅣ) and û (ㅡ), and consonants n (ㄴ) and k (ㄱ).It should be noted that the survey quoted in the column is based on literary works. What would be the results based on actual speech? How are words like ajôssi, ajumma, sajang(nim) placed on the frequency charts?
I tried to find a link to the survey at the Academy site, but didn't succeed. There's still tons of interesting material.Here's a notice of publication of a compiled Korean language purification (sunhwa) guidelines. There are some interesting examples of recommended expressions which have and have not taken root. Examples of the latter are computing words, which remain mostly English:
무른모 (← 소프트웨어)
My opinion on the purification? Use of pure Korean or more exact hanja expressions is a recommendable policy, but I don't expect that the association of "foreign" (read English-language) expressions with development and modernity is going to change soon, especially with administrative recommendations. Korean hasn't exactly been resistant towards non-Korean vocabulary throughout its history.
Categories at del.icio.us/hunjang: Koreanlanguage