近日,北外语料库团队协助西北师范大学武和平教授团队建成东干语语料库。
目前东干语单语及东干语-汉语平行语料库已部署在北外语料库团队CQPweb网站。
详见:http://114.251.154.212/cqp/
The Dungan Corpus, The Dungan-Chinese Corpus
账号密码皆为:test
北外语料库团队为濒危语言保护作出了自己的贡献。
东干语语料库的建成受到了著名汉学家梅维恒(Victor Mair)的关注。
https://languagelog.ldc.upenn.edu/nll/?p=51189
![](/__local/B/B5/4C/7348308480E153885D9E6587762_F7F473D3_17C54.png)
The Dungan Corpus (Dungan is spoken primarily in Kyrgyzstan, with speakers in Kazakhstan, Uzbekistan, and Russia as well. The Dungan ethnic group are the descendants of refugees from China who migrated west into Central Asia in the Qing Dynasty.) The Dungan language is Northwest Mandarin but written in Cyrillic script.