[Adapt] [Seminar] CSCD-IME: Correcting Spelling Errors Generated by Pinyin IME

陈浩平 13701970825 at 126.com
Wed Mar 22 12:15:24 CST 2023


Hi Adapters,



    Chinese spelling correction (CSC) aims to detect and correct spelling errors in texts. In fact, most of Chinese input is based on pinyin input method, so the study of spelling errors in this process is more practical and valuable. However, there is still no research dedicated to this essential scenario. In this paper, the authors first present a Chinese Spelling CorrectionDataset for errors generated by pinyin IME(CSCD-IME), including 40,000 annotated sentences from real posts of official media on Sina Weibo. Furthermore, they propose a novel method to automatically construct large-scale and high-quality pseudo data by simulating the input through pinyin IME.
    Last time i gave an overall introduction on CSC tasks, and today's talk is related to it.



Presenter: apple
Time: Wed 4:00pm
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://cs.sjtu.edu.cn/pipermail/adapt/attachments/20230322/198e4b3e/attachment.htm>


More information about the Adapt mailing list