Investigating Glyph Phonetic Information for Chinese Spell Checking: What Works and What's Next

Clicks: 21
ID: 282585
2022
While pre-trained Chinese language models have demonstrated impressive performance on a wide range of NLP tasks, the Chinese Spell Checking (CSC) task remains a challenge. Previous research has explored using information such as glyphs and phonetics to improve the ability to distinguish misspelled characters, with good results. However, the generalization ability of these models is not well understood: it is unclear whether they incorporate glyph-phonetic information and, if so, whether this information is fully utilized. In this paper, we aim to better understand the role of glyph-phonetic information in the CSC task and suggest directions for improvement. Additionally, we propose a new, more challenging, and practical setting for testing the generalizability of CSC models. All code is made publicly available.
Reference Key
qiu2022investigating Use this key to autocite in the manuscript while using SciMatic Manuscript Manager or Thesis Manager
Authors Xiaotian Zhang; Yanjun Zheng; Hang Yan; Xipeng Qiu
Journal arXiv
Year 2022
DOI DOI not found
URL
Keywords

Citations

No citations found. To add a citation, contact the admin at info@scimatic.org

No comments yet. Be the first to comment on this article.