TY - JOUR
T1 - Automated error detection and correction of chinese characters in written essays based on weighted finite-state transducer
AU - Hao, Shudong
AU - Gao, Zongtian
AU - Zhang, Mingqing
AU - Xu, Yanyan
AU - Peng, Hengli
AU - Su, Kaile
AU - Ke, Dengfeng
PY - 2013
Y1 - 2013
N2 - Chinese text error detection and correction is widely applicable, but the methods so far are not robust enough for industrial use. In this paper, a new method is proposed based on Tri-gram modeled-Weighted Finite-State Transducer (WFST). By integrating confusing-character table, beam search and A* search, we evaluate the performance on real test essays. Various experiments have been conducted to prove that the proposed method is effective with the recall rate of 85.68%, the detection accuracy of 91.22% and the correction accuracy of 87.30%.
AB - Chinese text error detection and correction is widely applicable, but the methods so far are not robust enough for industrial use. In this paper, a new method is proposed based on Tri-gram modeled-Weighted Finite-State Transducer (WFST). By integrating confusing-character table, beam search and A* search, we evaluate the performance on real test essays. Various experiments have been conducted to prove that the proposed method is effective with the recall rate of 85.68%, the detection accuracy of 91.22% and the correction accuracy of 87.30%.
KW - Error correction
KW - Error detection
KW - N-gram language model
KW - Weighted Finite-State Transducer (WFST)
UR - http://www.scopus.com/inward/record.url?scp=84889568099&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84889568099&partnerID=8YFLogxK
U2 - 10.1109/ICDAR.2013.156
DO - 10.1109/ICDAR.2013.156
M3 - Conference article
AN - SCOPUS:84889568099
SN - 1520-5363
SP - 763
EP - 767
JO - Proceedings of the International Conference on Document Analysis and Recognition, ICDAR
JF - Proceedings of the International Conference on Document Analysis and Recognition, ICDAR
M1 - 6628721
T2 - 12th International Conference on Document Analysis and Recognition, ICDAR 2013
Y2 - 25 August 2013 through 28 August 2013
ER -