TY - GEN
T1 - Korean spacing by improving viterbi segmentation
AU - Hong, Gumwon
AU - Rim, Hae Chang
PY - 2007
Y1 - 2007
N2 - This paper presents a Korean spacing approach which employs an improved Viterbi segmentation model. Traditional Viterbi segmentation using the word imigram language model is simple and fast, but has two problems: data sparseness and impmper preference of fewer segments. To overcome these limitations, the segmentation model is extended by employing a split probability based on character bigram. Contextual information is selectively used for further resolution of spacing ambiguities without much increase of the complexity. Experimental results show that the extended model performs better than the traditional segmentation model. Futhennore, compared to the state of the art system, our approach achieves better efficiency in terms of processing time without losing significant accuracy.
AB - This paper presents a Korean spacing approach which employs an improved Viterbi segmentation model. Traditional Viterbi segmentation using the word imigram language model is simple and fast, but has two problems: data sparseness and impmper preference of fewer segments. To overcome these limitations, the segmentation model is extended by employing a split probability based on character bigram. Contextual information is selectively used for further resolution of spacing ambiguities without much increase of the complexity. Experimental results show that the extended model performs better than the traditional segmentation model. Futhennore, compared to the state of the art system, our approach achieves better efficiency in terms of processing time without losing significant accuracy.
UR - http://www.scopus.com/inward/record.url?scp=50049100975&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=50049100975&partnerID=8YFLogxK
U2 - 10.1109/ALPIT.2007.84
DO - 10.1109/ALPIT.2007.84
M3 - Conference contribution
AN - SCOPUS:50049100975
SN - 0769529305
SN - 9780769529301
T3 - Proceedings - ALPIT 2007 6th International Conference on Advanced Language Processing and Web Information Technology
SP - 75
EP - 80
BT - Proceedings - ALPIT 2007 6th International Conference on Advanced Language Processing and Web Information Technology
T2 - 6th International Conference on Advanced Language Processing and Web Information Technology, ALPIT 2007
Y2 - 22 August 2007 through 24 August 2007
ER -