Title | Component-enhanced Chinese character embeddings |
Authors | Li, Yanran Li, Wenjie Sun, Fei Li, Sujian |
Affiliation | Department of Computing, Hong Kong Polytechnic University, Hong Kong, Hong Kong Institute of Computing Technology, Chinese Academy of Sciences, China Key Laboratory of Computational Linguistics, Peking University, MOE, China |
Issue Date | 2015 |
Publisher | Conference on Empirical Methods in Natural Language Processing, EMNLP 2015 |
Citation | Conference on Empirical Methods in Natural Language Processing, EMNLP 2015.Lisbon, Portugal,2015/1/1. |
Abstract | Distributed word representations are very useful for capturing semantic information and have been successfully applied in a variety of NLP tasks, especially on English. In this work, we innovatively develop two component-enhanced Chinese character embedding models and their bigram extensions. Distinguished from English word embeddings, our models explore the compositions of Chinese characters, which often serve as semantic indictors inherently. The evaluations on both word similarity and text classification demonstrate the effectiveness of our models. ? 2015 Association for Computational Linguistics. |
URI | http://hdl.handle.net/20.500.11897/436966 |
ISSN | 9781941643327 |
Indexed | EI |
Appears in Collections: | 计算语言学教育部重点实验室 |