[1]
|
TAO T,AHAI C X.Mining comparable bilingual text corpora for cross-language information integration[C].Proceedings of the 2005 ACM SIGKDD International Conference on Knowledge on Knowledge and Datamining,Chicago,Illinois,USA,2005:691-696. |
[2]
|
SUN C N,ZHENG C,XIA Q S.Chinese text similarity computing based on LDA[J].Computer Technology and Development,2013,23(1):217-220. |
[3]
|
YANG Y,JIN F,KAMEL,et al.Survey of clustering validity evaluation[J].Application Research of Computers,2008,25(6):1630-1632. |
[4]
|
THUY Vu,Ai Ti Aw,ZHANG M.Feature-based method for document alignment in comparable news corpora[C].Proceedings of the 12th Conference of the European Chapter of the ACL.Athens Greece,2009:843-851. |
[5]
|
TALVENSAARI T,LAURIKKALA J,JARVELIN K,et al.Creating and exploiting a comparable corpus in cross language information retrieval[J].ACM Transactions on Information Systems,2007,25(1):322-334. |
[6]
|
OTERO P G,LOPEZ I G.Wikipedia as multilingual source of comparable corpora[C].Proceedings of the 3rd Workshop on BUCC,LREC2010,Malta,2010:21-25. |
[7]
|
JUDITA P.Identifying comparable corpora using LDA[C].Proceedings of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies.Montr'eal,Canada,2012:558-562. |
[8]
|
ZHU Z D,LI M,CHEN L,et al.Building comparable corpora based on bilingual LDA model[C].Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics,Sofia,Bulgaria,2013:278-282. |
[9]
|
BLEI D M,NG A Y,JORDAN M I.Latent dirichlet allocation[J].Journal of Machine Learning Research,2003,3:993-1022. |
[10]
|
孫昌年,鄭誠,夏青松.基于LDA的中文文本相似度計算[J].計算機技術與發展,2013,23(1):217-220. |
[11]
|
LI R P,LIU T.Application research on BP artificial neural network and LM algorithms in the fixed assets investment performance evaluation[C].Proc of the 7th International Conference on Machine Learning and Cybernetics,Kunming,2008:12-15. |
[12]
|
JIN R,HAUPTMANN A G.Title generation for machine translated documents[C].IJCAI#x02019;01,Proc of the 17th International Joint Conference on Artificial Intelligence,2001:1229-1234. |
[13]
|
HU X H,ZHANG X D,LU C M,et al.Exploiting wikipedia as external knowledge for document clustering[C].Proc of the 15th ACM SIGKDD Int#x02019;l Conf on Knowledge Discovery and Data Mining,2009:389-396 doi: 10.1145/1557019.1557066. |
[14]
|
MURTAGH F.Clustering in massive data sets[M]//Handbook of Massive Data Sets.Springer US,2002:501-543. |
[15]
|
MUNTEANU D S,FRASER A M,MARCU D.Improved machine translation performance via parallel sentence extraction from comparable corpora[C]//Proceeding of Hlt-Naacl,2004:265-272. |
[16]
|
楊燕,靳蕃,KAMEL,et al.聚類有效性評價綜述[J].計算機應用研究,2008,25(6):1630-1632. |
[17]
|
CONRAD J G,AL-KOFAHI K,ZHAO Y,et al.Effective document clustering for large heterogeneous law firm collections[C]// Proceedings of the 10th International Conference on Artificial Intelligence and Law,ACM,2005:177-187. |