Foundations of statistical natural language processing
Foundations of statistical natural language processing
Algorithms for the Longest Common Subsequence Problem
Journal of the ACM (JACM)
MFCS '94 Proceedings of the 19th International Symposium on Mathematical Foundations of Computer Science 1994
Aligning Multiword Terms Using a Hybrid Approach
CICLing '02 Proceedings of the Third International Conference on Computational Linguistics and Intelligent Text Processing
Multiword Expressions: A Pain in the Neck for NLP
CICLing '02 Proceedings of the Third International Conference on Computational Linguistics and Intelligent Text Processing
COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 1
A unified statistical model for the identification of English baseNP
ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
Character contiguity in N-gram-based word matching: the case for Arabic text searching
Information Processing and Management: an International Journal
A nonparametric method for extraction of candidate phrasal terms
ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Multiword expressions in spoken language: An exploratory study on pronunciation variation
Computer Speech and Language
The design, implementation, and use of the Ngram statistics package
CICLing'03 Proceedings of the 4th international conference on Computational linguistics and intelligent text processing
Improving effectiveness of mutual information for substantival multiword expression extraction
Expert Systems with Applications: An International Journal
ICCOMP'09 Proceedings of the WSEAES 13th international conference on Computers
Hi-index | 0.00 |
This paper proposes a new approach for Multi-word Expression (MWE)extraction on the motivation of gene sequence alignment because textual sequence is similar to gene sequence in pattern analysis. Theory of Longest Common Subsequence (LCS) originates from computer science and has been established as affine gap model in Bioinformatics. We perform this developed LCS technique combined with linguistic criteria in MWE extraction. In comparison with traditional n-gram method, which is the major technique for MWE extraction, LCS approach is applied with great efficiency and performance guarantee. Experimental results show that LCS-based approach achieves better results than n-gram.