A statistical approach to machine translation
Computational Linguistics
Information Theory and Reliable Communication
Information Theory and Reliable Communication
Robust learning, smoothing, and parameter tying on syntactic ambiguity resolution
Computational Linguistics
Improving statistical language model performance with automatically generated word hierarchies
Computational Linguistics
Learning bias and phonological-rule induction
Computational Linguistics
Textual context analysis for information retrieval
Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval
A study of probability kinematics in information retrieval
ACM Transactions on Information Systems (TOIS)
Distributional clustering of words for text classification
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
A Review of Statistical Language Processing Techniques
Artificial Intelligence Review
Applications of linear algebra in information retrieval and hypertext analysis
PODS '99 Proceedings of the eighteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Similarity-Based Models of Word Cooccurrence Probabilities
Machine Learning - Special issue on natural language learning
Postprocessing of Recognized Strings Using Nonstationary Markovian Models
IEEE Transactions on Pattern Analysis and Machine Intelligence
Toward natural language interfaces for robotic agents: grounding linguistic meaning in sensors
AGENTS '00 Proceedings of the fourth international conference on Autonomous agents
Document clustering using word clusters via the information bottleneck method
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Using latent semantic analysis to find different names for the same entity in free text
Proceedings of the 4th international workshop on Web information and data management
Exploiting the Similarity of Non-Matching Terms at RetrievalTime
Information Retrieval
Toward a unified approach to statistical language modeling for Chinese
ACM Transactions on Asian Language Information Processing (TALIP)
The disambiguation of nominalizations
Computational Linguistics
Stochastic k-testable Tree Languages and Applications
ICGI '02 Proceedings of the 6th International Colloquium on Grammatical Inference: Algorithms and Applications
Implementing a Semantic Lexicon
ICCS '99 Proceedings of the 7th International Conference on Conceptual Structures: Standards and Practices
User-Centred Ontology Learning for Knowledge Management
NLDB '02 Proceedings of the 6th International Conference on Applications of Natural Language to Information Systems-Revised Papers
Tree k-Grammar Models for Natural Language Modelling and Parsing
Proceedings of the Joint IAPR International Workshop on Structural, Syntactic, and Statistical Pattern Recognition
Optimization of Association Word Knowledge Base through Genetic Algorithm
DaWaK 2000 Proceedings of the 4th International Conference on Data Warehousing and Knowledge Discovery
FASTY - A Multi-lingual Approach to Text Prediction
ICCHP '02 Proceedings of the 8th International Conference on Computers Helping People with Special Needs
Representation and Discovery of Vertical Patterns in Music
ICMAI '02 Proceedings of the Second International Conference on Music and Artificial Intelligence
Statistical Decision Making from Text and Dialogue Corpora for Effective Plan Recognition
TSD '02 Proceedings of the 5th International Conference on Text, Speech and Dialogue
Machine Learning in Human Language Technology
Machine Learning and Its Applications, Advanced Lectures
L&H Lexicography Toolkit for Machine Translation
AMTA '00 Proceedings of the 4th Conference of the Association for Machine Translation in the Americas on Envisioning Machine Translation in the Information Future
Word reordering and a dynamic programming beam search algorithm for statistical machine translation
Computational Linguistics
A neural probabilistic language model
The Journal of Machine Learning Research
Task adaptation in stochastic language model for Chinese homophone disambiguation
ACM Transactions on Asian Language Information Processing (TALIP)
The mathematics of statistical machine translation: parameter estimation
Computational Linguistics - Special issue on using large corpora: II
A class-based approach to word alignment
Computational Linguistics
Introduction to the special issue on word sense disambiguation: the state of the art
Computational Linguistics - Special issue on word sense disambiguation
Automatic word sense discrimination
Computational Linguistics - Special issue on word sense disambiguation
Generalizing case frames using a thesaurus and the MDL principle
Computational Linguistics
Word clustering and disambiguation based on co-occurrence data
Natural Language Engineering
Natural Language Engineering
Verb sense disambiguation based on dual distributional similarity
Natural Language Engineering
A reestimation algorithm for probabilistic dependency grammars
Natural Language Engineering
Topic-based mixture language modelling
Natural Language Engineering
A fast method for statistical grammar induction
Natural Language Engineering
Fast statistical parsing of noun phrases for document indexing
ANLC '97 Proceedings of the fifth conference on Applied natural language processing
Review of "Statistical language learning" by Eugene Charniak. The MIT Press 1993.
Computational Linguistics
Grouping words using statistical context
EACL '95 Proceedings of the seventh conference on European chapter of the Association for Computational Linguistics
An efficient method for determining bilingual word classes
EACL '99 Proceedings of the ninth conference on European chapter of the Association for Computational Linguistics
New models for improving supertag disambiguation
EACL '99 Proceedings of the ninth conference on European chapter of the Association for Computational Linguistics
Similarity-based methods for word sense disambiguation
ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Predicting the semantic orientation of adjectives
ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Intonational boundaries, speech repairs and discourse markers: modeling spoken dialog
ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Memory-based learning: using similarity for smoothing
ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Word clustering and disambiguation based on co-occurrence data
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
A stochastic language model using dependency and its improvement by word clustering
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
MindNet: acquiring and structuring semantic information from text
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
Modeling with structures in statistical machine translation
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
Redundancy: helping semantic disambiguation
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Learning a syntagmatic and paradigmatic structure from language data with a bi-multigram model
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Automatic acquistion of language model based on head-dependent relation between words
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Contextual word similarity and estimation from sparse data
ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
Part-of-speech induction from scratch
ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
Raisins, sultanas, and currants: lexical classification and abstraction via context priming
ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
Statistical sense disambiguation with relatively small corpora using dictionary definitions
ACL '95 Proceedings of the 33rd annual meeting on Association for Computational Linguistics
Bayesian grammar induction for language modeling
ACL '95 Proceedings of the 33rd annual meeting on Association for Computational Linguistics
Statistical decision-tree models for parsing
ACL '95 Proceedings of the 33rd annual meeting on Association for Computational Linguistics
Precise n-gram probabilities from stochastic context-free grammars
ACL '94 Proceedings of the 32nd annual meeting on Association for Computational Linguistics
Similarity-based estimation of word cooccurrence probabilities
ACL '94 Proceedings of the 32nd annual meeting on Association for Computational Linguistics
Morphological cues for lexical semantics
ACL '96 Proceedings of the 34th annual meeting on Association for Computational Linguistics
An English to Korean transliteration model of extended Markov window
COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 1
A rule-based approach to prepositional phrase attachment disambiguation
COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 2
Word class discovery for postprocessing Chinese handwriting recognition
COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 2
Clustering words with the MDL principle
COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 1
Concept clustering and knowledge integration from a children's dictionary
COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 1
N-th order Ergodic Multigram HMM for modeling of languages without marked word boundaries
COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 1
Automatic extraction of semantic relations from specialized corpora
COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 2
A statistical approach to the processing of metonymy
COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 2
Redefining similarity in a thesaurus by using corpora
COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 2
Hierarchical clustering of words
COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 2
Finding aliases on the web using latent semantic analysis
Data & Knowledge Engineering - Special issue: WIDM 2002
Corpus structure, language models, and ad hoc information retrieval
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Multimodal model integration for sentence unit detection
Proceedings of the 6th international conference on Multimodal interfaces
Supervised learning for the legacy document conversion
Proceedings of the 2004 ACM symposium on Document engineering
Distributional term representations: an experimental comparison
Proceedings of the thirteenth ACM international conference on Information and knowledge management
Distributional similarity models: clustering vs. nearest neighbors
ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Automatic construction of a hypernym-labeled noun hierarchy from text
ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Word prediction using a clustered optimal binary search tree
Information Processing Letters
Evaluating and combining approaches to selectional preference acquisition
EACL '03 Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - Volume 1
Combining distributional and morphological information for part of speech induction
EACL '03 Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - Volume 1
Using evolutionary optimization to improve markov-based classification with limited training data
GECCO '05 Proceedings of the 7th annual conference on Genetic and evolutionary computation
Probabilistic Finite-State Machines-Part I
IEEE Transactions on Pattern Analysis and Machine Intelligence
Chinese named entity identification using class-based language model
COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Wordform- and class-based prediction of the components of German nominal compounds in an AAC system
COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
An unsupervised learning method for associative relationships between verb phrases
COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Evaluating smoothing algorithms against plausibility judgements
ACL '01 Proceedings of the 39th Annual Meeting on Association for Computational Linguistics
ACL '01 Proceedings of the 39th Annual Meeting on Association for Computational Linguistics
Active learning for statistical natural language parsing
ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Exploring asymmetric clustering for statistical language modeling
ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Frequency estimates for statistical word similarity measures
NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
Generalized algorithms for constructing statistical language models
ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Building a large ontology for machine translation
HLT '93 Proceedings of the workshop on Human Language Technology
Hypothesizing word association from untagged text
HLT '93 Proceedings of the workshop on Human Language Technology
Augmenting lexicons automatically: clustering semantically related adjectives
HLT '93 Proceedings of the workshop on Human Language Technology
Semantic classes and syntactic ambiguity
HLT '93 Proceedings of the workshop on Human Language Technology
Scalable collaborative filtering using cluster-based smoothing
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Statistical Language Models for On-line Handwritten Sentence Recognition
ICDAR '05 Proceedings of the Eighth International Conference on Document Analysis and Recognition
Co-occurrence Retrieval: A Flexible Framework for Lexical Distributional Similarity
Computational Linguistics
Inducing syntactic categories by context distribution clustering
ConLL '00 Proceedings of the 2nd workshop on Learning language in logic and the 4th conference on Computational natural language learning - Volume 7
ConLL '00 Proceedings of the 2nd workshop on Learning language in logic and the 4th conference on Computational natural language learning - Volume 7
Topic analysis using a finite mixture model
EMNLP '00 Proceedings of the 2000 Joint SIGDAT conference on Empirical methods in natural language processing and very large corpora: held in conjunction with the 38th Annual Meeting of the Association for Computational Linguistics - Volume 13
Detection of language (model) errors
EMNLP '00 Proceedings of the 2000 Joint SIGDAT conference on Empirical methods in natural language processing and very large corpora: held in conjunction with the 38th Annual Meeting of the Association for Computational Linguistics - Volume 13
Using HLT for acquiring, retrieving and publishing knowledge in AKT: position paper
HLTKM '01 Proceedings of the workshop on Human Language Technology and Knowledge Management - Volume 2001
Improvements in automatic thesaurus extraction
ULA '02 Proceedings of the ACL-02 workshop on Unsupervised lexical acquisition - Volume 9
Using eigenvectors of the bigram graph to infer morpheme identity
MPL '02 Proceedings of the ACL-02 workshop on Morphological and phonological learning - Volume 6
MPL '02 Proceedings of the ACL-02 workshop on Morphological and phonological learning - Volume 6
Ensemble methods for automatic thesaurus extraction
EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Exploiting headword dependency and predictive clustering for language modeling
EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Two-dimensional clustering for text categorization
COLING-02 proceedings of the 6th conference on Natural language learning - Volume 20
An efficient clustering algorithm for class-based language models
CONLL '03 Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003 - Volume 4
Modeling of long distance context dependency in Chinese
SIGHAN '03 Proceedings of the second SIGHAN workshop on Chinese language processing - Volume 17
Extracting redundancy-aware top-k patterns
Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
MM&Sec '06 Proceedings of the 8th workshop on Multimedia and security
Scalable search-based image annotation of personal images
MIR '06 Proceedings of the 8th ACM international workshop on Multimedia information retrieval
Salience modeling based on non-verbal modalities for spoken language understanding
Proceedings of the 8th international conference on Multimodal interfaces
Melodic analysis with segment classes
Machine Learning
Journal of Biomedical Informatics - Special issue: Dialog systems for health communications
ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
An end-to-end discriminative approach to machine translation
ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Modeling of long distance context dependency
COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Automatic learning of language model structure
COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Toward unsupervised whole-corpus tagging
COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Characterising measures of lexical distributional similarity
COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Training neural network language models on very large corpora
HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
A salience driven approach to robust input interpretation in multimodal conversational systems
HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Hidden-variable models for discriminative reranking
HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Paraphrasing for automatic evaluation
HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
Improving statistical machine translation using shallow linguistic knowledge
Computer Speech and Language
A practical solution to the problem of automatic part-of-speech induction from text
ACLdemo '05 Proceedings of the ACL 2005 on Interactive poster and demonstration sessions
Detecting Irrelevant Subtrees to Improve Probabilistic Learning from Tree-structured Data
Fundamenta Informaticae - Advances in Mining Graphs, Trees and Sequences
Continuous space language models
Computer Speech and Language
Text mining techniques for patent analysis
Information Processing and Management: an International Journal
An intelligent human-expert forum system based on fuzzy information retrieval technique
Expert Systems with Applications: An International Journal
Examining the content load of part of speech blocks for information retrieval
COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
ACM Transactions on Asian Language Information Processing (TALIP)
Automatic playlist composition in a dynamic music landscape
SADPI '07 Proceedings of the 2007 international workshop on Semantically aware document processing and indexing
Ontology learning: state of the art and open issues
Information Technology and Management
DynaSpeak: SRI's scalable speech recognizer for embedded and mobile systems
HLT '02 Proceedings of the second international conference on Human Language Technology Research
Video search re-ranking via multi-graph propagation
Proceedings of the 15th international conference on Multimedia
Automatic extraction of the multiple semantic and syntactic categories of words
AIAP'07 Proceedings of the 25th conference on Proceedings of the 25th IASTED International Multi-Conference: artificial intelligence and applications
A syntactically-based query reformulation technique for information retrieval
Information Processing and Management: an International Journal
Improving Speech Recognition and Understanding using Error-Corrective Reranking
ACM Transactions on Asian Language Information Processing (TALIP)
Advertising keyword suggestion based on concept hierarchy
WSDM '08 Proceedings of the 2008 International Conference on Web Search and Data Mining
Improving dialogue systems in a home automation environment
Proceedings of the 1st international conference on Ambient media and systems
The application of hidden Markov models in speech recognition
Foundations and Trends in Signal Processing
Similarity based smoothing in language modeling
Acta Cybernetica
Statistical machine translation
ACM Computing Surveys (CSUR)
Applications of corpus-based semantic similarity and word segmentation to database schema matching
The VLDB Journal — The International Journal on Very Large Data Bases
Unsupervised learning of multilingual short message service (SMS) dialect from noisy examples
Proceedings of the second workshop on Analytics for noisy unstructured text data
IEA/AIE '08 Proceedings of the 21st international conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems: New Frontiers in Applied Artificial Intelligence
Implement Web Learning System Based on Genetic Algorithm and Pervasive Agent Ontology
ICIC '08 Proceedings of the 4th international conference on Intelligent Computing: Advanced Intelligent Computing Theories and Applications - with Aspects of Artificial Intelligence
A Comparison of Language Models for Dialog Act Segmentation of Meeting Transcripts
TSD '08 Proceedings of the 11th international conference on Text, Speech and Dialogue
User language model for collaborative personalized search
ACM Transactions on Information Systems (TOIS)
Word Topic Models for Spoken Document Retrieval and Transcription
ACM Transactions on Asian Language Information Processing (TALIP)
Modeling Documents by Combining Semantic Concepts with Unsupervised Statistical Learning
ISWC '08 Proceedings of the 7th International Conference on The Semantic Web
User Interaction with Word Prediction: The Effects of Prediction Quality
ACM Transactions on Accessible Computing (TACCESS)
ACS'08 Proceedings of the 8th conference on Applied computer scince
Clusters, language models, and ad hoc information retrieval
ACM Transactions on Information Systems (TOIS)
Concept vector extraction from Wikipedia category network
Proceedings of the 3rd International Conference on Ubiquitous Information Management and Communication
Word Clustering for Collocation-Based Word Sense Disambiguation
CICLing '07 Proceedings of the 8th International Conference on Computational Linguistics and Intelligent Text Processing
Part of Speech Based Term Weighting for Information Retrieval
ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
Client-centered multimedia content adaptation
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Improving Markov chain classification using string transformations and evolutionary search
Proceedings of the 11th Annual conference on Genetic and evolutionary computation
Implement web learning environment based on data mining
Knowledge-Based Systems
Multi-documents Automatic Abstracting based on text clustering and semantic analysis
Knowledge-Based Systems
Smoothing clickthrough data for web search ranking
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Categorizing local contexts as a step in grammatical category induction
CACLA '09 Proceedings of the EACL 2009 Workshop on Cognitive Aspects of Computational Language Acquisition
Improving product review search experiences on general search engines
Proceedings of the 11th International Conference on Electronic Commerce
Design challenges and misconceptions in named entity recognition
CoNLL '09 Proceedings of the Thirteenth Conference on Computational Natural Language Learning
Towards full automation of lexicon construction
CLS '04 Proceedings of the HLT-NAACL Workshop on Computational Lexical Semantics
A powerful and general approach to context exploitation in natural language processing
CLS '04 Proceedings of the HLT-NAACL Workshop on Computational Lexical Semantics
COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Using hidden Markov random fields to combine distributional and pattern-based word clustering
COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Automation of treebank annotation
NeMLaP3/CoNLL '98 Proceedings of the Joint Conferences on New Methods in Language Processing and Computational Natural Language Learning
Choosing a distance metric for automatic word categorization
NeMLaP3/CoNLL '98 Proceedings of the Joint Conferences on New Methods in Language Processing and Computational Natural Language Learning
Domain adaptation with structural correspondence learning
EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Unsupervised learning of Bulgarian POS tags
MorphSlav '03 Proceedings of the 2003 EACL Workshop on Morphological Processing of Slavic Languages
Refining generative language models using discriminative learning
EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Coarse-to-fine syntactic machine translation using language projections
EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
A graph-theoretic model of lexical syntactic acquisition
EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Multi-speaker language modeling
HLT-NAACL-Short '04 Proceedings of HLT-NAACL 2004: Short Papers
Factored neural language models
NAACL-Short '06 Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers
Semi-supervised sequence modeling with syntactic topic models
AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 2
Performance prediction for exponential language models
NAACL '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Shrinking exponential language models
NAACL '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics
POS tagging of dialectal Arabic: a minimally supervised approach
Semitic '05 Proceedings of the ACL Workshop on Computational Approaches to Semitic Languages
English-to-Czech factored machine translation
StatMT '07 Proceedings of the Second Workshop on Statistical Machine Translation
Phrase-based and deep syntactic English-to-Czech statistical machine translation
StatMT '08 Proceedings of the Third Workshop on Statistical Machine Translation
Exploiting long distance collocational relations in predictive typing
TextEntry '03 Proceedings of the 2003 EACL Workshop on Language Modeling for Text Entry Methods
Approximate searching for distributional similarity
DeepLA '05 Proceedings of the ACL-SIGLEX Workshop on Deep Lexical Acquisition
UMSLLS '09 Proceedings of the Workshop on Unsupervised and Minimally Supervised Learning of Lexical Semantics
A word clustering approach for language model-based sentence retrieval in question answering systems
Proceedings of the 18th ACM conference on Information and knowledge management
Estimation of stochastic context-free grammars and their use as language models
Computer Speech and Language
Using semantic analysis to improve speech recognition performance
Computer Speech and Language
Using morphology and syntax together in unsupervised learning
PMHLA '05 Proceedings of the Workshop on Psychocomputational Models of Human Language Acquisition
Reducing the annotation effort for letter-to-phoneme conversion
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
A metric-based framework for automatic taxonomy induction
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
Phrase clustering for discriminative learning
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2
Improving generative statistical parsing with semi-supervised word clustering
IWPT '09 Proceedings of the 11th International Conference on Parsing Technologies
IWPT '09 Proceedings of the 11th International Conference on Parsing Technologies
Semi-supervised semantic role labeling using the latent words language model
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1 - Volume 1
An empirical study of semi-supervised structured conditional models for dependency parsing
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
A joint language model with fine-grain syntactic tags
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
Discriminative training of clustering functions: theory and experiments with entity identification
CONLL '05 Proceedings of the Ninth Conference on Computational Natural Language Learning
Morphology induction from term clusters
CONLL '05 Proceedings of the Ninth Conference on Computational Natural Language Learning
Word prediction using a clustered optimal binary search tree
Information Processing Letters
Word prediction and communication rate in AAC
Telehealth/AT '08 Proceedings of the IASTED International Conference on Telehealth/Assistive Technologies
Probabilistic logic with minimum perplexity: Application to language modeling
Pattern Recognition
Smoothing and compression with stochastic k-testable tree languages
Pattern Recognition
Extracting learning concepts from educational texts in intelligent tutoring systems automatically
Expert Systems with Applications: An International Journal
Generalized Expectation Criteria for Semi-Supervised Learning with Weakly Labeled Data
The Journal of Machine Learning Research
Document retrieval: shallow data, deep theories; historical reflections, potential directions
ECIR'03 Proceedings of the 25th European conference on IR research
Extending weighting models with a term quality measure
SPIRE'07 Proceedings of the 14th international conference on String processing and information retrieval
Topic-Dependent Language Model with Voting on Noun History
ACM Transactions on Asian Language Information Processing (TALIP)
Segment-based classes for language modeling within the field of CSR
CIARP'07 Proceedings of the Congress on pattern recognition 12th Iberoamerican conference on Progress in pattern recognition, image analysis and applications
A semantics-enhanced language model for unsupervised word sense disambiguation
CICLing'08 Proceedings of the 9th international conference on Computational linguistics and intelligent text processing
An estimate method of the minimum entropy of natural languages
CICLing'03 Proceedings of the 4th international conference on Computational linguistics and intelligent text processing
Optimizing language models for polarity classification
ECIR'08 Proceedings of the IR research, 30th European conference on Advances in information retrieval
Spectral clustering for Chinese word
FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 1
A composite kernel for named entity recognition
Pattern Recognition Letters
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Word representations: a simple and general method for semi-supervised learning
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Improved unsupervised POS induction through prototype discovery
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Long distance bigram models applied to word clustering
Pattern Recognition
Unsupervised Part-of-Speech Tagging in the Large
Research on Language and Computation
SPMRL '10 Proceedings of the NAACL HLT 2010 First Workshop on Statistical Parsing of Morphologically-Rich Languages
Improved unsupervised POS induction using intrinsic clustering quality and a Zipfian constraint
CoNLL '10 Proceedings of the Fourteenth Conference on Computational Natural Language Learning
Online entropy-based model of lexical category acquisition
CoNLL '10 Proceedings of the Fourteenth Conference on Computational Natural Language Learning
Two decades of unsupervised POS induction: how far have we come?
EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Uptraining for accurate deterministic question parsing
EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
The necessity of combining adaptation methods
EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Training continuous space language models: some practical issues
EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Discovery of numerous specific topics via term co-occurrence analysis
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Exploiting background knowledge for relation extraction
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Structuring ordered nominal data for event sequence discovery
Proceedings of the international conference on Multimedia
Hierarchical Bayesian language models for conversational speech recognition
IEEE Transactions on Audio, Speech, and Language Processing
Clustering product features for opinion mining
Proceedings of the fourth ACM international conference on Web search and data mining
Update Legal Documents Using Hierarchical Ranking Models and Word Clustering
Proceedings of the 2010 conference on Legal Knowledge and Information Systems: JURIX 2010: The Twenty-Third Annual Conference
DiG: a task-based approach to product search
Proceedings of the 16th international conference on Intelligent user interfaces
Benchmarking of statistical dependency parsers for French
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Covariance in Unsupervised Learning of Probabilistic Grammars
The Journal of Machine Learning Research
An information-theoretic, vector-space-model approach to cross-language information retrieval*
Natural Language Engineering
Editorial: Mining business process variants: Challenges, scenarios, algorithms
Data & Knowledge Engineering
Recognizing named entities in tweets
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Semi-supervised relation extraction with large-scale word clustering
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Exploiting syntactico-semantic structures for relation extraction
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
A hierarchical Pitman-Yor process HMM for unsupervised part of speech induction
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Integrating history-length interpolation and classes in language modeling
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Deterministic statistical mapping of sentences to underspecified semantics
IWCS '11 Proceedings of the Ninth International Conference on Computational Semantics
Improved modeling of out-of-vocabulary words using morphological classes
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Language models as representations for weakly-supervised NLP tasks
CoNLL '11 Proceedings of the Fifteenth Conference on Computational Natural Language Learning
Adapting text instead of the model: an open domain approach
CoNLL '11 Proceedings of the Fifteenth Conference on Computational Natural Language Learning
Gauging the internet doctor: ranking medical claims based on community knowledge
Proceedings of the 2011 workshop on Data mining for medicine and healthcare
Improving subtree-based question classification classifiers with word-cluster models
NLDB'11 Proceedings of the 16th international conference on Natural language processing and information systems
Code completion of multiple keywords from abbreviated input
Automated Software Engineering
Controlling complexity in part-of-speech induction
Journal of Artificial Intelligence Research
Passage retrieval for incorporating global evidence in sequence labeling
Proceedings of the 20th ACM international conference on Information and knowledge management
Trained trigger language model for sentence retrieval in QA: bridging the vocabulary gap
Proceedings of the 20th ACM international conference on Information and knowledge management
Toward multimodal situated analysis
ICMI '11 Proceedings of the 13th international conference on multimodal interfaces
Statistical modelling in continuous speech recognition (CSR)
UAI'01 Proceedings of the Seventeenth conference on Uncertainty in artificial intelligence
Computational Linguistics
Natural Language Processing (Almost) from Scratch
The Journal of Machine Learning Research
An experimental study of boosting model classifiers for chinese text categorization
ICADL'04 Proceedings of the 7th international Conference on Digital Libraries: international collaboration and cross-fertilization
Statistical and linguistic clustering for language modeling in ASR
CIARP'05 Proceedings of the 10th Iberoamerican Congress conference on Progress in Pattern Recognition, Image Analysis and Applications
Intelligent data recognition of DNA sequences using statistical models
PReMI'05 Proceedings of the First international conference on Pattern Recognition and Machine Intelligence
An english-hindi statistical machine translation system
IJCNLP'04 Proceedings of the First international joint conference on Natural Language Processing
Statistical behavior analysis of smoothing methods for language models of mandarin data sets
AIRS'06 Proceedings of the Third Asia conference on Information Retrieval Technology
Traffic models for community-based ranking and navigation
WINE'05 Proceedings of the First international conference on Internet and Network Economics
DNA sequence identification by statistics-based models
FSKD'05 Proceedings of the Second international conference on Fuzzy Systems and Knowledge Discovery - Volume Part II
WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
The CMU-ARK German-English translation system
WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
Factored translation with unsupervised word clusters
WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
Generative models of monolingual and bilingual gappy patterns
WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
A similarity-based approach to data sparseness problem of chinese language modeling
MICAI'05 Proceedings of the 4th Mexican international conference on Advances in Artificial Intelligence
Quasi-synchronous phrase dependency grammars for machine translation
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
A Bayesian mixture model for part-of-speech induction using multiple features
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
A fast, accurate, non-projective, semantically-enriched parser
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Unsupervised dependency parsing without gold part-of-speech tags
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Cross-cutting models of lexical semantics
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Named entity recognition in tweets: an experimental study
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Empirical study of utilizing morph-syntactic information in SMT
IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
Principles of non-stationary hidden markov model and its applications to sequence labeling task
IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
Rethinking language models within the framework of dynamic bayesian networks
AI'05 Proceedings of the 18th Canadian Society conference on Advances in Artificial Intelligence
Are morphosyntactic taggers suitable to improve automatic transcription?
TSD'06 Proceedings of the 9th international conference on Text, Speech and Dialogue
Signature recognition methods for identifying influenza sequences
AIME'05 Proceedings of the 10th conference on Artificial Intelligence in Medicine
On the assessment of text corpora
NLDB'09 Proceedings of the 14th international conference on Applications of Natural Language to Information Systems
Is the contextual information relevant in text clustering by compression?
Expert Systems with Applications: An International Journal
FSMNLP '11 Proceedings of the 9th International Workshop on Finite State Methods and Natural Language Processing
A word clustering approach to domain adaptation: effective parsing of biomedical texts
IWPT '11 Proceedings of the 12th International Conference on Parsing Technologies
French parsing enhanced with a word clustering method based on a syntactic lexicon
SPMRL '11 Proceedings of the Second Workshop on Statistical Parsing of Morphologically Rich Languages
The latent words language model
Computer Speech and Language
A semi supervised learning model for mapping sentences to logical form with ambiguous supervision
NLDB'12 Proceedings of the 17th international conference on Applications of Natural Language Processing and Information Systems
Detecting Irrelevant Subtrees to Improve Probabilistic Learning from Tree-structured Data
Fundamenta Informaticae - Advances in Mining Graphs, Trees and Sequences
A Bayesian approach to unsupervised semantic role induction
EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
Generalization methods for in-domain and cross-domain opinion holder extraction
EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
Cutting the long tail: hybrid language models for translation style adaptation
EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
Continuous space translation models with neural networks
NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Taxonomy induction using hierarchical random graphs
NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Cross-lingual word clusters for direct transfer of linguistic structure
NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Unsupervised translation sense clustering
NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Knowing your enemy: understanding and detecting malicious web advertising
Proceedings of the 2012 ACM conference on Computer and communications security
Deep unsupervised feature learning for natural language processing
NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Student Research Workshop
Non-atomic classification to improve a semantic role labeler for a low-resource language
SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation
Extraction and analysis of the structure of labels in biomedical ontologies
Proceedings of the 2nd international workshop on Managing interoperability and compleXity in health systems
Clustered word classes for preordering in statistical machine translation
ROBUS-UNSUP '12 Proceedings of the Joint Workshop on Unsupervised and Semi-Supervised Learning in NLP
Clinical entity recognition using structural support vector machines with rich features
Proceedings of the ACM sixth international workshop on Data and text mining in biomedical informatics
Informing determiner and preposition error correction with word clusters
Proceedings of the Seventh Workshop on Building Educational Applications Using NLP
The PASCAL Challenge on Grammar Induction
WILS '12 Proceedings of the NAACL-HLT Workshop on the Induction of Linguistic Structure
Hierarchical clustering of word class distributions
WILS '12 Proceedings of the NAACL-HLT Workshop on the Induction of Linguistic Structure
A class-based agreement model for generating accurately inflected translations
ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
Attacking parsing bottlenecks with unlabeled data and relevant factorizations
ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
Measuring the influence of long range dependencies with neural network language models
WLM '12 Proceedings of the NAACL-HLT 2012 Workshop: Will We Ever Really Replace the N-gram Model? On the Future of Language Modeling for HLT
Concurrent acquisition of word meaning and lexical categories
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Learning syntactic categories using paradigmatic representations of word context
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Wiki-ly supervised part-of-speech tagging
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Rel-grams: a probabilistic model of relations in text
AKBC-WEKEX '12 Proceedings of the Joint Workshop on Automatic Knowledge Base Construction and Web-scale Knowledge Extraction
Interactive data-driven discovery of temporal behavior models from events in media streams
Proceedings of the 20th ACM international conference on Multimedia
Role-explicit query identification and intent role annotation
Proceedings of the 21st ACM international conference on Information and knowledge management
Joint bilingual name tagging for parallel corpora
Proceedings of the 21st ACM international conference on Information and knowledge management
Phrase-based statistical language modeling from bilingual parallel corpus
ESCAPE'07 Proceedings of the First international conference on Combinatorics, Algorithms, Probabilistic and Experimental Methodologies
International Journal of Automation and Computing
Increasing adaptability of a speech into sign language translation system
Expert Systems with Applications: An International Journal
Two-stage NER for tweets with clustering
Information Processing and Management: an International Journal
What is middleware made of?: exploring abstractions, concepts, and class names in modern middleware
Proceedings of the 11th International Workshop on Adaptive and Reflective Middleware
A text input method for half-sized keyboard using keying interval
Proceedings of the 11th International Conference on Mobile and Ubiquitous Multimedia
Design of n-gram based dynamic pre-fetching for DSM
ICA3PP'12 Proceedings of the 12th international conference on Algorithms and Architectures for Parallel Processing - Volume Part II
Named entity recognition for tweets
ACM Transactions on Intelligent Systems and Technology (TIST) - Special section on twitter and microblogging services, social recommender systems, and CAMRa2010: Movie recommendation in context
An ontology-driven framework towards building enterprise semantic information layer
Advanced Engineering Informatics
A versatile tool for privacy-enhanced web search
ECIR'13 Proceedings of the 35th European conference on Advances in Information Retrieval
Class-Based language models for chinese-english parallel corpus
CICLing'13 Proceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume 2
Detecting concept relations in clinical text: Insights from a state-of-the-art model
Journal of Biomedical Informatics
Introducing baselines for russian named entity recognition
CICLing'13 Proceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part I
A computational model of logical metonymy
ACM Transactions on Speech and Language Processing (TSLP) - Special issue on multiword expressions: From theory to practice and use, part 2
Representing objects, relations, and sequences
Neural Computation
Personalized next-song recommendation in online karaokes
Proceedings of the 7th ACM conference on Recommender systems
Universal schema for entity type prediction
Proceedings of the 2013 workshop on Automated knowledge base construction
Exploring the effectiveness of medical entity recognition for clinical information retrieval
Proceedings of the 7th international workshop on Data and text mining in biomedical informatics
Using regression for spectral estimation of HMMs
SLSP'13 Proceedings of the First international conference on Statistical Language and Speech Processing
Semantic spaces for improving language modeling
Computer Speech and Language
IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP)
Max-Margin Early Event Detectors
International Journal of Computer Vision
Hi-index | 0.01 |
We address the problem of predicting a word from previous words in a sample of text. In particular, we discuss n-gram models based on classes of words. We also discuss several statistical algorithms for assigning words to classes based on the frequency of their co-occurrence with other words. We find that we are able to extract classes that have the flavor of either syntactically based groupings or semantically based groupings, depending on the nature of the underlying statistics.