C4.5: programs for machine learning
C4.5: programs for machine learning
Journal of Computer and System Sciences
Machine Learning
Exponentiated gradient versus gradient descent for linear predictors
Information and Computation
Fast training of support vector machines using sequential minimal optimization
Advances in kernel methods
Large Margin Classification Using the Perceptron Algorithm
Machine Learning - The Eleventh Annual Conference on computational Learning Theory
General Convergence Results for Linear Discriminant Updates
Machine Learning
The Relaxed Online Maximum Margin Algorithm
Machine Learning
The Kernel-Adatron Algorithm: A Fast and Simple Learning Procedure for Support Vector Machines
ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
On the Learnability and Design of Output Codes for Multiclass Problems
COLT '00 Proceedings of the Thirteenth Annual Conference on Computational Learning Theory
Reducing multiclass to binary: a unifying approach for margin classifiers
The Journal of Machine Learning Research
A new approximate maximal margin classification algorithm
The Journal of Machine Learning Research
On the algorithmic implementation of multiclass kernel-based vector machines
The Journal of Machine Learning Research
Solving multiclass learning problems via error-correcting output codes
Journal of Artificial Intelligence Research
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Filtering-Ranking Perceptron Learning for Partial Parsing
Machine Learning
Online multiclass learning by interclass hypothesis sharing
ICML '06 Proceedings of the 23rd international conference on Machine learning
Fast Kernel Classifiers with Online and Active Learning
The Journal of Machine Learning Research
Online large-margin training of dependency parsers
ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Non-projective dependency parsing using spanning tree algorithms
HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Online Passive-Aggressive Algorithms
The Journal of Machine Learning Research
Step Size Adaptation in Reproducing Kernel Hilbert Space
The Journal of Machine Learning Research
Exact decoding for jointly labeling and chunking sequences
COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Solving multiclass support vector machines with LaRank
Proceedings of the 24th international conference on Machine learning
World knowledge in broad-coverage information filtering
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Applications of regularized least squares to pattern classification
Theoretical Computer Science
A dual coordinate descent method for large-scale linear SVM
Proceedings of the 25th international conference on Machine learning
Efficient bandit algorithms for online multiclass prediction
Proceedings of the 25th international conference on Machine learning
Accurate max-margin training for structured output spaces
Proceedings of the 25th international conference on Machine learning
A sequential dual method for large scale multi-class linear svms
Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Label ranking by learning pairwise preferences
Artificial Intelligence
Online Learning of Complex Prediction Problems Using Simultaneous Projections
The Journal of Machine Learning Research
Automatically profiling the author of an anonymous text
Communications of the ACM - Inspiring Women in Computing
Online learning by ellipsoid method
ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Logistic online learning methods and their application to incremental dependency parsing
ACL '07 Proceedings of the 45th Annual Meeting of the ACL: Student Research Workshop
Answering Definition Question: Ranking for Top-k
Proceedings of the 2008 conference on ECAI 2008: 18th European Conference on Artificial Intelligence
A Markov logic approach to bio-molecular event extraction
BioNLP '09 Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing: Shared Task
Multilingual dependency analysis with a two-stage discriminative parser
CoNLL-X '06 Proceedings of the Tenth Conference on Computational Natural Language Learning
Multi-lingual dependency parsing with incremental integer linear programming
CoNLL-X '06 Proceedings of the Tenth Conference on Computational Natural Language Learning
Maximum spanning tree algorithm for non-projective labeled dependency parsing
CoNLL-X '06 Proceedings of the Tenth Conference on Computational Natural Language Learning
Collective semantic role labelling with Markov logic
CoNLL '08 Proceedings of the Twelfth Conference on Computational Natural Language Learning
DeSRL: a linear-time semantic role labeling system
CoNLL '08 Proceedings of the Twelfth Conference on Computational Natural Language Learning
Data-driven dependency parsing of new languages using incomplete and noisy training data
CoNLL '09 Proceedings of the Thirteenth Conference on Computational Natural Language Learning
Multilingual semantic role labelling with Markov logic
CoNLL '09 Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task
Japanese dependency parsing using a tournament model
COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Summarization with a joint model for sentence extraction and compression
ILP '09 Proceedings of the Workshop on Integer Linear Programming for Natural Langauge Processing
Cutting-plane training of structural SVMs
Machine Learning
Using information about multi-word expressions for the word-alignment task
MWE '06 Proceedings of the Workshop on Multiword Expressions: Identifying and Exploiting Underlying Properties
Online large-margin training of syntactic and structural translation features
EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Learning with compositional semantics as structural inference for subsentential sentiment analysis
EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Eye movement analysis for activity recognition
Proceedings of the 11th international conference on Ubiquitous computing
Jointly identifying predicates, arguments and senses using Markov logic
NAACL '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics
SRWS '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Student Research Workshop and Doctoral Consortium
On the complexity of non-projective data-driven dependency parsing
IWPT '07 Proceedings of the 10th International Conference on Parsing Technologies
Dependency parsing with second-order feature maps and annotated semantic information
IWPT '07 Proceedings of the 10th International Conference on Parsing Technologies
Combination strategies for semantic role labeling
Journal of Artificial Intelligence Research
Global inference for sentence compression an integer linear programming approach
Journal of Artificial Intelligence Research
Sentence compression as tree transduction
Journal of Artificial Intelligence Research
SSST '07 Proceedings of the NAACL-HLT 2007/AMTA Workshop on Syntax and Structure in Statistical Translation
Piecewise training for structured prediction
Machine Learning
Leveraging structural relations for fluent compressions at multiple compression rates
ACLShort '09 Proceedings of the ACL-IJCNLP 2009 Conference Short Papers
A ranking approach to stress prediction for letter-to-phoneme conversion
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
Jointly identifying temporal relations with Markov Logic
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
Fast consensus decoding over translation forests
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2
Quadratic-time dependency parsing for machine translation
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2
Automatic Document Tagging in Social Semantic Digital Library
ICONIP '09 Proceedings of the 16th International Conference on Neural Information Processing: Part II
Effective use of linguistic and contextual information for statistical machine translation
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1 - Volume 1
Multi-class confidence weighted algorithms
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Gazpacho and summer rash: lexical relationships from temporal patterns of web search queries
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
DirecTL: a language-independent approach to transliteration
NEWS '09 Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration
NEWS '09 Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration
The Journal of Machine Learning Research
Bounded Kernel-Based Online Learning
The Journal of Machine Learning Research
Bundle Methods for Regularized Risk Minimization
The Journal of Machine Learning Research
A Quasi-Newton Approach to Nonsmooth Convex Optimization Problems in Machine Learning
The Journal of Machine Learning Research
Hyperspectral data classification using margin infused relaxed algorithm
ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
Compositional Machine Transliteration
ACM Transactions on Asian Language Information Processing (TALIP)
The best lexical metric for phrase-based statistical MT system optimization
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Integrating joint n-gram features into a discriminative training framework
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Faster parsing by supertagger adaptation
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Learning to translate with source and target syntax
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Discriminative modeling of extraction sets for machine translation
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Taming structured perceptrons on wild feature vectors
WMT '10 Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR
Transliteration generation and mining with limited training resources
NEWS '10 Proceedings of the 2010 Named Entities Workshop
Improved natural language learning via variance-regularization support vector machines
CoNLL '10 Proceedings of the Fourteenth Conference on Computational Natural Language Learning
Distributed asynchronous online learning for natural language processing
CoNLL '10 Proceedings of the Fourteenth Conference on Computational Natural Language Learning
Minimum error rate training by sampling the translation lattice
EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Collective cross-document relation extraction without labelled data
EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Fast and accurate arc filtering for dependency parsing
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Jointly modeling WSD and SRL with Markov logic
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Online multiple kernel learning: algorithms and mistake bounds
ALT'10 Proceedings of the 21st international conference on Algorithmic learning theory
Semantic classification of automatically acquired nouns using lexico-syntactic clues
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Discourse constraints for document compression
Computational Linguistics
Labelwise margin maximization for sequence labeling
CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part I
Syntactic processing using the generalized perceptron and beam search
Computational Linguistics
Goodness: a method for measuring machine translation confidence
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Jointly learning to extract and compress
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Joint training of dependency parsing filters through latent support vector machines
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Web information extraction using markov logic networks
Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Double Updating Online Learning
The Journal of Machine Learning Research
Collaborative online learning of user generated content
Proceedings of the 20th ACM international conference on Information and knowledge management
COLT'06 Proceedings of the 19th annual conference on Learning Theory
Robust biomedical event extraction with dual decomposition and minimal domain adaptation
BioNLP Shared Task '11 Proceedings of the BioNLP Shared Task 2011 Workshop
The huller: a simple and efficient online SVM
ECML'05 Proceedings of the 16th European conference on Machine Learning
Multimodal recognition of reading activity in transit using body-worn sensors
ACM Transactions on Applied Perception (TAP)
Loss bounds for online category ranking
COLT'05 Proceedings of the 18th annual conference on Learning Theory
A new perspective on an old perceptron algorithm
COLT'05 Proceedings of the 18th annual conference on Learning Theory
Fast and robust joint models for biomedical event extraction
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
SMT helps bitext dependency parsing
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
A word reordering model for improved machine translation
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
An online framework for learning novel concepts over multiple cues
ACCV'09 Proceedings of the 9th Asian conference on Computer Vision - Volume Part I
Contextual semantic processing for a spanish dialogue system using markov logic
MICAI'11 Proceedings of the 10th Mexican international conference on Advances in Artificial Intelligence - Volume Part I
Structured Learning and Prediction in Computer Vision
Foundations and Trends® in Computer Graphics and Vision
Sentence-level instance-weighting for graph-based and transition-based dependency parsing
IWPT '11 Proceedings of the 12th International Conference on Parsing Technologies
Features for phrase-structure reranking from dependency parses
IWPT '11 Proceedings of the 12th International Conference on Parsing Technologies
Learning with stochastic inputs and adversarial outputs
Journal of Computer and System Sciences
Entropy-Guided feature generation for structured learning of portuguese dependency parsing
PROPOR'12 Proceedings of the 10th international conference on Computational Processing of the Portuguese Language
Enhancing search results with semantic annotation using augmented browsing
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Sibyl, a factoid question-answering system for spoken documents
ACM Transactions on Information Systems (TOIS)
Hope and fear for discriminative training of statistical translation models
The Journal of Machine Learning Research
EXPLOITING SUBTREES IN AUTO-PARSED DATA TO IMPROVE DEPENDENCY PARSING
Computational Intelligence
A brief survey of automatic methods for author name disambiguation
ACM SIGMOD Record
EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
Structured perceptron with inexact search
NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Vine pruning for efficient multi-pass dependency parsing
NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation
Extracting narrative timelines as temporal dependency structures
ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
Utilizing dependency language models for graph-based dependency parsing models
ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
Syntactic transfer using a bilingual lexicon
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Locally training the log-linear model for SMT
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Dynamic programming for higher order parsing of gap-minding trees
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Joint learning for coreference resolution with Markov logic
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Part-of-speech tagging for Chinese-English mixed texts with dynamic features
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Optimization strategies for online large-margin learning in machine translation
WMT '12 Proceedings of the Seventh Workshop on Statistical Machine Translation
Predicting v(d)j recombination using conditional random fields
PRIB'12 Proceedings of the 7th IAPR international conference on Pattern Recognition in Bioinformatics
Online Multiple Kernel Classification
Machine Learning
Confidence Weighted Mean Reversion Strategy for Online Portfolio Selection
ACM Transactions on Knowledge Discovery from Data (TKDD)
Adaptive regularization of weight vectors
Machine Learning
Cost-sensitive online active learning with application to malicious URL detection
Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
Triggering effective social support for online groups
ACM Transactions on Interactive Intelligent Systems (TiiS)
The Journal of Machine Learning Research
Integrative semantic dependency parsing via efficient large-scale feature selection
Journal of Artificial Intelligence Research
Joint Optimization for Chinese POS Tagging and Dependency Parsing
IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP)
Hi-index | 0.00 |
In this paper we study a paradigm to generalize online classification algorithms for binary classification problems to multiclass problems. The particular hypotheses we investigate maintain one prototype vector per class. Given an input instance, a multiclass hypothesis computes a similarity-score between each prototype and the input instance and sets the predicted label to be the index of the prototype achieving the highest similarity. To design and analyze the learning algorithms in this paper we introduce the notion of ultraconservativeness. Ultraconservative algorithms are algorithms that update only the prototypes attaining similarity-scores which are higher than the score of the correct label's prototype. We start by describing a family of additive ultraconservative algorithms where each algorithm in the family updates its prototypes by finding a feasible solution for a set of linear constraints that depend on the instantaneous similarity-scores. We then discuss a specific online algorithm that seeks a set of prototypes which have a small norm. The resulting algorithm, which we term MIRA (for Margin Infused Relaxed Algorithm) is ultraconservative as well. We derive mistake bounds for all the algorithms and provide further analysis of MIRA using a generalized notion of the margin for multiclass problems. We discuss the form the algorithms take in the binary case and show that all the algorithms from the first family reduce to the Perceptron algorithm while MIRA provides a new Perceptron-like algorithm with a margin-dependent learning rate. We then return to multiclass problems and describe an analogous multiplicative family of algorithms with corresponding mistake bounds. We end the formal part by deriving and analyzing a multiclass version of Li and Long's ROMMA algorithm. We conclude with a discussion of experimental results that demonstrate the merits of our algorithms.