Structured sparsity in structured prediction

  • Authors:
  • André F. T. Martins;Noah A. Smith;Pedro M. Q. Aguiar;Mário A. T. Figueiredo

  • Affiliations:
  • Carnegie Mellon University, Pittsburgh, PA, and Instituto de Telecomunicações, Instituto Superior Técnico, Lisboa, Portugal;Carnegie Mellon University, Pittsburgh, PA;Instituto de Sistemas e Robótica, Instituto Superior Técnico, Lisboa, Portugal;Instituto de Telecomunicações, Instituto Superior Técnico, Lisboa, Portugal

  • Venue:
  • EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Linear models have enjoyed great success in structured prediction in NLP. While a lot of progress has been made on efficient training with several loss functions, the problem of endowing learners with a mechanism for feature selection is still unsolved. Common approaches employ ad hoc filtering or L1-regularization; both ignore the structure of the feature space, preventing practicioners from encoding structural prior knowledge. We fill this gap by adopting regularizers that promote structured sparsity, along with efficient algorithms to handle them. Experiments on three tasks (chunking, entity recognition, and dependency parsing) show gains in performance, compactness, and model interpretability.