Adapting ADtrees for high arity features

Authors:
Robert Van Dam;Irene Langkilde-Geary;Dan Ventura
Affiliations:
Computer Science Department, Brigham Young University;Computer Science Department, Brigham Young University;Computer Science Department, Brigham Young University
Venue:
AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
Year:
2008

Citing 4
Cited 0

A Dynamic Adaptation of AD-trees for Efficient Machine Learning on Large Data Sets

ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Enriching the knowledge sources used in a maximum entropy part-of-speech tagger

EMNLP '00 Proceedings of the 2000 Joint SIGDAT conference on Empirical methods in natural language processing and very large corpora: held in conjunction with the 38th Annual Meeting of the Association for Computational Linguistics - Volume 13
Sequential update of ADtrees

ICML '06 Proceedings of the 23rd international conference on Machine learning
Cached sufficient statistics for efficient machine learning with large datasets

Journal of Artificial Intelligence Research

Quantified Score

Hi-index	0.00

Visualization

Abstract

ADtrees, a data structure useful for caching sufficient statistics, have been successfully adapted to grow lazily when memory is limited and to update sequentially with an incrementally updated dataset. For low arity symbolic features, ADtrees trade a slight increase in query time for a reduction in overall tree size. Unfortunately, for high arity features, the same technique can often result in a very large increase in query time and a nearly negligible tree size reduction. In the dynamic (lazy) version of the tree, both query time and tree size can increase for some applications. Here we present two modifications to the ADtree which can be used separately or in combination to achieve the originally intended space-time tradeoff in the ADtree when applied to datasets containing very high arity features.