Learning of Finite Unions of Tree Patterns with Internal Structured Variables from Queries

  • Authors:
  • Satoshi Matsumoto;Takayoshi Shoudai;Tetsuhiro Miyahara;Tomoyuki Uchida

  • Affiliations:
  • -;-;-;-

  • Venue:
  • AI '02 Proceedings of the 15th Australian Joint Conference on Artificial Intelligence: Advances in Artificial Intelligence
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

We consider the polynomial time learnability of finite unions of ordered tree patterns with internal structured variables, in the query learning model of Angluin (1988). An ordered tree pattern with internal structured variables, called a term tree, is a rooted tree pattern which consists of tree structures with ordered children and internal structured variables. A term tree is suited for representing structural features in semistructured or tree structured data such as HTML/XML files. The language L(t) of a term tree t is the set of all trees which are obtained from t by substituting arbitrary trees for all variables in t. Moreover, for a finite set H of term trees, L(H) = 驴t驴H L(t). Let H*, which is a target of learning, be a finite set of term trees. An oracle for restricted subset queries answers "yes" for an input set H if L(H) 驴 L(H*), and answers "no", otherwise. An oracle for equivalence queries returns "yes" for an input set H if L(H) = L(H*), and returns a counterexample in L(H)驴L(H*)-L(H)驴L(H*), otherwise. We show that any finite union of languages defined by m term trees is exactly identifiable in polynomial time using at most 2mn2 restricted subset queries and at most m + 1 equivalence queries, where n is the maximum size of counterexamples.