The minimum consistent DFA problem cannot be approximated within any polynomial
Journal of the ACM (JACM)
Foundations of statistical natural language processing
Foundations of statistical natural language processing
Elements of the Theory of Computation
Elements of the Theory of Computation
A general regression technique for learning transductions
ICML '05 Proceedings of the 22nd international conference on Machine learning
Finite automata for testing composition-based reconstructibility of sequences
Journal of Computer and System Sciences
Deciding unique decodability of bigram counts via finite automata
Journal of Computer and System Sciences
Hi-index | 5.23 |
We define the family of n-gram embeddings from strings over a finite alphabet into the semimodule NK. We classify all ξ ∈ NK that are valid images of strings under such embeddings, as well as all ξ whose inverse image consists of exactly 1 string (we call such ξ uniquely decodable). We prove that for a fixed alphabet, the set of all strings whose image is uniquely decodable is a regular language.