TY - GEN
T1 - On the learnability of shuffle ideals
AU - Angluin, Dana
AU - Aspnes, James
AU - Kontorovich, Aryeh
PY - 2012/10/30
Y1 - 2012/10/30
N2 - Although PAC learning unrestricted regular languages is long known to be a very difficult problem, one might suppose the existence (and even an abundance) of natural efficiently learnable sub-families. When our literature search for a natural efficiently learnable regular family came up empty, we proposed the shuffle ideals as a prime candidate. A shuffle ideal generated by a string u is simply the collection of all strings containing u as a (discontiguous) subsequence. This fundamental language family is of theoretical interest in its own right and also provides the building blocks for other important language families. Somewhat surprisingly, we discovered that even a class as simple as the shuffle ideals is not properly PAC learnable, unless RP=NP. In the positive direction, we give an efficient algorithm for properly learning shuffle ideals in the statistical query (and therefore also PAC) model under the uniform distribution.
AB - Although PAC learning unrestricted regular languages is long known to be a very difficult problem, one might suppose the existence (and even an abundance) of natural efficiently learnable sub-families. When our literature search for a natural efficiently learnable regular family came up empty, we proposed the shuffle ideals as a prime candidate. A shuffle ideal generated by a string u is simply the collection of all strings containing u as a (discontiguous) subsequence. This fundamental language family is of theoretical interest in its own right and also provides the building blocks for other important language families. Somewhat surprisingly, we discovered that even a class as simple as the shuffle ideals is not properly PAC learnable, unless RP=NP. In the positive direction, we give an efficient algorithm for properly learning shuffle ideals in the statistical query (and therefore also PAC) model under the uniform distribution.
UR - http://www.scopus.com/inward/record.url?scp=84867855580&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-34106-9_12
DO - 10.1007/978-3-642-34106-9_12
M3 - Conference contribution
AN - SCOPUS:84867855580
SN - 9783642341052
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 111
EP - 123
BT - Algorithmic Learning Theory - 23rd International Conference, ALT 2012, Proceedings
T2 - 23rd International Conference on Algorithmic Learning Theory, ALT 2012
Y2 - 29 October 2012 through 31 October 2012
ER -