## Abstract

We investigate a population of binary mistake sequences that result from learning with parametric models of different order. We obtain estimates of their error, algorithmic complexity and divergence from a purely random Bernoulli sequence. We study the relationship of these variables to the learner's information density parameter which is defined as the ratio between the lengths of the compressed to uncompressed files that contain the learner's decision rule. The results indicate that good learners have a low information density ρ while bad learners have a high ρ Bad learners generate mistake sequences that are atypically complex or diverge stochastically from a purely random Bernoulli sequence. Good learners generate typically complex sequences with low divergence from Bernoulli sequences and they include mistake sequences generated by the Bayes optimal predictor. Based on the static algorithmic interference model of [18] the learner here acts as a static structure which " scatters" the bits of an input sequence (to be predicted) in proportion to its information density ρ thereby deforming its randomness characteristics.

Original language | English |
---|---|

Pages (from-to) | 2832-2844 |

Number of pages | 13 |

Journal | Communications in Nonlinear Science and Numerical Simulation |

Volume | 16 |

Issue number | 7 |

DOIs | |

State | Published - 1 Jul 2011 |

Externally published | Yes |

## Keywords

- Algorithmic complexity
- Binary sequences
- Chaotic scattering
- Description complexity
- Information theory
- Prediction
- Statistical learning

## ASJC Scopus subject areas

- Numerical Analysis
- Modeling and Simulation
- Applied Mathematics