Minimax optimal Bayes mixture for memoryless sources over large alphabets

Published by NAVER LABS Europe at 21 February 2018

Elias Jaasaari, Janne Leppa-aho, Tomi Silander, Teemu Roos

International Conference on Algorithmic Learning Theory (ALT), Lanzarote, Spain, 7-9 April, 2018

Minimax Optimal Bayes Mixture for Memoryless Sources over Large Alphabets.pdf

@inproceedings{jaasaari2018minimax,
  title={Minimax optimal bayes mixtures for memoryless sources over large alphabets},
  author={J{\"a}{\"a}saari, Elias and Lepp{\"a}-Aho, Janne and Silander, Tomi and Roos, Teemu},
  booktitle={Algorithmic Learning Theory},
  pages={470--488},
  year={2018}
}

Careers home

The normalized maximum likelihood (NML) distribution achieves minimax log loss and coding regret for the multinomial model. In practice other nearly minimax distributions are used instead as calculating the sequential predictions needed for coding and prediction takes exponential time with NML. The Bayes mixture obtained with the Dirichlet prior Dir(1/2,…,1/2) and asymptotically minimax modifications of it have been widely studied in the context of large sample sizes. However, recently there has been interest in minimax optimal coding distributions for large alphabets. In this paper we investigate Dirichlet priors that achieve minimax coding regret when the alphabet is finite but large in comparison to the sample size. We prove that a Bayes mixture with the Dirichlet prior Dir(1/3,…,1/3) is optimal in this regime (in particular, when $m > 5n/2 + 4/(n – 2) + 3/2). The regret of the resulting distribution approaches the NML regret as the alphabet size grows.

NAVER FRANCE Gender Equality 2024

NAVER FRANCE Gender Equality 2023

VISION

Perception to help robots understand and interact with the environment.

INTERACTION

Equip robots to interact safely with humans, other robots and systems.

ACTION

Providing embodied agents with sequential decision-making capabilities to safely execute complex tasks in dynamic environments.

Action

All

Publications

Blog

News

Careers

People

Minimax optimal Bayes mixture for memoryless sources over large alphabets

All

Publications

Blog

News

Careers

People

Cookie settings