Matching bandit

Citation

@InProceedings{pmlr-v139-sentenac21a,
  title = 	 {Pure Exploration and Regret Minimization in Matching Bandits},
  author =       {Sentenac, Flore and Yi, Jialin and Calauzenes, Clement and Perchet, Vianney and Vojnovic, Milan},
  booktitle = 	 {Proceedings of the 38th International Conference on Machine Learning},
  pages = 	 {9434--9442},
  year = 	 {2021},
  editor = 	 {Meila, Marina and Zhang, Tong},
  volume = 	 {139},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {18--24 Jul},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v139/sentenac21a/sentenac21a.pdf},
  url = 	 {https://proceedings.mlr.press/v139/sentenac21a.html},
  abstract = 	 {Finding an optimal matching in a weighted graph is a standard combinatorial problem. We consider its semi-bandit version where either a pair or a full matching is sampled sequentially. We prove that it is possible to leverage a rank-1 assumption on the adjacency matrix to reduce the sample complexity and the regret of off-the-shelf algorithms up to reaching a linear dependency in the number of vertices (up to to poly-log terms).}
}

List of algorithms in the experiments section

	Pair selection	Matching selection
Regret minimisation	STOA: `Rank1Elim` in Katariya et al 2017; Ours: `Pair1Elim`	STOA: `ESCB` in Combes et al 2015 ; Ours: `MATCHING DIVIDE CONQUER`

Installation

After downloading the source code, install the dependencies with:

cd matching-bandit
pip install -e .

Basic Usage

You can test the environment by running the SIMPLE-ADAPTIVE-MATCHING agents:

python matching_bandit/agents/simple_adaptive_matching.py --n_pairs 11 --Delta 0.1 --horizon 200000

The code will generate a MatchingSelectionBandit environment with 11 pairs of items and demonstrate the performance of the SIMPLE-ADAPTIVE-MATCHING algorithm. The simulation will last 200000 iterations (horizon).

A window will be opened and show the agent's performance and the environment's state:

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
docs		docs
experiments		experiments
figures		figures
matching_bandit		matching_bandit
tests/agents		tests/agents
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Matching bandit

Citation

List of algorithms in the experiments section

Installation

Basic Usage

About

Releases

Packages

Languages

License

jialinyi94/matching-bandit

Folders and files

Latest commit

History

Repository files navigation

Matching bandit

Citation

List of algorithms in the experiments section

Installation

Basic Usage

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages