Skip to content

rmitsuboshi/bandit

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Bandit

A small collection of Bandit algorithms, written in Rust🦀.

This crate provides a generic framework for bandit protocol and some bandit algorithms. The algorithms are named & implemented based on this book.

Test run

One can reproduce the following result by cargo test -- --nocapture command. demo image

How To Use

Write the following line to Cargo.toml.

bandit = { git = "https://github.com/rmitsuboshi/bandit" }

You can find code examples in tests/small-tests.rs. I'll write some documents to every algorithms in this crate.

Currently, the following algorithms are implemented.

Stochastic Bandits

  • ETC (Explore-Then-Commit),
  • UCB (Upper-Confidence-Bound),
  • Asymptotically Optimal UCB (Asymptotically Optimal UCB),

Adversarial Bandits

  • Exp3 (Exponential-weight algorithm for Exploration and Exploitation)
  • Exp3-IX (Exp3 + Implicit eXploration)

Environments

  • Sub-Gaussian environment,
  • A worst-case environment for the ETC algorithm.

Releases

No releases published

Packages

No packages published

Languages

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy