Skip to content

Implementations of known algorithms for the bandit problem

Notifications You must be signed in to change notification settings

wesbz/BanditAgents

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Bandit algorithms.

For a Reinforcement Learning class, I worked on a few popular algorithms for the bandit problem. Among them were :

  • Epsilon-greedy bandit
  • BESA
  • Softmax
  • UCB1
  • Thompson sampling
  • KL-UCB

Bandits are implemented in agent.py

How to use ?

For the purpose of the class, each agent was tested on a specific configuration : 1000 rounds for 2000 agents in parallel : python main.py --niter 1000 --batch 2000 Use python main.py -h to know more.

About

Implementations of known algorithms for the bandit problem

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages