bandits

A benchmark to test decision-making algorithms for contextual-bandits. The library implements a variety of algorithms (many of them based on approximate Bayesian Neural Networks and Thompson sampling), and a number of real and syntethic data problems exhibiting a diverse set of properties.

bandits bandit-algorithms multiarmed-bandits

Updated Jan 26, 2022
Python

jayeshk7 / RL-Algorithms

Star

Python implementation of common RL algorithms using OpenAI gym environments

reinforcement-learning sarsa policy-iteration value-iteration bandits tabular-q-learning

Updated Jan 8, 2021
Python

doerlbh / BanditZoo

Star

Python library of bandits and RL agents in different real-world environments

reinforcement-learning simulation bandits bandit bandit-algorithms

Updated Feb 21, 2022
Python

doerlbh / dilemmaRL

Star

Code for our PRICAI 2022 paper: "Online Learning in Iterated Prisoner's Dilemma to Mimic Human Behavior".

machine-learning reinforcement-learning game-theory multiplayer-game behavioral-cloning multiagent-systems human-behavior bandits contextual-bandits prisoner-dilemma

Updated Aug 27, 2022
Python

DURUII / Replica-AUCB

Star

🐯REPLICA of "Auction-based combinatorial multi-armed bandit mechanisms with strategic arms"

multi-armed-bandit bandits mab cmab bandit-algorithms aution aucb

Updated Dec 17, 2023
Python

manome / python-mab

Star

This project provides a simulation of multi-armed bandit problems. This implementation is based on the below paper. https://arxiv.org/abs/2308.14350.

reinforcement-learning multi-armed-bandits bandits stochastic-bandit-algorithms stochastic-multi-armed-bandits survival-multi-armed-bandits

Updated Aug 29, 2023
Python

sarthakmittal92 / multi-armed-bandits

Star

Repository for the course project done as part of CS-747 (Foundations of Intelligent & Learning Agents) course at IIT Bombay in Autumn 2022.

python thompson-sampling reinforcement-learning-algorithms ucb multi-armed-bandits bandits kl-ucb

Updated Oct 14, 2022
Python

alxthm / rld-project

Star

Play Rock, Paper, Scissors (Kaggle competition) with Reinforcement Learning: bandits, tabular Q-learning and PPO with LSTM.

q-learning rl bandits ppo rps-game

Updated Mar 2, 2021
Python

Ralyhu / CMAB-CC

Star

Code and data for the paper "A Combinatorial Multi-Armed Bandit Approach to Correlation Clustering", DAMI 2023

clustering online-learning bandits correlation-clustering combinatorial-problems

Updated Aug 4, 2023
Python

krishnaw14 / CS747-assignments

Star

Foundations of Intelligent and Learning Agenet

reinforcement-learning-algorithms markov-decision-processes bandits

Updated Dec 13, 2019
Python

MehranTaghian / prophet-inequlity-implementation

Star

Implementation of the prophet inequalities

multi-armed-bandits bandits prophet-inequality k-prophet

Updated Dec 11, 2021
Python

Ralami1859 / MemoryBandits

Star

online-learning bandits non-stationary

Updated Nov 7, 2019
Python

SC5 / bandits

Star

machine-learning reinforcement-learning bandits contextual-bandit

Updated Nov 16, 2017
Python

AlxBouras / NeuralRandUCB

Star

Project for the RL course @ Université Laval

bandits neural-bandits

Updated Jun 6, 2023
Python

JoelJa835 / MAB_Algorithms

Star

Implementation of Multi-Armed Bandit (MAB) algorithms UCB and Epsilon-Greedy. MAB is a class of problems in reinforcement learning where an agent learns to choose actions from a set of arms, each associated with an unknown reward distribution. UCB and Epsilon-Greedy are popular algorithms for solving MAB problems.

reinforcement-learning-algorithms ucb bandits mab e-greedy

Updated Mar 26, 2023
Python

riccardodv / COOP-learning

Star

Study the interplay between communication and feedback in a cooperative online learning setting.

bandits cooperation online-learning-algorithms

Updated May 31, 2024
Python

XiaoMutt / ucbc

Star

Stanford CS234 Course Side Project

bandits

Updated Mar 27, 2021
Python

Improve this page

Add a description, image, and links to the bandits topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the bandits topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bandits

Here are 20 public repositories matching this topic...

tensorflow / agents

banditml / banditml

annieyan / Bandits-using-UCB-algorithm

babaniyi / Deep-contextual-bandits

jayeshk7 / RL-Algorithms

doerlbh / BanditZoo

doerlbh / dilemmaRL

DURUII / Replica-AUCB

manome / python-mab

sarthakmittal92 / multi-armed-bandits

alxthm / rld-project

Ralyhu / CMAB-CC

krishnaw14 / CS747-assignments

MehranTaghian / prophet-inequlity-implementation

Ralami1859 / MemoryBandits

SC5 / bandits

AlxBouras / NeuralRandUCB

JoelJa835 / MAB_Algorithms

riccardodv / COOP-learning

XiaoMutt / ucbc

Improve this page

Add this topic to your repo