Multi-armed bandit

Thompson is Python package to evaluate the multi-armed bandit problem. In addition to thompson, Upper Confidence Bound (UCB) algorithm, and randomized results are also implemented. The thompson package implements three algorithms for solving the multi-armed bandit problem:

Thompson Sampling: A Bayesian approach that maintains probability distributions over the expected rewards of each arm and samples from these distributions to select the next arm to pull.
Upper Confidence Bound (UCB): A deterministic algorithm that selects arms based on their estimated rewards and the uncertainty in those estimates.
Randomized Sampling: A baseline method that randomly selects arms without considering their past performance.

The multi-armed bandit problem is a classic reinforcement learning problem that exemplifies the exploration-exploitation tradeoff dilemma. In this problem, a fixed limited set of resources must be allocated between competing choices in a way that maximizes expected gain, when each choice's properties are only partially known at the time of allocation.

⭐️ Star this repo if you like it ⭐️

Install thompson from PyPI

pip install thompson

Import thompson package

import thompson as th

Documentation pages

On the documentation pages you can find detailed information about the working of the thompson with examples.

Examples

Example: Compute multi-armed bandit using Thompson

Example: Compute multi-armed bandit using UCB-Upper confidence Bound

Example: Compute multi-armed bandit using randomized data

References

https://en.wikipedia.org/wiki/Multi-armed_bandit

Name		Name	Last commit message	Last commit date
Latest commit History 58 Commits
.github		.github
docs		docs
thompson		thompson
.gitignore		.gitignore
CITATION.cff		CITATION.cff
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.cfg		setup.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Repository files navigation

Multi-armed bandit

Install thompson from PyPI

Import thompson package

Documentation pages

Examples

References

About

Uh oh!

Releases 4

Sponsor this project

Uh oh!

Packages

Uh oh!

Languages

Uh oh!

License

erdogant/thompson

Folders and files

Latest commit

History

Repository files navigation

Multi-armed bandit

Install thompson from PyPI

Import thompson package

Documentation pages

Examples

References

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 4

Sponsor this project

Uh oh!

Packages 0

Uh oh!

Languages

Packages