added multi armed bandit problem with three strategies to solve it #12668

sephml · 2025-04-11T18:31:07Z

Describe your change:

Add an algorithm?
Fix a bug or typo in an existing algorithm?
Add or change doctests? -- Note: Please avoid changing both code and tests in a single pull request.
Documentation change?

Checklist:

What is added?

Multi-armed bandits (MAB) represent a class of sequential decision-making problems, where an agent chooses from multiple actions (or "arms") with uncertain rewards, aiming to maximize cumulative reward through balancing exploration (gathering information about each arm) and exploitation (leveraging known rewarding arms). It's one of the foundational algorithms in reinforcement learning and optimization contexts, as it models fundamental exploration-exploitation trade-offs that underpin decision-making processes. MAB algorithms, such as the epsilon-greedy, Upper Confidence Bound (UCB), and Thompson Sampling, find widespread applications across recommendation systems, adaptive clinical trials, online advertising, and resource allocation, effectively optimizing real-world decisions under uncertainty with minimal data collection.

for more information, see https://pre-commit.ci

algorithms-keeper

Click here to look at the relevant links ⬇️

🔗 Relevant Links

Repository:

Contributing guidelines

Project Euler solution guidelines

Python:

Formatted string literals (f-strings)

Type hints

doctest

unittest

pytest

Automated review generated by algorithms-keeper. If there's any problem regarding this review, please open an issue about it.

algorithms-keeper commands and options

algorithms-keeper actions can be triggered by commenting on this PR:

@algorithms-keeper review to trigger the checks for only added pull request files

@algorithms-keeper review-all to trigger the checks for all the pull request files, including the modified files. As we cannot post review comments on lines not part of the diff, this command will post all the messages in one comment.

NOTE: Commands are in beta and so this feature is restricted only to a member or owner of the organization.

machine_learning/mab.py

for more information, see https://pre-commit.ci

algorithms-keeper

Click here to look at the relevant links ⬇️

🔗 Relevant Links

Repository:

Contributing guidelines

Project Euler solution guidelines

Python:

Formatted string literals (f-strings)

Type hints

doctest

unittest

pytest

Automated review generated by algorithms-keeper. If there's any problem regarding this review, please open an issue about it.

algorithms-keeper commands and options

algorithms-keeper actions can be triggered by commenting on this PR:

@algorithms-keeper review to trigger the checks for only added pull request files

@algorithms-keeper review-all to trigger the checks for all the pull request files, including the modified files. As we cannot post review comments on lines not part of the diff, this command will post all the messages in one comment.

NOTE: Commands are in beta and so this feature is restricted only to a member or owner of the organization.

machine_learning/mab.py

for more information, see https://pre-commit.ci

sephml · 2025-04-16T07:09:09Z

@algorithms-keeper review

sephml · 2025-04-16T07:22:49Z

@algorithms-keeper review

sephml · 2025-04-23T12:19:58Z

@MaximSmolskiy, Hi, hope you are well. Can you please review this PR? It seems you are the most recent active maintainer.

sephml and others added 3 commits April 11, 2025 19:03

added multi arm bandit alg with three strategies to solve it

c1ed3c0

added doctest tests

ddbce91

[pre-commit.ci] auto fixes from pre-commit.com hooks

46fdb1b

for more information, see https://pre-commit.ci

algorithms-keeper bot added the tests are failing Do not merge until tests pass label Apr 11, 2025

sephml added 2 commits April 13, 2025 07:44

corrected test cases

9fdf39f

Merge branch 'master' of https://github.com/sephml/Python

81d197d

algorithms-keeper bot removed tests are failing Do not merge until tests pass labels Apr 13, 2025

sephml marked this pull request as draft April 15, 2025 11:45

sephml marked this pull request as ready for review April 15, 2025 11:46

algorithms-keeper bot added awaiting reviews This PR is ready to be reviewed require descriptive names This PR needs descriptive function and/or variable names require type hints https://docs.python.org/3/library/typing.html labels Apr 15, 2025

algorithms-keeper bot reviewed Apr 15, 2025

View reviewed changes

sephml and others added 4 commits April 15, 2025 19:12

added return type hinting

f80b843

[pre-commit.ci] auto fixes from pre-commit.com hooks

f2d9038

for more information, see https://pre-commit.ci

return typehint for test func updated

9d7a028

Merge branch 'master' of https://github.com/sephml/Python

a824511

algorithms-keeper bot reviewed Apr 15, 2025

View reviewed changes

fixed variable name k

7343268

algorithms-keeper bot removed require descriptive names This PR needs descriptive function and/or variable names require type hints https://docs.python.org/3/library/typing.html labels Apr 15, 2025

pre-commit-ci bot and others added 4 commits April 15, 2025 18:23

[pre-commit.ci] auto fixes from pre-commit.com hooks

d0b6719

for more information, see https://pre-commit.ci

fixed formatting

ef11ca4

Merge branch 'master' of https://github.com/sephml/Python

4167ddb

fix1

c34feff

fixed issues with mypy, ruff

c243cd8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

added multi armed bandit problem with three strategies to solve it #12668

added multi armed bandit problem with three strategies to solve it #12668

Uh oh!

sephml commented Apr 11, 2025

Uh oh!

algorithms-keeper bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

algorithms-keeper bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sephml commented Apr 16, 2025

Uh oh!

sephml commented Apr 16, 2025

Uh oh!

sephml commented Apr 23, 2025

Uh oh!

Uh oh!

Uh oh!

added multi armed bandit problem with three strategies to solve it #12668

Are you sure you want to change the base?

added multi armed bandit problem with three strategies to solve it #12668

Uh oh!

Conversation

sephml commented Apr 11, 2025

Describe your change:

Checklist:

What is added?

Uh oh!

algorithms-keeper bot left a comment

Choose a reason for hiding this comment

🔗 Relevant Links

Repository:

Python:

Automated review generated by algorithms-keeper. If there's any problem regarding this review, please open an issue about it.

algorithms-keeper actions can be triggered by commenting on this PR:

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

algorithms-keeper bot left a comment

Choose a reason for hiding this comment

🔗 Relevant Links

Repository:

Python:

Automated review generated by algorithms-keeper. If there's any problem regarding this review, please open an issue about it.

algorithms-keeper actions can be triggered by commenting on this PR:

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sephml commented Apr 16, 2025

Uh oh!

sephml commented Apr 16, 2025

Uh oh!

sephml commented Apr 23, 2025

Uh oh!

Uh oh!