Statistics > Machine Learning

arXiv:2203.10214 (stat)

[Submitted on 19 Mar 2022 (v1), last revised 25 Mar 2022 (this version, v2)]

Title:Thompson Sampling on Asymmetric $α$-Stable Bandits

Authors:Zhendong Shi, Ercan E. Kuruoglu, Xiaoli Wei

View PDF

Abstract:In algorithm optimization in reinforcement learning, how to deal with the exploration-exploitation dilemma is particularly important. Multi-armed bandit problem can optimize the proposed solutions by changing the reward distribution to realize the dynamic balance between exploration and exploitation. Thompson Sampling is a common method for solving multi-armed bandit problem and has been used to explore data that conform to various laws. In this paper, we consider the Thompson Sampling approach for multi-armed bandit problem, in which rewards conform to unknown asymmetric $\alpha$-stable distributions and explore their applications in modelling financial and wireless data.

Comments:	8 pages, 4 figures
Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:2203.10214 [stat.ML]
	(or arXiv:2203.10214v2 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2203.10214

Submission history

From: Zhendong Shi [view email]
[v1] Sat, 19 Mar 2022 01:55:08 UTC (272 KB)
[v2] Fri, 25 Mar 2022 13:59:55 UTC (273 KB)

Full-text links:

Access Paper:

view license

Current browse context:

stat.ML

< prev | next >

new | recent | 2022-03

Change to browse by:

cs
cs.LG
stat

References & Citations

export BibTeX citation

Statistics > Machine Learning

Title:Thompson Sampling on Asymmetric $α$-Stable Bandits

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Thompson Sampling on Asymmetric $α$-Stable Bandits

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators