Venske S, de Almeida C and Delgado M.
(2024). Metaheuristics and machine learning: an approach with reinforcement learning assisting neural architecture search. Journal of Heuristics. 30:3-4. (199-224). Online publication date: 1-Aug-2024.
Du B, Qian K, Claudel C and Sun D. Multiagent Online Source Seeking Using Bandit Algorithm. IEEE Transactions on Automatic Control. 10.1109/TAC.2022.3232190. 68:5. (3147-3154).
Liu X, Zuo J, Wang S, Joe-Wong C, Lui J and Chen W. Batch-size independent regret bounds for combinatorial semi-bandits with probabilistically triggered arms or independent arms. Proceedings of the 36th International Conference on Neural Information Processing Systems. (14904-14916).
Pesquerel F, Saber H and Maillard O. Stochastic bandits with groups of similar arms. Proceedings of the 35th International Conference on Neural Information Processing Systems. (19461-19472).
Zhang R and Combes R. On the suboptimality of thompson sampling in high dimensions. Proceedings of the 35th International Conference on Neural Information Processing Systems. (8345-8354).