PAMBUDI, Pandu Dwi Luhur. Adaptive Fuel Subsidy Optimization Using Deep Q-Learning and Bandit-Based Policy Selection: A Simulation Study. Engineering, MAthematics and Computer Science Journal (EMACS), [S. l.], v. 7, n. 2, p. 191–200, 2025. DOI: 10.21512/emacsjournal.v7i2.13419. Disponível em: https://journal.binus.ac.id/index.php/EMACS/article/view/13419. Acesso em: 4 aug. 2025.