Pambudi, Pandu Dwi Luhur. “Adaptive Fuel Subsidy Optimization Using Deep Q-Learning and Bandit-Based Policy Selection: A Simulation Study”. Engineering, MAthematics and Computer Science Journal (EMACS) 7, no. 2 (May 31, 2025): 191–200. Accessed August 4, 2025. https://journal.binus.ac.id/index.php/EMACS/article/view/13419.