This is the complete code for running algorithms on Risk-Sensitive Q-Value Iteration, Risk-Sensitive Q Learning and Dual Q-Learning (Risk-Sensitive + Classical). It has been written for a queuing system with buffer length N=20 and desirable length M=18.