python (12.9k questions)
javascript (9.2k questions)
reactjs (4.7k questions)
java (4.2k questions)
java (4.2k questions)
c# (3.5k questions)
c# (3.5k questions)
html (3.3k questions)
How should I code the Gambler's Problem with Q-learning (without any reinforcement learning packages)?
I would like to solve the Gambler's problem as an MDP (Markov Decision Process).
Gambler's problem: A gambler has the opportunity to make bets on the outcomes of a sequence of coin flips. If the coin ...
Dalma Tóth-Lakits
Votes: 0
Answers: 1