python (12.9k questions)
javascript (9.2k questions)
reactjs (4.7k questions)
java (4.2k questions)
java (4.2k questions)
c# (3.5k questions)
c# (3.5k questions)
html (3.3k questions)
Shaping theorem for MDPs
I need help with understanding the shaping theorem for MDPs. Here's the relevant paper: https://people.eecs.berkeley.edu/~pabbeel/cs287-fa09/readings/NgHaradaRussell-shaping-ICML1999.pdf it basically ...
Garrett Baker
Votes: 0
Answers: 1
How should I code the Gambler's Problem with Q-learning (without any reinforcement learning packages)?
I would like to solve the Gambler's problem as an MDP (Markov Decision Process).
Gambler's problem: A gambler has the opportunity to make bets on the outcomes of a sequence of coin flips. If the coin ...
Dalma Tóth-Lakits
Votes: 0
Answers: 1