python (12.9k questions)
javascript (9.2k questions)
reactjs (4.7k questions)
java (4.2k questions)
java (4.2k questions)
c# (3.5k questions)
c# (3.5k questions)
html (3.3k questions)
Compile time distribution strategy issue
i have following code- which tries to implement simple Reinforcement learning environment with keras
import gym
from gym import Env
import numpy as np
from gym.spaces import Discrete,Box
import random...

user466534
Votes: 0
Answers: 1
Shaping theorem for MDPs
I need help with understanding the shaping theorem for MDPs. Here's the relevant paper: https://people.eecs.berkeley.edu/~pabbeel/cs287-fa09/readings/NgHaradaRussell-shaping-ICML1999.pdf it basically ...
Garrett Baker
Votes: 0
Answers: 1
PPO Model not loading
I have trained an agent on colab using openAI gym and stable_baselines(PPO) but when I downloaded the model to local computer I can't load the model and it is throwing error.
model = PPO.load(TRAINED_...
Ashish Kumar
Votes: 0
Answers: 0
Keras GradientType: Calculating gradients with respect to the output node
For startes: this question does not ask for help regarding reinforcement learning (RL), RL is only used as an example.
The Keras documentation contains an example actor-critic reinforcement learning i...
CLRW97
Votes: 0
Answers: 1