Puzzle2048
Puzzle2048(
size
)
Deep Deterministic Policy Gradient
Arguments
actor_model (keras.nn.Model instance): See Model for details.
critic_model (keras.nn.Model instance): See Model for details.
optimizer (keras.optimizers.Optimizer instance):
See Optimizer for details.
action_inp (keras.layers.Input / keras.layers.InputLayer instance):
See Input for details.
tau (float): tau.
gamma (float): gamma.
Methods:
.close
.close()
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
.render
.render(
mode = 'human'
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
.reset
.reset()
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
.seed
.seed(
seed = None
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
.step
.step(
action
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.