Puzzle2048

Puzzle2048(
   size
)

Deep Deterministic Policy Gradient

Arguments

actor_model (keras.nn.Model instance): See Model for details. critic_model (keras.nn.Model instance): See Model for details. optimizer (keras.optimizers.Optimizer instance): See Optimizer for details. action_inp (keras.layers.Input / keras.layers.InputLayer instance): See Input for details. tau (float): tau. gamma (float): gamma.

Methods:

.close

.close()

Remember the transaction.

Accepts a state, action, reward, next_state, terminal transaction.

Arguments

transaction (abstract): state, action, reward, next_state, terminal transaction.

.render

.render(
   mode = 'human'
)

Remember the transaction.

Accepts a state, action, reward, next_state, terminal transaction.

Arguments

transaction (abstract): state, action, reward, next_state, terminal transaction.

.reset

.reset()

Remember the transaction.

Accepts a state, action, reward, next_state, terminal transaction.

Arguments

transaction (abstract): state, action, reward, next_state, terminal transaction.

.seed

.seed(
   seed = None
)

Remember the transaction.

Accepts a state, action, reward, next_state, terminal transaction.

Arguments

transaction (abstract): state, action, reward, next_state, terminal transaction.

.step

.step(
   action
)

Remember the transaction.

Accepts a state, action, reward, next_state, terminal transaction.

Arguments

transaction (abstract): state, action, reward, next_state, terminal transaction.