Address
Address(
value, process, default_type = 'uint'
)
Deep Deterministic Policy Gradient
Arguments
actor_model (keras.nn.Model instance): See Model for details.
critic_model (keras.nn.Model instance): See Model for details.
optimizer (keras.optimizers.Optimizer instance):
See Optimizer for details.
action_inp (keras.layers.Input / keras.layers.InputLayer instance):
See Input for details.
tau (float): tau.
gamma (float): gamma.
Methods:
.read
.read(
of_type = None, max_length = None, errors = 'raise'
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
.write
.write(
data, of_type = None
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
.symbol
.symbol()
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
.get_instruction
.get_instruction()
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
.dump
.dump(
of_type = 'bytes', size = 512, before = 32
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.