wrapped_partial
.wrapped_partial(
func, *args, **kwargs
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
_check_tensor_devices
._check_tensor_devices(
*tensors: Tensor
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
_check_tensors
._check_tensors(
*tensors: Tensor
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
_get_engine
._get_engine(
*_: Union[Tensor, str]
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
_set_grad
._set_grad(
tensor: Tensor, data
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
_create_tensor
._create_tensor(
*tensors: Tensor, data, func
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
is_tensor
.is_tensor(
obj: object
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
concat_backward
.concat_backward(
gradient: Tensor, tensors: List[Tensor], axis: int = 0
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
stack_backward
.stack_backward(
gradient: Tensor, tensors: List[Tensor], axis: int = 0
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
chunk_backward
.chunk_backward(
gradient: Tensor, tensor: Tensor, chunks: int
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
view_backward
.view_backward(
gradient: Tensor, inp: Tensor
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
index_select_backward
.index_select_backward(
gradient: Tensor, inp: Tensor, index: Tensor, dim: int
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
squeeze_backward
.squeeze_backward(
gradient: Tensor, inp: Tensor
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
expand_dim_backward
.expand_dim_backward(
gradient: Tensor, inp: Tensor
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
transpose_backward
.transpose_backward(
gradient: Tensor, inp: Tensor
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
absolute_backward
.absolute_backward(
gradient: Tensor, inp: Tensor
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
around_backward
.around_backward(
gradient: Tensor, inp: Tensor
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
floor_backward
.floor_backward(
gradient: Tensor, inp: Tensor
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
ceil_backward
.ceil_backward(
gradient: Tensor, inp: Tensor
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
clip_backward
.clip_backward(
gradient: Tensor, inp: Tensor
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
negative_backward
.negative_backward(
gradient: Tensor, inp: Tensor
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
summation_backward
.summation_backward(
gradient: Tensor, inp: Tensor
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
mean_backward
.mean_backward(
gradient: Tensor, inp: Tensor
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
std_backward
.std_backward(
gradient: Tensor, inp: Tensor
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
var_backward
.var_backward(
gradient: Tensor, inp: Tensor
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
add_backward
.add_backward(
gradient: Tensor, inp1: Tensor, inp2: Tensor
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
sub_backward
.sub_backward(
gradient: Tensor, inp1: Tensor, inp2: Tensor
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
mul_backward
.mul_backward(
gradient: Tensor, inp1: Tensor, inp2: Tensor
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
div_backward
.div_backward(
gradient: Tensor, inp1: Tensor, inp2: Tensor
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
power_backward
.power_backward(
gradient: Tensor, inp: Tensor, p: int
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
clone_backward
.clone_backward(
gradient: Tensor, inp: Tensor
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
relu_backward
.relu_backward(
gradient: Tensor, inp: Tensor, alpha: float
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
sigmoid_backward
.sigmoid_backward(
gradient: Tensor, inp: Tensor, out
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
softmax_backward
.softmax_backward(
gradient: Tensor, inp: Tensor
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
tanh_backward
.tanh_backward(
gradient: Tensor, inp: Tensor, out: Tensor
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
dense_backward
.dense_backward(
gradient: Tensor, inp: Tensor, weight: Tensor, bias: Tensor
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
conv_backward
.conv_backward(
gradient: Tensor, inp: Tensor, weight: Tensor, bias: Tensor, stride: int,
padding: Union[List[int], Tuple[int]]
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
dropout_backward
.dropout_backward(
gradient: Tensor, inp: Tensor, mask, keep_prob: float
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
batch_norm_backward
.batch_norm_backward(
gradient: Tensor, inp: Tensor, weight: Tensor, bias: Tensor, training: bool,
**kwargs
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
max_pool_backward
.max_pool_backward(
gradient: Tensor, inp: Tensor, kernel_size: Union[List[int], Tuple[int]],
stride: int, padding: Union[List[int], Tuple[int]], cache: dict
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
avg_pool_backward
.avg_pool_backward(
gradient: Tensor, inp: Tensor, kernel_size: Union[List[int], Tuple[int]],
stride: int, padding: Union[List[int], Tuple[int]]
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
lstm_cell_backward
.lstm_cell_backward(
gradient, inp, all_weights, cache
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
lstm_backward
.lstm_backward(
gradient, inp, all_weights, cache
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
concat
.concat(
tensors: List[Tensor], axis: int = 0
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
stack
.stack(
tensors: List[Tensor], axis: int = 0
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
chunk
.chunk(
tensor: Tensor, chunks: int, dim: int = 0
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
view
.view(
inp, size = None
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
index_select
.index_select(
inp, dim, index
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
zero
.zero(
inp
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
one
.one(
inp
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
fill
.fill(
inp, value
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
squeeze
.squeeze(
inp, axis = None
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
expand_dim
.expand_dim(
inp, axis = None
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
transpose
.transpose(
inp, axes = None
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
absolute
.absolute(
inp
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
around
.around(
inp
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
floor
.floor(
inp
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
ceil
.ceil(
inp
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
clip
.clip(
inp, min_val, max_val
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
negative
.negative(
inp
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
summation
.summation(
inp
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
mean
.mean(
inp
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
std
.std(
inp
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
var
.var(
inp
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
add
.add(
inp1, inp2
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
sub
.sub(
inp1, inp2
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
mul
.mul(
inp1, inp2
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
div
.div(
inp1, inp2
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
power
.power(
inp, p
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
clone
.clone(
inp
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
detach
.detach(
inp, inplace = True
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
arange
.arange(
start = 0, stop = 0, step = 1, requires_grad = False, device = 'cpu', dtype = 'float32'
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
linspace
.linspace(
start, end, steps, requires_grad = False, device = 'cpu', dtype = 'float32'
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
normal
.normal(
loc = 0.0, scale = 1.0, size = None, requires_grad = False, device = 'cpu',
dtype = 'float32'
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
uniform
.uniform(
low = -1.0, high = 1.0, size = None, requires_grad = False, device = 'cpu',
dtype = 'float32'
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
rand
.rand(
size, requires_grad = False, device = 'cpu', dtype = 'float32'
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
randint
.randint(
low = 0, high = 0, size = None, requires_grad = False, device = 'cpu', dtype = 'float32'
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
randn
.randn(
size, requires_grad = False, device = 'cpu', dtype = 'float32'
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
eye
.eye(
rows, columns = None, requires_grad = False, device = 'cpu', dtype = 'float32'
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
empty
.empty(
size, requires_grad = False, device = 'cpu', dtype = 'float32'
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
full
.full(
size, fill_value, requires_grad = False, device = 'cpu', dtype = 'float32'
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
zeros
.zeros(
size, requires_grad = False, device = 'cpu', dtype = 'float32'
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
ones
.ones(
size, requires_grad = False, device = 'cpu', dtype = 'float32'
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
normal_like
.normal_like(
tensor, loc = 0.0, scale = 1.0, requires_grad = False, device = 'cpu',
dtype = 'float32'
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
uniform_like
.uniform_like(
tensor, low = -1.0, high = 1.0, requires_grad = False, device = 'cpu',
dtype = 'float32'
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
rand_like
.rand_like(
tensor, requires_grad = False, device = 'cpu', dtype = 'float32'
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
randint_like
.randint_like(
tensor, low = 0, high = 0, requires_grad = False, device = 'cpu', dtype = 'float32'
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
randn_like
.randn_like(
tensor, requires_grad = False, device = 'cpu', dtype = 'float32'
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
eye_like
.eye_like(
tensor, requires_grad = False, device = 'cpu', dtype = 'float32'
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
empty_like
.empty_like(
tensor, requires_grad = False, device = 'cpu', dtype = 'float32'
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
full_like
.full_like(
tensor, fill_value, requires_grad = False, device = 'cpu', dtype = 'float32'
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
zeros_like
.zeros_like(
tensor, requires_grad = False, device = 'cpu', dtype = 'float32'
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
ones_like
.ones_like(
tensor, requires_grad = False, device = 'cpu', dtype = 'float32'
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
from_array
.from_array(
data, requires_grad = False, device = 'cpu', dtype = 'float32'
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
to_array
.to_array(
inp
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
half
.half(
inp
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
single
.single(
inp
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
double
.double(
inp
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
cpu
.cpu(
inp
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
gpu
.gpu(
inp
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
relu
.relu(
inp, alpha
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
sigmoid
.sigmoid(
inp
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
softmax
.softmax(
inp
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
tanh
.tanh(
inp
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
dense
.dense(
inp, weight, bias
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
conv
.conv(
inp, weight, bias, stride, padding
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
dropout
.dropout(
inp, keep_prob
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
batch_norm
.batch_norm(
inp, weight, bias, running_mean, running_var, momentum, eps, training
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
max_pool
.max_pool(
inp, kernel_size, stride, padding
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
avg_pool
.avg_pool(
inp, kernel_size, stride, padding
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
lstm_cell
.lstm_cell(
inp, all_weights
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
lstm
.lstm(
inp, all_weights
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
adam
.adam(
params, grads, exp_avgs, exp_avg_sqs, max_exp_avg_sqs, state_steps, amsgrad,
beta1, beta2, lr, weight_decay, eps
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.
rmsprop
.rmsprop(
params, grads, square_avgs, alphas, momentum_buffers, grad_avgs, momentum,
centered, lr, weight_decay, eps
)
Remember the transaction.
Accepts a state, action, reward, next_state, terminal transaction.
Arguments
transaction (abstract): state, action, reward, next_state, terminal transaction.