Struct renforce::trainer::LSPolicyIteration [] [src]

pub struct LSPolicyIteration<F: Float> { /* fields omitted */ }

Least-squares Policy Iteration method

Methods

impl<F: Float> LSPolicyIteration<F>
[src]

Constructs a new LSPolicyIteration with randomly initialized mean and deviation

Updates gamma field of self

Trait Implementations

impl<F: Debug + Float> Debug for LSPolicyIteration<F>
[src]

Formats the value using the given formatter.

impl<F: Float + 'static, S: Space, A: Space, T> BatchTrainer<S, A, T> for LSPolicyIteration<F> where T: Agent<S, A> + ParameterizedFunc<F> + FeatureExtractor<S, A, F>
[src]

Trains agent based on the observed transitions

impl<F: Float> Default for LSPolicyIteration<F>
[src]

Creates a new LSPolicyIteration with gamma = 0.99