Struct renforce::trainer::LSPolicyIteration [−] [src]

pub struct LSPolicyIteration<F: Float> { /* fields omitted */ }

Least-squares Policy Iteration method

Uses LSTD-Q for calculating the Q-function associated with a policy
Only trains linear Q-functions (not currently enforced by library)

Methods

`impl<F: Float> LSPolicyIteration<F>`
[src]

`fn new(gamma: F) -> LSPolicyIteration<F>`

Constructs a new LSPolicyIteration with randomly initialized mean and deviation

`fn gamma(self, gamma: F) -> LSPolicyIteration<F>`

Updates gamma field of self

Trait Implementations

`impl<F: Debug + Float> Debug for LSPolicyIteration<F>`
[src]

`fn fmt(&self, __arg_0: &mut Formatter) -> Result`

Formats the value using the given formatter.

`impl<F: Float + 'static, S: Space, A: Space, T> BatchTrainer<S, A, T> for LSPolicyIteration<F> where T: Agent<S, A> + ParameterizedFunc<F> + FeatureExtractor<S, A, F>`
[src]

`fn train(&mut self, agent: &mut T, transitions: Vec<Transition<S, A>>)`

Trains agent based on the observed transitions

`impl<F: Float> Default for LSPolicyIteration<F>`
[src]

`fn default() -> LSPolicyIteration<F>`

Creates a new LSPolicyIteration with gamma = 0.99

Keyboard Shortcuts

?: Show this help dialog
S: Focus the search field
⇤: Move up in search results
⇥: Move down in search results
⏎: Go to active search result
+: Collapse/expand all sections

Search Tricks

Prefix searches with a type followed by a colon (e.g. fn:) to restrict the search to a given type.

Accepted types are: fn, mod, struct, enum, trait, type, macro, and const.

Search functions by type signature (e.g. vec -> usize or * -> vec)