Trait renforce::model::Model [] [src]

pub trait Model<S: Space, A: Space> {
    fn transition(&self,
                  curr: &S::Element,
                  action: &A::Element,
                  next: &S::Element)
                  -> f64; fn reward(&self,
              curr: &S::Element,
              action: &A::Element,
              next: &S::Element)
              -> f64; fn update(&mut self, transition: Transition<S, A>); }

Represents a (nondeterministic) model of an environment The model itself is composed of the transition and reward functions

Required Methods

Returns the probabilty of moving from curr to next when performing action

Returns the reward received when moving from curr to next when performing action

Updates the model using information from the given transition

Implementors