Releases: whitemech/temprl
Releases · whitemech/temprl
Release v0.3.0
- Simplify APIs of
TemporalWrapper
: removefeature_extractor
andcombine
parameters, as well as reward shaping support. The reason is that these functionalities, in the OpenAI Gym "philosophy", should be delegated to other Gym wrappers, e.g.ObservationWrapper
for combining the features and the automata states. - Remove
flloat
dependency. SinceTemporalGoal
now only requires apythomata.DFA
object, it is up to the user to decide how to generate the reward automaton. - Update dependencies to their latest version, e.g.
pythomata
. - The
reset()
method of the temporal wrapper now first resets the temporal goals, and then makes a step on each of them according to the fluents extracted from the environment's initial state. This is needed because otherwise the initial state of the wrapped environment is ignored. - The support for terminating conditions from the temporal goals is removed. Again, this is because the only job of the DFAs is to provide rewards according to the history of the episode; any other customization of the underlying envrionment, or the wrapper, must be done by using other wrappers.
Release 0.1.2.post1
Merge branch 'release-0.1.2' for postfix 1