ReImproveJS
A framework using TensorFlow.js for Deep Reinforcement Learning
Documentation | NPM | Wiki | Changelog
ReImproveJS
is a little library to create Reinforcement Learning environments with Javascript.
It currently implements DQN algorithm, but aims to allow users to change easily algorithms, like for instance A3C or Sarsa.
The library is using TensorFlow.js as a computing background, enabling the use of WebGL to empower computations.
Getting started
Installation
ReImproveJS is available as a standalone or as a NPM package. As usual, you can use the CDN
or if you have your local version
You can also install it through NPM.
$ npm install reimprovejs
Usage
With ReImproveJS, you have an environment organized as if your agents were part of a "school". The idea is that you are managing
an Academy
, possessing Teachers
and Agents
(Students). You add Teachers
and assign Agents
to them. At each step of
your world, you just need to give the Academy
each Teacher
's input, which will handle everything concerning learning.
Because you are in Reinforcement Learning, you need a Neural Network model in order for your agents to learn. TFJS's Model
is
embedded into a wrapper, and you just need to precise what type of layers you need, and that's all !
For instance :
const modelFitConfig = // Exactly the same idea here by using tfjs's model's epochs: 1 // fit config. stepsPerEpoch: 16; const numActions = 2; // The number of actions your agent can choose to doconst inputSize = 100; // Inputs size (10x10 image for instance)const temporalWindow = 1; // The window of data which will be sent yo your agent // For instance the x previous inputs, and what actions the agent took const totalInputSize = inputSize * temporalWindow + numActions * temporalWindow + inputSize; const network = ;networkInputShape = totalInputSize;network;// Now we initialize our model, and start adding layersconst model = network modelFitConfig; // Finally compile the model, we also exactly use tfjs's optimizers and loss functions// (So feel free to choose one among tfjs's)model
Now that our model is ready, let's create an agent...
// Every single field here is optionnal, and has a default value. Be careful, it may not// fit your needs ... const teacherConfig = lessonsQuantity: 10 // Number of training lessons before only testing agent lessonsLength: 100 // The length of each lesson (in quantity of updates) lessonsWithRandom: 2 // How many random lessons before updating epsilon's value epsilon: 1 // Q-Learning values and so on ... epsilonDecay: 0995 // (Random factor epsilon, decaying over time) epsilonMin: 005 gamma: 08 // (Gamma = 1 : agent cares really much about future rewards); const agentConfig = memorySize: 5000 // The size of the agent's memory (Q-Learning) batchSize: 128 // How many tensors will be given to the network when fit temporalWindow: temporalWindow // The temporal window giving previous inputs & actions; const academy = ; // First we need an academy to host everythingconst teacher = academy;const agent = academy; academy;
And that's it ! Now you just need to update during your world emulation if the agent gets rewards, and feed inputs to it.
// Nice event occuring during world emulation { academy // Give a nice reward if the agent did something nice !} // Bad event { academy // Give a bad reward to the agent if he did something wrong} // Animation loop, update loop, whatever loop you want { let inputs = // Need to give a number[] of your inputs for one teacher. await academy; } // Start your loop (/!\ for your environment, not specific to ReImproveJS).;
Rewards are reset to 0 at each new step.