biwako commited on
Commit
a9a7094
·
1 Parent(s): a896e35

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +14 -5
README.md CHANGED
@@ -19,6 +19,7 @@ model-index:
19
  value: 261.64 +/- 17.88
20
  name: mean_reward
21
  verified: false
 
22
  ---
23
 
24
  # **PPO** Agent playing **LunarLander-v2**
@@ -26,12 +27,20 @@ This is a trained model of a **PPO** agent playing **LunarLander-v2**
26
  using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
27
 
28
  ## Usage (with Stable-baselines3)
29
- TODO: Add your code
30
-
31
 
32
  ```python
33
- from stable_baselines3 import ...
34
- from huggingface_sb3 import load_from_hub
35
-
 
 
 
 
 
 
 
 
 
36
  ...
37
  ```
 
19
  value: 261.64 +/- 17.88
20
  name: mean_reward
21
  verified: false
22
+
23
  ---
24
 
25
  # **PPO** Agent playing **LunarLander-v2**
 
27
  using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
28
 
29
  ## Usage (with Stable-baselines3)
30
+ Configurations/Parameters
 
31
 
32
  ```python
33
+ model = PPO(
34
+ policy="MlpPolicy",
35
+ env=env,
36
+ n_steps=2048,
37
+ batch_size=64,
38
+ n_epochs=6,
39
+ gamma=0.999,
40
+ gae_lambda=0.98,
41
+ ent_coef=0.01,
42
+ verbose=1,
43
+ )
44
+ total_timesteps=750000
45
  ...
46
  ```