Skip to content

Commit 21058fb

Browse files
authored
Merge pull request #80 from pockerman/refactor_actor_critic_agent
Refactor actor critic agent
2 parents 904a76e + 34e2e08 commit 21058fb

31 files changed

+721
-290
lines changed

docs/source/API/a2c.rst

Lines changed: 0 additions & 13 deletions
This file was deleted.

docs/source/API/algorithms/a2c.rst

Lines changed: 11 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,11 @@
1+
a2c
2+
===
3+
.. automodule:: a2c
4+
:members: create_discounts_array, calculate_discounted_returns
5+
6+
.. autoclass:: A2CConfig
7+
8+
.. autoclass:: _ActResult
9+
10+
.. autoclass:: A2C
11+
:members: __init__, share_memory, parameters, on_episode, default_action_sampler, from_path, _do_train
File renamed without changes.

docs/source/API/epsilon_greedy_q_estimator.rst

Lines changed: 0 additions & 9 deletions
This file was deleted.
File renamed without changes.
Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,10 @@
1+
pytorch\_trainer
2+
================
3+
4+
.. automodule:: pytorch_trainer
5+
:members: worker
6+
7+
.. autoclass:: PyTorchTrainerConfig
8+
9+
.. autoclass:: PyTorchTrainer
10+
:members: __init__, avg_rewards, avg_distortion, actions_before_training, actions_before_episode_begins, actions_after_episode_ends, train, actions_after_training
File renamed without changes.

docs/source/conf.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -22,6 +22,7 @@
2222
sys.path.append(os.path.abspath("../../src/utils/"))
2323
sys.path.append(os.path.abspath("../../src/datasets/"))
2424
sys.path.append(os.path.abspath("../../src/networks/"))
25+
sys.path.append(os.path.abspath("../../src/trainers/"))
2526
print(sys.path)
2627

2728

docs/source/modules.rst

Lines changed: 7 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -4,23 +4,25 @@ API
44
.. toctree::
55
:maxdepth: 4
66

7-
API/epsilon_greedy_policy
8-
API/epsilon_greedy_q_estimator
9-
API/q_learning
10-
API/trainer
7+
API/algorithms/epsilon_greedy_q_estimator
8+
API/algorithms/a2c
9+
API/algorithms/q_learning
10+
API/trainers/trainer
11+
API/trainers/pytorch_trainer
1112
API/datasets/column_type
1213
API/exceptions/exceptions
1314
API/maths/optimizer_type
1415
API/maths/pytorch_optimizer_builder
1516
API/networks/a2c_networks
17+
API/policies/epsilon_greedy_policy
1618
API/spaces/actions
1719
API/spaces/action_space
1820
API/spaces/state
1921
API/spaces/discrete_state_environment
2022
API/spaces/tiled_environment
2123
API/spaces/time_step
2224
API/replay_buffer
23-
API/a2c
25+
2426

2527

2628

0 commit comments

Comments
 (0)