Proto commits in cogment/cogment-verse

These 46 commits are when the Protocol Buffers files have changed:

2023-08-22

Commit:	ebff5bf
Author:	William Duguay	2023-08-22 10:12:40 -0400
Committer:	GitHub	2023-08-22 10:12:40 -0400

Bump and clean dependencies (#187) * update requirements * update requirements and test experiments * remove supersuit installation instructions * correct PettingZoo references * gitlab test * full gymnasium support * self review * linters

The documentation is generated from this commit.

2023-08-09

Commit:	38f35d0
Author:	wduguay-air	2023-08-09 17:59:55 -0400

merge 142-bump-dependencies

Commit:	ce70763
Author:	wduguay-air	2023-08-09 16:05:55 -0400

merge main

2023-07-21

Commit:	f68ca7f
Author:	wduguay-air	2023-07-20 21:03:59 -0400

full gymnasium support

The documentation is generated from this commit.

2023-07-14

Commit:	bd95827
Author:	Clodéric Mars	2023-07-14 12:51:49 -0400
Committer:	GitHub	2023-07-14 12:51:49 -0400

Centralize a bunch of boilerplate code in shared [environment / actor / sample producer] session (#178) * Introduce the session helper in the behavior cloning code * Transparent addition of the session helpers as mixin * Classic pettingzoo environment using the helper * tutorials * updated tutorial doc * simple_dqn * td3 * sac * appo * ppo_atari * isaac gym and overcooked_ai * linters * test hydra config composition before full runs * linters * Naming + Documentation * Fix tutorials * remove web proto files * revert gitignore --------- Co-authored-by: wduguay-air <william@ai-r.com>

2023-07-10

Commit:	b8eaba2
Author:	William Duguay	2023-07-10 13:55:56 -0400
Committer:	GitHub	2023-07-10 13:55:56 -0400

Add Typescript and Rebuild web app (#185) * add typescript and rebuild web app * remove untracked files accidentaly tracked * gitignore * removed tracked files to untrack * update gitignore

2023-06-14

Commit:	607a35b
Author:	wduguay-air	2023-06-14 09:40:51 -0400

fix multi env spec support

2023-06-12

Commit:	7651033
Author:	wduguay-air	2023-06-12 08:45:59 -0400

fix web app

2023-06-08

Commit:	dd604d1
Author:	Clodéric Mars	2023-06-08 11:56:01 -0400
Committer:	GitHub	2023-06-08 11:56:01 -0400

Extract environments frontend from the main SDK (#174)

2023-05-26

Commit:	91c4380
Author:	wduguay-air	2023-05-26 11:19:11 -0400

base integration of hf hub actor

2023-05-24

Commit:	33acefa
Author:	Clodéric Mars	2023-05-24 14:52:12 -0400
Committer:	GitHub	2023-05-24 14:52:12 -0400

Update readme & license (#169) * Update the doc and citation file * Update license

2023-04-29

Commit:	922556f
Author:	Clodéric Mars	2023-04-29 14:35:32 -0400
Committer:	Clodéric Mars	2023-04-29 14:37:48 -0400

WIP

2023-04-21

Commit:	64d6e6d
Author:	William Duguay	2023-04-21 16:13:46 -0400
Committer:	GitHub	2023-04-21 16:13:46 -0400

Migrate to Model Registry V2 (#144) * space serialization * web app * linters * linters * linters * gitignore * model_registry_v2 and smoke tests * remove smoke tests * remove changes from Smoke test PR * remove python-version * self-review * self-review * self-review * linters * linters * use pysdk model_registry_v2 * remove old model registry * fix config * fix config * self review * adapt new actors * lintersd * fix pong impl * soft-actor critic test config * rename model_version to model_iteration * renaming * track latest model changes * address PR comments * latest model * fix model tracking. Bump cogment. Fix dependencies with SuperSuit

2023-04-13

Commit:	f03b619
Author:	Luong-Ha Nguyen	2023-04-13 09:44:41 -0400
Committer:	GitHub	2023-04-13 09:44:41 -0400

Multi-agent RL for Petting Zoo (#135) * Single python entry point & hydra based configuration * Introducing DQN * Add support for petting zoo classic environment - action mask - turn based play - connect four UI * Self play and HILL DQN training for connect four * Update readme with missing dependencies * Fix issue in the mountain car bc experiment conf * Fix bug for linux * Using cogment 2.5.0 * add debugger for docs in the next branch (#82) * add debugger for docs in the next branch * correct the typo * Pytorch multiproc fix (#81) * Fix config bug * Fix pytorch multiproccessing bug * Move torch specific multiprocessing config to torch imports * Add space to pass license test Co-authored-by: saikrishnagv_1996 <saikrishnagv1996@gmail.com> * fix SimpleQueue issue (#83) * fix SimpleQueue issue * add a2c run config and reference in experiment * ToDos * black formatting * fix override_run in simple_a2c/cartpole * fix log_metric bug (#86) * Upgrading Cogment and Gym (#87) * Dev ppo (#91) * add ppo for continuous actions * fix lint * remove debug folder * fix lint * fix lint * refactor gym env wrapper * latest code * multi envs with a single agent for ppo * remove debugger * latest code * clean code * fixed lint * remove time profiler results * uncomment the render option for gym * revised version according to the reviewers feedback * fixe bugs in gyms (#98) * remove duplicates in gym adapter * Hot fix (#100) * remove duplicates in gym adapter * remove duplicates in gym adapter * fix lint * TD3 (#94) * TD3 * add exploration noise, black format * remove unused imports * new test * hacky fix to sample_space * add random action * current * fix shape bug in critic loss, add extra model params * current * converrging rewards afrer 400K time steps * black formatting * fix pylint issues * chill out pylint * disable pylint in td3.py * Isaac gym integration (#84) * isaac adapter, configs * add isaacgym requirements * arrange imports * add isaacgym installation instructions in README * fix import issue * edit requirements, readme, mlflow port * fixes * hydra config * add extra instructions for isaac gym * add full readme instructions * black format * Update config.yaml * Update import_class.py * Update environment.py * remove unused imports * disable pylint * pylint * pylint test * fixed some issues for isaac gym * black format * fixed some bugs * fix vugs (#102) * fix vugs * fix gym adapter Co-authored-by: Luong-Ha Nguyen <ha@Luong-Has-MBP.air> * black formatting * fix pylint * pylint * handle both discrete and continous actions * no cuda imposition in simple_a2c * config.environment_specs * pylint fix in TD3 * modified random action size * put replaybuffer back * fixed lint Co-authored-by: Luong-Ha Nguyen <ha@Luong-Has-MacBook-Pro.local> Co-authored-by: Luong-Ha Nguyen <ha@Luong-Has-MBP.air> Co-authored-by: Luong-Ha Nguyen <luongha.nguyen@gmail.com> * add soft actor critic * add sac actor * add SAC optimizer * add working pipeline * fixed sac policy formulations * fixed bugs in SAC actor * fixed license * set separate learning rate for policy and value networks * add new set of hyperparameters for SAC * Adapt PPO to petting zoo atari * Add training part for petting zoo pong * Add UI test * Resolve conflits * Add selfplay petting zoo * Add working UI for pong * Add hill ppo for petting zoo * Add control keyboards for petting zoo pong * Add feedback icons * Add colored buttons for UI * Refactor UI for human feedback * Add data buffer for human data and update README for petting zoo * Fix typos in README * Fix lint * Add player name for human feedback UI * Add neuralfeedback and feedback frequency to UI * Defaulting training device to CPU * Take all observations of an episode for training (pz pong) * Fix typo * Fix bugs in loglikehood * Add dqn for petting zoo * Fit UI to screen size and Modify data collection for ppo * simple PPO * hyperparams * Add new sets of hyperparameters for pong * Add model registry for videos * Modify hyperparameter values * Adapt to new main branch * Remove component folders * Adapt PPO to gym spaces * Adapt petting zoo to cogverse * Remove merged file * Fix lint * Fix license * fix lint * fix black * delete trained model for pong * Modify unitest * Fix serialization format * Speccify the serialization format for unitest * Modify the code based on the review * Fix typos in readme * change run command to console * Fix typo developpment setup * Add hydra in requirements * Remove duplicated hydra * Remove redundant comments * Add evaluator class and handling random seed * Remove unsed import * fix: modify config experiment for ppo * refactor: modify the config file for improving unittest * refactor: unittest config file --------- Co-authored-by: Clodéric Mars <cloderic@ai-r.com> Co-authored-by: Josh <josh@ai-r.com> Co-authored-by: joshair <109359509+joshair@users.noreply.github.com> Co-authored-by: saikrishnagv_1996 <saikrishnagv1996@gmail.com> Co-authored-by: vabdollahi <vahid@ai-r.com> Co-authored-by: Luong-Ha Nguyen <luong.ha.nguyen@notostechnologies.com> Co-authored-by: Luong-Ha Nguyen <ha@Luong-Has-MacBook-Pro.local> Co-authored-by: Luong-Ha Nguyen <ha@Luong-Has-MBP.air>

2023-03-01

Commit:	f338c29
Author:	wduguay-air	2023-03-01 17:22:01 -0500
Committer:	GitHub	2023-03-01 17:22:01 -0500

Support Gym MultiDiscrete MultiBinary (#130) * space serialization * web app * linters * linters * linters * js space serialization and inspector

2023-02-26

Commit:	24d80ee
Author:	wduguay-air	2023-02-26 10:04:51 -0500
Committer:	GitHub	2023-02-26 10:04:51 -0500

117 Add int32 to serialization data types (#118) * fix * fix * test * address review comments * linter * rebuild web app * address review comments --------- Co-authored-by: Clodéric Mars <cloderic@ai-r.com>

2023-01-24

Commit:	a1483cf
Author:	Clodéric Mars	2023-01-24 13:52:33 -0500
Committer:	Clodéric Mars	2023-01-24 14:04:15 -0500

Directly using and serializing gym.spaces and their value (#116) * Directly using and serializing gym.spaces and their value * Introducing debug inspector of received observation on the web side * Take into account review

Commit:	cc4343b
Author:	Clodéric Mars	2022-06-12 17:01:06 -0400
Committer:	Clodéric Mars	2023-01-24 14:04:15 -0500

[BREAKING] Single python entry point & hydra based configuration

Commit:	a2311bf
Author:	Clodéric Mars	2022-12-07 10:18:40 -0500
Committer:	Clodéric Mars	2023-01-24 14:04:15 -0500

Introduce a lobby + fix multiuser joining + instruction for deploying using a tunnelling system (#109) * Fix missing constructor parameters * Introduce a trial lobby * Fix copyright notices * Adding documentation for ngrok based tunnel * Take into account review

2022-12-07

Commit:	6a166a1
Author:	Clodéric Mars	2022-12-07 10:18:40 -0500
Committer:	GitHub	2022-12-07 10:18:40 -0500

2022-07-15

Commit:	4212224
Author:	Clodéric Mars	2022-06-19 23:01:22 -0400
Committer:	Clodéric Mars	2022-07-15 11:21:45 -0400

Add support for petting zoo classic environment - action mask - turn based play - connect four UI

Commit:	68953eb
Author:	Clodéric Mars	2022-06-17 12:59:50 -0400
Committer:	Clodéric Mars	2022-07-15 11:21:38 -0400

Introducing DQN

Commit:	d949a1c
Author:	Clodéric Mars	2022-06-12 17:01:06 -0400
Committer:	Clodéric Mars	2022-07-15 11:21:04 -0400

Single python entry point & hydra based configuration

2022-04-28

Commit:	46e1a9d
Author:	Clodéric Mars	2022-04-28 09:30:57 -0400
Committer:	GitHub	2022-04-28 09:30:57 -0400

Add continuous action space web client for lunar lander (#65) * Introduce `Space` definitions, use in environment specs * Streamline the human player implementation in the web client * Implementing visual Joystick and DPad for lunar lander - Would be easy to add support for other games * Add some styling powered by tailwind * Fix lint issues * Fix remaining formatting issues * Update environment_adapter.py * flatten dimensions * black * Revert "black" This reverts commit 5be497969f72f9c75e13d1dff82babcf51cec5d3. * format agent adapter * Fix format and improve readme Co-authored-by: saikrishnagv_1996 <saikrishnagv1996@gmail.com>

2022-04-18

Commit:	2fd1f99
Author:	saikrishna-1996	2022-04-18 11:36:05 -0400

no contrainer hive

Commit:	6eb691d
Author:	Clodéric Mars	2022-04-18 10:48:14 -0400
Committer:	GitHub	2022-04-18 10:48:14 -0400

Migration to cogment 2.2 - No container (#61)

2022-04-05

Commit:	c23958a
Author:	saikrishnagv_1996	2022-04-05 18:59:24 -0400
Committer:	GitHub	2022-04-05 18:59:24 -0400

Dev sb3 (#54) * Add hugging face - stable baselines 3

2022-03-18

Commit:	7cc93e8
Author:	saikrishnagv_1996	2022-03-17 22:54:18 -0400

add data proto

2022-03-16

Commit:	eec3326
Author:	Sagar Kurandwad	2022-03-16 13:32:17 -0400
Committer:	GitHub	2022-03-16 13:32:17 -0400

Selfplay RL (#45) * Defining multiple environment implementations * add procgen dependency * basic procgen wrapper * Example run config for procgen * add documentation * remove unnecessary whitespace * procgen webclient support * add controls for remaining procgen environments * fix control description * added basic adapter for pybullet driving environment * resolving merge conflicts * added pybullet, resolved import error * required changes to data.proto and training_run.py * solved parameters problem, error while setting joint control in car.py * check git creds * fixed parallel runs issue * selfplayRL * training run changes * reun params * data.proto update * run_params * class name changes * sample_producer * selfplay_agent * refactor * add selfplay_td3 to env * bug fixes * refactoring * change num_players * env cahnges: * pybullet env integration * env switch turns b/w bob and alice; end of trial flag * switch turns bob and alice * switch turns bob and alice * adding turns b/w agents * adding turns b/w agents * turn based agent actions * cleanup * sample producer * get SARSD for both agents * sample producer * debugging * alie rewards * agent implementation * model implementation * add action_dims * replaybuffer * training * learning * agent training and test * rebase * change mlflow port * environment mode * adding exception * updae bob and alice order in trials * hyperparameters and port updates * readme update * cleanup * cleanup * cleanup * cleanup * Licenses * cleanup and pylynt fixes * pylynt changes * pylint changes * license checker * license checker * pylint changes Co-authored-by: Clodéric Mars <cloderic@ai-r.com> Co-authored-by: Jonathan Fisher <jonathan@ai-r.com> Co-authored-by: Kharyal <chaitanyajee@gmail.com> Co-authored-by: saikrishna-1996 <saikrishnagv1996@gmail.com>

2022-03-15

Commit:	2f754c6
Author:	Jonathan Fisher	2022-03-15 16:45:17 -0400

rework muzero networks

2022-03-14

Commit:	e0a13df
Author:	Jonathan Fisher	2022-03-14 13:22:18 -0400
Committer:	GitHub	2022-03-14 13:22:18 -0400

Muzero (#32) * Basic MuZero implementation * Update PyTorch version * Improve test coverage

2022-02-08

Commit:	f54979a
Author:	Clodéric Mars	2022-02-08 14:59:32 -0500
Committer:	GitHub	2022-02-08 14:59:32 -0500

Introduce a `play` run to execute (and observe) a few trials using any actor implementation (#41) * Extract environment spec configuration to its own message * Using environment params in all run configs * Introduce a 'play' run and a 'random' agent implementations * Factorizing the environment specs * Moving the number of player in an environment to the specs * Fix docker compose * Further linter fix * Make sure client containers are removed after each call * Make better usage of model and version user data when saving/loading the models * Add the notion of role to the human actor * Fix python base test * Add documentation

2022-02-01

Commit:	5a42655
Author:	air-sara	2022-02-01 14:45:59 -0500
Committer:	GitHub	2022-02-01 14:45:59 -0500

2.0 (#38) Update to Cogment 2.0

2022-01-27

Commit:	df637e6
Author:	Jonathan Fisher	2022-01-27 16:58:20 -0500
Committer:	GitHub	2022-01-27 16:58:20 -0500

Behavior Cloning Tutorial + Development Mode (#36) * simple BC functional * Lander, cartpole, & mountaincar working * add doc for simple BC * fix lint issue * Change the default mlflow port to 3000 macOS Monterey nos uses port 5000 by default for its AirPlay server * Introduce dev version of the web_client - with sync with local files - + proper production version * Minor refactors * Tutorial steps * Update and unify the poetry installations * Introduce a 'development' mode * Add instructions to build the client when needed * Fix HILL environment rendering * Update simple_bc.md * docker tzdata noninteractive fix * Update README.md * Making the dev mode more resilient * Fix the retrieval of the atari roms * Pinning grpcwebproxy version * Make sure that environment and tf_agents are interrupted properly Co-authored-by: Clodéric Mars <cloderic@ai-r.com> Co-authored-by: saikrishnagv_1996 <sai@ai-r.com> Co-authored-by: saikrishnagv_1996 <saikrishnagv1996@gmail.com>

2022-01-25

Commit:	0dbf54c
Author:	Air-sara	2022-01-25 12:22:25 -0500
Committer:	Air-sara	2022-01-25 12:25:01 -0500

2.0!

2022-01-10

Commit:	bb0591c
Author:	saikrishnagv_1996	2022-01-10 15:18:37 -0500
Committer:	GitHub	2022-01-10 15:18:37 -0500

Merge branch 'main' into dev-hive

2021-12-20

Commit:	7eefa64
Author:	saikrishnagv_1996	2021-12-20 13:16:16 -0500

current version

2021-12-17

Commit:	c070a1c
Author:	Jonathan Fisher	2021-12-17 08:10:05 -0500
Committer:	GitHub	2021-12-17 08:10:05 -0500

Add procgen environments (#28) * Defining multiple environment implementations * add procgen dependency * basic procgen wrapper * Example run config for procgen * add documentation * procgen webclient support Co-authored-by: Clodéric Mars <cloderic@ai-r.com> * add link to details * fix typo * add link to README * Run black linter * fix linter complaints Co-authored-by: Clodéric Mars <cloderic@ai-r.com>

2021-12-11

Commit:	cc57725
Author:	saikrishnagv_1996	2021-12-10 19:03:20 -0500

update atari config, data proto, fixes in td3, training run

2021-11-29

Commit:	96f7f2b
Author:	Clodéric Mars	2021-11-18 11:24:52 -0500
Committer:	Clodéric Mars	2021-11-29 17:46:24 -0500

Defining multiple environment implementations

2021-11-24

Commit:	be12619
Author:	Sagar Kurandwad	2021-11-24 12:53:21 -0500
Committer:	GitHub	2021-11-24 12:53:21 -0500

Reinforce Refactor (#18) * tune hyperparameters * remove probability regularizer * refactor learning * refactor learning * refactor init_model and remove epsilon and epsilon schedule * refactor reinforce and agent adapter * cleanup * cleanup * cleanup * cleanup * refactoring * cleanup * removing double initialization of model * rollout cleanup * cleanup * rollout cleanup * sample producer cleanup * sample producer cleanup * sample producer cleanup * sample producer cleanup * remove torch * cleaning load * removing lr schedule * remove legal_moves * replaybuffer cleanup * remove model_params * replay buffer cleanup * removing third party * cleanup * reinforce docs * remove equation * remove equation * remove equation * apche liceses * new run results * Update REINFORCE.md * Update index.md Co-authored-by: Clodéric Mars <cloderic@ai-r.com>

2021-11-23

Commit:	0486fef
Author:	Vincent ROBERT	2021-11-23 14:13:10 -0500

Support pipe world in the cogver environment

Commit:	e8a5474
Author:	Clodéric Mars	2021-11-22 20:44:51 -0500
Committer:	GitHub	2021-11-22 20:44:51 -0500

Upgrade to cogment-py-sdk 1.3.1 (#21)

2021-11-22

Commit:	542a736
Author:	Vincent ROBERT	2021-11-22 16:51:17 -0500

Inital work to conect with web-client

2021-11-10

Commit:	5dd665d
Author:	Clodéric Mars	2021-11-10 16:02:37 -0500
Committer:	GitHub	2021-11-10 16:02:37 -0500

Introduce very simple A2C implementation (#1) * Add support for providing seed to the environment * Remove unused parts of the configuration * Use the same MLFLOW_TRACKING_URI inside and outside the docker container * Add support for multiple run config message types * Add ability to log full dict or protobuf messages in xp tracker * Improve manual interruption handling * Introduce simple A2C implementation

2021-11-05

Commit:	52f2bc2
Author:	Clodéric Mars	2021-11-05 08:52:59 -0400

Initial implementation including several agents and environments