Граф коммитов

3529 Коммитов

Автор SHA1 Сообщение Дата
Miguel Alonso Jr 63b71a8d17
Updated yamato pytest GPU tests for torch 1.13.1 (#5985) 2023-10-05 20:23:13 -04:00
Miguel Alonso Jr fb24597b16
Develop sentis upgrade (#5979)
* Commiting changes.

* Initial barracuda 4 upgrade.

* Play mode tests passing.

* Edit mode tests passing.

* Training fixes.

* Fixed performance issue with stacking sensor.

* Fixed failing tests and issue with stacking sensor.

* Updated examples for barracuda 4 upgrade.

* Fixed issue with attention ONNX export w.r.t. dimensions.

* Fixed issue with Buffer Sensor and Recurrent In/Out.

* Retrained old policies and updated with ONNX policies. Deprecated old policy versions.

* Saving work.

* Saving work.

* Updating to Sentis 1.1.1-exp.2

* Fixed more errors with Sentis upgrade.

* Fixed tensor allocation issue in TensorUtils.ResizeTensor. Inference is working for 3DBall with Sentis.

* Fixed broken Sentis model links for some example environments.

* Fixed some broken edit mode tests.

* Fixed some failing tests.

* Fixing bugs with GPU inference on Sentis.

* Updated packages lock and onnx meta files.

* Refactoring all Barracuda related naming to Sentis.

* Python max version bump.

* Precommit fixes.

* Pinned tensorboard version

* Revert tensorboard version.

* Fixed rpc tests.

* Fixed failing python tests.

* Fixed some more failing tests. Added six as an explicit dependency due to tensorboard requirements.

* gha fix.

* Updated environment registry for Sentis.

* Fixed texture sensor test.

* Develop python 3.10 (#5981)

* Deprecated python 3.8.x and 3.9.x.

* Updated colab gha test to 3.10.12

* Updated colabs for Sentis and python 3.10.

* Test fix.

* Minor update to colabs.

* Develop torch 1.13.1 (#5982)

* Bumped PyTorch version to 1.13.1

* Added potential fixes to model overrider TBD at a later date.

* Updated changelog.

* Updated protobufs. (#5983)

* Updated training init tests to remove inference test temporarily. (#5984)
2023-10-05 18:28:39 -04:00
Miguel Alonso Jr 8bcedabd80
Updated DevProject physics settings to turn on autosync transforms. (#5960) 2023-08-07 18:32:02 -07:00
Miguel Alonso Jr 4ea89950c4
Update to 2022.3 LTS and batched raycasts (#5950)
* Updated to Unity 2022.3.

* Added support for batched raycasting in ray perception sensor. Only for 3D (batched raycasting is only supported in 3D).

* Added batched raycast tests and docs.

* Updated meta file validation exclude list.

* Added unity physics module dependency to core package.
2023-07-25 11:50:49 -07:00
Miguel Alonso Jr 4cfe3c02aa
Fixed ws on contributing.md. (#5953) 2023-07-24 11:06:19 -04:00
omartheironman 10a3e07055
validate catalog-info.yaml (#5880)
Co-authored-by: Sourcegraph <batch-changes@sourcegraph.com>
2023-07-24 10:54:07 -04:00
Jesse Talavera-Greenberg 47d0785623
Update System.Interactive.Async.dll.meta (#5951) 2023-07-24 10:49:11 -04:00
Yoldaş Ulaş 1f43265ea4
Contributing md update (#5933)
* Update Contributing.md

* Update CONTRIBUTING.md

* Update CONTRIBUTING.md
2023-07-24 10:47:49 -04:00
Koki Mitsunami 3fd3fdf157
Updated document on DecisionStep feature in DecisionRequester (#5940)
* Updated docs/Learning-Environment-Design-Agents.md
* Fixed a typo in comments in com.unity.ml-agents/Runtime/DecisioRequester.cs
2023-06-23 11:15:41 -04:00
Koki Mitsunami 8c6629f4a8
Added DecisionStep parameter to the decision requester (#5939)
* Added DecisionStep parameter to the decision requester
* Updated com.unity.ml-agents/Runtime/DecisionRequester.cs

* Added DecisionStep parameter to the decision requester
- Updated the changelog
2023-06-23 09:50:50 -04:00
Thomas Simonini e2e0768fd7
Update ML-Agents to Python 3.10.12 (Colab) (#5929)
* Update setup.py

*python_requires from <= 3.10.11 to 3.10.12

* Update setup.py

*python_requires from <= 3.10.11 to 3.10.12
2023-06-09 13:45:43 -04:00
Thomas Simonini 24f6f34200
Added ELO Rating System Doc (#4685)
* *added ELO rating system doc
*added some illustrations for ELO rating system doc (elo_example,
elo_expected_score_formula, elo_score_update_formula)

* Update ELO-Rating-System.md

* Remove blank space

* Run pre-commit

* Remove tennis illustration link

* Update ELO-Rating-System.md

* Add ELO link

* Pre-commit
2023-06-06 16:41:56 -04:00
Thomas Simonini 06968765ba
Integrating Hugging Face Hub 🤗 (updated) (#5856)
* Add Hugging Face Integration

* Update setup.py

* Update push_to_hf.py

* Update push_to_hf.py

* Update push_to_hf.py

* Update push_to_hf.py

Remove use_auth_token

* Update push_to_hf.py

* Update push_to_hf.py

* Create Huggy

* Update load_from_hf.py

*Change loading to snapshot_download (able to use cache)

* Update push_to_hf.py

* Use create_repo and upload_folder instead of git

* Apply suggestions from code review

Co-authored-by: Lucain <lucainp@gmail.com>
Co-authored-by: Omar Sanseviero <osanseviero@gmail.com>

* Delete Huggy + update load and push to hf

* Delete Huggy config file
* Update load_from_hf
* Update push_to_hf

* Apply suggestions from code review

Co-authored-by: Omar Sanseviero <osanseviero@gmail.com>
Co-authored-by: Lucain <lucainp@gmail.com>

* Update push_to_hf.py

* Update with Omar Feedback

* Black style formatter

* Create __init__.py

* Updates based on flake8 review

* Change logging to MLAgents logger

* Update python version

* Update Python Version

* Update setup.py

* Update logger

* Update logger

* Pre-commit

* Update Tuple

* Update metadata generation

* Ignore mypy error

* Create Hugging-Face-Integration.md

* Update ML-Agents-Toolkit-Documentation.md

* Update style of Hugging-Face-Integration.md

* Typo

* Remove spaces

---------

Co-authored-by: Lucain <lucainp@gmail.com>
Co-authored-by: Omar Sanseviero <osanseviero@gmail.com>
2023-06-06 13:10:27 -04:00
Julian Rogawski 9c27e73fac
Remove misleading comment in docs (#5860)
Comment probably a copy/paste mistake. Removing comment as there is no "PPO-specific hyperparameters" section above.
2023-06-05 07:07:54 -04:00
Tyler Lindberg 51f97fb44d
Update to release_20 in installation instructions (#5864)
Based on the README, release_20 seems to be the new latest stable release.
2023-06-05 07:05:42 -04:00
teamclouday 26743c61a3
Update publish_docs.yaml (#5886) 2023-06-05 06:58:46 -04:00
Stefano Probst 6e1b1181e4
Update citation sources in Readme.md (#5909) 2023-06-05 06:57:06 -04:00
Armando Fandango 3f33f65caf
Update setup.py to remove duplicate numpy entry (#5924) 2023-06-05 06:53:40 -04:00
Hunter-Unity 9ee80bf490
Retrain Walker (#5911)
* reduce hidden nodes to 256 and retrain 30M steps

* update demo file
2023-04-27 13:10:41 -04:00
Hunter-Unity 8a4c5d86ee
Normal Sized Walker (#5871)
* Add final trained model

* add better nn model

* decrease sun brightness, remove unused walker prefab

* remove unused meta files

* remove misc files

* Pinned protobuf version to <3.20

* Update max python version to 3.10.11

---------

Co-authored-by: Miguel Alonso Jr <miguel.alonso@unity3d.com>
2023-04-26 11:03:03 -04:00
Jason Rupert cc1a7501c4
Fix bug in visual3dball example. (#5867)
* Revert Max Steps on first agent of visual 3d ball back to 5000.  It was inadvertently changed to 500.

* Revert Max Steps on first agent of visual 3d ball back to 5000.  It was inadvertently changed to 500.
2023-01-25 09:00:12 -08:00
Maryam Honari 5b6cb9878c
update secret (#5846)
* update secret

* update secret yamato format

* echos

* echos

* remove echos
2022-12-15 09:53:29 -05:00
Miguel Alonso Jr 9ee3ff0aee
Convenience Features (#5842)
* Added feature to training area replicator to optionally specify replications in builds only.

* Add ModelCarousel component that cycles through a list of models to show training progress.

* Add usage notes and fix default values

* Added new config feature to distribute checkpointing evenly throughout training.

* Dotnet formatting.

* Updated docs.

* Added tests.

Co-authored-by: Jason Rupert <jason.rupert@unity3d.com>
2022-12-14 13:16:48 -08:00
Miguel Alonso Jr 50d891e127
Fixed python version pinning issue on colab.yml gha. (#5843) 2022-12-08 10:43:04 -08:00
Miguel Alonso Jr 316b6d6ede
Nit to core package, update package manifest, update build provider to address this bug: https://unity.slack.com/archives/C8FECS6L9/p1669646048318809 (#5841) 2022-12-08 12:48:23 -05:00
Miguel Alonso Jr 6bb711f1b0
Fixed gha for move back to public repo. (#5838) 2022-12-05 11:49:42 -05:00
Miguel Alonso Jr 93d3a51439
Changed to material mkdocs theme. Minor doc updates. (#5835)
* Changed to material mkdocs theme. Minor doc updates.

* Updated colors and home banner.
2022-12-02 16:44:59 -05:00
Miguel Alonso Jr 2a888f2363 Merged main into develop 2022-12-02 14:47:44 -05:00
Maryam Honari 28ec36a743
bump release name readme (#5833) 2022-11-28 09:57:47 -05:00
Maryam Honari 23aefec3b4
Dev rel20 post (#5827)
* update relase tag references

* add table in readme (#5816)

* nit updates (#5823)

* prepare main for dev

* update python to v0.31.0-dev0, add unreleased section
2022-11-22 16:40:09 -08:00
Maryam Honari 2335e6fd5f pull development changes to `main` branch (#81)
* yamato promotion min editor bumped to 2020.3 (#34)

* Fix pypi publish github action (#35)

* update logo (#39)

* Move SB3 dependencies to colab (#29)

* Basic example running with multiple envs in sb3

* Merge & cleanup

* trains 3dball

* move SB3 helpers into colab

* remove sb3/supersuit hard reqs

* remove sb3 funcs

* remove unused imports

* remove sb3 example directory and script

* remove vec env tests

* clean colab

* fix link

Co-authored-by: Henry Peteet <henry.peteet@unity3d.com>
Co-authored-by: mahon94 <maryam.honari@unity3d.com>

* bump to barracuda 3.0.0 (#30)

* bump to barracuda 3.0.0 & Updated submodule com.unity.ml-agents

* Feature/mla 2394 update web docs (#37)

* Added mkdocs and refactored docs.

* Updated docs.

* Added gh action to manually public docs. Updated mkdocs.yml.

* fix markdown docs linking issue (#40)

* Updated submodule com.unity.ml-agents (#41)

* Add sonarqube yamato job (#38)

* Pinning markdown link check to 3.8.7 on nightly gha. (#43)

* Update submodules

* pin black version to 22.3.0 (#46)

* fix-sonar-bugs (#44)

* Update submodules

* bump version to 0.29.0 (#42)

* bump version to 0.29.0

* remove dev tag

* add checks for pytorch-gpu install

* pin black version to 22.3.0

* Update submodules

* Update automated trigger references to support develop branch (#36)

* Update automated trigger references to support develop branch

* Update pull request template links to refer to develop branch.

* Fix release branch name for github actions

* Update pre-commit pattern

* Removing test that no longer applys to this repo

* Removed dead links in pull request template. (#48)

* Use pinned version of package-ci/mac image (#50)

* Add shared critic configurability for PPO (#45)

* Update submodules

* Updated Unity version to 2020.3.33f1. (#53)

* Update submodules (#56)

Co-authored-by: GitHub Actions - update submodules <bot@noreply.github.com>

* Add poca reference and update to latex refs (#57)

* add poca ref in docs (#58)

* Update submodules

* Update Training-Configuration-File.md (#59)

* Update Training-Configuration-File.md

Doc says "behavior_cloning" when it should say "behavioral_cloning".

* Update Learning-Environment-Design-Agents.md

Minor correction to wording.  (https://github.com/Unity-Technologies/ml-agents/issues/5745)

* rename ml-agents/trainers/torch to torch_entities (#55)

* rename ml-agents/trainers/torch to torch_entities
* fix mock patch

* Update submodules

* Update docs to explain Alternating Ray Order switch (mla-2537) (#61)

* Update docs to explain Alternating Ray Order switch (mla-2537)

* Update submodules

* Update submodules

* Updated validate meta files. (#65)

* Fix failing gha pytest (#66)

* Updated pytest.

* Updated SURVEY.md to remove dead survey link.

* Updated bug_report.md to update dead link.

* Update submodules

* Fixed upm ci bug in yamato tests. (#67)

* Update submodules

* remove torch shadow in trainers tests (#69)

* extend stale bot to 90 days (#70)

* Update submodules

* Dev bump min ver (#74)

* Bumped minimum version of Unity.

* Bumped minimum version of python and pytorch. fixed issues with deprecation warnings for x.T (tensor transpose).

* Fixed np.bool deprecation warning. This is a port of an external PR: https://github.com/Unity-Technologies/ml-agents/pull/5795. (#71)

* Updated submodule.

* Bumped max ver of python.

* Bumped to 2021.3 in yamato tests.

* Fixed version issue with torch in test constraints.

* update sysroot version

* update toolchain version

Co-authored-by: zhuo <zhuo@unity3d.com>

* Removed old backwards compatibility test since tensorflow/2019.4 are no longer s upported. (#76)

* Develop custom trainers (#73)

* Make create_policy more generic (#54)

* add on/off policy classes and inherit from

* trainers as plugins


* remove swap files

* clean up registration debug

* clean up all pre-commit

* a2c plugin pass precommit

* move gae to trainer utils

* move lambda return to trainer util

* add validator for num_epoch

* add types for settings/type methods

* move create policy into highest level api

* move update_reward_signal into optimizer

* move get_policy into Trainer

* remove get settings type

* dummy_config settings

* move all stats from actor into dict, enables arbitrary actor data

* remove shared_critic flag, cleanups

* refactor create_policy

* remove sample_actions, evaluate_actions, update_norm from policy

* remove comments

* fix return type get stat

* update poca create_policy

* clean up policy init

* remove conftest

* add sharedecritic to settings

* fix test_networks

* fix test_policy

* fix test network

* fix some ppo/sac tests

* add back conftest.py

* improve specification of trainer type

* add defaults fpr trainer_type/hyperparam

* fix test_saver

* fix reward providers

* add settings check utility for tests

* fix some settings tests

* add trainer types to run_experiment

* type check for arbitary actor data

* cherrypick rename ml-agents/trainers/torch to torch_entities (#55)

* make all trainers types and setting visible at module level

* remove settings from run_experiment console script

* fix test_settings and upgrade config scripts

* remove need of trainer_type argument up to trainefactory

* fix gohst trainer behavior id in policy Queue

* fix torch shadow in tests

* update trainers, rl trainers tests

* update tests to match the refactors

* fixing behavior name in ghost trainer

* update ml-agents-envs test configs

* separating the plugin package changes

* bring get_policy back for sake of ghost trainer

* add return types and remove unused returns

* remove duplicate methods in poca (_update_policy, add_policy)

Co-authored-by: mahon94 <maryam.honari@unity3d.com>

* Online/offline custom trainer examples with plugin system (#52)

* add on/off policy classes and inherit from

* trainers as plugins

* a2c trains

* remove swap files

* clean up registration debug

* clean up all pre-commit

* a2c plugin pass precommit

* move gae to trainer utils

* move lambda return to trainer util

* add validator for num_epoch

* add types for settings/type methods

* move create policy into highest level api

* move update_reward_signal into optimizer

* move get_policy into Trainer

* remove get settings type

* dummy_config settings

* move all stats from actor into dict, enables arbitrary actor data

* remove shared_critic flag, cleanups

* refactor create_policy

* remove sample_actions, evaluate_actions, update_norm from policy

* remove comments

* fix return type get stat

* update poca create_policy

* clean up policy init

* remove conftest

* add sharedecritic to settings

* fix test_networks

* fix test_policy

* fix test network

* fix some ppo/sac tests

* add back conftest.py

* improve specification of trainer type

* add defaults fpr trainer_type/hyperparam

* fix test_saver

* fix reward providers

* add settings check utility for tests

* fix some settings tests

* add trainer types to run_experiment

* type check for arbitary actor data

* cherrypick rename ml-agents/trainers/torch to torch_entities (#55)

* make all trainers types and setting visible at module level

* remove settings from run_experiment console script

* fix test_settings and upgrade config scripts

* remove need of trainer_type argument up to trainefactory

* fix gohst trainer behavior id in policy Queue

* fix torch shadow in tests

* update trainers, rl trainers tests

* update tests to match the refactors

* fixing behavior name in ghost trainer

* update ml-agents-envs test configs

* fix precommit

* separating the plugin package changes

* bring get_policy back for sake of ghost trainer

* add return types and remove unused returns

* remove duplicate methods in poca (_update_policy, add_policy)

* add a2c trainer back

* Add DQN cleaned up trainer/optimizer

* nit naming

* fix logprob/entropy types in torch_policy.py

* clean up DQN/SAC

* add docs for custom trainers,TODO: refrence tutorial

* add docs for custom trainers,TODO: refrence tutorial

* add clipping to loss function

* set old importlim-metadata version

* bump precomit hook env to 3.8.x

* use smooth l1 loss

Co-authored-by: mahon94 <maryam.honari@unity3d.com>

* add tutorial for validation

* fix formatting errors

* clean up

* minor changes

Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com>
Co-authored-by: zhuo <zhuo@unity3d.com>

* Trainer qa fix (#78)

* grammer fixes

* fix nit comments from QA

* add info about on/off policy

* add more context to the code block

* more context and minor fix

* pip3

Co-authored-by: zhuo <zhuo@unity3d.com>

* Develop merge submodule (#77)

* Update com.unity.ml-agents/CHANGELOG.md

Co-authored-by: Arthur Juliani <awjuliani@gmail.com>

* Update changelog

* [MLA-1762] reduce memory allocations from DiscreteActionOutputApplier (#4922)

* add option for Burst inference (#4925)

* surface specific GRPC errors more visibly (#4930)

* Add additional logic to avoid load being called on every advance (#4934)

* [MLA-1767] Refactor communicator connection exceptions (#4935)

* Update changelog for release 13. (#4938)

* Update master versions for release 13. (#4945)

* Release 13 versions. (#4946)

- updated release tag validation script to automate the updating of files with release tags that need to be changed as part of the pre-commit operation.

* Update docs to pass doc validation. (#4953)

* update defines, compile out Initialize body on non-desktop (#4957)

* Adding references to the Extensions package to help promote it. (#4967)

* Adding references to the Extensions package to help promote it.

* Updating header formatting to match the rest of the page

* Adding references to the Extensions package to help promote it. (#4967) (#4968)

Co-authored-by: Marwan Mattar <marwan@unity3d.com>
Co-authored-by: Chris Elion <chris.elion@unity3d.com>

* InputActuatorComponent to allow the generation of an action space from an InputActionAsset (#4881)

* InputActuatorComponent to allow the generation of an action space from an InputActionAsset (#4881) (#4974)

* pass sensor dimension flags to analytics (#4954)

* Communicator factory (#4965)

* Update barracuda, swtich Agents in Sorter use Burst. (#4979)

* Update barracuda, swtich Agents in Sorter use Burst. (#4979) (#4981)

* Set ignore done=False in GAIL (#4971)

* MultiAgentGroup Interface (#4923)

* add SimpleMultiAgentGroup

* add group reward field to agent and proto

* Make TrainingAnalyticsSideChannel internal (#4999)

* [MLA-1783] built-in actuator type (#4950)

* Add component menues for some sensors and actuators. (#5001)

* Add component menues for some sensors and actuators. (#5001) (#5002)

* Merge master -> release_13_branch-to-master

* Fix RpcCommunicator merge.

* master -> main. (#5010)

* Adding a name field to BufferSensorComponent

* Adding a note to the CHANGELOG about var len obs

* Adding a helper method for creating observation placeholder names and removed the _h and _c placeholders

* Adding a custom editor for BufferSensorComponent

* adding inheritdoc

* Update cattrs dependencies to support python3.9 (#4821)

* Fix issue with queuing input events that stomp on others. (#5034)

* Update cattrs dependencies to support python3.9 (#4821)

* Fix issue with queuing input events that stomp on others. (#5034)

* Update versions for release 14 hotfix. (#5040)

* master -> main. (#5010) (#5044)

* Update changelog. (#5045)

* [MLA-1809] catch mismatched observation sizes (#5030)

* Update changelog. (#5055)

* Fix xml docs. (#5057)

* pass sensor name through to ObservationSpec (#5036)

* Remove unused allocation (#5068)

* Automatically generate samples based on placement of mlagents-sample.json files in our examples. (#5077)

* Update barracuda to 1.3.2-preview. (#5084)

* POCA trainer (#5005)

Co-authored-by: Ervin Teng <ervin@unity3d.com>
Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com>
Co-authored-by: Chris Elion <chris.elion@unity3d.com>
Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>

* [docs] Documentation for POCA and cooperative behaviors (#5056)

Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com>

* [docs] Update changelog (#5095)

* Add multiAgentGroup capabilities flag (#5096)

* Add multiAgentGroup capabilities flag

* Add proto

* Fix compiler error

* Add warning for multiagent group

* Add comment

* Fix spelling mistake

* Fix ghost curriculum and make steps private (#5098)

* use get step to determine curriculum

* add to CHANGELOG

* Make step in trainer private (#5099)

Co-authored-by: Ervin T <ervin@unity3d.com>

* [cherry-pick] Fix ghost curriculum and make steps private (#5098)

* use get step to determine curriculum

* add to CHANGELOG

* Make step in trainer private (#5099)

Co-authored-by: Ervin T <ervin@unity3d.com>

* Update changelog for samples. (#5103)

* Update changelog for samples. (#5103) (#5106)

* Update versions on main (#5102)

Increment versions after release 15 branch split

* [release_15] Release 15 update versions (#5101)

* Update versions

* Fix for validate release links

* Update release tag and docs

* [docs] Add missing package docs (#5108)

* Add missing package docs

* Fix grammar

* Update comment on Dispose

* Fix typo in doc

* [docs] Update readme and changelog (#5107)

* Make analytics module an optional dependency. (#5109)

Co-authored-by: Chris Elion <chris.elion@unity3d.com>

* Redo dotnet format (#5119)

* [docs] Add links to example envs for Buffer Sensor and MultiAgentGroup (#5116)

* Update changelog with links to example envs

* Cooperative push block link update

* [cherry-pick] Integrate Group Manager to soccer/retrain with POCA (#5115) (#5121)

* Integrate Group Manager to soccer/retrain with POCA (#5115)

* Add Soccer env to changelog

Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com>

* [cherry-pick][docs] Add Dungeon Escape Environment (#5133)

* Add DungeonEscape POCA Environment (#5128)

* Add DungeonEscape assets from working branch

* Add Dungeon Escape docs

* Create dungeon_escape.png

* Add to docs

Co-authored-by: Hunter-Unity <hunter@unity3d.com>

* update pre-commit instructions for CONTRIBUTING.md (#5130)

* Remove duplicated changelog

* Fix another duplicated line in the changelog...

* Removing Obsolete methods from the package (#5024)

* Removing Obsolete methods from the package

* Missing depecration and modified changelog

* Readding the obsolete BrainParameter methods, will need a larger discussion on these

* Removing Action Masker, readding the warining when using a non-implemented Heuristic, Removing NumAction from Brain Parameters

* removing documentation and some calls to deprecated methods in the extensions package

* Editing the Changelog to put the unreleased on top

* non-IEnumerable interface for action masking (#5060)

* V2 staging new model version (#5080)

* Make modelCheck have flavors of error messages

* ONNX exporter v3

* Using a better CheckType and a switch statement

* Removing unused message

* More tests

* Use an enum for valid versions and use GetVersion on model directly

* Maybe the model export version a static constant in Python

* Use static constructor for FailedCheck

* Use static constructor for FailedCheck

* Modifying the docstrings

* renaming LegacyDiscreteActionOutputApplier

* removing testing code

* better warning message

* Nest the CheckTypeEnum into the FailedCheck class

* Update com.unity.ml-agents/Runtime/Inference/BarracudaModelParamLoader.cs

* Adding a line explaining that legacy tensor checks are for versions 1.X only

* Modifying the changelog

* Exporting all the branches size instead of omly the sum (#5092)

* addressing comments

* Update com.unity.ml-agents/Runtime/Inference/BarracudaModelParamLoader.cs

Co-authored-by: Chris Elion <chris.elion@unity3d.com>

* readding tests

* Adding a comment around the new DiscreteOutputSize method

* Clearer warning : Model contains unexpected input > Model requires unknown input

* Fixing a bug in the case where the discrete action tensor does not exist

Co-authored-by: Chris Elion <chris.elion@unity3d.com>

* Make IActuator implement IHeuristicProvider. (#5110)

* Adding more tests for v2 models (#5138)

* Adding more tests for v2 models

* formatting

* fixing formatting

* [MLA-1634] Add ObservationSpec and update ISensor interfaces (#5127)

* Add custom package settings (#5027)

* cleanup: yamato triggers, changelog, docstrings

* clean up merge

* fix compile error

* update c# version to 2.0.0-exp.1 (#5160)

Co-authored-by: Chris Elion <chris.elion@unity3d.com>

* Move physics/sensor tests that fail in edit mode depending on what's in the scene to playmode tests. (#5162)

* update package version

* remove defines that are now always true

* changelog

* Remove all old analytics defines. (#5168)

* [MLA-1634] Compression spec (#5164)

* Low hanging fruit tests for coverage. (#5170)

* Update Policies to conditionally compile their analytics method. (#5171)

* Turns physics modules into optional dependencies. (#5112)

* [MLA-1634] Remove SensorComponent.GetObservationShape() (#5172)

* PR Feedback. (#5173)

* change default barracuda behavior (#5175)

* [MLA-1824] make SensorComponent return ISensor[] (#5181)

* Make SensorComponent return an array

* split match3 sensors, partial retrain

* docstrings, migration, changelog, cleanup

* Adding the goal conditioning sensors with the new observation specs (#5159)

* Fixing networks.py for the merge

* fix compile error

* Adding the goal conditioning sensors with the new observation specs

* addressing feedback

* I forgot to change the m_observationType

* Renaming Goal to GoalSignal (#5190)

* Renaming GOAL to GOAL_SIGNAL

* VectorSensorComponent to use new API

* Adding docstrings

* verbose pytest on github action

Co-authored-by: Chris Elion <chris.elion@unity3d.com>

* [MLA-1879] culture-invariant sorting for sensors and actuators (#5194)

* Goal conditioning grid world : Example of goal conditioning (#5193)

* Aded the Goal conditioned GridWorld to replace regular gridworld

* adding missing files

* Code improvements

* Documentation change on gridworld

* resolving conflicts

* new model

* Addressing comments

* comments and renames

* Update docs/Learning-Environment-Examples.md

Co-authored-by: Ervin T. <ervin@unity3d.com>

* adding reference to gridworld in docs about goal signal

Co-authored-by: Chris Elion <chris.elion@unity3d.com>
Co-authored-by: Ervin T. <ervin@unity3d.com>

* [MLA-1584] Match3 variable board size (#5189)

* [bug-fix] When agent isn't training, don't clear update buffer (#5205)

* Don't clear update buffer, but don't append to it either

* Update changelog

* Address comments

* Make experience replay buffer saving more verbose

* Changelog for goal conditioning (#5195)

* Changelog for goal conditioning

* adding a line about the changes to the gridworld example

* Addressing comments

* Change capitalization

* [bug fix] Fix warning using demo recorder (#5216)

* Making VectorActionSize and VectorActionSpaceType internal (#5214)

Made sure the editor does not complain;

* Turning some logger.info into logger.debug and remove some logging overhead when not using debug (#5211)

* turning some logger.info into logger.debug and remove some logging overhead when not using debug

* Addressing comments

* Adding to changelog

* rearrange match3 docs (#5215)

* rearrange match3 docs

* changelog and migration from previous PR

* [bug-fix] Fix POCA LSTM, pad sequences in the back (#5206)

* Pad buffer at the end

* Fix padding in optimizer value estimate

* Fix additional bugs and POCA

* Fix groupmate obs, add tests

* Update changelog

* Improve tests

* Address comments

* Fix poca test

* Fix buffer test

* Increase entropy for Hallway

* Add EOF newline

* Fix Behavior Name

* Address comments

* [MLA-1880] Raycast sensor interface improvements (#5222)

* WIP

* remove debug info struct

* cleanup + add to test

* changelog

* fix unit tests

* PR feedback

* Load individual elements if state dict load fails (#5213)


Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>
Co-authored-by: Ervin T. <ervin@unity3d.com>

* [MLA-1159] Add virtual methods to DecisionRequester (#5223)

* [Bug Fix] Serialization of vectorActionSpace (#5220)

* [Bug Fix] Serialization of vectorActionSpace

* adding a test

* Clear ActionBuffers before Heuristic calls (#5227)

* Add ObservationType to analytics ObservationSpec (#5218)

* [MLA-1909] Match3 and Camera/RenderTexture sensor GC improvements (#5233)

* avoid empty set iteration, avoid Debug.AssertFormat (#5246)

* avoid empty set iteration, avoid Debug.AssertFormat

* changelog

* Update to barracuda 1.3.3 and changes to the model inputs and outputs for LSTM (#5236)

* Initial commit

* making it work with hallway. Added a new model version

* addressing feedback

* Adding a comment

* formatting

* modifying the changelog

* Adding descriptions on the model version descriptions

* Update com.unity.ml-agents/Runtime/Inference/GeneratorImpl.cs

Co-authored-by: Chris Elion <chris.elion@unity3d.com>

* addressing comments

Co-authored-by: Chris Elion <chris.elion@unity3d.com>

* [Release 16] Release 16 Merge Back to Main (#5255)

Update versions and documentation for Release 16. 

Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>
Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com>
Co-authored-by: Chris Elion <chris.elion@unity3d.com>

* fix link, run link check in quiet mode (#5257)

* [MLA-1912] Move Match3 to com.unity.ml-agents (#5259)

* Move GridSensor into main package (#5256)

* move OneHotGridSensor into main package

* changelog and migration guide

* remove old doc

* check if physics module presents

* Add advanced installation guide for com.unity.ml-agents (#5260)

* Fix errors when creating new RayCast sensor (#5261)

* check if tag and angle is null

* add test

* changelog

* fix all PVS and doc generation warnings (#5262)

* Fix --results-dir (#5269)

* Fix GridSensorComponent bug (#5270)

* LSTM models from 1.x will be incompatible with MLA 2.x (#5254)

* LSTM models from 1.x will be incompatible with MLA 2.x

* Adding a test and a new v2 model

* Make the Model Runner raise an error if using 1.0 model with LSTM

* adding a new model for hallway trained with 2.0

* reword error messages

* Only raise if error, not if warning

* Addressing comments: The legacy Barrauda memory generator and applier were removed. All code that checked for (memories + v1.X) have been removed since these will no longer be supported

* Modifying the changelog and the migrating guide with this change

* Fixing the merge issues

Co-authored-by: Chris Elion <chris.elion@unity3d.com>

* [MLA-1939] Match3 Custom Editor (#5263)

* Rider suggested cleanup, part 1  (#5265)

* Fix the attention module embedding size (#5272)

* Fix the attention module embedding size

* editing the changelog

* [bug-fix] Fix null ref exception in the demo recorder (#5274)

* Fix null ref exception in the demo recorder

* Update changelog

* Clearer logic

* Modifying the changelog for release 17 (#5277)

* Modifying the changelog for release 17

* Change version to 2.0.0-exp.1

* Removing the [Unreleased] section

* check for missing AbstractBoard, display warning (#5276)

* Updating the barracuda 1.4.0 (#5291)

* [debug] Require all behavior names to have a matching YAML entry (#5210)

* Add strict check to settings.py

* Remove warning from trainer factory, add test

* Add changelog

* Fix test

* Update changelog

* Remove strict CLI options

* Remove strict option, rename, make strict default

* Remove newline

* Update comments

* Set default dict to actually default to a default dict

* Fix tests

* Fix tests again

* Default trainer dict to requiring all fields

* Fix settings typing

* Use logger

* Add default_settings to error

* Version bump main (#5279)

* Version bump for the main branch following branching of release_17

* Messed up the extensions package version

* Bring back root reference in grid sensor (#5300)

* [MLA-1952] Add optional seed for gym action spaces (#5303)

* add optional seed for action spaces

* add optional seed for action spaces

* changelog

* undo packages-lock.json change

* Release 17 version bumps and docs version bumps (#5280)

* Migration guide - list removed APIs with replacements (#5307)

* list removed APIs with replacements

* fixes

* [docs] Update changelog (#5308)

* [MLA-1952] Add optional seed for gym action spaces (#5303) (#5315)

* Fix stacked grid sensor (#5335)

* Update to Unity Package licenses (#5340)

* Update to Unity Package licenses

Per Dan

* Updating the main repo license dates.

* Adding a fully connected visual encoder for super small visual input + tests (#5351)

* initial commit for a fully connected visual encoder

* adding a test

* addressing comments

* Fixing error with minimal size of fully connected network

* adding documentation and changelog

* Make OverlapChecker an interface (#5324)

* Add stacking option to VectorSensorComponent (#5376)

* [WIP] [Fix] Fixing collect observation called on done (#5375)

* [WIP] [Fix] Fixing collect observation called on done

* Update com.unity.ml-agents/Runtime/Agent.cs

* ⚠️ Modifying the test of stacking sensor when the agent is done

* modifying the documentation for BufferSensor to specify to call AddObservation in the CollectObservations method

* Fix NullReferenceException for Behavior Parameters without Agent (#5382)

* check agent existence. add warning box

* changelog

* [WIP] 2.0 verified to main (#5385)

* Warn if no joints on Root Body (#5387)

* Modifying the changelog for the continuous action SAC target entropy fix (#5394)

* Editing the GridSensor documentation for 2D use case (#5396)

* Editing the GridSensor documentation for 2D use case

* changing chagelog

* [MLA-2017] Move colab notebooks to github (#5399)

* Fix the reporting of histogram stats and adding a test (#5410)

* Fix the reporting of histogram stats and adding a test

* Appending to the Changelog

* Update changelog before Release 18 branch (#5412)

* Update package versions for main branch (#5413)

* [Release 18] Update versions and links (#5414)

* add changelog entries for PRs that were missing them before. (#5416)

* fix cref error found by doc validation (#5421)

* fix release versions and changelog section

* [bug-fix] Fix when group terminal steps are deleted, robust test (#5441)

* Fix when terminal steps are deleted, robust test

* Update changelog

* Fix test comment

* Fix torch device option in argparser (#5467)

* [FIX] prioritize cli options over yaml (#5495)

* prioritize cli over yaml in checkpointSettings

* prioritize resume if both set in one place

* fixed test_commandline_args

* addressing comments:renaming, commenting

* more tests and referenced in change_log

* Only update the native library and the c# calls that need to be made to clean up channels. (#5283)

* Fix Mac backcompat test (#5519)

* Initialize-from custom  checkpoints  (#5525)

* init from any checkpoint including older ones
* moving init_path logic ahead to learn.py
* fixing pytest to take the full path
* doc & changelog

* Update gym version to 0.20.0 (#5540)

* Fixed failing test due to gym 0.20.0 release. Updated gym-unity package to fix gym version to 0.20.0.

* Updated CHANGELOG.

* Fix VAIL (#5546)

* Custom trainer editor analytics (#5511)

* Custom trainer editor analytics
* inherit the default_training_analytics_side_channel

* Feature/mla 2205 separate schedule lr beta epsilon (#5538)

* Added initial (untested) support for separating schedule for beta, LR, and epsilon.

* Added support for beta and epsilon schedule in test config conversion

* Updated docs. Set epsilon to be linear by default.

* Updated configs to add beta and epsilon schedule where appropriate.

* Addressed CR change requests.

* Minor update.

* Updated docs.

* Added changes to CHANGELOG.

* Updated CHANGELOG.

* Fixed typo in Training-Configuration-File.md

* Reverted config changes to remove beta and epsilon schedules.

* mypy error fixed from pre-commit.

* Fixed a bug that was causing a settings test to fail.

* Addressed CR comments.

Co-authored-by: Miguel Alonso Jr <miguel.alonso@miguel.alonso-00MM>
Co-authored-by: Miguel Alonso Jr <miguelalonsojr>

* Harden user PII protection logic and extend TrainingAnalytics to expose detailed configuration parameters. (#5512)

* Hash128 is not a cryptographic hash, replace with HMAC-SHA256.

* Extend TrainingAnalytics side channel to expose configuration details

* Change member function scopes and hash demo_paths

* Extract tbiEvent hashing method and add test coverage

* Restart crashed Unity environments (#5553)

### Change(s)

Update the SubprocessEnvManager to restart workers when the underlying Unity environments crash. 
When a worker receives an ENV_EXITED signal it will now:

1. Record all failures coming through the step queue and drop all other messages.
2. Purge any pending trajectories as they may belong to a crashed worker or be corrupted.
3. Restart all failed workers (up to a configurable limit)

This behavior can be limited via a rate limit, max lifetime limit, or both. The configuration options for both are shown below with their default values. 

⚠️  Each of these options applies to a single environment, if num_envs > 1 then the limit will apply separately to each replica (num_envs = 2 will spawn 2 Unity environments which can each be restarted 10 times).

```yaml
env_settings:
  # Can restart 10 times over the lifetime of the experiment.
  max_lifetime_restarts: 10
  # Rate limit of 1 failure per 60s
  restarts_rate_limit_n: 1
  restarts_rate_limit_period_s: 60
```

They can of course be passed via CLI arguments as well

```bash
--max-lifetime-restarts
--restarts-rate-limit-n
--restarts-rate-limit-period-s
```

### Disabling this feature
* Rate limiting can be turned off by setting `--restarts-rate-limit-n=-1`
* Lifetime limiting can be turned off by setting `--max-lifetime-restarts=-1`

* Develop add training area replicator (#5568)

* Added training area replicator to com.unity.ml-agents package.

* Added num_areas to Unity RL Initialization proto. Added cli and config file support for num_areas.

* Changed training area replicator to size grid automatically from number of areas.

* Added tests for the training area replicator.

* Added setup for tests for the training area replicator.

* Added comments and updated create tutorial for training area replicator.

* Updated CHANGELOG.

* Fixed some failing tests.

* Update com.unity.ml-agents/CHANGELOG.md

Co-authored-by: Henry Peteet <henry.peteet@unity3d.com>

* Update docs/Learning-Environment-Create-New.md

Co-authored-by: Henry Peteet <henry.peteet@unity3d.com>

* Update com.unity.ml-agents/Runtime/Areas/TrainingAreaReplicator.cs

Co-authored-by: Henry Peteet <henry.peteet@unity3d.com>

* Addressed CR comments.

Co-authored-by: Miguel Alonso Jr <miguelalonsojr>
Co-authored-by: Henry Peteet <henry.peteet@unity3d.com>

* Fixing the analytics side-channel for curriculum learning. (#5586)

* Fixing the analytics side-channel for curriculum learning.

* Made a more robust test.

* Update the changelog.

* Update com.unity.ml-agents/CHANGELOG.md

Co-authored-by: Maryam Honari <honari.m94@gmail.com>

Co-authored-by: Maryam Honari <honari.m94@gmail.com>

* Method to return stacked observations (#5547)

* Method to return stacked observations

* Added testing and returning staked observation flat.

* Update the comment lines.

* Remove brainstorm commits.

* Upgrade Barracuda to 2.3.1-preview (#5591)

* Upgrade to 2.3.1-preview

* Fix for critic normalization bug (#5595)

* Added normalization to critic during training for poca, ppo, and sac.

* Fixed critic normalization bug. Added new tests to cover.

* Updated CHANGELOG.

* Fixed typo in test_trainers.py

* Deterministic actions python training (#5619)

* Progress on propagating the setting to the action model.

* Added the _sample_action logic and tests.

* Add information to the changelog.

* Prioritize the CLI over the configuration file.

* Update documentation for config file.

* CR refactor.

* Update docs/Training-Configuration-File.md

Co-authored-by: Miguel Alonso Jr. <76960110+miguelalonsojr@users.noreply.github.com>
Update com.unity.ml-agents/CHANGELOG.md

Co-authored-by: Miguel Alonso Jr. <76960110+miguelalonsojr@users.noreply.github.com>
Update com.unity.ml-agents/CHANGELOG.md

Co-authored-by: Miguel Alonso Jr. <76960110+miguelalonsojr@users.noreply.github.com>
Update com.unity.ml-agents/CHANGELOG.md

Co-authored-by: Maryam Honari <honari.m94@gmail.com>
Update ml-agents/mlagents/trainers/settings.py

Co-authored-by: Maryam Honari <honari.m94@gmail.com>
Update ml-agents/mlagents/trainers/cli_utils.py

Co-authored-by: Maryam Honari <honari.m94@gmail.com>

* Fix CR requests

* Add tests for discrete.

* Update ml-agents/mlagents/trainers/torch/distributions.py

Co-authored-by: Maryam Honari <honari.m94@gmail.com>

* Added more stable test.

* Return deterministic actions for training (#5615)

* Added more stable test.

* Fix the tests.

* Fix pre-commit

* Fix help line to pass precommit.

* support for deterministic inference in onnx (#5593)

* Init: actor.forward outputs separate deterministic actions

* changelog

* Renaming

* Add more tests

* Package changes to support deterministic inference (#5599)

* Init: actor.forward outputs separate deterministic actions

* fix tensor shape for discrete actions

* Add test and editor flag

- Add tests for deterministic sampling
- update editor and tooltips

* Reverting to "Deterministic Inference"

* dissect tests

* Update docs

* Update CHANGELOG.md

Co-authored-by: Chingiz Mardanov <chingiz.mardanov@unity3d.com>
Co-authored-by: cmard <87716492+cmard@users.noreply.github.com>

* Revert "Deterministic actions python training (#5619)" (#5622)

This reverts commit 9ea962027d4c581de7c45a07d669f96453236c5c.

* Deterministic actions python training (#5626)

* Progress on propagating the setting to the action model.

* Added the _sample_action logic and tests.

* Add information to the changelog.

* Prioritize the CLI over the configuration file.

* Update documentation for config file.

* CR refactor.

* Update docs/Training-Configuration-File.md

Co-authored-by: Miguel Alonso Jr. <76960110+miguelalonsojr@users.noreply.github.com>
Update com.unity.ml-agents/CHANGELOG.md

Co-authored-by: Miguel Alonso Jr. <76960110+miguelalonsojr@users.noreply.github.com>
Update com.unity.ml-agents/CHANGELOG.md

Co-authored-by: Miguel Alonso Jr. <76960110+miguelalonsojr@users.noreply.github.com>
Update com.unity.ml-agents/CHANGELOG.md

Co-authored-by: Maryam Honari <honari.m94@gmail.com>
Update ml-agents/mlagents/trainers/settings.py

Co-authored-by: Maryam Honari <honari.m94@gmail.com>
Update ml-agents/mlagents/trainers/cli_utils.py

Co-authored-by: Maryam Honari <honari.m94@gmail.com>

* Fix CR requests

* Add tests for discrete.

* Update ml-agents/mlagents/trainers/torch/distributions.py

Co-authored-by: Maryam Honari <honari.m94@gmail.com>

* Added more stable test.

* Return deterministic actions for training (#5615)

* Added more stable test.

* Fix the tests.

* Fix pre-commit

* Fix help line to pass precommit.

* support for deterministic inference in onnx (#5593)

* Init: actor.forward outputs separate deterministic actions

* changelog

* Renaming

* Add more tests

* Package changes to support deterministic inference (#5599)

* Init: actor.forward outputs separate deterministic actions

* fix tensor shape for discrete actions

* Add test and editor flag

- Add tests for deterministic sampling
- update editor and tooltips

* Reverting to "Deterministic Inference"

* dissect tests

* Update docs

* Update CHANGELOG.md

* Fix the deterministic showing up all the tiime (#5621)

Co-authored-by: Chingiz Mardanov <chingiz.mardanov@unity3d.com>
Co-authored-by: cmard <87716492+cmard@users.noreply.github.com>

* Changelog cleanup. (#5627)

* Release cleanup for changelog

* Update CHANGELOG.md

Co-authored-by: Maryam Honari <honari.m94@gmail.com>

* Fix analytics error when compiling XBOX and PS5 (#5628)

* Add ENABLE_CLOUD_SERVICES_ANALYTICS Guards

To use analytics safely you must wrap usage with 
`#if ENABLE_CLOUD_SERVICES_ANALYTICS` 
See https://docs.unity3d.com/ScriptReference/Analytics.Analytics.html

* Update DQN Colab example to support export for Barracuda inference (#5643)

* Update DQN GridWorld colab example to be capable of outputting an ONNX model that works for inference.
-Allow GridWorld to work without a GoalSensor.  Defaults to the GreenPlus target in that case.
-Add GridWorldColab.onnx file, trained and generated with the colab.

* Upgrade DQN GridWorld Colab to produce model that can work with (modified) GridWorld Scene.
-permute observation images from NHWC to NCHW right after env_step.  Everything following is NCHW.
-update input_shape indexes and remove equivalent permute from the VisualQNetwork
-fix reference to global qnet inside generate_trajectories (use the passed in version)
-create wrapper network to add required constants and dummy input masks required by the
 runtime inference (Barracuda)
-export the wrapped network to ONNX format compatible with Barracuda

* Add additional GridWorld scene called GridWorldColab that has the goal sensors removed (target is always green plus) and masks disabled.  The purpose of this is to have a scene compatible with the DQN GridWorld Colab example.

* Minor edit in Colab docs.

* Delete unnecessary .lighting file.

* Update changelog.

* Update colab/Colab_UnityEnvironment_2_Train.ipynb

Co-authored-by: Maryam Honari <honari.m94@gmail.com>

* Fix error in colab metadata name.

* Add suggested comments and fix incorrect parameter ordering.

* Update docs for Elo curriculum completion measure.

Co-authored-by: Maryam Honari <honari.m94@gmail.com>

* Changelog prepped for release (#5629)

* Changelog prepped for release

* Added missing PRs to the changelog.

* Added missing CHANGELOG item.

* enable 3D Ball sample import button in package manager for unity sdk (#5665)

* enable 3D Ball sample import button in package manager for unity sdk

* add Samples folder

* add meta file

* fix git ignore config

* Updated versions for main. Also updated validate versions script to r… (#5668)

* Updated versions for main. Also updated validate versions script to remove exp.1 from package version.

* Reverted validate versions.

* Updated versions.

* Bumped package version for patch.

* Removed unreleased section. (#5667)

* Updated release versions and release links. (#5669)

* Updates for package validation errors. (#5671)

* Release 19 branch lts deprecation (#5673)

* Updated Project and DevProject to 2020.3 LTS.

* Updated supported LTS version to 2020.3.

* Updated changelog.

* Bumped up CI Unity version from 2020.2 to 2020.3.

* Update CHANGELOG.md

* Removing VS Studio dependency from manifest.json.

* Update packages-lock.json

* Update CHANGELOG.md

* Bumped package version for patch.

* Fixed build-docker and updated CHANGELOG to add unreleased section. (#5682)

* Fixed build-docker and updated CHANGELOG to add unreleased section.

* Updated backward compat mac training yamato test. (#5685)

* Develop deprecate python 3.6 (#5)

* Dropped support for python 3.6

* Pinning python 3.9.9 for tests due to typing issues with 3.9.10

* Testing new bokken image.

* Testing new bokken image.

* Updated yamato standalone build test.

* Updated yamato standalone build test.

* Updated standalone build test.

* Updated yamato configs to use mla bokken vm.

* Bug fixes for yamato yml files.

* Fixed com.unity.ml-agents-test.yml

* Bumped min python version to 3.7.2

* Updated failing yamato jobs.

* Updated github workflow for colab tests.

* Updated github workflow for colab tests.

* Updated github workflow for colab tests.

* Develop python api ga (#6)

* Dropped support for python 3.6

* Pinning python 3.9.9 for tests due to typing issues with 3.9.10

* Testing new bokken image.

* Testing new bokken image.

* Updated yamato standalone build test.

* Updated yamato standalone build test.

* Updated standalone build test.

* Updated yamato configs to use mla bokken vm.

* Bug fixes for yamato yml files.

* Fixed com.unity.ml-agents-test.yml

* Bumped min python version to 3.7.2

* pettingzoo api prototype

* add example

* update file names

* support multiple behavior names

* fix multi behavior action index

* add install in colab

* add setup

* update colab

* fix __init__

* clone single branch

* import tags only

* import in init

* catch import error

* update colab

* move colab and add readme

* handle agent dying

* add tests

* update doc

* add info

* add action mask

* fix action mask

* update action masks in colab

* change default env

* set version

* fix hybrid action

* fix colab for hybrid actions

* add note on auto reset

* Updated colab name.

* Update README.md

* Following petting_zoo registry API (#5557)

* init petting_zoo registry

* cherrypick Custom trainer editor analytics (#5511)

* cherrypick "Update dotnet-format to address breaking changes introduced by upstream changes (#5528)"

* Update colab to match pettingZoo import api

* ToRevert: pull exp-petting-registry branch

* Add init file to tests

* Install pettingzoo-unity requirements for pytest

* update pytest command

* Add docstrings and comments

* update coverage to pettingzoo folder

* unset log level

* update env string

* Two small bugfixes (#5589)

1. Add the missing `_cumulative_rewards` property
2. Update `agent_selection` to not error out when an agent finishes an episode.

* Updated gym to 0.21.0 and petting zoo to 1.13.1, fixed bugs with AEC wrapper for gym and PZ updates. API tests are passing.

* Some refactoring.

* Finished inital implementation of parallel. Tests not passing.

* Finished parallel API implementation and refactor. All PZ tests passing.

* Cleanup.

* Refactoring.

* Pinning numpy version.

* add metadata and behavior_specs initialization

* addressing behaviour_spec issues

* Bumped PZ version to 1.14.0. Fixed failing tests.

* Refactored gym-unity and petting-zoo into ml-agents-envs

* Added TODO to pydoc-config.yaml

* Refactored gym and pz to be under a subpackage in mlagents_env package

* Refactored ml-agents-envs docs.

* Minor update to PZ API doc.

* Updated mlagents_envs docs and colab.

* Updated pytest gh workflow to remove ref to gym and pz.

* Refactored to remove some test coupling between trainers and envs.

* Updated installation doc.

* Update ml-agents-envs/README.md

Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com>

* Updated failing yamato jobs.

* pettingzoo api prototype

* add example

* update file names

* support multiple behavior names

* fix multi behavior action index

* add install in colab

* add setup

* update colab

* fix __init__

* clone single branch

* import tags only

* import in init

* catch import error

* update colab

* move colab and add readme

* handle agent dying

* add tests

* update doc

* add info

* add action mask

* fix action mask

* update action masks in colab

* change default env

* set version

* fix hybrid action

* fix colab for hybrid actions

* add note on auto reset

* Updated colab name.

* Update README.md

* Following petting_zoo registry API (#5557)

* init petting_zoo registry

* cherrypick Custom trainer editor analytics (#5511)

* cherrypick "Update dotnet-format to address breaking changes introduced by upstream changes (#5528)"

* Update colab to match pettingZoo import api

* ToRevert: pull exp-petting-registry branch

* Add init file to tests

* Install pettingzoo-unity requirements for pytest

* update pytest command

* Add docstrings and comments

* update coverage to pettingzoo folder

* unset log level

* update env string

* Two small bugfixes (#5589)

1. Add the missing `_cumulative_rewards` property
2. Update `agent_selection` to not error out when an agent finishes an episode.

* Updated gym to 0.21.0 and petting zoo to 1.13.1, fixed bugs with AEC wrapper for gym and PZ updates. API tests are passing.

* Some refactoring.

* Finished inital implementation of parallel. Tests not passing.

* Finished parallel API implementation and refactor. All PZ tests passing.

* Cleanup.

* Refactoring.

* Pinning numpy version.

* add metadata and behavior_specs initialization

* addressing behaviour_spec issues

* Bumped PZ version to 1.14.0. Fixed failing tests.

* Refactored gym-unity and petting-zoo into ml-agents-envs

* Added TODO to pydoc-config.yaml

* Refactored gym and pz to be under a subpackage in mlagents_env package

* Refactored ml-agents-envs docs.

* Minor update to PZ API doc.

* Updated mlagents_envs docs and colab.

* Updated pytest gh workflow to remove ref to gym and pz.

* Refactored to remove some test coupling between trainers and envs.

* Updated installation doc.

* Update ml-agents-envs/README.md

Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com>

* Updated CHANGELOG.

* Updated Migration guide.

* Doc updates based on CR.

* Updated github workflow for colab tests.

* Updated github workflow for colab tests.

* Updated github workflow for colab tests.

* Fixed yamato import error.

Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com>
Co-authored-by: Miguel Alonso Jr <miguelalonsojr>
Co-authored-by: jmercado1985 <75792879+jmercado1985@users.noreply.github.com>
Co-authored-by: Maryam Honari <honari.m94@gmail.com>
Co-authored-by: Henry Peteet <henry.peteet@unity3d.com>
Co-authored-by: mahon94 <maryam.honari@unity3d.com>
Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com>

* Refactoring for extraction to separate repo.

* removing unused github workflows.

* Updates for pre-commit and github actions.

* Fixed bug with validate_meta_files.py missing a file to exclude.

* Updated precommit gh action.

* Refactored yamato pack.

* Added yamato test yaml.

* Updated to CHANGELOG and VectorSensor from mono repo before swap.

* Pin dotnet version for GHA formatting.

* Refactor for MLA pro support. (#1)

* Refactored for MLA pro support.

* Fixed bug in yamato test.

* Updated the CHANGELOG.

* Fixed trigger logic in yamato test.

* Fixed failing tests. (#2)

* Fixed failing tests.

* Adding meta files.

* Fix execution order (#4)

* Fixed execution order.

* Fixed bug with CRT build.

* precommit formatting update.

* Updated agent Awake docstring.

* Set execution order to be negative to make sure scripts run before default time. (#5)

* Added #if to exclude communicator registration in the Agent awake script (#6)

* Add visibility to pro package (#7)

* add coverage publish promotion yamato jobs (#3)

* bump barracuda to 3.0.0 (#9)

upgrade to barracuda 3.0.0

* Updated README. (#8)

* add validation test dep. to publish job (#12)

* add validation test dep. to publish job
* fix package path

* Update automated trigger rules to support oneflow develop branch (#11)

* Update automated trigger rules to support oneflow develop branch

* Fix release branch name for github actions

* Release 2.3.0 exp.1 (#13)

* upgrade package to 2.3.0-exp.1

* rebase on develop branch

* update changelog

* remove unreleased section (#16)

* Updated experimental prerelease version and changelog. (#18)

* Added submodule update gha. (#19)

* fix bugs from sonar-scanner (#21)

* add ml-agents-pro as a upstream for submodule update (#22)

* Added support for standalone use of Academy with refactored communicator factory. (#24)

* clean commandline Arguments (#25)

* clean commandline Arguments
* fix precommit

* update unrelease section in changelog (#27)

* MLA-2537 Reorder RayPerceptionSensor output to reflect its spatial structure. (#26)

* Changes to support in order rays.  GUI for switching to this.  Updated tests.

* Updated README. (#29)

* Fixed samples. (#30)

* Added camera sensor component checkbox for runtime enablement. update… (#31)

* Added camera sensor component checkbox for runtime enablement. updated camera sensor to only be active when enabled for runtime.

* Updated yamato job.

* Updated CHANGELOG.

* Fixed issue with Academy not building for webgl builds. (#32)

* Dev bump min ver (#33)

* Bumped min version of Unity to 2021.3.

* Updated min version of python in pre-commit. Updated changelog.

* Moving files into packages/com.unity.ml-agents directoy.

* Remove com.unity.ml-agents submodule.

* Added back in ml-agents package.

* Removed packages temp folder.

* Updated yamato protobuf gen.

* Updated yamato to remove submodule update.

* Updated linux tool chain.

* Updated Project package dependencies.

* Updated flake8 repo link in precommit.

Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>
Co-authored-by: Arthur Juliani <awjuliani@gmail.com>
Co-authored-by: Chris Elion <chris.elion@unity3d.com>
Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com>
Co-authored-by: Chris Goy <christopherg@unity3d.com>
Co-authored-by: Marwan Mattar <marwan@unity3d.com>
Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com>
Co-authored-by: Ervin Teng <ervin@unity3d.com>
Co-authored-by: Hunter-Unity <hunter@unity3d.com>
Co-authored-by: Maryam Honari <honari.m94@gmail.com>
Co-authored-by: Miguel Alonso Jr <76960110+miguelalonsojr@users.noreply.github.com>
Co-authored-by: Miguel Alonso Jr <miguel.alonso@miguel.alonso-00MM>
Co-authored-by: Jason Bowman <jasonb@unity3d.com>
Co-authored-by: Henry Peteet <henry.peteet@unity3d.com>
Co-authored-by: cmard <87716492+cmard@users.noreply.github.com>
Co-authored-by: Chingiz Mardanov <chingiz.mardanov@unity3d.com>
Co-authored-by: Jason Rupert <92821409+jrupert-unity@users.noreply.github.com>
Co-authored-by: Xun Cao <xun.cao@outlook.com>
Co-authored-by: jmercado1985 <75792879+jmercado1985@users.noreply.github.com>
Co-authored-by: mahon94 <maryam.honari@unity3d.com>
Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com>
Co-authored-by: Xun Cao <xun.cao@unity3d.com>
Co-authored-by: Jason Rupert <jason.rupert@unity3d.com>

* Release 20 changelog (#80)

* clean changelog
* match extention package dependency
* fix typos

Co-authored-by: Xun Cao <xun.cao@unity3d.com>
Co-authored-by: Brandon Henry <brandonh@unity3d.com>
Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com>
Co-authored-by: Henry Peteet <henry.peteet@unity3d.com>
Co-authored-by: Miguel Alonso Jr <miguel.alonso@unity3d.com>
Co-authored-by: GitHub Actions - update submodules <bot@noreply.github.com>
Co-authored-by: Jason Bowman <jasonb@unity3d.com>
Co-authored-by: Florence Rolland <florence@unity3d.com>
Co-authored-by: Jason Rupert <jason.rupert@unity3d.com>
Co-authored-by: zhuo <zhuo@unity3d.com>
Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>
Co-authored-by: Arthur Juliani <awjuliani@gmail.com>
Co-authored-by: Chris Elion <chris.elion@unity3d.com>
Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com>
Co-authored-by: Chris Goy <christopherg@unity3d.com>
Co-authored-by: Marwan Mattar <marwan@unity3d.com>
Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com>
Co-authored-by: Ervin Teng <ervin@unity3d.com>
Co-authored-by: Hunter-Unity <hunter@unity3d.com>
Co-authored-by: Maryam Honari <honari.m94@gmail.com>
Co-authored-by: Miguel Alonso Jr <76960110+miguelalonsojr@users.noreply.github.com>
Co-authored-by: Miguel Alonso Jr <miguel.alonso@miguel.alonso-00MM>
Co-authored-by: cmard <87716492+cmard@users.noreply.github.com>
Co-authored-by: Chingiz Mardanov <chingiz.mardanov@unity3d.com>
Co-authored-by: Jason Rupert <92821409+jrupert-unity@users.noreply.github.com>
Co-authored-by: Xun Cao <xun.cao@outlook.com>
Co-authored-by: jmercado1985 <75792879+jmercado1985@users.noreply.github.com>
2022-11-18 13:21:12 -08:00
Maryam Honari 7fbb284aee Release 20 changelog (#80)
* clean changelog
* match extention package dependency
* fix typos
2022-11-18 11:12:33 -08:00
Miguel Alonso Jr a78132bcad Develop merge submodule (#77)
* Update com.unity.ml-agents/CHANGELOG.md

Co-authored-by: Arthur Juliani <awjuliani@gmail.com>

* Update changelog

* [MLA-1762] reduce memory allocations from DiscreteActionOutputApplier (#4922)

* add option for Burst inference (#4925)

* surface specific GRPC errors more visibly (#4930)

* Add additional logic to avoid load being called on every advance (#4934)

* [MLA-1767] Refactor communicator connection exceptions (#4935)

* Update changelog for release 13. (#4938)

* Update master versions for release 13. (#4945)

* Release 13 versions. (#4946)

- updated release tag validation script to automate the updating of files with release tags that need to be changed as part of the pre-commit operation.

* Update docs to pass doc validation. (#4953)

* update defines, compile out Initialize body on non-desktop (#4957)

* Adding references to the Extensions package to help promote it. (#4967)

* Adding references to the Extensions package to help promote it.

* Updating header formatting to match the rest of the page

* Adding references to the Extensions package to help promote it. (#4967) (#4968)

Co-authored-by: Marwan Mattar <marwan@unity3d.com>
Co-authored-by: Chris Elion <chris.elion@unity3d.com>

* InputActuatorComponent to allow the generation of an action space from an InputActionAsset (#4881)

* InputActuatorComponent to allow the generation of an action space from an InputActionAsset (#4881) (#4974)

* pass sensor dimension flags to analytics (#4954)

* Communicator factory (#4965)

* Update barracuda, swtich Agents in Sorter use Burst. (#4979)

* Update barracuda, swtich Agents in Sorter use Burst. (#4979) (#4981)

* Set ignore done=False in GAIL (#4971)

* MultiAgentGroup Interface (#4923)

* add SimpleMultiAgentGroup

* add group reward field to agent and proto

* Make TrainingAnalyticsSideChannel internal (#4999)

* [MLA-1783] built-in actuator type (#4950)

* Add component menues for some sensors and actuators. (#5001)

* Add component menues for some sensors and actuators. (#5001) (#5002)

* Merge master -> release_13_branch-to-master

* Fix RpcCommunicator merge.

* master -> main. (#5010)

* Adding a name field to BufferSensorComponent

* Adding a note to the CHANGELOG about var len obs

* Adding a helper method for creating observation placeholder names and removed the _h and _c placeholders

* Adding a custom editor for BufferSensorComponent

* adding inheritdoc

* Update cattrs dependencies to support python3.9 (#4821)

* Fix issue with queuing input events that stomp on others. (#5034)

* Update cattrs dependencies to support python3.9 (#4821)

* Fix issue with queuing input events that stomp on others. (#5034)

* Update versions for release 14 hotfix. (#5040)

* master -> main. (#5010) (#5044)

* Update changelog. (#5045)

* [MLA-1809] catch mismatched observation sizes (#5030)

* Update changelog. (#5055)

* Fix xml docs. (#5057)

* pass sensor name through to ObservationSpec (#5036)

* Remove unused allocation (#5068)

* Automatically generate samples based on placement of mlagents-sample.json files in our examples. (#5077)

* Update barracuda to 1.3.2-preview. (#5084)

* POCA trainer (#5005)

Co-authored-by: Ervin Teng <ervin@unity3d.com>
Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com>
Co-authored-by: Chris Elion <chris.elion@unity3d.com>
Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>

* [docs] Documentation for POCA and cooperative behaviors (#5056)

Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com>

* [docs] Update changelog (#5095)

* Add multiAgentGroup capabilities flag (#5096)

* Add multiAgentGroup capabilities flag

* Add proto

* Fix compiler error

* Add warning for multiagent group

* Add comment

* Fix spelling mistake

* Fix ghost curriculum and make steps private (#5098)

* use get step to determine curriculum

* add to CHANGELOG

* Make step in trainer private (#5099)

Co-authored-by: Ervin T <ervin@unity3d.com>

* [cherry-pick] Fix ghost curriculum and make steps private (#5098)

* use get step to determine curriculum

* add to CHANGELOG

* Make step in trainer private (#5099)

Co-authored-by: Ervin T <ervin@unity3d.com>

* Update changelog for samples. (#5103)

* Update changelog for samples. (#5103) (#5106)

* Update versions on main (#5102)

Increment versions after release 15 branch split

* [release_15] Release 15 update versions (#5101)

* Update versions

* Fix for validate release links

* Update release tag and docs

* [docs] Add missing package docs (#5108)

* Add missing package docs

* Fix grammar

* Update comment on Dispose

* Fix typo in doc

* [docs] Update readme and changelog (#5107)

* Make analytics module an optional dependency. (#5109)

Co-authored-by: Chris Elion <chris.elion@unity3d.com>

* Redo dotnet format (#5119)

* [docs] Add links to example envs for Buffer Sensor and MultiAgentGroup (#5116)

* Update changelog with links to example envs

* Cooperative push block link update

* [cherry-pick] Integrate Group Manager to soccer/retrain with POCA (#5115) (#5121)

* Integrate Group Manager to soccer/retrain with POCA (#5115)

* Add Soccer env to changelog

Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com>

* [cherry-pick][docs] Add Dungeon Escape Environment (#5133)

* Add DungeonEscape POCA Environment (#5128)

* Add DungeonEscape assets from working branch

* Add Dungeon Escape docs

* Create dungeon_escape.png

* Add to docs

Co-authored-by: Hunter-Unity <hunter@unity3d.com>

* update pre-commit instructions for CONTRIBUTING.md (#5130)

* Remove duplicated changelog

* Fix another duplicated line in the changelog...

* Removing Obsolete methods from the package (#5024)

* Removing Obsolete methods from the package

* Missing depecration and modified changelog

* Readding the obsolete BrainParameter methods, will need a larger discussion on these

* Removing Action Masker, readding the warining when using a non-implemented Heuristic, Removing NumAction from Brain Parameters

* removing documentation and some calls to deprecated methods in the extensions package

* Editing the Changelog to put the unreleased on top

* non-IEnumerable interface for action masking (#5060)

* V2 staging new model version (#5080)

* Make modelCheck have flavors of error messages

* ONNX exporter v3

* Using a better CheckType and a switch statement

* Removing unused message

* More tests

* Use an enum for valid versions and use GetVersion on model directly

* Maybe the model export version a static constant in Python

* Use static constructor for FailedCheck

* Use static constructor for FailedCheck

* Modifying the docstrings

* renaming LegacyDiscreteActionOutputApplier

* removing testing code

* better warning message

* Nest the CheckTypeEnum into the FailedCheck class

* Update com.unity.ml-agents/Runtime/Inference/BarracudaModelParamLoader.cs

* Adding a line explaining that legacy tensor checks are for versions 1.X only

* Modifying the changelog

* Exporting all the branches size instead of omly the sum (#5092)

* addressing comments

* Update com.unity.ml-agents/Runtime/Inference/BarracudaModelParamLoader.cs

Co-authored-by: Chris Elion <chris.elion@unity3d.com>

* readding tests

* Adding a comment around the new DiscreteOutputSize method

* Clearer warning : Model contains unexpected input > Model requires unknown input

* Fixing a bug in the case where the discrete action tensor does not exist

Co-authored-by: Chris Elion <chris.elion@unity3d.com>

* Make IActuator implement IHeuristicProvider. (#5110)

* Adding more tests for v2 models (#5138)

* Adding more tests for v2 models

* formatting

* fixing formatting

* [MLA-1634] Add ObservationSpec and update ISensor interfaces (#5127)

* Add custom package settings (#5027)

* cleanup: yamato triggers, changelog, docstrings

* clean up merge

* fix compile error

* update c# version to 2.0.0-exp.1 (#5160)

Co-authored-by: Chris Elion <chris.elion@unity3d.com>

* Move physics/sensor tests that fail in edit mode depending on what's in the scene to playmode tests. (#5162)

* update package version

* remove defines that are now always true

* changelog

* Remove all old analytics defines. (#5168)

* [MLA-1634] Compression spec (#5164)

* Low hanging fruit tests for coverage. (#5170)

* Update Policies to conditionally compile their analytics method. (#5171)

* Turns physics modules into optional dependencies. (#5112)

* [MLA-1634] Remove SensorComponent.GetObservationShape() (#5172)

* PR Feedback. (#5173)

* change default barracuda behavior (#5175)

* [MLA-1824] make SensorComponent return ISensor[] (#5181)

* Make SensorComponent return an array

* split match3 sensors, partial retrain

* docstrings, migration, changelog, cleanup

* Adding the goal conditioning sensors with the new observation specs (#5159)

* Fixing networks.py for the merge

* fix compile error

* Adding the goal conditioning sensors with the new observation specs

* addressing feedback

* I forgot to change the m_observationType

* Renaming Goal to GoalSignal (#5190)

* Renaming GOAL to GOAL_SIGNAL

* VectorSensorComponent to use new API

* Adding docstrings

* verbose pytest on github action

Co-authored-by: Chris Elion <chris.elion@unity3d.com>

* [MLA-1879] culture-invariant sorting for sensors and actuators (#5194)

* Goal conditioning grid world : Example of goal conditioning (#5193)

* Aded the Goal conditioned GridWorld to replace regular gridworld

* adding missing files

* Code improvements

* Documentation change on gridworld

* resolving conflicts

* new model

* Addressing comments

* comments and renames

* Update docs/Learning-Environment-Examples.md

Co-authored-by: Ervin T. <ervin@unity3d.com>

* adding reference to gridworld in docs about goal signal

Co-authored-by: Chris Elion <chris.elion@unity3d.com>
Co-authored-by: Ervin T. <ervin@unity3d.com>

* [MLA-1584] Match3 variable board size (#5189)

* [bug-fix] When agent isn't training, don't clear update buffer (#5205)

* Don't clear update buffer, but don't append to it either

* Update changelog

* Address comments

* Make experience replay buffer saving more verbose

* Changelog for goal conditioning (#5195)

* Changelog for goal conditioning

* adding a line about the changes to the gridworld example

* Addressing comments

* Change capitalization

* [bug fix] Fix warning using demo recorder (#5216)

* Making VectorActionSize and VectorActionSpaceType internal (#5214)

Made sure the editor does not complain;

* Turning some logger.info into logger.debug and remove some logging overhead when not using debug (#5211)

* turning some logger.info into logger.debug and remove some logging overhead when not using debug

* Addressing comments

* Adding to changelog

* rearrange match3 docs (#5215)

* rearrange match3 docs

* changelog and migration from previous PR

* [bug-fix] Fix POCA LSTM, pad sequences in the back (#5206)

* Pad buffer at the end

* Fix padding in optimizer value estimate

* Fix additional bugs and POCA

* Fix groupmate obs, add tests

* Update changelog

* Improve tests

* Address comments

* Fix poca test

* Fix buffer test

* Increase entropy for Hallway

* Add EOF newline

* Fix Behavior Name

* Address comments

* [MLA-1880] Raycast sensor interface improvements (#5222)

* WIP

* remove debug info struct

* cleanup + add to test

* changelog

* fix unit tests

* PR feedback

* Load individual elements if state dict load fails (#5213)


Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>
Co-authored-by: Ervin T. <ervin@unity3d.com>

* [MLA-1159] Add virtual methods to DecisionRequester (#5223)

* [Bug Fix] Serialization of vectorActionSpace (#5220)

* [Bug Fix] Serialization of vectorActionSpace

* adding a test

* Clear ActionBuffers before Heuristic calls (#5227)

* Add ObservationType to analytics ObservationSpec (#5218)

* [MLA-1909] Match3 and Camera/RenderTexture sensor GC improvements (#5233)

* avoid empty set iteration, avoid Debug.AssertFormat (#5246)

* avoid empty set iteration, avoid Debug.AssertFormat

* changelog

* Update to barracuda 1.3.3 and changes to the model inputs and outputs for LSTM (#5236)

* Initial commit

* making it work with hallway. Added a new model version

* addressing feedback

* Adding a comment

* formatting

* modifying the changelog

* Adding descriptions on the model version descriptions

* Update com.unity.ml-agents/Runtime/Inference/GeneratorImpl.cs

Co-authored-by: Chris Elion <chris.elion@unity3d.com>

* addressing comments

Co-authored-by: Chris Elion <chris.elion@unity3d.com>

* [Release 16] Release 16 Merge Back to Main (#5255)

Update versions and documentation for Release 16. 

Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>
Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com>
Co-authored-by: Chris Elion <chris.elion@unity3d.com>

* fix link, run link check in quiet mode (#5257)

* [MLA-1912] Move Match3 to com.unity.ml-agents (#5259)

* Move GridSensor into main package (#5256)

* move OneHotGridSensor into main package

* changelog and migration guide

* remove old doc

* check if physics module presents

* Add advanced installation guide for com.unity.ml-agents (#5260)

* Fix errors when creating new RayCast sensor (#5261)

* check if tag and angle is null

* add test

* changelog

* fix all PVS and doc generation warnings (#5262)

* Fix --results-dir (#5269)

* Fix GridSensorComponent bug (#5270)

* LSTM models from 1.x will be incompatible with MLA 2.x (#5254)

* LSTM models from 1.x will be incompatible with MLA 2.x

* Adding a test and a new v2 model

* Make the Model Runner raise an error if using 1.0 model with LSTM

* adding a new model for hallway trained with 2.0

* reword error messages

* Only raise if error, not if warning

* Addressing comments: The legacy Barrauda memory generator and applier were removed. All code that checked for (memories + v1.X) have been removed since these will no longer be supported

* Modifying the changelog and the migrating guide with this change

* Fixing the merge issues

Co-authored-by: Chris Elion <chris.elion@unity3d.com>

* [MLA-1939] Match3 Custom Editor (#5263)

* Rider suggested cleanup, part 1  (#5265)

* Fix the attention module embedding size (#5272)

* Fix the attention module embedding size

* editing the changelog

* [bug-fix] Fix null ref exception in the demo recorder (#5274)

* Fix null ref exception in the demo recorder

* Update changelog

* Clearer logic

* Modifying the changelog for release 17 (#5277)

* Modifying the changelog for release 17

* Change version to 2.0.0-exp.1

* Removing the [Unreleased] section

* check for missing AbstractBoard, display warning (#5276)

* Updating the barracuda 1.4.0 (#5291)

* [debug] Require all behavior names to have a matching YAML entry (#5210)

* Add strict check to settings.py

* Remove warning from trainer factory, add test

* Add changelog

* Fix test

* Update changelog

* Remove strict CLI options

* Remove strict option, rename, make strict default

* Remove newline

* Update comments

* Set default dict to actually default to a default dict

* Fix tests

* Fix tests again

* Default trainer dict to requiring all fields

* Fix settings typing

* Use logger

* Add default_settings to error

* Version bump main (#5279)

* Version bump for the main branch following branching of release_17

* Messed up the extensions package version

* Bring back root reference in grid sensor (#5300)

* [MLA-1952] Add optional seed for gym action spaces (#5303)

* add optional seed for action spaces

* add optional seed for action spaces

* changelog

* undo packages-lock.json change

* Release 17 version bumps and docs version bumps (#5280)

* Migration guide - list removed APIs with replacements (#5307)

* list removed APIs with replacements

* fixes

* [docs] Update changelog (#5308)

* [MLA-1952] Add optional seed for gym action spaces (#5303) (#5315)

* Fix stacked grid sensor (#5335)

* Update to Unity Package licenses (#5340)

* Update to Unity Package licenses

Per Dan

* Updating the main repo license dates.

* Adding a fully connected visual encoder for super small visual input + tests (#5351)

* initial commit for a fully connected visual encoder

* adding a test

* addressing comments

* Fixing error with minimal size of fully connected network

* adding documentation and changelog

* Make OverlapChecker an interface (#5324)

* Add stacking option to VectorSensorComponent (#5376)

* [WIP] [Fix] Fixing collect observation called on done (#5375)

* [WIP] [Fix] Fixing collect observation called on done

* Update com.unity.ml-agents/Runtime/Agent.cs

* ⚠️ Modifying the test of stacking sensor when the agent is done

* modifying the documentation for BufferSensor to specify to call AddObservation in the CollectObservations method

* Fix NullReferenceException for Behavior Parameters without Agent (#5382)

* check agent existence. add warning box

* changelog

* [WIP] 2.0 verified to main (#5385)

* Warn if no joints on Root Body (#5387)

* Modifying the changelog for the continuous action SAC target entropy fix (#5394)

* Editing the GridSensor documentation for 2D use case (#5396)

* Editing the GridSensor documentation for 2D use case

* changing chagelog

* [MLA-2017] Move colab notebooks to github (#5399)

* Fix the reporting of histogram stats and adding a test (#5410)

* Fix the reporting of histogram stats and adding a test

* Appending to the Changelog

* Update changelog before Release 18 branch (#5412)

* Update package versions for main branch (#5413)

* [Release 18] Update versions and links (#5414)

* add changelog entries for PRs that were missing them before. (#5416)

* fix cref error found by doc validation (#5421)

* fix release versions and changelog section

* [bug-fix] Fix when group terminal steps are deleted, robust test (#5441)

* Fix when terminal steps are deleted, robust test

* Update changelog

* Fix test comment

* Fix torch device option in argparser (#5467)

* [FIX] prioritize cli options over yaml (#5495)

* prioritize cli over yaml in checkpointSettings

* prioritize resume if both set in one place

* fixed test_commandline_args

* addressing comments:renaming, commenting

* more tests and referenced in change_log

* Only update the native library and the c# calls that need to be made to clean up channels. (#5283)

* Fix Mac backcompat test (#5519)

* Initialize-from custom  checkpoints  (#5525)

* init from any checkpoint including older ones
* moving init_path logic ahead to learn.py
* fixing pytest to take the full path
* doc & changelog

* Update gym version to 0.20.0 (#5540)

* Fixed failing test due to gym 0.20.0 release. Updated gym-unity package to fix gym version to 0.20.0.

* Updated CHANGELOG.

* Fix VAIL (#5546)

* Custom trainer editor analytics (#5511)

* Custom trainer editor analytics
* inherit the default_training_analytics_side_channel

* Feature/mla 2205 separate schedule lr beta epsilon (#5538)

* Added initial (untested) support for separating schedule for beta, LR, and epsilon.

* Added support for beta and epsilon schedule in test config conversion

* Updated docs. Set epsilon to be linear by default.

* Updated configs to add beta and epsilon schedule where appropriate.

* Addressed CR change requests.

* Minor update.

* Updated docs.

* Added changes to CHANGELOG.

* Updated CHANGELOG.

* Fixed typo in Training-Configuration-File.md

* Reverted config changes to remove beta and epsilon schedules.

* mypy error fixed from pre-commit.

* Fixed a bug that was causing a settings test to fail.

* Addressed CR comments.

Co-authored-by: Miguel Alonso Jr <miguel.alonso@miguel.alonso-00MM>
Co-authored-by: Miguel Alonso Jr <miguelalonsojr>

* Harden user PII protection logic and extend TrainingAnalytics to expose detailed configuration parameters. (#5512)

* Hash128 is not a cryptographic hash, replace with HMAC-SHA256.

* Extend TrainingAnalytics side channel to expose configuration details

* Change member function scopes and hash demo_paths

* Extract tbiEvent hashing method and add test coverage

* Restart crashed Unity environments (#5553)

### Change(s)

Update the SubprocessEnvManager to restart workers when the underlying Unity environments crash. 
When a worker receives an ENV_EXITED signal it will now:

1. Record all failures coming through the step queue and drop all other messages.
2. Purge any pending trajectories as they may belong to a crashed worker or be corrupted.
3. Restart all failed workers (up to a configurable limit)

This behavior can be limited via a rate limit, max lifetime limit, or both. The configuration options for both are shown below with their default values. 

⚠️  Each of these options applies to a single environment, if num_envs > 1 then the limit will apply separately to each replica (num_envs = 2 will spawn 2 Unity environments which can each be restarted 10 times).

```yaml
env_settings:
  # Can restart 10 times over the lifetime of the experiment.
  max_lifetime_restarts: 10
  # Rate limit of 1 failure per 60s
  restarts_rate_limit_n: 1
  restarts_rate_limit_period_s: 60
```

They can of course be passed via CLI arguments as well

```bash
--max-lifetime-restarts
--restarts-rate-limit-n
--restarts-rate-limit-period-s
```

### Disabling this feature
* Rate limiting can be turned off by setting `--restarts-rate-limit-n=-1`
* Lifetime limiting can be turned off by setting `--max-lifetime-restarts=-1`

* Develop add training area replicator (#5568)

* Added training area replicator to com.unity.ml-agents package.

* Added num_areas to Unity RL Initialization proto. Added cli and config file support for num_areas.

* Changed training area replicator to size grid automatically from number of areas.

* Added tests for the training area replicator.

* Added setup for tests for the training area replicator.

* Added comments and updated create tutorial for training area replicator.

* Updated CHANGELOG.

* Fixed some failing tests.

* Update com.unity.ml-agents/CHANGELOG.md

Co-authored-by: Henry Peteet <henry.peteet@unity3d.com>

* Update docs/Learning-Environment-Create-New.md

Co-authored-by: Henry Peteet <henry.peteet@unity3d.com>

* Update com.unity.ml-agents/Runtime/Areas/TrainingAreaReplicator.cs

Co-authored-by: Henry Peteet <henry.peteet@unity3d.com>

* Addressed CR comments.

Co-authored-by: Miguel Alonso Jr <miguelalonsojr>
Co-authored-by: Henry Peteet <henry.peteet@unity3d.com>

* Fixing the analytics side-channel for curriculum learning. (#5586)

* Fixing the analytics side-channel for curriculum learning.

* Made a more robust test.

* Update the changelog.

* Update com.unity.ml-agents/CHANGELOG.md

Co-authored-by: Maryam Honari <honari.m94@gmail.com>

Co-authored-by: Maryam Honari <honari.m94@gmail.com>

* Method to return stacked observations (#5547)

* Method to return stacked observations

* Added testing and returning staked observation flat.

* Update the comment lines.

* Remove brainstorm commits.

* Upgrade Barracuda to 2.3.1-preview (#5591)

* Upgrade to 2.3.1-preview

* Fix for critic normalization bug (#5595)

* Added normalization to critic during training for poca, ppo, and sac.

* Fixed critic normalization bug. Added new tests to cover.

* Updated CHANGELOG.

* Fixed typo in test_trainers.py

* Deterministic actions python training (#5619)

* Progress on propagating the setting to the action model.

* Added the _sample_action logic and tests.

* Add information to the changelog.

* Prioritize the CLI over the configuration file.

* Update documentation for config file.

* CR refactor.

* Update docs/Training-Configuration-File.md

Co-authored-by: Miguel Alonso Jr. <76960110+miguelalonsojr@users.noreply.github.com>
Update com.unity.ml-agents/CHANGELOG.md

Co-authored-by: Miguel Alonso Jr. <76960110+miguelalonsojr@users.noreply.github.com>
Update com.unity.ml-agents/CHANGELOG.md

Co-authored-by: Miguel Alonso Jr. <76960110+miguelalonsojr@users.noreply.github.com>
Update com.unity.ml-agents/CHANGELOG.md

Co-authored-by: Maryam Honari <honari.m94@gmail.com>
Update ml-agents/mlagents/trainers/settings.py

Co-authored-by: Maryam Honari <honari.m94@gmail.com>
Update ml-agents/mlagents/trainers/cli_utils.py

Co-authored-by: Maryam Honari <honari.m94@gmail.com>

* Fix CR requests

* Add tests for discrete.

* Update ml-agents/mlagents/trainers/torch/distributions.py

Co-authored-by: Maryam Honari <honari.m94@gmail.com>

* Added more stable test.

* Return deterministic actions for training (#5615)

* Added more stable test.

* Fix the tests.

* Fix pre-commit

* Fix help line to pass precommit.

* support for deterministic inference in onnx (#5593)

* Init: actor.forward outputs separate deterministic actions

* changelog

* Renaming

* Add more tests

* Package changes to support deterministic inference (#5599)

* Init: actor.forward outputs separate deterministic actions

* fix tensor shape for discrete actions

* Add test and editor flag

- Add tests for deterministic sampling
- update editor and tooltips

* Reverting to "Deterministic Inference"

* dissect tests

* Update docs

* Update CHANGELOG.md

Co-authored-by: Chingiz Mardanov <chingiz.mardanov@unity3d.com>
Co-authored-by: cmard <87716492+cmard@users.noreply.github.com>

* Revert "Deterministic actions python training (#5619)" (#5622)

This reverts commit 9ea962027d4c581de7c45a07d669f96453236c5c.

* Deterministic actions python training (#5626)

* Progress on propagating the setting to the action model.

* Added the _sample_action logic and tests.

* Add information to the changelog.

* Prioritize the CLI over the configuration file.

* Update documentation for config file.

* CR refactor.

* Update docs/Training-Configuration-File.md

Co-authored-by: Miguel Alonso Jr. <76960110+miguelalonsojr@users.noreply.github.com>
Update com.unity.ml-agents/CHANGELOG.md

Co-authored-by: Miguel Alonso Jr. <76960110+miguelalonsojr@users.noreply.github.com>
Update com.unity.ml-agents/CHANGELOG.md

Co-authored-by: Miguel Alonso Jr. <76960110+miguelalonsojr@users.noreply.github.com>
Update com.unity.ml-agents/CHANGELOG.md

Co-authored-by: Maryam Honari <honari.m94@gmail.com>
Update ml-agents/mlagents/trainers/settings.py

Co-authored-by: Maryam Honari <honari.m94@gmail.com>
Update ml-agents/mlagents/trainers/cli_utils.py

Co-authored-by: Maryam Honari <honari.m94@gmail.com>

* Fix CR requests

* Add tests for discrete.

* Update ml-agents/mlagents/trainers/torch/distributions.py

Co-authored-by: Maryam Honari <honari.m94@gmail.com>

* Added more stable test.

* Return deterministic actions for training (#5615)

* Added more stable test.

* Fix the tests.

* Fix pre-commit

* Fix help line to pass precommit.

* support for deterministic inference in onnx (#5593)

* Init: actor.forward outputs separate deterministic actions

* changelog

* Renaming

* Add more tests

* Package changes to support deterministic inference (#5599)

* Init: actor.forward outputs separate deterministic actions

* fix tensor shape for discrete actions

* Add test and editor flag

- Add tests for deterministic sampling
- update editor and tooltips

* Reverting to "Deterministic Inference"

* dissect tests

* Update docs

* Update CHANGELOG.md

* Fix the deterministic showing up all the tiime (#5621)

Co-authored-by: Chingiz Mardanov <chingiz.mardanov@unity3d.com>
Co-authored-by: cmard <87716492+cmard@users.noreply.github.com>

* Changelog cleanup. (#5627)

* Release cleanup for changelog

* Update CHANGELOG.md

Co-authored-by: Maryam Honari <honari.m94@gmail.com>

* Fix analytics error when compiling XBOX and PS5 (#5628)

* Add ENABLE_CLOUD_SERVICES_ANALYTICS Guards

To use analytics safely you must wrap usage with 
`#if ENABLE_CLOUD_SERVICES_ANALYTICS` 
See https://docs.unity3d.com/ScriptReference/Analytics.Analytics.html

* Update DQN Colab example to support export for Barracuda inference (#5643)

* Update DQN GridWorld colab example to be capable of outputting an ONNX model that works for inference.
-Allow GridWorld to work without a GoalSensor.  Defaults to the GreenPlus target in that case.
-Add GridWorldColab.onnx file, trained and generated with the colab.

* Upgrade DQN GridWorld Colab to produce model that can work with (modified) GridWorld Scene.
-permute observation images from NHWC to NCHW right after env_step.  Everything following is NCHW.
-update input_shape indexes and remove equivalent permute from the VisualQNetwork
-fix reference to global qnet inside generate_trajectories (use the passed in version)
-create wrapper network to add required constants and dummy input masks required by the
 runtime inference (Barracuda)
-export the wrapped network to ONNX format compatible with Barracuda

* Add additional GridWorld scene called GridWorldColab that has the goal sensors removed (target is always green plus) and masks disabled.  The purpose of this is to have a scene compatible with the DQN GridWorld Colab example.

* Minor edit in Colab docs.

* Delete unnecessary .lighting file.

* Update changelog.

* Update colab/Colab_UnityEnvironment_2_Train.ipynb

Co-authored-by: Maryam Honari <honari.m94@gmail.com>

* Fix error in colab metadata name.

* Add suggested comments and fix incorrect parameter ordering.

* Update docs for Elo curriculum completion measure.

Co-authored-by: Maryam Honari <honari.m94@gmail.com>

* Changelog prepped for release (#5629)

* Changelog prepped for release

* Added missing PRs to the changelog.

* Added missing CHANGELOG item.

* enable 3D Ball sample import button in package manager for unity sdk (#5665)

* enable 3D Ball sample import button in package manager for unity sdk

* add Samples folder

* add meta file

* fix git ignore config

* Updated versions for main. Also updated validate versions script to r… (#5668)

* Updated versions for main. Also updated validate versions script to remove exp.1 from package version.

* Reverted validate versions.

* Updated versions.

* Bumped package version for patch.

* Removed unreleased section. (#5667)

* Updated release versions and release links. (#5669)

* Updates for package validation errors. (#5671)

* Release 19 branch lts deprecation (#5673)

* Updated Project and DevProject to 2020.3 LTS.

* Updated supported LTS version to 2020.3.

* Updated changelog.

* Bumped up CI Unity version from 2020.2 to 2020.3.

* Update CHANGELOG.md

* Removing VS Studio dependency from manifest.json.

* Update packages-lock.json

* Update CHANGELOG.md

* Bumped package version for patch.

* Fixed build-docker and updated CHANGELOG to add unreleased section. (#5682)

* Fixed build-docker and updated CHANGELOG to add unreleased section.

* Updated backward compat mac training yamato test. (#5685)

* Develop deprecate python 3.6 (#5)

* Dropped support for python 3.6

* Pinning python 3.9.9 for tests due to typing issues with 3.9.10

* Testing new bokken image.

* Testing new bokken image.

* Updated yamato standalone build test.

* Updated yamato standalone build test.

* Updated standalone build test.

* Updated yamato configs to use mla bokken vm.

* Bug fixes for yamato yml files.

* Fixed com.unity.ml-agents-test.yml

* Bumped min python version to 3.7.2

* Updated failing yamato jobs.

* Updated github workflow for colab tests.

* Updated github workflow for colab tests.

* Updated github workflow for colab tests.

* Develop python api ga (#6)

* Dropped support for python 3.6

* Pinning python 3.9.9 for tests due to typing issues with 3.9.10

* Testing new bokken image.

* Testing new bokken image.

* Updated yamato standalone build test.

* Updated yamato standalone build test.

* Updated standalone build test.

* Updated yamato configs to use mla bokken vm.

* Bug fixes for yamato yml files.

* Fixed com.unity.ml-agents-test.yml

* Bumped min python version to 3.7.2

* pettingzoo api prototype

* add example

* update file names

* support multiple behavior names

* fix multi behavior action index

* add install in colab

* add setup

* update colab

* fix __init__

* clone single branch

* import tags only

* import in init

* catch import error

* update colab

* move colab and add readme

* handle agent dying

* add tests

* update doc

* add info

* add action mask

* fix action mask

* update action masks in colab

* change default env

* set version

* fix hybrid action

* fix colab for hybrid actions

* add note on auto reset

* Updated colab name.

* Update README.md

* Following petting_zoo registry API (#5557)

* init petting_zoo registry

* cherrypick Custom trainer editor analytics (#5511)

* cherrypick "Update dotnet-format to address breaking changes introduced by upstream changes (#5528)"

* Update colab to match pettingZoo import api

* ToRevert: pull exp-petting-registry branch

* Add init file to tests

* Install pettingzoo-unity requirements for pytest

* update pytest command

* Add docstrings and comments

* update coverage to pettingzoo folder

* unset log level

* update env string

* Two small bugfixes (#5589)

1. Add the missing `_cumulative_rewards` property
2. Update `agent_selection` to not error out when an agent finishes an episode.

* Updated gym to 0.21.0 and petting zoo to 1.13.1, fixed bugs with AEC wrapper for gym and PZ updates. API tests are passing.

* Some refactoring.

* Finished inital implementation of parallel. Tests not passing.

* Finished parallel API implementation and refactor. All PZ tests passing.

* Cleanup.

* Refactoring.

* Pinning numpy version.

* add metadata and behavior_specs initialization

* addressing behaviour_spec issues

* Bumped PZ version to 1.14.0. Fixed failing tests.

* Refactored gym-unity and petting-zoo into ml-agents-envs

* Added TODO to pydoc-config.yaml

* Refactored gym and pz to be under a subpackage in mlagents_env package

* Refactored ml-agents-envs docs.

* Minor update to PZ API doc.

* Updated mlagents_envs docs and colab.

* Updated pytest gh workflow to remove ref to gym and pz.

* Refactored to remove some test coupling between trainers and envs.

* Updated installation doc.

* Update ml-agents-envs/README.md

Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com>

* Updated failing yamato jobs.

* pettingzoo api prototype

* add example

* update file names

* support multiple behavior names

* fix multi behavior action index

* add install in colab

* add setup

* update colab

* fix __init__

* clone single branch

* import tags only

* import in init

* catch import error

* update colab

* move colab and add readme

* handle agent dying

* add tests

* update doc

* add info

* add action mask

* fix action mask

* update action masks in colab

* change default env

* set version

* fix hybrid action

* fix colab for hybrid actions

* add note on auto reset

* Updated colab name.

* Update README.md

* Following petting_zoo registry API (#5557)

* init petting_zoo registry

* cherrypick Custom trainer editor analytics (#5511)

* cherrypick "Update dotnet-format to address breaking changes introduced by upstream changes (#5528)"

* Update colab to match pettingZoo import api

* ToRevert: pull exp-petting-registry branch

* Add init file to tests

* Install pettingzoo-unity requirements for pytest

* update pytest command

* Add docstrings and comments

* update coverage to pettingzoo folder

* unset log level

* update env string

* Two small bugfixes (#5589)

1. Add the missing `_cumulative_rewards` property
2. Update `agent_selection` to not error out when an agent finishes an episode.

* Updated gym to 0.21.0 and petting zoo to 1.13.1, fixed bugs with AEC wrapper for gym and PZ updates. API tests are passing.

* Some refactoring.

* Finished inital implementation of parallel. Tests not passing.

* Finished parallel API implementation and refactor. All PZ tests passing.

* Cleanup.

* Refactoring.

* Pinning numpy version.

* add metadata and behavior_specs initialization

* addressing behaviour_spec issues

* Bumped PZ version to 1.14.0. Fixed failing tests.

* Refactored gym-unity and petting-zoo into ml-agents-envs

* Added TODO to pydoc-config.yaml

* Refactored gym and pz to be under a subpackage in mlagents_env package

* Refactored ml-agents-envs docs.

* Minor update to PZ API doc.

* Updated mlagents_envs docs and colab.

* Updated pytest gh workflow to remove ref to gym and pz.

* Refactored to remove some test coupling between trainers and envs.

* Updated installation doc.

* Update ml-agents-envs/README.md

Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com>

* Updated CHANGELOG.

* Updated Migration guide.

* Doc updates based on CR.

* Updated github workflow for colab tests.

* Updated github workflow for colab tests.

* Updated github workflow for colab tests.

* Fixed yamato import error.

Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com>
Co-authored-by: Miguel Alonso Jr <miguelalonsojr>
Co-authored-by: jmercado1985 <75792879+jmercado1985@users.noreply.github.com>
Co-authored-by: Maryam Honari <honari.m94@gmail.com>
Co-authored-by: Henry Peteet <henry.peteet@unity3d.com>
Co-authored-by: mahon94 <maryam.honari@unity3d.com>
Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com>

* Refactoring for extraction to separate repo.

* removing unused github workflows.

* Updates for pre-commit and github actions.

* Fixed bug with validate_meta_files.py missing a file to exclude.

* Updated precommit gh action.

* Refactored yamato pack.

* Added yamato test yaml.

* Updated to CHANGELOG and VectorSensor from mono repo before swap.

* Pin dotnet version for GHA formatting.

* Refactor for MLA pro support. (#1)

* Refactored for MLA pro support.

* Fixed bug in yamato test.

* Updated the CHANGELOG.

* Fixed trigger logic in yamato test.

* Fixed failing tests. (#2)

* Fixed failing tests.

* Adding meta files.

* Fix execution order (#4)

* Fixed execution order.

* Fixed bug with CRT build.

* precommit formatting update.

* Updated agent Awake docstring.

* Set execution order to be negative to make sure scripts run before default time. (#5)

* Added #if to exclude communicator registration in the Agent awake script (#6)

* Add visibility to pro package (#7)

* add coverage publish promotion yamato jobs (#3)

* bump barracuda to 3.0.0 (#9)

upgrade to barracuda 3.0.0

* Updated README. (#8)

* add validation test dep. to publish job (#12)

* add validation test dep. to publish job
* fix package path

* Update automated trigger rules to support oneflow develop branch (#11)

* Update automated trigger rules to support oneflow develop branch

* Fix release branch name for github actions

* Release 2.3.0 exp.1 (#13)

* upgrade package to 2.3.0-exp.1

* rebase on develop branch

* update changelog

* remove unreleased section (#16)

* Updated experimental prerelease version and changelog. (#18)

* Added submodule update gha. (#19)

* fix bugs from sonar-scanner (#21)

* add ml-agents-pro as a upstream for submodule update (#22)

* Added support for standalone use of Academy with refactored communicator factory. (#24)

* clean commandline Arguments (#25)

* clean commandline Arguments
* fix precommit

* update unrelease section in changelog (#27)

* MLA-2537 Reorder RayPerceptionSensor output to reflect its spatial structure. (#26)

* Changes to support in order rays.  GUI for switching to this.  Updated tests.

* Updated README. (#29)

* Fixed samples. (#30)

* Added camera sensor component checkbox for runtime enablement. update… (#31)

* Added camera sensor component checkbox for runtime enablement. updated camera sensor to only be active when enabled for runtime.

* Updated yamato job.

* Updated CHANGELOG.

* Fixed issue with Academy not building for webgl builds. (#32)

* Dev bump min ver (#33)

* Bumped min version of Unity to 2021.3.

* Updated min version of python in pre-commit. Updated changelog.

* Moving files into packages/com.unity.ml-agents directoy.

* Remove com.unity.ml-agents submodule.

* Added back in ml-agents package.

* Removed packages temp folder.

* Updated yamato protobuf gen.

* Updated yamato to remove submodule update.

* Updated linux tool chain.

* Updated Project package dependencies.

* Updated flake8 repo link in precommit.

Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>
Co-authored-by: Arthur Juliani <awjuliani@gmail.com>
Co-authored-by: Chris Elion <chris.elion@unity3d.com>
Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com>
Co-authored-by: Chris Goy <christopherg@unity3d.com>
Co-authored-by: Marwan Mattar <marwan@unity3d.com>
Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com>
Co-authored-by: Ervin Teng <ervin@unity3d.com>
Co-authored-by: Hunter-Unity <hunter@unity3d.com>
Co-authored-by: Maryam Honari <honari.m94@gmail.com>
Co-authored-by: Miguel Alonso Jr <76960110+miguelalonsojr@users.noreply.github.com>
Co-authored-by: Miguel Alonso Jr <miguel.alonso@miguel.alonso-00MM>
Co-authored-by: Jason Bowman <jasonb@unity3d.com>
Co-authored-by: Henry Peteet <henry.peteet@unity3d.com>
Co-authored-by: cmard <87716492+cmard@users.noreply.github.com>
Co-authored-by: Chingiz Mardanov <chingiz.mardanov@unity3d.com>
Co-authored-by: Jason Rupert <92821409+jrupert-unity@users.noreply.github.com>
Co-authored-by: Xun Cao <xun.cao@outlook.com>
Co-authored-by: jmercado1985 <75792879+jmercado1985@users.noreply.github.com>
Co-authored-by: mahon94 <maryam.honari@unity3d.com>
Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com>
Co-authored-by: Xun Cao <xun.cao@unity3d.com>
Co-authored-by: Jason Rupert <jason.rupert@unity3d.com>
2022-11-16 14:39:59 -05:00
Maryam Honari 547acbed23 Trainer qa fix (#78)
* grammer fixes

* fix nit comments from QA

* add info about on/off policy

* add more context to the code block

* more context and minor fix

* pip3

Co-authored-by: zhuo <zhuo@unity3d.com>
2022-11-10 17:15:20 -08:00
Maryam Honari df96d5c835 Develop custom trainers (#73)
* Make create_policy more generic (#54)

* add on/off policy classes and inherit from

* trainers as plugins


* remove swap files

* clean up registration debug

* clean up all pre-commit

* a2c plugin pass precommit

* move gae to trainer utils

* move lambda return to trainer util

* add validator for num_epoch

* add types for settings/type methods

* move create policy into highest level api

* move update_reward_signal into optimizer

* move get_policy into Trainer

* remove get settings type

* dummy_config settings

* move all stats from actor into dict, enables arbitrary actor data

* remove shared_critic flag, cleanups

* refactor create_policy

* remove sample_actions, evaluate_actions, update_norm from policy

* remove comments

* fix return type get stat

* update poca create_policy

* clean up policy init

* remove conftest

* add sharedecritic to settings

* fix test_networks

* fix test_policy

* fix test network

* fix some ppo/sac tests

* add back conftest.py

* improve specification of trainer type

* add defaults fpr trainer_type/hyperparam

* fix test_saver

* fix reward providers

* add settings check utility for tests

* fix some settings tests

* add trainer types to run_experiment

* type check for arbitary actor data

* cherrypick rename ml-agents/trainers/torch to torch_entities (#55)

* make all trainers types and setting visible at module level

* remove settings from run_experiment console script

* fix test_settings and upgrade config scripts

* remove need of trainer_type argument up to trainefactory

* fix gohst trainer behavior id in policy Queue

* fix torch shadow in tests

* update trainers, rl trainers tests

* update tests to match the refactors

* fixing behavior name in ghost trainer

* update ml-agents-envs test configs

* separating the plugin package changes

* bring get_policy back for sake of ghost trainer

* add return types and remove unused returns

* remove duplicate methods in poca (_update_policy, add_policy)

Co-authored-by: mahon94 <maryam.honari@unity3d.com>

* Online/offline custom trainer examples with plugin system (#52)

* add on/off policy classes and inherit from

* trainers as plugins

* a2c trains

* remove swap files

* clean up registration debug

* clean up all pre-commit

* a2c plugin pass precommit

* move gae to trainer utils

* move lambda return to trainer util

* add validator for num_epoch

* add types for settings/type methods

* move create policy into highest level api

* move update_reward_signal into optimizer

* move get_policy into Trainer

* remove get settings type

* dummy_config settings

* move all stats from actor into dict, enables arbitrary actor data

* remove shared_critic flag, cleanups

* refactor create_policy

* remove sample_actions, evaluate_actions, update_norm from policy

* remove comments

* fix return type get stat

* update poca create_policy

* clean up policy init

* remove conftest

* add sharedecritic to settings

* fix test_networks

* fix test_policy

* fix test network

* fix some ppo/sac tests

* add back conftest.py

* improve specification of trainer type

* add defaults fpr trainer_type/hyperparam

* fix test_saver

* fix reward providers

* add settings check utility for tests

* fix some settings tests

* add trainer types to run_experiment

* type check for arbitary actor data

* cherrypick rename ml-agents/trainers/torch to torch_entities (#55)

* make all trainers types and setting visible at module level

* remove settings from run_experiment console script

* fix test_settings and upgrade config scripts

* remove need of trainer_type argument up to trainefactory

* fix gohst trainer behavior id in policy Queue

* fix torch shadow in tests

* update trainers, rl trainers tests

* update tests to match the refactors

* fixing behavior name in ghost trainer

* update ml-agents-envs test configs

* fix precommit

* separating the plugin package changes

* bring get_policy back for sake of ghost trainer

* add return types and remove unused returns

* remove duplicate methods in poca (_update_policy, add_policy)

* add a2c trainer back

* Add DQN cleaned up trainer/optimizer

* nit naming

* fix logprob/entropy types in torch_policy.py

* clean up DQN/SAC

* add docs for custom trainers,TODO: refrence tutorial

* add docs for custom trainers,TODO: refrence tutorial

* add clipping to loss function

* set old importlim-metadata version

* bump precomit hook env to 3.8.x

* use smooth l1 loss

Co-authored-by: mahon94 <maryam.honari@unity3d.com>

* add tutorial for validation

* fix formatting errors

* clean up

* minor changes

Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com>
Co-authored-by: zhuo <zhuo@unity3d.com>
2022-10-20 16:06:58 -04:00
Miguel Alonso Jr 7e0b5113ce Removed old backwards compatibility test since tensorflow/2019.4 are no longer s upported. (#76) 2022-10-19 10:22:28 -04:00
Miguel Alonso Jr f9b7399338 Dev bump min ver (#74)
* Bumped minimum version of Unity.

* Bumped minimum version of python and pytorch. fixed issues with deprecation warnings for x.T (tensor transpose).

* Fixed np.bool deprecation warning. This is a port of an external PR: https://github.com/Unity-Technologies/ml-agents/pull/5795. (#71)

* Updated submodule.

* Bumped max ver of python.

* Bumped to 2021.3 in yamato tests.

* Fixed version issue with torch in test constraints.

* update sysroot version

* update toolchain version

Co-authored-by: zhuo <zhuo@unity3d.com>
2022-10-18 21:27:43 -04:00
GitHub Actions - update submodules 02f5621205 Update submodules 2022-10-18 18:38:08 +00:00
Maryam Honari 04591804d8 extend stale bot to 90 days (#70) 2022-09-22 12:17:42 -07:00
Maryam Honari dfa5cfd938 remove torch shadow in trainers tests (#69) 2022-08-22 11:07:29 -07:00
GitHub Actions - update submodules bd5c8eae7e Update submodules 2022-08-22 16:26:02 +00:00
Miguel Alonso Jr 296979f033 Fixed upm ci bug in yamato tests. (#67) 2022-08-09 14:15:52 -04:00
GitHub Actions - update submodules 0848b2c317 Update submodules 2022-08-08 20:09:13 +00:00
Miguel Alonso Jr bbc7d59012 Fix failing gha pytest (#66)
* Updated pytest.

* Updated SURVEY.md to remove dead survey link.

* Updated bug_report.md to update dead link.
2022-07-28 12:19:27 -04:00
Miguel Alonso Jr 363fbe4ac7 Updated validate meta files. (#65) 2022-07-26 10:20:18 -04:00
GitHub Actions - update submodules 7941c08b90 Update submodules 2022-07-22 14:41:02 +00:00
GitHub Actions - update submodules b0ba56992e Update submodules 2022-07-20 14:19:29 +00:00
Jason Rupert ef2fc6a95d Update docs to explain Alternating Ray Order switch (mla-2537) (#61)
* Update docs to explain Alternating Ray Order switch (mla-2537)
2022-06-14 09:33:20 -07:00
GitHub Actions - update submodules c9fe570dc2 Update submodules 2022-05-26 19:13:57 +00:00
Maryam Honari e7c566ab84 rename ml-agents/trainers/torch to torch_entities (#55)
* rename ml-agents/trainers/torch to torch_entities
* fix mock patch
2022-05-25 12:09:01 -07:00