The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.
unity
machine-learning
unity3d
deep-learning
reinforcement-learning
neural-networks
deep-reinforcement-learning
Обновлено 2024-10-28 14:18:34 +03:00
TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.
Обновлено 2024-10-18 16:33:49 +03:00
A learning environment for man-made Interactive Fiction games.
Обновлено 2024-10-15 20:11:05 +03:00
Multi-Agent Resource Optimization (MARO) platform is an instance of Reinforcement Learning as a Service (RaaS) for real-world resource optimization problems.
docker
reinforcement-learning
simulator
finance
inventory-management
logistics
maro
multi-agent
multi-agent-reinforcement-learning
operations-research
raas
resource-optimization
rl-algorithms
transportation
agent
citi-bike
Обновлено 2024-02-23 11:45:58 +03:00
Debugging, monitoring and visualization for Python Machine Learning and Data Science
machine-learning
python
deep-learning
data-science
ai
monitoring
reinforcement-learning
jupyter
jupyter-notebook
debugging
deeplearning
debug
machinelearning
explainable-ai
explainable-ml
saliency
debugging-tool
model-visualization
Обновлено 2023-08-30 10:47:36 +03:00
Visuomotor policies from event-based cameras through representation learning and reinforcement learning. Accompanies our paper: https://arxiv.org/abs/2103.00806
Обновлено 2023-08-15 00:50:21 +03:00
Logarithmic Reinforcement Learning
reinforcement-learning
algorithms
algorithm
discount
discount-factor
dqn
gap
log-rl
logrl
rl
action-gap
Обновлено 2023-03-25 04:36:23 +03:00
Showcase environment for ML-Agents
unity
unity3d
machine-learning
reinforcement-learning
game-development
ml-agents
reinforcement-learning-environments
Обновлено 2022-05-05 01:10:53 +03:00
DOTS compatible version of ML-Agents
Обновлено 2021-10-04 23:09:24 +03:00
Obstacle Tower Environment
Обновлено 2020-07-29 04:08:41 +03:00
Demo project using tabular Q-learning algorithm
Обновлено 2018-01-12 05:11:34 +03:00