From 8d31fc3c79fb0e123ba8073a4f80361837745717 Mon Sep 17 00:00:00 2001 From: Nico Melone Date: Fri, 3 Aug 2018 14:50:55 -0500 Subject: [PATCH 1/2] Create README.md --- README.md | 4 ++++ 1 file changed, 4 insertions(+) create mode 100644 README.md diff --git a/README.md b/README.md new file mode 100644 index 0000000..61d04ea --- /dev/null +++ b/README.md @@ -0,0 +1,4 @@ +# reinforcement-learning-quad-copter +A reinforcement learning task to have a simulated quad copter learn to achieve a particular goal + +This project was part of the Machine Learning Engineer nanodegree from Udacity. The assignment was to create a task and agent to train a quad copter in simulation. The task was left open ended as well was the model but the simulation environment was provided. I chose to have the quad copter try to learn to fly to a designated location and hover there. This is seen in the task.py file under the get_reward method. I employed a DDPG agent through the Keras-RL infrastructure. DDPG is a model that uses two networks that cooperate to acheive a task. One is the Actor and performs a task while the other is a Critic that judges how well the actor did a task. Both networks start out untrained and learn in tandum. From 0de85d29ab866640c5a470a3f0bc8ce2912fe880 Mon Sep 17 00:00:00 2001 From: Nico Melone Date: Mon, 27 Aug 2018 14:07:26 -0500 Subject: [PATCH 2/2] Update README.md --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index 61d04ea..d6d5b2b 100644 --- a/README.md +++ b/README.md @@ -1,4 +1,4 @@ # reinforcement-learning-quad-copter A reinforcement learning task to have a simulated quad copter learn to achieve a particular goal -This project was part of the Machine Learning Engineer nanodegree from Udacity. The assignment was to create a task and agent to train a quad copter in simulation. The task was left open ended as well was the model but the simulation environment was provided. I chose to have the quad copter try to learn to fly to a designated location and hover there. This is seen in the task.py file under the get_reward method. I employed a DDPG agent through the Keras-RL infrastructure. DDPG is a model that uses two networks that cooperate to acheive a task. One is the Actor and performs a task while the other is a Critic that judges how well the actor did a task. Both networks start out untrained and learn in tandum. +This project was part of the Machine Learning Engineer nano-degree from Udacity. The assignment was to create a task and agent to train a quad copter in simulation. The task was left open ended as well was the model but the simulation environment was provided. I chose to have the quad copter try to learn to fly to a designated location and hover there. This is seen in the task.py file under the get_reward method. I employed a DDPG agent through the Keras-RL infrastructure. DDPG is a model that uses two networks that cooperate to achieve a task. One is the Actor and performs a task while the other is a Critic that judges how well the actor did a task. Both networks start out untrained and learn in tandem.