1 19 5g assignment details ds 402 section 001 trend in data sci 22411
Search for question
Question
1:19
5G
Assignment details
DS 402, Section 001: TREND IN DATA SCI (22411--UP---P-D...
Objective
This lab is meant for you to work with a neural
network that solves MDPs the same way the table-
based Q-learning agent did (but we switched the
function representation and approximation).
Recipe Ingredients
Add the following files to your project from Lab3:
agent/DeepQNetworkAgent.py
testLab4.py
Arrange the DeepQNetworkAgent.py in the agent
folder. The test file should be in the top level
directory of your code project.
This project has a dependency on the sklearn's
MLPRegressor. To add this package to your virtual
machine in PyCharm, go to Preferences -> Project -
> Python Interpreter. There you can press the +
button and search for "scikit-learn" ("sklearn" will
also work, but it is deprecated).
Submit assignment
◄ Previous
Next ▸
12
51
Dashboard Calendar
To-do
Notifications
Inbox 1:19A
5G
Assignment details
DS 402, Section 001: TREND IN DATA SCI (22411--UP---P-D...
Your Tasks
Before you begin the graded tasks, feel free to take
some time reviewing all the new code and output of
lab 4's tests 1+2, which follow "the movie" format
we have seen in previous labs. As you do this, feel
free to insert print statements anywhere you think
might clarify things for you.
1. Task 1: Understand DQN training:
A. Run lab4test3 to observe how our basic
framework trains up a neural network.
B. (TURN THIS IN) Prepare a learning
curve for this training run. What do you
see in it? In particular, what do you think
is causing it to not improve consistently
over time?
2. Task 2: DQNs and greed
A. Run lab4test4 to observe three training
sessions varying the probability of
selecting a greedy action.
B. (TURN THIS IN) Prepare learning
curves for each of the three trainings.
What do you see in them?
3. Task 3: DQNs and learning rate
A. Run lab4test5 to observe three training
sessions varying the initial learning rate
Submit assignment
◄ Previous
Next ▸
12
51
Dashboard Calendar
To-do
Notifications
Inbox 1:19A
5G
Assignment details
DS 402, Section 001: TREND IN DATA SCI (22411--UP---P-D...
curves for each of the three trainings.
What do you see in them?
3. Task 3: DQNs and learning rate
A. Run lab4test5 to observe three training
sessions varying the initial learning rate
of the ADAM optimizer.
B. (TURN THIS IN) Prepare learning
curves for these training sessions. What
do you see in them?
4. Task 4: DQNs and network depth
A. Run lab4test7 (note that we skipped 6,
feel free to check that test out, but it is
not required) to observe three training
sessions varying the depth of the
network while keeping the neuron count
fixed.
B. (TURN THIS IN) Prepare learning
curves for these training sessions. What
do you see in them?
5. Task 5: DQNs and network width
A. Run lab4test8 to observe three training
sessions varying the width of the
network.
B. (TURN THIS IN) Prepare learning
curves for these training sessions. What
do you see in them?
6. Task 6: DQNs and regularization
A. Run lab4test9 to observe three training
Submit assignment
◄ Previous
Next ▸
12
51
Dashboard Calendar
To-do
Notifications
Inbox 1:19A
5G
Assignment details
DS 402, Section 001: TREND IN DATA SCI (22411--UP---P-D...
do you see in them?
5. Task 5: DQNs and network width
A. Run lab4test8 to observe three training
sessions varying the width of the
network.
B. (TURN THIS IN) Prepare learning
curves for these training sessions. What
do you see in them?
6. Task 6: DQNs and regularization
A. Run lab4test9 to observe three training
sessions varying the amount of
regularization we apply to the network.
B. (TURN THIS IN) Prepare learning
curves for these training sessions. What
do you see in them?
7. Task 7: Compare DQN with table-based Q-
learning.
A. (TURN THIS IN) Refer back to your
answers from lab 3. Compare and
contrast your answers to this lab with
the analogous questions there.
B. (TURN THIS IN) Examine lab11test10.
Using the framework there, create the
best DQN you can to solve that
specified MDP (feel free to change the
source in the DQNAgent itself too).
Provide any modified source, a learning
curve for your agent, and a reflection
Submit assignment
◄ Previous
Next ▸
12
51
Dashboard Calendar
To-do
Notifications
Inbox 1:19
5G
Assignment details
DS 402, Section 001: TREND IN DATA SCI (22411--UP---P-D...
B. (TURN THIS IN) Prepare learning
curves for these training sessions. What
do you see in them?
7. Task 7: Compare DQN with table-based Q-
learning.
Submit
A. (TURN THIS IN) Refer back to your
answers from lab 3. Compare and
contrast your answers to this lab with
the analogous questions there.
B. (TURN THIS IN) Examine lab11test10.
Using the framework there, create the
best DQN you can to solve that
specified MDP (feel free to change the
source in the DQNAgent itself too).
Provide any modified source, a learning
curve for your agent, and a reflection
about approaches you tried as you
settled on a particular network
configuration and why you settled on
what you did.
A file that is readable (pdf, docx, etc) containing
your charts, explanations, and neural network
creation function(s).
◄ Previous
Submit assignment
Next ▸
12
51
Dashboard Calendar
To-do
Notifications
Inbox