Reinforcement Learning | B.E. Semester VII Winter 2024 | GTU Papers

Q: Q1. Summarize brief history of reinforcement learning.

This is question 1 (3 marks) from the GTU Reinforcement Learning BE Winter 2024 question paper. Subject code: 3174208.

Q: Q1. Illustrate the general idea of reinforcement learning using Tic-Tac-Toe example.

This is question 1 (4 marks) from the GTU Reinforcement Learning BE Winter 2024 question paper. Subject code: 3174208.

Q: Q1. Describe the Elements of Reinforcement Learning in detail.

This is question 1 (7 marks) from the GTU Reinforcement Learning BE Winter 2024 question paper. Subject code: 3174208.

Q: Q2. Define and formulate probability density function (PDF)

This is question 2 (3 marks) from the GTU Reinforcement Learning BE Winter 2024 question paper. Subject code: 3174208.

Q: Q2. Explain concepts of random variables in probability with suitable example.

This is question 2 (4 marks) from the GTU Reinforcement Learning BE Winter 2024 question paper. Subject code: 3174208.

Q: Q2. Discuss the concept of joint, and conditional probability with their equations.

This is question 2 (7 marks) from the GTU Reinforcement Learning BE Winter 2024 question paper. Subject code: 3174208.

Q: Q2. Define Probability Mass Function (PMF)? Explain with suitable example.

This is question 2 (7 marks) from the GTU Reinforcement Learning BE Winter 2024 question paper. Subject code: 3174208.

Q: Q3. Illustrate Markov Reward Process (MRP) with suitable example.

This is question 3 (3 marks) from the GTU Reinforcement Learning BE Winter 2024 question paper. Subject code: 3174208.

Q: Q3. What are the roles of Optimal Value Functions in Markov decision process (MDP)?

This is question 3 (4 marks) from the GTU Reinforcement Learning BE Winter 2024 question paper. Subject code: 3174208.

Q: Q3. Formulate Policy Evaluation to compute the state-value function in dynamic programming.

This is question 3 (7 marks) from the GTU Reinforcement Learning BE Winter 2024 question paper. Subject code: 3174208.

Questions25

Summarize brief history of reinforcement learning.

[3 marks]

Illustrate the general idea of reinforcement learning using Tic-Tac-Toe example.

[4 marks]

Describe the Elements of Reinforcement Learning in detail.

[7 marks]

Define and formulate probability density function (PDF)

[3 marks]

Explain concepts of random variables in probability with suitable example.

[4 marks]

Discuss the concept of joint, and conditional probability with their equations.

[7 marks]

Define Probability Mass Function (PMF)? Explain with suitable example.

[7 marks]

Illustrate Markov Reward Process (MRP) with suitable example.

[3 marks]

What are the roles of Optimal Value Functions in Markov decision process (MDP)?

[4 marks]

Formulate Policy Evaluation to compute the state-value function in dynamic programming.

[7 marks]

What is The Markov Property? List the criteria to identify the Markov Property.

[3 marks]

Solve the Bellman equation for v∗ for the simple Gridworld problem.

[4 marks]

Explain Policy Iteration in detail.

[7 marks]

Give overview of Monte Carlo (MC) methods for model free reinforcement learning.

[3 marks]

Illustrate with example, On policy and off policy learning.

[4 marks]

Write an algorithm for the first-visit MC method for estimating v . 𝜋

[7 marks]

Explain how to apply Importance Sampling in off-policy technique.

[3 marks]

Take down value iteration algorithm.

[4 marks]

Write an algorithm for the Every-visit MC method for estimating v . 𝜋

[7 marks]

Define Temporal-Difference Learning (TD). Give the overview of Overview TD (0).

[3 marks]

What are the advantages of TD Prediction Methods?

[4 marks]

Explain Q-Learning for an off-policy TD control algorithm.

[7 marks]

What it means by the term “Eligibility Traces”? What are two ways to view eligibility traces?

[3 marks]

Differentiate State-action-reward-state-action (SARSA) and Q-learning

[4 marks]

Describe N-step TD prediction technique in detail.

[7 marks]

Reinforcement Learning — Winter 2024

Questions25