Prioritized Sweeping, Expected vs. Sample Updates, Trajectory Sampling, Real-time Dynamic Programming, Planning at Decision Time, Heuristic Search, Rollout Algorithms
#Exemplar/Case Studies
Multiagent Reinforcement Learning: Rollout and Policy Iteration, IEEE/CAA Journal of Automatica Sinica ( Volume: 8, Issue: 2, February 2021)
*Mapping of Course Outcomes for Unit VI
CO6
Learning Resources
Text Books:
1. Reinforcement Learning with TensorFlow: A beginner's guide to designing self-learning systems with TensorFlow and OpenAI Gym, Sayon Dutta, Packt Publishing (24 April 2018) 2. Reinforcement Learning: An Introduction, second edition Richard S. Sutton and Andrew G.
Barto, The MIT Press Cambridge, Massachusetts
Reference Books:
1. Sudharsan Ravichandiran, “Hands-On Reinforcement Learning with Python: Master Reinforcement and Deep Reinforcement Learning Using OpenAI Gym and TensorFlow”
2. Maxim Lapan, “Deep Reinforcement Learning Hands-On: Apply Modern RL Methods, with Deep Q-networks, Value Iteration, Policy Gradients, TRPO, AlphaGo Zero and More”
e-Resources:
1. Jason Gauci, “Horizon: Facebook's Open Source Applied Reinforcement Learning Platform”, https://doi.org/10.48550/arXiv.1811.00260
2. G. Tesauro, "Reinforcement Learning in Autonomic Computing: A Manifesto and Case Studies," in IEEE Internet Computing, vol. 11, no. 1, pp. 22-30, Jan.-Feb. 2007, doi:
10.1109/MIC.2007.21
3. D. Bertsekas, "Multiagent Reinforcement Learning: Rollout and Policy Iteration," in IEEE/CAA Journal of Automatica Sinica, vol. 8, no. 2, pp. 249-272, February 2021, doi:
10.1109/JAS.2021.1003814 MOOC Courses:
1. Reinforcement Learning Specialization: https://www.coursera.org/specializations/reinforcement- learning
2. Reinforcement Learning Lecture Series 2021: https://www.deepmind.com/learning- resources/reinforcement-learning-lecture-series-2021
3. Introduction to Reinforcement Learning with David Silver: https://www.deepmind.com/learning- resources/introduction-to-reinforcement-learning-with-david-silver
4. Deep RL Bootcamp: https://sites.google.com/view/deep-rl-bootcamp/lectures
The CO-PO Mapping Matrix
CO/
PO PO1 PO2 PO3 PO4 PO5 PO6 PO7 PO8 PO9 PO10 PO11 PO12
CO1 3 - - - - - - - - - - -
CO2 3 2 3 - 3 - - - - - - -
CO3 3 3 3 - 3 - - - - - - -
CO4 3 3 3 3 3 - - - 2 - - -
CO5 3 3 3 3 3 - - - - - - -
CO6 3 - - 3 - - - - - - - 2
Syllabus for Fourth Year of Artificial Intelligence and Data Science (2020 Course) 112/129
Savitribai Phule Pune University
Fourth Year of Artificial Intelligence and Data Science (2020 Course) 417534: Computer Laboratory III
Teaching Scheme:
PR: 02 Hours/Week
Credit 01
Examination Scheme and Marks Term Work (TW): 50 Marks Practical (PR): 25 Marks Prerequisites Courses: Computer Network Laboratory (317527), Software Laboratory-III (317536)
Companion Course: Computational Intelligence (417529), Distributed Computing (417530) Course Objectives:
To understand the fundamentals of a distributed environment in complex application
To introduce the concepts inspired by the human immune system and their application in problem-solving and optimization
To make students aware about security issues and protection mechanisms for distributed environments
To familiarize with various evolutionary algorithms and optimization techniques inspired by natural evolution processes
Course Outcomes:
After completion of the course, learners should be able to-
CO1: Apply the principles on which the internet and other distributed systems are based
CO2: Understand and apply the basic theoretical concepts and algorithms of distributed systems in problem solving
CO3: Apply fuzzy logic techniques to model and solve problems
CO4: Design and implement evolutionary algorithms to solve optimization and search problems in diverse domains
CO5: Design and implement artificial immune system algorithms to solve complex problems in different domains
Guidelines for Instructor's Manual
The instructor‘s manual is to be developed as a reference and hands-on resource. It should include prologue (about University/program/ institute/ department/foreword/ preface), curriculum of the course, conduction and Assessment guidelines, topics under consideration, concept, objectives, outcomes, set of typical applications/assignments/ guidelines, and references.
Guidelines for Student's Laboratory Journal
The laboratory assignments are to be submitted by students in the form of Journal. Journal consists of Certificate, table of contents, and handwritten write-up of each assignment (Title, Date of Completion, Objectives, Problem Statement, Software and Hardware requirements, Assessment grade/marks and assessor's sign, Theory- Concept in brief, algorithm, flowchart, test cases, Test Data Set(if applicable), mathematical model (if applicable), conclusion/analysis. Program codes with sample output of all performed assignments are to be submitted as softcopy. As a conscious effort and little contribution towards Green IT and environment awareness, attaching printed papers as part of write-ups and program listing to Journal must be avoided. Use of DVD/Softcopy containing student programs maintained by Laboratory In-charge is highly encouraged. For reference one or two journals may be maintained with program prints in the Laboratory.
Guidelines for Laboratory /Term Work Assessment
Continuous assessment of laboratory work should be based on overall performance of Laboratory assignments by a student. Each Laboratory assignment assessment will assign grade/marks based on parameters, such as timely completion, performance, innovation, efficient codes, and punctuality.
Guidelines for Practical Examination
Problem statements must be decided jointly by the internal examiner and external examiner. During practical assessment, maximum weightage should be given to satisfactory implementation of the problem statement. Relevant questions may be asked at the time of evaluation to test the student’s understanding of the fundamentals, effective and efficient implementation. This will encourage, transparent evaluation and fair approach, and hence will not create any uncertainty or doubt in the minds of the students. So, adhering to these principles will consummate our team efforts to the promising start of student's academics.
Guidelines for Laboratory Conduction
The instructor is expected to frame the assignments by understanding the prerequisites, technological aspects, utility and recent trends related to the topic. The assignment framing policy needs to address the average students and inclusive of an element to attract and promote the intelligent students. Use of open source software is encouraged. Based on the concepts learned. Instructors may also set one assignment or mini-project that is suitable to the AI & DS branch beyond the scope of the syllabus.
Recommended Programming Languages: Python or Java