CS 294-277, Robots That Learn (Fall 2024)

Logistics

UC Berkeley Course Number 34334

Time: Mondays 3-5PM

~ Welcome from Jitendra ~

Robotics has been late to the deep learning revolution compared to computer vision and natural language processing, mainly because “big data” is not so readily available. However significant advances have been made in the last few years, and the purpose of this class is to present a coherent framework for studying these advances.

My goal is to build machines that can emulate the remarkable capabilities of humans and other animals at motor control, defined as connecting perception to action in the physical world. Hence there will be a distinct preference for legs over wheels, multi-finger hands over parallel jaw grippers, rich visual and tactile perception over minimal sensing, humanoid robots in the home over specialist robots on the factory floor. The course itself will have three main parts (1) biological motor control basics for inspiration (2) main paradigms for robot motor skill acquisition (3) case studies of locomotion, navigation and manipulation.

Prerequisites

This is a graduate level class, and prerequisite knowledge at the level of Deep Learning (Bishop & Bishop), and Reinforcement Learning (Sutton & Barto) will be assumed.

We have now closed class enrollment. Here is a list of self-evaluation questions for interested students to see whether they are ready for the class.

Course Format

Each session consists of a two 1-hour lectures with a 10 minute break in between, i.e. 3:10-4 PM and 4:10-5PM. We will use notations A/B to denote the sub-sessions: e.g. Lecture 1A, 1B, 2A, 2B, …

Schedule

The following schedule is a tentative assignment and will evolve in real time. Weekly materials will be added.

Lecture 1A (9/9) Introduction [slides]
Lecture 1B (9/9) Biomechanics of walking and running [slides]
Lecture 2A (9/16) Robot mechanisms - kinematics and dynamics [slides] [video]
Lecture 2B (9/16) The human hand and dexterous object manipulation
Lecture 3A (9/23) Robot hands [slides] [video]
Lecture 3B (9/23) Proprioception and tactile perception [slides] [video]
Lecture 4A (9/30) Vision for action [slides] [video]
Lecture 4B (9/30) The developmental perspective on motor control [slides] [video]
Lecture 5A (10/7) Robot dynamics, control, and motion planning [slides] [video]
Lecture 5B (10/7) Computational neuroscience perspective on prediction and control [slides] [video]
Lecture 6AB (10/14) Reinforcement Learning [slides] [videoA] [videoB]
Lecture 7AB (10/21) Behavior cloning [slides] [video]
Lecture 8AB (10/28) Visual Imitation [slideA] [slideB] [video]
Lecture 9AB (11/04) Case Studies in Locomotion [slideA] [slideB] [video]
Veterans Day (11/11)
Lecture 10AB (11/18) Case Studies in Navigation [slideA] [slideB] [video]
Lecture 11AB (11/25) Case Studies in Dexterous Manipulation [slides] [videoA] [videoB]
Lecture 12AB (12/2) Long horizon planning and the role of language [slides] [videoA]
RRR Week (12/9) Final Project Presentations [Sign-Up Sheet]

Please see below (“Reading Materials”) for link to reading assignment submission form.

Coursework

10% Weekly Reading Assignment
10% Lecture Scribing
30% Midterm (11/18, first hour of the class)
50% Final Project

Weekly Reading Assignment: For every weekly reading, each student should come up with 2 multiple choice questions, and supply with answers. We will send out a Google form for submission each week.

Lecture Scribing: For each lecture, two student scribes will organize lecture notes in LaTeX. The students can decide to submit a single note together or individually; grades will be assigned based on note quality. The lecture notes should be ready by the same Friday. Sign-up sheet here.

Note: since the lecture content does not necessarily align with lecture title, each scribe is only required to cover the time slot they have signed up for.

Midterm: During the first hour of our 11/18 class, we will have a mid-term exam based on questions sourced from the weekly reading assignments so far. A total of about 30 multiple-choice questions will be given. One 8.5”x11”, double-sided cheat sheet will be permitted.

Final Project: The goal of the final project is to explore and push the boundaries of robot learning, choosing from topics presented in this course. Here are a few examples of possible project formats: proposal and evaluation of new algorithms, investigation of a robotic application, benchmarking a range of existing methods, etc. Ideally, the project covers interesting new ground and could be the foundation for a future conference paper submission or product. The project can be done in groups of 1-4 people. Note that our expectations will scale linearly with the number of people in the group.

❗Final Project Logistics

Project Proposal: To keep track of your final project progress, please submit a 1 page project proposal using this template by EOD 11/19 (the day after midterm). Please submit through a Google doc shared with the course instructor and TA, so we can give feedback and suggestions. Note that this proposal will not be graded.

Project Presentation: We will have the final project presentations on 12/9 during our regular class time (starting at 3:10pm). Due to the large number of projects we have, every group will have 5 minutes to present (no Q&A). This will serve as the most important basis for grading.

Project Report: The final project report will be due by EOD 12/13. Please use CoRL 2024 format for the project report, with a maximum of 4 pages. We would like you to focus on the problem setting, why it matters and what’s interesting/novel about it, your approach, your results, analysis of results, limitations, and future directions. Cite and briefly survey prior work as appropriate but don’t re-write prior work when not directly relevant to understanding your approach. References will not be counted against the 4 pages.

❗Assignment Deadlines

Weekly Reading: 23:59PM PT, the Friday after assignment release
Lecture Scribing: 23:59PM PT, the Friday after scribed lecture

Background Materials

Reading materials

Lecture 1

Lecture 1B

T. K. Uchida and S. L. Delp. Biomechanics of movement: the science of sports, robotics, and rehabilitation. Mit Press, 2021.
P. Ramdya and A. J. Ijspeert. The neuromechanics of animal locomotion: From biology to robotics and back. Science Robotics, 8(78):eadg0279, 2023. [PDF]

(It is not expected to read the Uchida-Delp book, but we will cover a couple of chapters from it.)

Lecture 2

[Reading Assignment Submission Form]

Lecture 2A

In advance of lecture 2A, students should try to familiarize themselves with how 3D rotations and translations are represented. We would like students to learn about “exponential coordinates” - how a rotation matrix is the exponential of a skew-symmetric matrix corresponding to the axis of rotation, and when rotation is accompanied by translation, we use the exponential of a twist. This formalism results in an elegant way to specify the forward kinematics of a robot using the product of matrix exponentials. The Li-Murray-Sastry textbook and the Lynch-Park textbook are good sources. You can find lectures on YouTube for Lynch-Park. I recommend the ones corresponding to Chapter 3. [link]

Lecture 2B

Video on human hand anatomy [link]

Lecture 3

[Reading Assignment Submission Form]

Lecture 3A

Piazza, Cristina, et al. “A century of robotic hands.” Annual Review of Control, Robotics, and Autonomous Systems 2.1 (2019): 1-32. [PDF]
LEAP Hand, LEAP Hand v2

Lecture 3B

Jones, L. A. “Human hand function.” (2006). [PDF]
Esther P Gardner, “Touch” (2010). [PDF]

Lecture 4

[Reading Assignment Submission Form]

Lecture 4A

Land, M et al. “The roles of vision and eye movements in the control of activities of daily living.” Perception vol. 28,11 (1999): 1311-28. doi:10.1068/p2935 [PDF]

Lecture 4B

Loquercio, Antonio, Ashish Kumar, and Jitendra Malik. “Learning visual locomotion with cross-modal supervision.” 2023 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2023. [PDF]
Smith, Linda B. and Michael Gasser. “The Development of Embodied Cognition: Six Lessons from Babies.” Artificial Life 11 (2005): 13-29. [PDF]

Lecture 5

[Reading Assignment Submission Form]

Kawato M. Internal models for motor control and trajectory planning. Curr Opin Neurobiol. 1999 Dec;9(6):718-27. doi: 10.1016/s0959-4388(99)00028-8. PMID: 10607637. [PDF]
Flanagan JR, Bowman MC, Johansson RS. Control strategies in object manipulation tasks. Curr Opin Neurobiol. 2006 Dec;16(6):650-9. doi: 10.1016/j.conb.2006.10.005. Epub 2006 Nov 3. PMID: 17084619. [PDF]

Lecture 6

[Reading Assignment Submission Form]

“Learning to Walk via Deep Reinforcement Learning.” RSS 2019. Tuomas Haarnoja, Sehoon Ha, Aurick Zhou, Jie Tan, George Tucker, Sergey Levine
Learning Dexterous In-Hand Manipulation. IJRR 2019. OpenAI et al.

Lecture 7

[Reading Assignment Submission Form]

Diffusion Policy [Website] [PDF]

Lecture 8

[Reading Assignment Submission Form]

Shaw K, Bahl S, Sivakumar A, Kannan A, Pathak D. Learning dexterity from human hand motion in internet videos. The International Journal of Robotics Research. 2024;43(4):513-532. doi:10.1177/02783649241227559 [PDF]
Goyal, Mohit, et al. “Human hands as probes for interactive object understanding.” Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2022. [PDF]
Kumar, Ashish, Saurabh Gupta, and Jitendra Malik. “Learning navigation subroutines from egocentric videos.” Conference on Robot Learning. PMLR, 2020. [PDF]

Lecture 9

[Reading Assignment Submission Form]

Scott Kuindersma’s talk on BD MPC [Link]
Robin Deit’s recent RSS talk on BD MPC to supplement Scott’s talk [Link]
Russ Tedrake’s lecture on humanoids covering ZMP [Link]

Lecture 10

[Reading Assignment Submission Form]

Chang, Matthew, et al. “Goat: Go to any thing.” arXiv preprint arXiv:2311.06430 (2023). [PDF]

Lecture 11

[Reading Assignment Submission Form]

Zhao, Tony Z., et al. “Learning fine-grained bimanual manipulation with low-cost hardware.” arXiv preprint arXiv:2304.13705 (2023). [PDF]
Zhao, Tony Z., et al. “Aloha unleashed: A simple recipe for robot dexterity.” arXiv preprint arXiv:2410.13126 (2024). [PDF]
Dalal, Murtaza, et al. “Local Policies Enable Zero-shot Long-horizon Manipulation.” arXiv preprint arXiv:2410.22332 (2024). [PDF]

CS 294-277, Robots That Learn (Fall 2024)

Logistics

[YouTube Playlist]

[Course Notes]

~ Welcome from Jitendra ~

Prerequisites

Course Format

Schedule

Coursework

❗Final Project Logistics

❗Assignment Deadlines

Background Materials

Reading materials

Lecture 1

Lecture 1B

Lecture 2

Lecture 2A

Lecture 2B

Lecture 3

Lecture 3A

Lecture 3B

Lecture 4

Lecture 4A

Lecture 4B

Lecture 5

Lecture 6

Lecture 7

Lecture 8

Lecture 9

Lecture 10

Lecture 11