Instructor info
Course description
What makes something a good explanation? This graduate seminar examines explanation from three complementary perspectives: philosophy of science, cognitive psychology, and artificial intelligence. In the first sessions we look at classical accounts from the philosophy of science that attempt to define normative criteria for what counts as a good scientific explanation. A second part of the seminar explores the cognitive science of explanations, which tries to capture the criteria underlying people's common sense explanations. We look into the analogies between those and the normative principles discussed by philosophers. Finally, the last part of the course applies some of these insights to the emerging challenges of AI interpretability. We will see how many of the strategies deployed to make black box models amenable to human understanding can be understood as applications of the criteria for good explanations studied by philosophers and psychologists.
Schedule
Theories of Scientific Explanation
Sessions 1–5 · Weeks 1–3Cognitive Science of Explanation
Sessions 6–12 · Weeks 3–6Interpretable AI
Sessions 13–17 · Weeks 7–9Student Presentations
Sessions 18–19 · Week 10General information
What to expect?
What you can expect from me
I will …
- Provide an introduction and context for the papers discussed during the seminar
- Facilitate class discussions
- Provide feedback on final papers
- Be available during office hours
What I expect from you
You will …
- Attend class and participate in discussions
- Lead one discussion session
- Submit reaction posts (due 8pm before class)
- Write a final paper
- Present your work in Sessions 18–19
Grading
- 1/3 class participation
- 1/3 reaction posts
- 1/3 final paper
Reaction posts
Reaction posts should be submitted by 8pm the night before class. They should be 1-2 paragraphs and engage critically with the assigned readings. You should express your opinion rather than summarize the contents of the paper. You may raise questions, identify connections to other material, or offer a brief argument.
Final paper
The final paper should be 1000-2000 words and engage substantively with topics covered in the course. It may be one of the following three:
- An empirical project proposal (can include computational modelling projects)
- A literature review based on one of the class topics
- A theoretical essay
You will also be expected to give a short presentation of your project during the last two sessions of the course, which will be part of the grading for the final paper.
Policies
Please familiarize yourself with Stanford’s honor code. We will adhere to it and follow through on its penalty guidelines.
Access and accommodations
Stanford is committed to providing equal educational opportunities for disabled students. Disabled students are a valued and essential part of the Stanford community. We welcome you to our class. If you experience disability, please register with the Office of Accessible Education (OAE). Professional staff will evaluate your needs, support appropriate and reasonable accommodations, and prepare an Academic Accommodation Letter for faculty. To get started, or to re-initiate services, please visit oae.stanford.edu. If you already have an Academic Accommodation Letter, we invite you to share your letter with us. Academic Accommodation Letters should be shared at the earliest possible opportunity so we may partner with you and OAE to identify any barriers to access and inclusion that might be encountered in your experience of this course.