Stanford CS 329X | Human-Centered LLMs

Class Schedule

Note: tentative schedule is subject to change.

Week	Date	Theme	Course Material
1	Sept 24 Tuesday	Introduction to Human Centered NLP [slides]
1	Sept 26 Thursday	The Ultimate Crash into NLP and LLMs Prompting [slides]	Readings: Schulhoff, Sander, Michael Ilie, Nishant Balepur, Konstantine Kahadze, Amanda Liu, Chenglei Si, Yinheng Li et al. "The Prompt Report: A Systematic Survey of Prompting Techniques." arXiv preprint arXiv:2406.06608 (2024).
2	Oct 1 Tuesday	Learning from Human Preferences [slides]	Readings: Stiennon, Nisan, Long Ouyang, Jeffrey Wu, Daniel Ziegler, Ryan Lowe, Chelsea Voss, Alec Radford, Dario Amodei, and Paul F. Christiano. "Learning to summarize with human feedback." Advances in Neural Information Processing Systems 33 (2020): 3008-3021. Ouyang, Long, Jeff Wu, Xu Jiang, Diogo Almeida, Carroll L. Wainwright, Pamela Mishkin, Chong Zhang et al. "Training language models to follow instructions with human feedback." arXiv preprint arXiv:2203.02155 (2022). Rafailov, Rafael, Archit Sharma, Eric Mitchell, Christopher D. Manning, Stefano Ermon, and Chelsea Finn. "Direct preference optimization: Your language model is secretly a reward model." Advances in Neural Information Processing Systems 36 (2024).
2	Oct 3 Thursday	Personalization vs. Collective Opinion in Preference Tuning [slides]	Readings: Huang, Saffron, Divya Siddarth, Liane Lovitt, Thomas I. Liao, Esin Durmus, Alex Tamkin, and Deep Ganguli. "Collective Constitutional AI: Aligning a Language Model with Public Input." In The 2024 ACM Conference on Fairness, Accountability, and Transparency, pp. 1395-1417. 2024. Bergman, Stevie, Nahema Marchal, John Mellor, Shakir Mohamed, Iason Gabriel, and William Isaac. "STELA: a community-centered approach to norm elicitation for AI alignment." Scientific Reports 14, no. 1 (2024): 6616. Shaikh, Omar, Michelle Lam, Joey Hejna, Yijia Shao, Michael Bernstein, and Diyi Yang. "Show, Don't Tell: Aligning Language Models with Demonstrated Feedback." arXiv preprint arXiv:2406.00888 (2024). Ahmadian, Arash, Beyza Ermis, Seraphina Goldfarb-Tarrant, Julia Kreutzer, Marzieh Fadaee, and Sara Hooker. "The Multilingual Alignment Prism: Aligning Global and Local Preferences to Reduce Harm." arXiv preprint arXiv:2406.18682 (2024). Sorensen, Taylor, Jared Moore, Jillian Fisher, Mitchell L. Gordon, Niloofar Mireshghallah, Christopher Michael Rytting, Andre Ye et al. "Position: A Roadmap to Pluralistic Alignment." In Forty-first International Conference on Machine Learning.
3	Oct 8 Tuesday	Data, Data and Data [slides]	Readings: Longpre, Shayne, Robert Mahari, Naana Obeng-Marnu, William Brannon, Tobin South, Jad Kabbara, and Sandy Pentland. "Data Authenticity, Consent, and Provenance for AI Are All Broken: What Will It Take to Fix Them?." (2024). Gururangan, Suchin, Dallas Card, Sarah K. Dreier, Emily K. Gade, Leroy Z. Wang, Zeyu Wang, Luke Zettlemoyer, and Noah A. Smith. "Whose language counts as high quality? measuring language ideologies in text data selection." arXiv preprint arXiv:2201.10474 (2022). Zhao, Dora, Jerone TA Andrews, Orestis Papakyriakopoulos, and Alice Xiang. "Position: Measure Dataset Diversity, Don't Just Claim It." arXiv preprint arXiv:2407.08188 (2024). Lucy, Li, Suchin Gururangan, Luca Soldaini, Emma Strubell, David Bamman, Lauren Klein, and Jesse Dodge. "AboutMe: Using self-descriptions in webpages to document the effects of english pretraining data filters." In ACL (2024). Klein, Lauren, and Catherine D'Ignazio. "Data Feminism for AI." In The 2024 ACM Conference on Fairness, Accountability, and Transparency, pp. 100-112.
3	Oct 10 Thursday	Design Thinking + Natural Language as the New User Interface [slides]	Readings: Pea, Roy D. "User centered system design: new perspectives on human-computer interaction" Journal educational computing research 3, no. 1 (1987): 129-134. Friedman, Batya, David G. Hendry, and Alan Borning. "A survey of value sensitive design methods." Foundations and Trends in Human–Computer Interaction 11, no. 2 (2017): 63-125. Birhane, Abeba, William Isaac, Vinodkumar Prabhakaran, Mark Díaz, Madeleine Clare Elish, Iason Gabriel, and Shakir Mohamed. "Power to the People? Opportunities and Challenges for Participatory AI." Equity and Access in Algorithms, Mechanisms, and Optimization (2022).
4	Oct 15 Tuesday	Enabling Human-AI Interaction [slides]	Readings: Gao, Jie, Simret Araya Gebreegziabher, Kenny Tsu Wei Choo, Toby Jia-Jun Li, Simon Tangi Perrault, and Thomas W. Malone. "A Taxonomy for Human-LLM Interaction Modes: An Initial Exploration." In Extended Abstracts of the CHI Conference on Human Factors in Computing Systems, pp. 1-11. 2024. Petridis, Savvas, Ben Wedin, James Wexler, Aaron Donsbach, Mahima Pushkarna, Nitesh Goyal, Carrie J. Cai, and Michael Terry. "ConstitutionMaker: Interactively Critiquing Large Language Models by Converting Feedback into Principles." arXiv preprint arXiv:2310.15428 (2023). Zhao, Siyan, John Dang, and Aditya Grover. "Group preference optimization: Few-shot alignment of large language models." arXiv preprint arXiv:2310.11523 (2023). Shaikh, Omar, Valentino Chai, Michele J. Gelfand, Diyi Yang, and Michael S. Bernstein. "Rehearsal: Simulating conflict to teach conflict resolution." arXiv preprint arXiv:2309.12309 (2023). Yang, Diyi, Caleb Ziems, William Held, Omar Shaikh, Michael S. Bernstein, and John Mitchell. "Social skill training with large language models." arXiv preprint arXiv:2404.04204 (2024).
4	Oct 17 Thursday	Evaluating Human-AI interaction [slides]	Readings: Mozannar, Hussein, Valerie Chen, Mohammed Alsobay, Subhro Das, Sebastian Zhao, Dennis Wei, Manish Nagireddy, Prasanna Sattigeri, Ameet Talwalkar, and David Sontag. "The RealHumanEval: Evaluating Large Language Models' Abilities to Support Programmers." arXiv preprint arXiv:2404.02806 (2024). Long, Tao, Katy Ilonka Gero, and Lydia B. Chilton. "Not Just Novelty: A Longitudinal Study on Utility and Customization of AI Workflows." arXiv preprint arXiv:2402.09894 (2024). Lee, Mina, Megha Srivastava, Amelia Hardy, John Thickstun, Esin Durmus, Ashwin Paranjape, Ines Gerard-Ursin et al. "Evaluating Human-Language Model Interaction." Transactions on Machine Learning Research.
5	Oct 22 Tuesday	📺 Guest Lecture: Human-AI Interaction in Education (Rose E. Wang) and Mental Health (Ryan Louie)	Readings:
5	Oct 24 Thursday	Evaluating Human-AI Interaction++ [slides]	Readings: Naous, Tarek, Michael J. Ryan, Alan Ritter, and Wei Xu. "Having beer after prayer? measuring cultural bias in large language models." arXiv preprint arXiv:2305.14456 (2023). Bhutani, Mukul, Kevin Robinson, Vinodkumar Prabhakaran, Shachi Dave, and Sunipa Dev. "Seegull multilingual: a dataset of geo-culturally situated stereotypes." arXiv preprint arXiv:2403.05696 (2024). Birhane, Abeba, Pratyusha Kalluri, Dallas Card, William Agnew, Ravit Dotan, and Michelle Bao. "The values encoded in machine learning research." In 2022 ACM Conference on Fairness, Accountability, and Transparency, pp. 173-184. 2022. Hershcovich, Daniel, Stella Frank, Heather Lent, Miryam de Lhoneux, Mostafa Abdou, Stephanie Brandl, Emanuele Bugliarello et al. "Challenges and Strategies in Cross-Cultural NLP." ACL 2022. Kirk, Hannah Rose, Alexander Whitefield, Paul Röttger, Andrew Bean, Katerina Margatina, Juan Ciro, Rafael Mosquera et al. "The PRISM Alignment Project: What Participatory, Representative and Individualised Human Feedback Reveals About the Subjective and Multicultural Alignment of Large Language Models." arXiv preprint arXiv:2404.16019 (2024).
6	Oct 29 Tuesday	Midway Project Showcase
6	Oct 31 Thursday	Guest Lecture: Megha Srivastava.
7	Nov 5 Tuesday	Democracy Day - no class
7	Nov 7 Thursday	Culture and Values in LLMs [slides]	Readings: Park, Joon Sung, Lindsay Popowski, Carrie Cai, Meredith Ringel Morris, Percy Liang, and Michael S. Bernstein. "Social Simulacra: Creating Populated Prototypes for Social Computing Systems." In Proceedings of the 35th Annual ACM Symposium on User Interface Software and Technology, pp. 1-18. 2022. Aher, Gati V., Rosa I. Arriaga, and Adam Tauman Kalai. "Using large language models to simulate multiple humans and replicate human subject studies." In International Conference on Machine Learning, pp. 337-371. PMLR, 2023. Argyle, Lisa P., Christopher A. Bail, Ethan C. Busby, Joshua R. Gubler, Thomas Howe, Christopher Rytting, Taylor Sorensen, and David Wingate. "Leveraging AI for democratic discourse: Chat interventions can improve online political conversations at scale." Proceedings of the National Academy of Sciences 120, no. 41 (2023): e2311627120. Zhou, Xuhui, Hao Zhu, Leena Mathur, Ruohong Zhang, Haofei Yu, Zhengyang Qi, Louis-Philippe Morency et al. "SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents." In The Twelfth International Conference on Learning Representations (2024).
8	Nov 12 Tuesday	Risks, Trust and Safety [slides]	Readings: Jacovi, Alon, Ana Marasović, Tim Miller, and Yoav Goldberg. "Formalizing trust in artificial intelligence: Prerequisites, causes and goals of human trust in AI." In Proceedings of the 2021 ACM conference on fairness, accountability, and transparency, pp. 624-635. 2021. Cheng, Myra, Kristina Gligoric, Tiziano Piccardi, and Dan Jurafsky. "AnthroScore: A Computational Linguistic Measure of Anthropomorphism." arXiv preprint arXiv:2402.02056 (2024). Reeves, Byron, and Clifford Nass. "The media equation: How people treat computers, television, and new media like real people." Cambridge, UK 10, no. 10 (1996): 19-36. Don, Abbe, Susan Brennan, Brenda Laurel, and Ben Shneiderman. "Anthropomorphism: from ELIZA to Terminator 2." In Proceedings of the SIGCHI conference on Human factors in computing systems, pp. 67-70. 1992. Weidinger, Laura, Jonathan Uesato, Maribeth Rauh, Conor Griffin, Po-Sen Huang, John Mellor, Amelia Glaese et al. "Taxonomy of risks posed by language models." In Proceedings of the 2022 ACM Conference on Fairness, Accountability, and Transparency, pp. 214-229. 2022. On AI Anthropomorphism Ryan, Michael J., William Held, and Diyi Yang. "Unintended impacts of LLM alignment on global representation." arXiv preprint arXiv:2402.15018 (2024). Ruan, Yangjun, Honghua Dong, Andrew Wang, Silviu Pitis, Yongchao Zhou, Jimmy Ba, Yann Dubois, Chris J. Maddison, and Tatsunori Hashimoto. "Identifying the risks of lm agents with an lm-emulated sandbox." arXiv preprint arXiv:2309.15817 (2023).
8	Nov 14 Thursday	📺 Guest Lecture: Hao Zhu and Caleb Ziems.
9	Nov 19 Tuesday	Creativity and Productivity [slides]	Readings: Chakrabarty, Tuhin, Philippe Laban, Divyansh Agarwal, Smaranda Muresan, and Chien-Sheng Wu. "Art or artifice? large language models and the false promise of creativity." In Proceedings of the CHI Conference on Human Factors in Computing Systems, pp. 1-34. 2024. Eloundou, Tyna, Sam Manning, Pamela Mishkin, and Daniel Rock. "GPTs are GPTs: Labor market impact potential of LLMs." Science 384, no. 6702 (2024): 1306-1308. Erik Brynjolfsson, Danielle Li, Lindsey Raymond. "Generative AI at Work." NBER 2023. Anderson, Barrett R., Jash Hemant Shah, and Max Kreminski. "Homogenization effects of large language models on human creative ideation." In Proceedings of the 16th Conference on Creativity & Cognition, pp. 413-425. 2024. Liu, Yiren, Si Chen, Haocong Cheng, Mengxia Yu, Xiao Ran, Andrew Mo, Yiliu Tang, and Yun Huang. "How ai processing delays foster creativity: Exploring research question co-creation with an llm-based agent." In Proceedings of the CHI Conference on Human Factors in Computing Systems, pp. 1-25. 2024. Padmakumar, Vishakh, and He He. "Does Writing with Language Models Reduce Content Diversity?." In The Twelfth International Conference on Learning Representations.
9	Nov 21 Thursday	📺 Guest Lecture: Julia Kreutzer, "Aya and beyond - challenges in multilingual modeling and evaluation"
10	Nov 26 Tuesday	Thanksgiving Holiday - No Class
10	Nov 28 Thursday	Thanksgiving Holiday - No Class
11	Dec 3 Tuesday	Class-Level Report Discussion/Hangout [slides]
11	Dec 5 Thursday	Final Project Presentation in Class

Week	Deadline	Date	Time
2	Homework 1 Released	Thursday October 3
3	Project Proposal Due	Thursday October 10	11:59 PM PT
4	Homework 1 Due	Wednesday October 16	11:59 PM PT
6	Midway Project Showcase Due	Tuesday October 29	4:30 PM PT
7	Homework 2 Released	Friday November 8
8	Midway Report Due	Sunday November 10	11:59 PM PT
8	Homework 2 Due	Friday November 15	11:59 PM PT
9	Midway Report Reviews Due	Sunday November 17	11:59 PM PT
9	Class-Level Report Section Due	Tuesday November 19	11:59 PM PT
11	Final Project Presentation Due	Thursday December 5	4:30 PM PT
12	Final Project Report Due	Tuesday December 10	11:59 PM PT

CS 329X: Human-Centered LLMs

Stanford / Fall 2024

Instructors

Welcome!

Class Schedule

Deadlines

Overview

Course Info

Office Hours

Prerequisites

Academic Accommodations

Well-Being, Stress Management, & Mental Health

Previous Offerings