Michael J. Ryan

PhD student in NLP

Stanford University

Hi, I’m Michael Ryan and I’m an Incoming PhD student studying Artificial Intelligence at Stanford University. I’m fortunate to be doing NLP research as a member of Dr. Diyi Yang’s SALT Lab! I’m also a core contributor to StanfordNLP/DSPy – the library for programming not prompting LLMs. I’m on the optimizer team for DSPy and I am the co-creator of the DSPy MIPROv2 optimizer.

My research interest is Human-Centered NLP through two directions: LLM personalization for various cultures, languages, and individuals [1] [2] [3]. And leveraging humans for system design and feedback to make better AI systems. [4] Previously I was an undergraduate researcher in Dr. Wei Xu’s NLP X Lab at Georgia Tech and a research intern at Snowflake.

Have a look at my CV, or if you’re in a hurry, check out my resume!

Interests

LLM Personalization
Human-Centered NLP
Compound LM Systems
LM System Optimization

Education

PhD in Computer Science, 202X
Stanford University
MS in Computer Science, 2025
Stanford University
BSc in Computer Science (Intelligence & Systems/Architecture), 2023
Georgia Institute of Technology

Selected Research

Michael J. Ryan, Omar Shaikh, Aditri Bhagirath, Daniel Frees, William Held, Diyi Yang

July, 2025 ACL 2025

SynthesizeMe! Inducing Persona-Guided Prompts for Personalized Reward Models in LLMs

Recent calls for pluralistic alignment of Large Language Models (LLMs) encourage adapting models to diverse user preferences. However, most prior work on personalized reward models heavily rely on additional identity information, such as demographic details or a predefined set of preference categories. To this end, we introduce SynthesizeMe, an approach to inducing synthetic user personas from user interactions for personalized reward modeling. SynthesizeMe first generates and verifies reasoning to explain user preferences, then induces synthetic user personas from that reasoning, and finally filters to informative prior user interactions in order to build personalized prompts for a particular user. We show that using SynthesizeMe induced prompts improves personalized LLM-as-a-judge accuracy by 4.4% on Chatbot Arena. Combining SynthesizeMe derived prompts with a reward model achieves top performance on PersonalRewardBench a new curation of user-stratified interactions with chatbots collected from 854 users of Chatbot Arena and PRISM.

Krista Opsahl-Ong*, Michael J. Ryan*, Josh Purtell, David Broman, Christopher Potts, Matei Zaharia, Omar Khattab

November, 2024 EMNLP 2024

Optimizing Instructions and Demonstrations for Multi-Stage Language Model Programs

We present MIPROv2, a language model program optimizer which improves both prompts and fewshot demonstrations for multistage language model programs. Our strategies include (i) program- and data-aware techniques for proposing effective instructions, (ii) a stochastic mini-batch evaluation function for learning a surrogate model of our objective, and (iii) a meta-optimization procedure in which we refine how LMs construct proposals over time. MIPRO outperforms baseline optimizers on five of seven diverse multi-stage LM programs using a best-in-class open-source model (Llama-3-8B), by as high as 13% accuracy.

Michael J. Ryan, William Held, Diyi Yang

March, 2024 ACL 2024

Unintended Impacts of LLM Alignment on Global Representation

We explore how alignment impacts performance along three axes of global representation, English dialects, multilingualism, and opinions from and about countries worldwide. Our results show that current alignment procedures create disparities between English dialects and global opinions. We find alignment improves capabilities in several languages. We conclude by discussing design decisions that led to these unintended impacts and recommendations for more equitable preference tuning.

Research

GEPA: Reflective Prompt Evolution Can Outperform Reinforcement Learning

Lakshya A Agrawal, Shangyin Tan, Dilara Soylu, Noah Ziems, Rishi Khare, Krista Opsahl-Ong, Arnav Singhvi, Herumb Shandilya, Michael J. Ryan, Meng Jiang, Christopher Potts, Koushik Sen, Alexandros G. Dimakis, Ion Stoica, Dan Klein, Matei Zaharia, Omar Khattab

ArXiv Preprint (under review)

SynthesizeMe! Inducing Persona-Guided Prompts for Personalized Reward Models in LLMs

Michael J. Ryan, Omar Shaikh, Aditri Bhagirath, Daniel Frees, William Held, Diyi Yang

63rd Annual Meeting of the Association for Computational Linguistics (Main Conference)

Mind the Gap: Static and Interactive Evaluations of Large Audio Models

Ella Li, William Held, Michael J. Ryan, Kunat Pipatanakul, Potsawee Manakul, Hao Zhu, Diyi Yang

63rd Annual Meeting of the Association for Computational Linguistics (Main Conference)

AudioJudge: Understanding What Works in Large Audio Model Based Speech Evaluation

Potsawee Manakul, Woody Haosheng Gan, Michael J. Ryan, Ali Sartaz Khan, Warit Sirichotedumrong, Kunat Pipatanakul, William Held, Diyi Yang

ArXiv Preprint (under review)

EnronQA: Towards Personalized RAG over Private Documents

Michael J. Ryan, Danmei Xu, Chris Nivera, Daniel Campos

ArXiv Preprint

See all publications

Teaching

CS221 - Artificial Intelligence Principles and Techniques (Head CA)

As Head CA for CS221 I led a teaching team of 17 CAs for a class of 285 students. I would run our weekly staff meetings, delegate responsibilities, organize office hours, facilitate course activities, and adapt the course based on student, CA, and instructor feedback.

Last updated on May 17, 2024

CS124 - From Languages to Information

As a CA for CS124 my primary responsibility was updating and administering the final course project. This iteration of the course the final course project involved making a movie recommendation chatbot using sentiment analysis, collaborative filtering, regular expressions, and similar fundamental NLP and IR concepts. The project also involved hands-on experience with modern LLMs to enhance the virtual assistants conversational capabilities.

Last updated on May 17, 2024

CS221 - Artificial Intelligence Principles and Techniques

For CS221 I ran the weekly problem session review. I would review the course material and go over practice problems. Course content included machine learning, search, mdps, games, csps, bayes nets, and logic.

Last updated on Jan 4, 2024

CS3600 - Introduction to AI (Head TA)

As Head TA for CS3600 I was responsible for organizing the grading and office hours of 26 TAs to make sure that the class ran smoothly for 1,633 students (across 4 semesters). I had the opportunity to teach about search, reinforcement learning, bayesian inference, and machine learning.

Last updated on May 17, 2024

Work Experience

Software Engineer Intern

Microsoft Corporation

May 2022 – August 2022 Seattle, WA

Designed and programmed static analysis tool in C++ for identifying security vulnerabilities throughout Windows OS.

Software Engineer Intern

Microsoft Corporation

May 2021 – August 2021 Virtual

Refactored existing codebase from .NET Framework to .NET Core.
Ported server-specific architecture to serverless functional units using Azure Durable Functions.

Software Engineer Intern

Uber Inc.

May 2020 – August 2020 Virtual

Implemented end-to-end testing in GoLang for bike, scooter, and moped rentals by building a simulated 3rd party CRUD API.

Projects

Knock Knock: Neural Joke Generation and Classification

A joke generator built on GPT-2 and joke classifier built on BERT for CS4650 Natural Language Understanding.

Sign Assist

Sign Assist is a digital American Sign Language (ASL) interpreter. Using a Kinect camera and computer vision Sign Assist tracks user movements and patterns to detect different signs. Sign Assist can output these signs as both text and speech.

Awards

Honors Program Distinction in Research

Georgia Tech Honors Program Dec 2022

Awarded for completing Honors Level Coursework and Approved Research as an undergraduate at Georgia Tech

See certificate

Outstanding Undergraduate TA for Interactive Computing

Georgia Tech Center for Teaching and Learning Mar 2022

Awarded for contributions to teaching excellence at Georgia Tech School of Interactive Computing

Michael J. Ryan

PhD student in NLP

Stanford University

Selected Research

Research

Teaching

Work Experience

Projects

Awards

Contact