ML+Cryptography Seminar

Join us for the reading group on ML and cryptography organized by Shafi Goldwasser, Yael Kalai, Jonathan Shafer, Neekon Vafa and Vinod Vaikuntanathan. For questions, feel free to contact Jonathan or Neekon.

📍 Where: Stata Center, Room 32G-575

📅 When: Tuesdays 1:30–3:30

📧 Mailing list: sign up here

Upcoming Talks

A Theoretical Point of View on Machine Unlearning and Privacy

Odelia Melamed (Weizmann Institute of Science)

May 12, 2026

Spring 2026

How to Sketch a Learning Algorithm

Sam Gunn (UC Berkeley)

May 5, 2026

Slides

Consensus Sampling for Safer Generative AI

Yael Tauman Kalai (MIT)

April 28, 2026

Subliminal Transfer in Training Data and Low Logit Rank

Abhishek Shetty (MIT)

April 21, 2026

Slides

Training modern large language models (LLMs) has become a veritable smorgasbord of algorithms and datasets designed to elicit particular behaviors, making it critical to develop techniques to understand the effects of datasets on the model's properties. This is exacerbated by recent experiments that show datasets can transmit signals that are not directly observable from individual datapoints, posing a conceptual challenge for dataset-centric understandings of LLM training and suggesting a missing fundamental account of such phenomena. Towards understanding such effects, inspired by recent work on the linear structure of LLMs, we uncover a general mechanism through which hidden subtexts can arise in generic datasets.

We introduce Logit-Linear-Selection (LLS), a method that prescribes how to select subsets of a generic preference dataset to elicit a wide range of hidden effects. We apply LLS to discover subsets of real-world datasets so that models trained on them exhibit behaviors ranging from having specific preferences, to responding to prompts in a different language not present in the dataset, to taking on a different persona. Crucially, the effect persists for the selected subset, across models with varying architectures, supporting its generality and universality. Perhaps more importantly, we show that this phenomenon can be explain by the notion of low logit rank (introduced in GLS25), pointing towards an avenue for designing empirically validated experiments starting from a theoretical lens.

Based on joint works with Noah Golowich, Allen Liu, Ishaq Aden-ali, Nika Haghtalab and Ankur Moitra.

Bio: Abhishek Shetty is the incoming Catherine M. and James E. Allchin Early-Career Assistant Professor in Computer Science at Georgia Tech, starting in Fall 2026. Currently, he is a FODSI postdoctoral fellow at Massachusetts Institute of Technology hosted by Costis Daskalakis, Ankur Moitra and Sasha Rakhlin and previously PhD student at the University of California at Berkeley advised by Nika Haghtalab. His research focuses on building fundamental connections between the theory of computation and machine learning, with particular interest in understanding the algorithmic and statistical role of data in generalization and sequential decision making, recently focussed on understanding large language models. His research has been recognized with an Apple AI/ML fellowship and an ASA SCGS best student paper.

The Space Complexity of Machine Unlearning

Ayush Sekhari (Chan Zuckerberg Initiative)

April 14, 2026

Slides

We study the learning–unlearning paradigm, motivated by emerging requirements for data deletion and algorithmic accountability. Given an initial dataset, the learner first trains a predictor. Subsequently, upon a request to remove a subset of training examples, the goal is to produce an updated predictor that is indistinguishable from the one obtained by retraining from scratch on the remaining data—crucially, without access to the original dataset and without retraining. This talk focuses on the space complexity of unlearning: what information must a learner retain to support data deletion post-training?

We develop lower and upper bounds under a range of data access and storage models, highlighting fundamental trade-offs between memory, computation, and update fidelity. A key aspect of the talk is a novel ticketed learning–unlearning framework, in which the learning algorithm augments training by issuing each data point a compact, encrypted “ticket,” while also maintaining a small amount of centralized state. These tickets are later used for unlearning. We show how this framework enables new algorithmic designs that significantly reduce the memory required for unlearning while preserving exact unlearning guarantees.

Brief bio: Ayush is a Senior Research Scientist at Chan Zuckerberg Initiative (BioHub). Previously, he was a postdoctoral researcher at the Institute of Data, Systems, and Society (IDSS) at MIT, hosted by Prof. Sasha Rakhlin. He earned his Ph.D. in Computer Science at Cornell University, advised by Prof. Karthik Sridharan and Prof. Robert D. Kleinberg. His research focuses on updatable machine learning, developing algorithms to efficiently update ML models as they interact with changing environments and real-world constraints. Before his Ph.D., Ayush received his B.Tech. in Computer Science from IIT Kanpur, where he earned the President’s Gold Medal. His work has been recognized with a student best paper award at COLT 2019, and multiple orals and spotlights at premier ML conferences such as NeurIPS and ICLR. He was also a finalist for the Meta AI fellowship in Statistics in 2022.

Refereed Learning

Connor Wagaman (BU)

March 10, 2026

Slides

Fall 2025

Sequences of Logits and the Low Rank Structure of Language Models

Noah Golowich (Microsoft Research NYC)

November 18, 2025

Why Language Models Hallucinate

Adam Tauman Kalai (OpenAI)

November 4, 2025

Slides (pdf, pptx)

Statistically Undetectable Backdoors in Deep Neural Networks

Neekon Vafa (MIT)

October 21, 2025

What Can Cryptography Tell Us About AI?

Greg Gluch (Simons Institute, UC Berkeley)

October 14, 2025

Model Stealing: Recent Results and Open Problems

Ankur Moitra (MIT)

October 7, 2025

Slides

Proofs for Distribution Properties: Review of Recent Results

Tal Herman (MIT)

September 30, 2025

Slides

A Survey of Cryptographic Watermarks for AI-Generated Content

Miranda Christ (Columbia)

September 23, 2025

Slides