publication | Yifei Wang

See full list on Google Scholar

2026

ICML

SSL4RL: Revisiting Self-supervised Learning as Intrinsic Reward for Visual-Language Reasoning

Xiaojun Guo^*, Runyu Zhou^*, Yifei Wang^*, Qi Zhang, Chenheng Zhang, Stefanie Jegelka, Xiaohan Wang, Jiajun Chai, Guojun Yin, Wei Lin, and Yisen Wang

ICML, 2026

PDF
ICML

No More K-means: Single-Stage Sparse Coding for Efficient Multi-Vector Retrieval

Lixuan Guo^*, Yifei Wang^*, Tiansheng Wen, Aosong Feng, Stefanie Jegelka, and Chenyu You

ICML, 2026

PDF
ICML

Data Difficulty and the Generalization–Extrapolation Tradeoff in LLM Fine-Tuning

Siyuan Liu, Tinghong Chen, Xinghan Li, Yifei Wang (advising), and Jingzhao Zhang

ICML, 2026

PDF
ICML

GRASP: Graph Reasoning via Agentic Solving and Probing of LLMs

Xiaojun Guo, Mingxue Tian, Chenheng Zhang, Xiaohan Wang, Jiajun Chai, Guojun Yin, Wei Lin, Yifei Wang (advising), and Yisen Wang

ICML, 2026

PDF
CVPR

Monet: Reasoning in Latent Visual Space Beyond Image and Language

Qixun Wang, Yang Shi, Yifei Wang, Yuanxing Zhang, Pengfei Wan, Kun Gai, Xianghua Ying, and Yisen Wang

CVPR, 2026

PDF
ACL

Beyond Cross-Modal Alignment: Measuring and Leveraging Modality Gap in Vision-Language Models

Hanqi Yan, Xiangxiang Cui, Lu Yin, Jindong Gu, Paul Pu Liang, Yulan He, and Yifei Wang (Corresponding Author)

ACL Findings, 2026

PDF
ICLR Best Paper Runner-up

When More is Less: Understanding Chain-of-Thought Length in LLMs

Yuyang Wu^*, Yifei Wang^*, Ziyu Ye, Tianqi Du, Stefanie Jegelka, and Yisen Wang

ICLR, 2026

🏆 Best Paper Runner-up Award at ICLR 2025 Workshop on Reasoning and Planning for LLMs

PDF
ICLR

Any-Order Any-Subset AutoRegressive Model

Tianqi Du, Lizhe Fang, Weijie Yang, Chenheng Zhang, Zeming Wei, Yifei Wang, and Yisen Wang

ICLR, 2026

PDF
ICLR

Scaling Attention via Feature Sparsity

Yan Xie, Tiansheng Wen, Tang Da Huang, Bo Chen, Chenyu You, Stefanie Jegelka, and Yifei Wang (Corresponding Author)

ICLR, 2026

PDF
ICLR

CSRv2: Unlocking Ultra-Sparse Embeddings

Lixuan Guo^*, Yifei Wang^*, Tiansheng Wen, Yifan Wang, Aosong Feng, Bo Chen, Stefanie Jegelka, and Chenyu You

ICLR, 2026

PDF
ICLR

SAE as a Crystal Ball: Interpretable Features Predict Cross-domain Transferability of LLMs without Training

Qi Zhang, Yifei Wang, Xiaohan Wang, Jiajun Chai, Guojun Yin, Wei Lin, and Yisen Wang

ICLR, 2026

PDF
ICLR

On the Limits of Sparse Autoencoders: A Theoretical Framework and Reweighted Remedy

Jingyi Cui, Qi Zhang, Yifei Wang, and Yisen Wang

ICLR, 2026

PDF

2025

JMLR

An Augmentation Overlap Theory of Contrastive Learning

Qi Zhang^*, Yifei Wang^*, and Yisen Wang

Journal of Machine Learning Research (JMLR), 2025

PDF
NeurIPS Best Paper

G1: Teaching LLMs to Reason on Graphs with Reinforcement Learning

Xiaojun Guo^*, Ang Li^*, Yifei Wang^*, Stefanie Jegelka, and Yisen Wang

In NeurIPS, 2025

🏆 Best Paper Award at NeurIPS 2025 NPGML Workshop

PDF Code
NeurIPS

A Signed Graph Approach to Understanding and Mitigating Oversmoothing in GNNs

Jiaqi Wang^*, Xinyi Wu^*, James Cheng, and Yifei Wang

NeurIPS, 2025

PDF
NeurIPS

Next Semantic Scale Prediction via Hierarchical Diffusion Language Models

Cai Zhou, Chenyu Wang, Dinghuai Zhang, Shangyuan Tong, Yifei Wang, Stephen Bates, and Tommi Jaakkola

NeurIPS, 2025

PDF
NeurIPS

Language System: A Lightweight Ranking Framework for Language Models

Chenheng Zhang, Tianqi Du, Jizhe Zhang, Mingqing Xiao, Yifei Wang, Yisen Wang, and Zhouchen Lin

NeurIPS, 2025

PDF
ICML Oral

Beyond Matryoshka: Revisiting Sparse Coding for Adaptive Representation

Tiansheng Wen^*, Yifei Wang^*, Zequn Zeng, Zhong Peng, Yudi Su, Xinyang Liu, Bo Chen, Hongwei Liu, Stefanie Jegelka, and Chenyu You

ICML Oral Presentation (Top 1%), 2025

PDF Code
ICML

On the Emergence of Position Bias in Transformers

Xinyi Wu, Yifei Wang, Stefanie Jegelka, and Ali Jadbabaie

ICML, 2025

Featured by MIT News

PDF Video
ICML

Long-Short Alignment for Effective Long-Context Modeling in LLMs

Tianqi Du^*, Haotian Huang^*, Yifei Wang, and Yisen Wang

ICML, 2025

PDF Code
ICLR LLM training and eval

What is Wrong with Perplexity for Long-context Language Modeling?

Lizhe Fang^*, Yifei Wang^*, Zhaoyang Liu, Chenheng Zhang, Stefanie Jegelka, Jinyang Gao, Bolin Ding, and Yisen Wang

ICLR, 2025

PDF Video Code
ICLR LLM training

Rethinking Invariance in In-context Learning

Lizhe Fang^*, Yifei Wang^*, Khashayar Gatmiry, Lei Fang, and Yisen Wang

ICLR, 2025

We discovered an expressive invariant in-context learning scheme (InvICL) that achieves permutation invariance of in-context demonstrations while preserving autoregressive nature and full context awareness at the same time.

PDF
ICLR

Can In-context Learning Really Generalize to Out-of-distribution Tasks?

Qixun Wang, Yifei Wang, Yisen Wang, and Xianghua Ying

ICLR, 2025

With controlled experiments, we found that in-context learning still happens only with in-domain tasks and hardly generalizes to novel OOD tasks. In other words, LLMs’ in-context abilities are learned essentially through training data with likewise tasks.

PDF
ICLR

Beyond Interpretability: The Gains of Feature Monosemanticity on Model Robustness

Qi Zhang^*, Yifei Wang^*, Jingyi Cui, Xiang Pan, Qi Lei, Stefanie Jegelka, and Yisen Wang

ICLR, 2025

We found that the merits of feature monosemanticity (as studied in mechanistic interpretability) extend beyond interpretability to improving robustness across various challenges like noisy data, limited training examples, and such.

PDF
ICLR

Projection Head is Secretly an Information Bottleneck

Zhuo Ouyang, Kaiwen Hu, Qi Zhang, Yifei Wang, and Yisen Wang

ICLR, 2025

We showed that projection heads serve as an information bottleneck that prevent features from collapsing toward the pretraining task (e.g. instance classification).

PDF

2024

NeurIPS Best Paper Award
at ICML-W’24

A Theoretical Understanding of Self-Correction through In-context Alignment

Yifei Wang^*, Yuyang Wu^*, Zeming Wei, Stefanie Jegelka, and Yisen Wang

In NeurIPS, 2024

🏆 Best Paper Award at ICML 2024 ICL Workshop
We proposed the first theoretical explanation of how LLM self-correction works (as in OpenAI o1) and showed its effectiveness against social bias and jailbreak attacks.

PDF Code
NeurIPS

Understanding the Role of Equivariance in Self-supervised Learning

Yifei Wang^*, Kaiwen Hu^*, Sharut Gupta, Ziyu Ye, Yisen Wang, and Stefanie Jegelka

In NeurIPS, 2024

First theoretical explanation of how equivariant prediction helps self-supervised representation learning – using information theory & probabilistic graphical models.

PDF Code
NeurIPS Oral at NeurIPS-W’24 Featured by MIT

In-Context Symmetries: Self-Supervised Learning through Contextual World Models

Sharut Gupta^*, Chenyu Wang^*, Yifei Wang^*, Tommi Jaakkola, and Stefanie Jegelka

In NeurIPS, 2024

Oral Presentation (top 4) at NeurIPS 2024 SSL Workshop & featured by MIT CSAIL News 📰
We introduced unsupervised test-time adaptation ability to self-supervised learning through a contextual world model designed for joint embedding (JEPA) models.

PDF Blog Code
NeurIPS

A Canonization Perspective on Invariant and Equivariant Learning

George Ma^*, Yifei Wang^*, Derek Lim, Stefanie Jegelka, and Yisen Wang

In NeurIPS, 2024

PDF Code
NeurIPS

On the Role of Attention Masks and LayerNorm in Transformers

Xinyi Wu, Amir Ajorlou, Yifei Wang, Stefanie Jegelka, and Ali Jadbabaie

In NeurIPS, 2024

PDF
NeurIPS

Dissecting the Failure of Invariant Learning on Graphs

Qixun Wang, Yifei Wang, Yisen Wang, and Xianghua Ying

In NeurIPS, 2024

PDF Code
NeurIPS Workshop

Reasoning in Reasoning: A Hierarchical Framework for Better and Faster Neural Theorem Proving

Ziyu Ye, Jiacheng Chen, Jonathan Light, Yifei Wang, Jiankai Sun, Mac Schwager, Philip Torr, Guohao Li, Yuxin Chen, Kaiyu Yang, Yisong Yue, and Ziniu Hu

In NeurIPS 2024 Workshop on Mathematical Reasoning and AI, 2024

PDF
NeurIPS Workshop

The Multi-faceted Monosemanticity in Multimodal Representations

Hanqi Yan, Yulan He, and Yifei Wang (Corresponding Author)

In NeurIPS 2024 Workshop on Responsibly Building the Next Generation of Multimodal Foundational Models, 2024

PDF
EMNLP

Encourage or Inhibit Monosemanticity? Revisit Monosemanticity from a Feature Decorrelation Perspective

Hanqi Yan, Yanzheng Xiang, Guangyi Chen, Yifei Wang, Lin Gui, and Yulan He

In EMNLP, 2024

PDF Code
ICML

Look Ahead or Look Around? A Theoretical Comparison Between Autoregressive and Masked Pretraining

Qi Zhang, Tianqi Du, Haotian Huang, Yifei Wang, and Yisen Wang

In ICML, 2024

PDF Code
ICML

OODRobustBench: a benchmark and large-scale analysis of adversarial robustness under distribution shift

Lin Li, Yifei Wang, Chawin Sitawarin, and Michael W. Spratling

In ICML, 2024

PDF Code Website
ICML

On the Duality Between Sharpness-Aware Minimization and Adversarial Training

Yihao Zhang, Hangzhou He, Jingyu Zhu, Huanran Chen, Yifei Wang, and Zeming Wei

In ICML, 2024

PDF Code
ICLR

Non-negative Contrastive Learning

Yifei Wang^*, Qi Zhang^*, Yaoyu Guo, and Yisen Wang

In ICLR, 2024

Inspired by NMF, we introduced a simple technique (one-line) that attains 90% feature sparsity and 10x feature interpretability for self-supervised contrastive learning, with theoretical guarantees on its disentanglement and performance.

PDF Code Slides
ICLR

Do Generated Data Always Help Contrastive Learning?

Yifei Wang^*, Jizhe Zhang^*, and Yisen Wang

In ICLR, 2024

We revealed both theoretically and practically that synthetic data introduces fundamental bias to SSL generalization, but, with an adaptive strategy of data mixing and augmentation, can yield substantial benefits.

PDF Code
ICLR Spotlight

On the Role of Discrete Tokenization in Visual Representation Learning

Tianqi Du^*, Yifei Wang^*, and Yisen Wang

In ICLR Spotlight, 2024

PDF Code

2023

TPAMI Featured by Anthropic

Jailbreak and guard aligned language models with only few in-context demonstrations

Zeming Wei, Yifei Wang, and Yisen Wang

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), arxiv:2310.06387, 2023

Featured and scaled up by Anthropic to many-shot jailbreaking. Cited over 400 times</span>

PDF
NeurIPS

Balance, Imbalance, and Rebalance: Understanding Robust Overfitting from a Minimax Game Perspective

Yifei Wang^*, Liangchen Li^*, Jiansheng Yang, Zhouchen Lin, and Yisen Wang

In NeurIPS, 2023

PDF Code
NeurIPS

Adversarial Examples Are Not Real Features

Ang Li^*, Yifei Wang^*, Yiwen Guo, and Yisen Wang

In NeurIPS, 2023

PDF Code
NeurIPS

Architecture Matters: Uncovering Implicit Mechanisms in Graph Contrastive Learning

Xiaojun Guo^*, Yifei Wang^*, Zeming Wei, and Yisen Wang

In NeurIPS, 2023

PDF Code
NeurIPS

Identifiable Contrastive Learning with Automatic Feature Importance Discovery

Qi Zhang^*, Yifei Wang^*, and Yisen Wang

In NeurIPS, 2023

PDF Code
NeurIPS

Laplacian Canonization: A Minimalist Approach to Sign and Basis Invariant Spectral Embedding

George Ma^*, Yifei Wang^*, and Yisen Wang

In NeurIPS, 2023

PDF Code
ICML

On the Generalization of Multi-modal Contrastive Learning

Qi Zhang^*, Yifei Wang^*, and Yisen Wang

In ICML, 2023

We established the first generalization analysis for multi-modal contrastive learning (e.g., CLIP) and explained how it outperforms self-supervised contrastive learning.

PDF Code
ICML

Rethinking Weak Supervision in Helping Contrastive Representation Learning

Jingyi Cui^*, Weiran Huang^*, Yifei Wang^*, and Yisen Wang

In ICML, 2023

PDF
CVPR

CFA: Class-wise Calibrated Fair Adversarial Training

Zeming Wei, Yifei Wang, Yiwen Guo, and Yisen Wang

In CVPR, 2023

PDF Code
TIP

Equilibrium Image Denoising with Implicit Differentiation

Qi Chen, Yifei Wang, Zhengyang Geng, Yisen Wang, Jiansheng Yang, and Zhouchen Lin

IEEE Transactions on Image Processing (IEEE TIP), 2023

PDF
ICLR

A Message Passing Perspective on Learning Dynamics of Contrastive Learning

Yifei Wang^*, Qi Zhang^*, Tianqi Du, Jiansheng Yang, Zhouchen Lin, and Yisen Wang

In ICLR, 2023

We revealed that contrastive learning performs message passing on sample graph, which connects self-supervised learning and graph neural networks as a whole.

PDF Blog Code Slides
ICLR

Towards a Unified Theoretical Understanding of Non-contrastive Learning via Rank Differential Mechanism

Zhijian Zhuo^*, Yifei Wang^*, Jinwen Ma, and Yisen Wang

In ICLR, 2023

We revealed that various asymmtric designs in non-contrastive learning (BYOL, SimSiam, DINO, SwAV) can be explained from a unified spectral filtering perspective.

PDF Code
ICLR

Rethinking the Effect of Data Augmentation in Adversarial Contrastive Learning

Rundong Luo^*, Yifei Wang^*, and Yisen Wang

In ICLR, 2023

We improved adversarial robustness under AutoAttack by 9% in the unsupervised setting with a dynamic training schedule, without extra computation cost.

PDF Code
ICLR

ContraNorm: A Contrastive Learning Perspective on Oversmoothing and Beyond

Xiaojun Guo^*, Yifei Wang^*, Tianqi Du^*, and Yisen Wang

In ICLR, 2023

PDF Code
ICLR

Unbiased Stochastic Proximal Solver for Graph Neural Networks with Equilibrium States

Mingjie Li, Yifei Wang, Yisen Wang, and Zhouchen Lin

In ICLR, 2023

PDF
AAAI Oral

On the Connection between Invariant Learning and Adversarial Training for Out-of-Distribution Generalization

Shiji Xin, Yifei Wang, Jingtong Su, and Yisen Wang

In AAAI, 2023

PDF

2022

NeurIPS Spotlight

How Mask Matters: Towards Theoretical Understandings of Masked Autoencoders

Qi Zhang^*, Yifei Wang^*, and Yisen Wang

In NeurIPS Spotlight (Top 5%), 2022

We theoretically explained how masked autoencoders work and revealed their mathematical connections to joint embedding methods, unifying them as a whole.

PDF Code Slides
NeurIPS Spotlight

Improving Out-of-distribution Robustness by Adversarial Training with Structured Priors

Qixun Wang^*, Yifei Wang^*, Hong Zhu, and Yisen Wang

In NeurIPS Spotlight (Top 5%), 2022

PDF Code Slides
NeurIPS Spotlight

When Adversarial Training Meets Vision Transformers: Recipes from Training to Architecture

Yichuan Mo, Dongxian Wu, Yifei Wang, Yiwen Guo, and Yisen Wang

In NeurIPS Spotlight (Top 5%), 2022

PDF Code
NeurIPS Workshop Oral

AggNCE: Asymptotically Identifiable Contrastive Learning

Jingyi Cui^*, Weiran Huang^*, Yifei Wang, and Yisen Wang

In NeurIPS 2022 Self-supervised Learning Workshop (Oral Representation), 2022

PDF
ICML

Optimization-Induced Graph Implicit Nonlinear Diffusion

Qi Chen, Yifei Wang, Yisen Wang, Jiansheng Yang, and Zhouchen Lin

In ICML, 2022

PDF Code
ICML

G2CN: Graph Gaussian Convolution Networks with Concentrated Graph Filters

Mingjie Li, Xiaojun Guo, Yifei Wang, Yisen Wang, and Zhouchen Lin

In ICML, 2022

PDF
ICLR JMLR

Chaos is a Ladder: A New Theoretical Understanding of Contrastive Learning via Augmentation Overlap

Yifei Wang^*, Qi Zhang^*, Yisen Wang, Jiansheng Yang, and Zhouchen Lin

In ICLR, 2022

A new augmentation overlap theory for understanding the generalization of contrastive learning. Cited over 150 times. Extended version was accepted at JMLR.

PDF Code Slides
ICLR Silver Best Paper
at ICML-W’21

A Unified Contrastive Energy-based Model for Understanding the Generative Ability of Adversarial Training

Yifei Wang, Yisen Wang, Jiansheng Yang, and Zhouchen Lin

In ICLR, 2022

🏆 Silver Best Paper Award at ICML 2021 AdvML workshop
From an energy-based perspective, we formulated contrastive learning as a generative model, and established the connection between adversarial training and maximum likelihood, thus briding generative and discriminative models together.

PDF Slides

2021

NeurIPS

Residual Relaxation for Multi-view Representation Learning

Yifei Wang, Zhengyang Geng, Feng Jiang, Chuming Li, Yisen Wang, Jiansheng Yang, and Zhouchen Lin

In NeurIPS, 2021

PDF Blog Slides
NeurIPS

Dissecting the Diffusion Process in Linear Graph Convolutional Networks

Yifei Wang, Yisen Wang, Jiansheng Yang, and Zhouchen Lin

In NeurIPS, 2021

PDF Blog Code Slides
ECML-PKDD Best ML Paper Award

Reparameterized Sampling for Generative Adversarial Networks

Yifei Wang, Yisen Wang, Jiansheng Yang, and Zhouchen Lin

In ECML-PKDD, 2021

🏆 Best ML Paper Award (1/685), invited to Machine Learning
We explored using GAN discriminator (as a good reward model) to bootstrap sample quality through an efficient MCMC algorithm, which not only guarantees theoretical convergence but also improves sample efficiency and quality in practice.

PDF Code Slides