Research | Linjun Zhang

Papers

Check out the latest update at Google Scholar

Technical Reports

(* indicates alphabetical ordering authorship; ** indicates equal contribution)

Sai Li and Linjun Zhang. (2024)
FAIRM: Learning Invariant Representations for Algorithmic Fairness and Domain Generalization with Minimax Optimality.
submitted
Ryumei Nakada, Yichen Xu, Lexin Li, Linjun Zhang. (2024)
Synthetic Oversampling: Theory and A Practical Approach Using LLMs to Address Data Imbalance.
submitted
* Tianxi Cai, Feiqing Huang, Ryumei Nakada, Linjun Zhang, and Doudou Zhou (2024)
Contrastive Learning on Multimodal Analysis of Electronic Health Records.
submitted
Huiying Zhong, Zhun Deng, Weijie J Su, Zhiwei Steven Wu, and Linjun Zhang (2024)
Provable Multi-Party Reinforcement Learning with Diverse Human Feedback.
submitted
Reid McIlroy-Young, Katrina Brown, Conlan Olson, Linjun Zhang, and Cynthia Dwork (2024)
Set-Based Prompting: Provably Solving the Language Model Order Dependency Problem.
submitted
Sai Li and Linjun Zhang. (2023)
Multi-dimensional domain generalization with low-rank structures. .
submitted
* T. Tony Cai, Yichen Wang and Linjun Zhang. (2023)
Score Attack: A Lower Bound Technique for Optimal Differentially Private Learning.
submitted
Peng Wang, Min-Ge Xie and Linjun Zhang. (2022)
Finite-and Large-Sample Inference for Model and Coefficients in High-dimensional Linear Regression with Repro Samples .
submitted
Zhe Zhang and Linjun Zhang. (2021)
High-Dimensional Differentially-Private EM Algorithm: Methods and Near-Optimal Statistical Guarantees .
submitted.
* Maya Burhanpurkar, Zhun Deng, Cynthia Dwork and Linjun Zhang. (2021)
Scaffolding Sets.
submitted.
* T. Tony Cai, Yichen Wang and Linjun Zhang. (2020)
The Cost of Privacy in Generalized Linear Models: Algorithms and Minimax Lower Bounds.
submitted.
Xianli Zeng, Yingcun Xia, and Linjun Zhang. (2019)
Double Cross Validation for The Number of Factors in Orthogonal Factor Models .
submitted
Linjun Zhang, Rong Ma, T. Tony Cai, and Hongzhe Li. (2020)
Estimation, Confidence Intervals, and Large-Scale Hypotheses Testing for High-Dimensional Mixed Linear Regression. .
submitted

Publications

(* indicates alphabetical ordering authorship; ** indicates equal contribution)

Algorithmic Fairness

* Lujing Zhang, Aaron Roth, and Linjun Zhang. (2023)
Fair Risk Control: A Generalized Framework for Calibrating Multi-group Fairness Risks. .
ITCS 2023.
* Zhun Deng, Cynthia Dwork and Linjun Zhang. (2023)
HappyMap: A Generalized Multicalibration Method .
ITCS 2023.
Shirley Wu, Mert Yuksekgonul, Linjun Zhang and James Zou. (2023)
Discover and Cure: Concept-aware Mitigation of Spurious Correlation .
ICML 2023.
Puheng Li, James Zou, and Linjun Zhang. (2023)
FaiREE: Fair Classification with Finite-Sample and Distribution-Free Guarantee .
ICLR 2023
Zhun Deng, Jiayao Zhang, Linjun Zhang, Ting Ye, Yates Coley, Weijie J. Su, and James Zou. (2023)
FIFA: Making Fairness More Generalizable in Classifiers Trained on Imbalanced Data .
ICLR 2023.
Zhun Deng, He Sun, Zhiwei Steven Wu, Linjun Zhang and David C. Parkes. (2023)
Reinforcement Learning with Stepwise Fairness Constraints .
AISTATS 2023.
Haotian Ye, $James Zou, and $Linjun Zhang. (2023)
Freeze then Train: Towards Provable Representation Learning under Spurious Correlations and Feature Noise .
AISTATS 2023
Huaxiu Yao, Yu Wang, Sai Li, Linjun Zhang, Weixin Liang, James Zou and Chelsea Finn. (2022)
Improving Out-of-Distribution Robustness via Selective Augmentation .
ICML 2022.

Statistical Perspective of Large Language Models

Xinming Tu, James Zou, Weijie Su and Linjun Zhang. (2024)
What Should Data Science Education Do with Large Language Models?.
Harvard Data Science Review
* Lujing Zhang, Aaron Roth, and Linjun Zhang. (2023)
Fair Risk Control: A Generalized Framework for Calibrating Multi-group Fairness Risks. .
ITCS 2023.
Yiyang Zhou, Chenhang Cui, Jaehong Yoon, Linjun Zhang, Zhun Deng, Chelsea Finn, Mohit Bansal, Huaxiu Yao. (2024)
Analyzing and mitigating object hallucination in large vision-language models. .
ICLR 2024

Private Data Analysis

* T. Tony Cai, Yichen Wang and Linjun Zhang. (2023)
Score Attack: A Lower Bound Technique for Optimal Differentially Private Learning.
* Jinshuo Dong, Weijie Su, and Linjun Zhang. (2021)
A Central Limit Theorem for Differentially Private Query Answering.
NeurIPS 2021, and selected as spotlight (top 3% of submissions)
* T. Tony Cai, Yichen Wang and Linjun Zhang. (2021)
The Cost of Privacy: Optimal Rates of Convergence for Parameter Estimation with Differential Privacy.
Annals of Statistics .
Zhe Zhang and Linjun Zhang. (2020)
Privacy-Preserving Algorithms: the Gain and the Loss .
CHANCE 33 (4), 22-28 .

Deep Learning (with focus on representation learning)

Jianguo Huang, HuaJun Xi, Linjun Zhang, Huaxiu Yao, Yue Qiu, Hongxin Wei. (2024)
Conformal Prediction for Deep Classifier via Label Ranking.
ICML 2024
Wenlong Ji, Zhun Deng, Ryumei Nakada, James Zou and Linjun Zhang. (2023)
The Power of Contrast for Feature Learning: A Theoretical Analysis.
Journal of Machine Learning Research.
Mert Yuksekgonul, Linjun Zhang, James Zou, and Carlos Guestrin. (2023)
Beyond Confidence: Reliable Models Should Also Consider Atypicality.
ICML 2023
Ryumei Nakada, Halil Ibrahim Gulluk, Zhun Deng, Wenlong Ji, James Zou and Linjun Zhang. (2023)
Understanding Multimodal Contrastive Learning and Incorporating Unpaired Data .
AISTATS 2023.
Huaxiu Yao, Linjun Zhang and Chelsea Finn. (2022)
Meta-Learning with Fewer Tasks through Task Interpolation .
ICLR 2022, selected as the oral presentation (top 1.5% of submissions).
Huaxiu Yao, Yiping Wang, Linjun Zhang, James Zou, and Chelsea Finn. (2022)
C-Mixup: Improving Generalization in Regression .
NeurIPS 2022.
Huaxiu Yao, Yu Wang, Sai Li, Linjun Zhang, Weixin Liang, James Zou and Chelsea Finn. (2022)
Improving Out-of-Distribution Robustness via Selective Augmentation .
ICML 2022.
** Linjun Zhang, ** Zhun Deng, Kenji Kawaguchi and James Zou. (2022)
When and How Mixup Improves Calibration .
ICML 2022.
Kenji Kawaguchi, Linjun Zhang and Zhun Deng. (2022)
Understanding Dynamics of Nonlinear Representation Learning and Its Application .
Neural Computation.
** Zhun Deng, ** Linjun Zhang, Kailas Vodrahalli, Kenji Kawaguchi, and James Zou. (2021)
Adversarial Training Helps Transfer Learning via Better Representations .
NeurIPS 2021.
Huaxiu Yao, Longkai Huang, Linjun Zhang, Ying Wei, Li Tian, James Zou, Junzhou Huang and Zhenhui Li. (2021)
Improving Generalization in Meta-learning via Task Augmentation .
ICML 2021.
**Linjun Zhang, **Zhun Deng, **Kenji Kawaguchi, Amirata Ghorbani, and James Zou. (2020)
How Does Mixup Help With Robustness and Generalization? [Slides]
ICLR 2021, and selected as spotlight (top 5% of submissions)
** Zhun Deng, ** Linjun Zhang, Amirata Ghorbani, and James Zou. (2020)
Improving Adversarial Robustness via Unlabeled Out-of-Domain Data .
AISTATS 2021, and selected as the oral presentation (top 3% of submissions)
* Zhun Deng, Cynthia Dwork, Jialiang Wang, and Linjun Zhang. (2020)
Interpreting Robust Optimization via Adversarial Influence Functions .
ICML 2020

High-dimensional Statistics

Sai Li, Linjun Zhang, T. Tony Cai, and Hongzhe Li. (2024)
Estimation and Inference in High-Dimensional Generalized Linear Models with Knowledge Transfer.
Journal of the American Statistical Association.
Ruijia Wu, Linjun Zhang, and T. Tony Cai. (2021)
Sparse Topic Modeling: Computational Efficiency, Near-Optimal Algorithms, and Statistical Inference. .
Journal of the American Statistical Association
* T. Tony Cai, and Linjun Zhang. (2020)
A Convex Optimization Approach to High-dimensional Sparse Quadratic Discriminant Analysis.
Annals of Statistics
* T. Tony Cai, and Linjun Zhang. (2019)
High-dimensional Linear Discriminant Analysis: Optimality, Adaptivity, and Missing Data.
Journal of Royal Statistical Society, B
* T. Tony Cai, Jing Ma and Linjun Zhang. (2018)
CHIME: Clustering of High-Dimensional Gaussian Mixtures with EM Algorithm and Its Optimality.
Annals of Statistics
* T. Tony Cai, Linjun Zhang and Harrison H. Zhou. (2017)
Adaptive Functional Linear Regression.
Statistica Sinica
* T. Tony Cai, and Linjun Zhang. (2017)
High-Dimensional Gaussian Copula Regression: Adaptive Estimation and Statistical Inference.
Statistica Sinica, 2018, Vol. 28, 963-993.
* T. Tony Cai, and Linjun Zhang. (2016)
Discussion: Important feature PCA for high dimensional clustering.
Annals of Statistics, 2016, Vol. 44, No. 6, 2372-2381.

Network Analysis

Linjun Zhang, Michael Small, and Kevin Judd. (2015)
Exactly scale-free scale-free networks.
Physica A, 433: 182-197.
Michael Small, Lvlin Hou, and Linjun Zhang. (2014)
Random complex networks.
The IEEE International Symposium on Circuits and Systems (ISCAS) 2014 invited paper.
Michael Small, Kevin Judd, L. Zhang. (2014)
How is that complex network complex?
National Science Review , 1(3): 357-367.