Tree and Rock AI Lab

News

Older posts…

Featured Research

Safety Neuron

We propose a neuron detection method to identify and tune safety neurons—less than 1% of model parameters—using SN-Tune, which enhances safety without affecting general model performance.

MixEval-X

The first any-to-any evaluations from real-world data mixtures.

Noisy Student

The first method that uses extra unlabeled data to achieve state-of-the-art on ImageNet. Also employed in AlphaFold 2 (Nobel Prize 2024), Google Search and other state-of-the-art AI systems.

RACE Dataset

The first large-scale language understanding dataset collected from exams for human. Currently MMLU and MATH are also collected from exams.

People

Faculty

Michael Qizhe Shieh
Assistant Professor
Tomas Mikolov
Visiting Faculty
Jinjie Ni
Research Fellow

PhD Students

Hannah Brown
Co-advised with Kenji Kawaguchi
Esther Gan E
Esther Gan
Co-advised with Min-Yen Kan
Yiran Zhao
Affiliated Member

Direction

What’s a human experience? Michael had this question one day. The initial answers were all around achievements. But a complementary perspective later arose as “experiencing rock music played in my car, trees outside and everything as it is”.

We work on democratizing effective and efficient AI models and systems in the current paradigm. We are also interested in directions which might fundamentally change how AIs experience and relate to the world.

So we named our lab Tree and Rock AI Lab (TRAIL). Enjoy the trail with us, as a reader, as a member or as a supporter!

Publications

Download BibTeX.

*: equal contribution, †: equal advising

2025
May
PDF Unnatural Languages Are Not Bugs but Features for LLMs
Keyu Duan*, Yiran Zhao*, Zhili Feng, Jinjie Ni, Tianyu Pang, Qian Liu, Tianle Cai, Longxu Dou, Kenji Kawaguchi, Anirudh Goyal, J. Zico Kolter, and Michael Qizhe Shieh
ICML 2025
2025
May
PDF Long-Context Inference with Retrieval-Augmented Speculative Decoding
Guanzheng Chen*, Qilong Feng*, Jinjie Ni, Xin Li, and Michael Qizhe Shieh
ICML 2025 (Spotlight)
2025
January
PDF CODEXGRAPH: Bridging Large Language Models and Code Repositories via Code Graph Databases
Xiangyan Liu*, Bo Lan*, Zhiyuan Hu, Yang Liu, Zhicheng Zhang, Fei Wang, Michael Qizhe Shieh, and Wenmeng Zhou
NAACL 2025
2025
January
PDF MixEval-X: Any-to-Any Evaluations from Real-World Data Mixtures
Jinjie Ni, Yiran Song, Deepanway Ghosal, Bo Li, David Junhao Zhang, Xiang Yue, Fuzhao Xue, Zian Zheng, Kaichen Zhang, Mahir Shah, Kabir Jain, Yang You, and Michael Qizhe Shieh
ICLR 2025 (Spotlight)
2025
January
PDF Identifying and Tuning Safety Neurons in Large Language Models
Yiran Zhao, Wenxuan Zhang, Yuxi Xie, Anirudh Goyal, Kenji Kawaguchi, and Michael Qizhe Shieh
ICLR 2025
2025
January
PDF LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization
Guanzheng Chen, Xin Li, Michael Qizhe Shieh, and Lidong Bing
ICLR 2025
2024
December
PDF Single Character Perturbations Break LLM Alignment
Leon Lin*, Hannah Brown*, Kenji Kawaguchi, and Michael Shieh
AAAI 2025
2024
September
PDF Accelerating greedy coordinate gradient via probe sampling
Yiran Zhao, Wenyue Zheng, Tianle Cai, Xuan Long Do, Kenji Kawaguchi, Anirudh Goyal, and Michael Qizhe Shieh
NeurIPS 2024
2024
September
PDF Monte Carlo Tree Search Boosts Reasoning via Iterative Preference Learning
Yuxi Xie, Anirudh Goyal, Wenyue Zheng, Min-Yen Kan, Timothy P Lillicrap, Kenji Kawaguchi, and Michael Shieh
NeurIPS 2024 Workshop
2024
September
PDF Advancing Adversarial Suffix Transfer Learning on Aligned Large Language Models
Hongfu Liu*, Yuxi Xie*, Ye Wang, and Michael Shieh
EMNLP 2024
2024
September
PDF Reasoning Robustness of LLMs to Adversarial Typographical Errors
Esther Gan*, Yiran Zhao*, Liying Cheng, Yancan Mao, Anirudh Goyal, Kenji Kawaguchi, Min-Yen Kan, and Michael Shieh
EMNLP 2024
2024
May
PDF Instructcoder: Instruction tuning large language models for code editing
Kaixin Li*, Qisheng Hu*, James Zhao, Hui Chen, Yuxi Xie, Tiedong Liu, Michael Shieh†, and Junxian He†
ACL 2024 Workshop
2024
May
PDF Prompt optimization via adversarial in-context learning
Xuan Long Do*, Yiran Zhao*, Hannah Brown*, Yuxi Xie, James Xu Zhao, Nancy F. Chen, Kenji Kawaguchi, Michael Shieh†, and Junxian He†
ACL 2024 (Oral)
2023
October
PDF Self-evaluation guided beam search for reasoning
Yuxi Xie, Kenji Kawaguchi, Yiran Zhao, James Xu Zhao, Min-Yen Kan†, Junxian He†, and Michael Xie†
NeurIPS 2023
2023
October
PDF Automatic model selection with large language models for reasoning
James Xu Zhao, Yuxi Xie, Kenji Kawaguchi, Junxian He, and Michael Qizhe Xie
EMNLP 2023 Findings
2023
May
PDF Multi-Source Test-Time Adaptation as Dueling Bandits for Extractive Question Answering
Hai Ye, Qizhe Xie, and Hwee Tou Ng
ACL 2023
2021
February
PDF Meta pseudo labels
Hieu Pham, Zihang Dai, Qizhe Xie, and Quoc V. Le
CVPR 2021
2020
October
PDF Unsupervised data augmentation for consistency training
Qizhe Xie, Zihang Dai, Eduard Hovy, Thang Luong, and Quoc V. Le
NeurIPS 2020
2020
February
PDF Self-training with noisy student improves imagenet classification
Qizhe Xie, Minh-Thang Luong, Eduard Hovy, and Quoc V. Le
CVPR 2020
2017
November
PDF RACE: Large-scale ReAding Comprehension Dataset From Examinations
Guokun Lai*, Qizhe Xie*, Hanxiao Liu, Yiming Yang, and Eduard Hovy
EMNLP 2017