Boshi Wang (汪博石)

PhD student at The Ohio State University

-------------------------------------

I'm passionate about building intelligent systems that can help us better understand the world. Currently I'm focused on understanding & improving the reasoning capabilities of (large) language models.

In my spare time, I like reading & thinking about entropy and consciousness. My favorite book: Gödel, Escher, Bach: an Eternal Golden Braid by Douglas Hofstadter.

About Me

Hi! I'm a 4th-year PhD at The Ohio State University, advised by Prof. Huan Sun. Previously, I received a Bachelor degree in Computer Science at ShanghaiTech University.

News

04/2025 | Check out our new work connecting Reversal Curse and the Binding problem.

10/2024 | Our paper studying grokking in Transformer's implicit reasoning is accepted to NeurIPS-24. See you in Vancouver!

05/2024 | Tool learning through simulated trial and error accepted to ACL-24.

07/2023 | Our paper on understanding chain-of-thought prompting got an Honorable Mention at ACL-23!

05/2023 | Started my internship at Semantic Machines.

05/2023 | Check out our paper on evaluating LLM's reasoning via debate.

05/2023 | Understanding Chain-of-Thought prompting accepted to ACL-23. See you in Toronto!

10/2022 | Paper on enhancing multi-step reasoning abilities of pretrained language models accepted to EMNLP-22.

08/2021 | Began my PhD @ OSU! Go Buckeyes!

Selected Publications

Is the Reversal Curse a Binding Problem? Uncovering Limitations of Transformers from a Basic Generalization Failure

B. Wang, H. Sun

arXiv preprint, 2025.

Paper Code Post

Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization

B. Wang, X. Yue, Y. Su, H. Sun

NeurIPS-24. Short version in the 1st Mechanistic Interpretability Workshop, ICML-24

Paper Code Post

LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error

B. Wang, H. Fang, J. Eisner, B. Van Durme, Y. Su

ACL-24, The 62nd Annual Meeting of the Association for Computational Linguistics

Paper Code Post

Can ChatGPT Defend its Belief in Truth? Evaluting LLM Reasoning via Debate

B. Wang, X. Yue, H. Sun

EMNLP-23 (findings), The 2023 Conference on Empirical Methods in Natural Language Processing

Paper Code Post

Towards Understanding Chain-of-Thought Prompting: An Empirical Study of What Matters

B. Wang, S. Min, X. Deng, J. Shen, W. You, L. Zettlemoyer, H. Sun

ACL-23 (Honorable Mention), The 61st Annual Meeting of the Association for Computational Linguistics

Paper Code Post

Mind2Web: Towards a Generalist Agent for the Web

X. Deng, Y. Gu, B. Zheng, S. Chen, S. Stevens, B. Wang, H. Sun, Y. Su

NeurIPS-23 Dataset Track (Spotlight), Thirty-seventh Conference on Neural Information Processing Systems

Iteratively Prompt Pre-trained Language Models for Chain of Thought

B. Wang, X. Deng, H. Sun

EMNLP-22 (Oral), The 2022 Conference on Empirical Methods in Natural Language Processing

Homomorphic Sensing: Sparsity and Noise

L. Peng, B. Wang and M. C. Tsakiris

ICML-21, Proceedings of the 38th International Conference on Machine Learning

Education

08/2021 - Present | PhD, Ohio State University, USA
06/2019 - 09/2019 | Robotics Institute Summer Scholars, Carnegie Mellon University, USA
09/2016 - 07/2020 | B.S.E in Computer Science, ShanghaiTech University, CHN

Work/Intern Experience

06/2023 - 08/2023

Semantic Machines

Research Intern

01/2021 - 06/2021

Microsoft Research Asia

Research Intern

07/2020 - 12/2020

Institue of Mathematical Sciences, ShanghaiTech University

Research Assistant

Service

Program Committee/Reviewer: NeurIPS (Top Reviewers 2024), ICLR, ACL-ARR, AAAI