Jack (Hao) Bai

haob2 AT illinois DOT edu

pic.jpg

Hi there! I’m Jack, a second-year thesis-based master student affiliated with UIUC CS, advised by Prof. Nan Jiang. I’m also a visiting scholar at BAIR, advised by Sergey Levine.

Recently, I focus my research on building intelligent agents with foundation model and reinforcement leanring (RL). The methodology usually integrates principled representation learning and RL algorithms.

I received my dual undergrad degree from UIUC and Zhejiang University. During those wonderful years, I was happy to worked with Yi Ma @ BAIR, Chengxiang Zhai @ UIUC, and Shilin He @ Microsoft Research.

A public up-to-date resume can be found here.

News

Sep 28, 2024 My two proud works DigiRL (first author) and RL4VLM (second author) have been accepted to NeurIPS 2024! 🎉🎉🎉
Jun 20, 2024 My first first-authored paper DigiRL is released and selected as Oral at Foundation Models in the Wild Workshop @ ICML’2024. This work is gracefully under guidance of Prof. Aviral Kumar and Prof. Sergey Levine.
May 15, 2024 My first RL paper RL4VLM is released, gracefully under guidance of Prof. Sergey Levine. I’m extremely honored to also collaborate with Saining Xie and Yann Lecun.

Latest Posts

Selected Publications

  1. Preprint Oral @ ICML WS
    DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning
    Hao Bai , Yifei Zhou, Jiayi Pan, Mert Cemri, Alane Suhr, Sergey Levine, and Aviral Kumar
    Jun 2024
  2. Preprint
    Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
    Yuexiang Zhai,  Hao Bai , Zipeng Lin, Jiayi Pan, Shengbang Tong, Yifei Zhou, Alane Suhr, Saining Xie, Yann LeCun, Yi Ma, and Sergey Levine
    May 2024
  3. JMLR
    White-Box Transformers via Sparse Rate Reduction: Compression Is All There Is?
    Yaodong Yu, Sam Buchanan, Druv Pai, Tianzhe Chu, Ziyang Wu, Shengbang Tong,  Hao Bai , Yuexiang Zhai, Benjamin D Haeffele, and Yi Ma
    Apr 2024
  4. WSDM’24
    CharmBana: Progressive Responses with Real-Time Internet Search for Knowledge-Powered Conversations
    Revanth Gangi Reddy, Sharath Chandra,  Hao Bai , Wentao Yao,  ..., and Chengxiang Zhai
    Feb 2024
  5. EMNLP’23
    Social Commonsense-Guided Search Query Generation for Open-Domain Knowledge-Powered Conversations
    Revanth Reddy,  Hao Bai , Wentao Yao, Sharath Chandra Etagi Suresh, Heng Ji, and ChengXiang Zhai
    Oct 2023