AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents
Ke Yang, Yao Liu, Sapana Chaudhary, Rasool Fakoor, Pratik Chaudhari, George Karypis, Huzefa Rangwala.
arxiv preprint. [Paper]Risk-Averse Finetuning of Large Language Models
Sapana Chaudhary, Ujwal Dinesha, Dileep Kalathil, Srinivas Shakkottai.
To appear in Neural Information Processing Systems (NeurIPS, 2024).Pedagogical Alignment of Large Language Models
Shashank Sonkar*, Kangqi Ni*, Sapana Chaudhary, Richard G. Baraniuk.
To appear in Empirical Methods in Natural Language Processing (EMNLP, 2024).[Paper]Dynamic Regret Analysis of Safe Distributed Online Optimization for Convex and Non-convex Problems
Ting-Jui Chang, Sapana Chaudhary, Dileep Kalathil, Shahin Shahrampour.
Transactions of Machine Learning Research (TMLR, 2023).[Paper]On Safety and Adaptivity in Sequential Decision Making
Sapana Chaudhary.
International Joint Conference on Artificial Intelligence (IJCAI) Doctoral Consortium, 2023.[Paper]Enhanced Meta Reinforcement Learning via Demonstrations in Sparse Reward Environments
Desik Rengarajan*, Sapana Chaudhary*, Jaewon Kim, Dileep Kalathil, Srinivas Shakkottai.
Neural Information Processing Systems (NeurIPS, 2022). (*-Equal contribution) [Paper]Safe Online Convex Optimization with Unknown Linear Safety Constraints
Sapana Chaudhary, Dileep Kalathil.
Conference on Artificial Intelligence (AAAI, 2022). [Paper][Poster]Smooth Imitation Learning via Smooth Costs and Smooth Policies
Sapana Chaudhary, Balaraman Ravindran.
Fifth Joint International Conference on Data Science and Management of Data (CoDS-COMAD, 2022). Research Track. ACM DL. [Paper][Slides][Talk]SILC: Smoother Imitation with Lipschitz Costs
Sapana Chaudhary*, Akshat Dave*, Balaraman Ravindran.
Workshop on Goal Specification in Reinforcement Learning, ICML, 2018. (*-Equal contribution)[Paper]
MS Thesis
On Learning Smooth Policies in Imitation Learning.
Sapana Chaudhary. Indian Institute of Technology Madras, Chennai, India. 2018. [Thesis]