• Home
  • Publications
  • Deception in Reinforced Autonomous Agents: The Unconventional Rabbit Hat Trick …

Deception in Reinforced Autonomous Agents: The Unconventional Rabbit Hat Trick in Legislation

Authors
Atharvan Dogra , Krishna Pillutla , Ameet Deshpande , Ananya B Sai , John Nay , Tanmay Rajpurohit , Ashwin Kalyan , Balaraman Ravindran
Published In
ACL 2025

We study how LLMs can engage in subtle deception through strategic phrasing in legislative settings, and show that re-planning and re-sampling can increase deception rates while preserving intent.