Presenter: Evan Welty Assmus (University of Michigan) Collaborator(s): Qining Zhang, Lei Ying Title: SP3O: Reinforcement Learning from Segment Preferences without Reward Modeling | Presenter: Nirmit Joshi (Toyota Technological Institute at Chicago) Collaborator(s): Gene Li, Siddharth Bhandari, Shiva Kasiviswanathan, Cong Ma, Nathan Srebro Title: Learning to Answer Correct Demonstrations |