List of accepted papers
</table>
Session 1
9:15 am Opening Remarks (5 min)
9:20 am Invited talk - Been Kim Towards Interpretability for Humanity.
10:00 am Invited talk - Zico Kolter LLM Robustness: Recent progress and the challenges ahead.
10:40 am Contributing Talk 1: Failures to Find Transferable Image Jailbreaks Between Vision-Language Models</td> </tr>
10:50 am Contributing Talk 2: Towards Safe Multilingual Frontier AI
11:00 am Poster session & Lunch
12:00 pm Lunch break
Session 2
1:00 pm Invited talk 3 - Rida Qadri AI's Cultural Futures: Designing for a Culturally Rich World
1:40 pm Invited talk 4 - Peter Henderson Aligning Machine Learning and Law for Responsible Real-World Deployments
2:20 pm Invited talk 5 - Hannah Rose Kirk A Tale of Two RCTs: ​ Building a rigorous evidence base on the societal impacts of frontier AI inside the UK Government.
3:00 pm Break (20 min)
3:20 pm Panel: Panelists: Yoshua Bengio, Margaret Mitchell, Jeff Clune, Moderator: Jakob Foerster
4:20 pm Contributing Talk 3: Report Cards: Qualitative Evaluation of LLMs Using Natural Language Summaries
4:30 pm Contributing Talk 4: An Adversarial Perspective on Machine Unlearning for AI Safety
4:40 pm Contributing Talk 5: Targeted Manipulation and Deception Emerge in LLMs Trained on User Feedback
4:50 pm Contributing Talk 6: On Demonstration Selection for Improving Fairness in Language Models
5:00 pm Closing remarks (10 min)