This is a selected list of publications focusses on sociotechnical AI research and AI safety. You can find a full listing of our work on Google Scholar.

2025 / In Press

    • Publication image
    • How will advanced AI systems impact democracy?
    • Summefield et al
    • Nature Human Behaviour (in press)
    • Publication image
    • Deep reinforcement learning can promote sustainable human behaviour in a common-pool resource problem
    • Koster et al
    • Nature Communications (2025)
    • Publication image
    • Deep mechanism design: Learning social and economic policies for human benefit
    • Tachetti et al
    • PNAS (2025)
    • Publication image
    • Reward Model Interpretability via Optimal and Pessimal Tokens
    • Christian et al
    • FacCT Proceedings (2025)
    • Publication image
    • Why human–AI relationships need socioaffective alignment
    • Kirk et al
    • Humanities and Social Sciences Communications (2025)
    • Publication image
    • Conversational AI increases political knowledge as effectively as self-directed internet search
    • Luettgau et al
    • arXiv
    • Publication image
    • Technological folie a deux: Feedback Loops Between AI Chatbots and Mental Illness
    • Nour et al
    • arXiv
    • Publication image
    • The Levers of Political Persuasion with Conversational AI
    • Hackenburg et al
    • arXiv
    • Publication image
    • Lessons from a Chimp: AI 'Scheming' and the Quest for Ape Language
    • Summerfield et al
    • arXiv
    • Publication image
    • How Malicious AI Swarms Can Threaten Democracy
    • Schroeder et al
    • arXiv
    • Publication image
    • Increasing happiness through conversations with artificial intelligence
    • Heffner et al
    • arXiv (2025)
    • Publication image
    • Language agents as digital representatives in collective decision-making
    • Jarrett et al
    • arXiv (2025)
    • Publication image
    • Can AI Mediation Improve Democratic Deliberation?
    • Tessler et al
    • Knight First Amendment Institute (2025)

2024

    • Publication image
    • AI can help humans find common ground in democratic deliberation
    • Tessler et al
    • Science (2024)

2023

    • Publication image
    • Using the Veil of Ignorance to align AI systems with principles of justice
    • Weidinger et al
    • PNAS (2023)

2022

    • Publication image
    • Fine-tuning language models to find agreement among humans with diverse preferences
    • Bakker et al
    • NeurIPS (2022)
    • Publication image
    • Human-centred mechanism design with Democratic AI
    • Koster et al
    • Nature Human Behaviour (2022)
    • Publication image
    • The good shepherd: An oracle agent for mechanism design
    • Balaguer et al
    • arXiv
    • Publication image
    • Hcmd-zero: Learning value aligned mechanisms from data
    • Balaguer et al
    • arXiv