Walter was in the MATS Winter 2023 Cohort under the mentorship of John Wentworth. At Cadenza Labs, he splits his time between being a team lead and research. With other collaborators, his team created an open-source library, where Walter is a main contributor. As part of ASET, he implemented benchmarks for the UK AI Security Institute and supervised others in their implementation. Before starting his PhD in ML, Walter was working as software engineer for a couple of years after graduating in CS.
Sharan first joined Cadenza Labs to work with Walter on Cluster-Normalization for Unsupervised Probing. He is a first-year PhD student at the Language Technology Lab at the University of Cambridge, where he works on interpretability and evals. Additionally, he is a MATS scholar under Evan Hubinger. Sharan has a background in statistics after studying at Imperial College and Edinburgh.
Kieron is currently finishing his thesis for the M.Sc. in AI at the University of Amsterdam in collaboration with Cadenza. In his research, he investigates ways in which supervised and unsupervised probes can fail to predict truthfulness under distributional shifts, with a focus on quantifying and methods to mitigate these failure modes. During his studies he has set up and organized an AI Safety reading group, and has participated in the ML4Good and Talos Fellowships. Before pivoting his career towards technical AI Safety research, he has co-founded two startups and obtained a B.Sc. in Technomathematics.
Jim Chapman has worked in leadership roles for three non-governmental organizations, serving as board vice president, policy analyst, and executive director. He co-managed a multi-million-dollar, multi-year academic research project and currently leads a consulting firm as managing principal. After becoming concerned about AI risks, and a year of learning and upskilling, Jim is joining Cadenza Labs to leverage his 25+ years of operational experience and leadership in non-profit, university, and for-profit consulting settings.
Grégoire's work at Cadenza Labs primarily focused on the project "Cluster-Norm for Unsupervised Probing of Knowledge." His main contribution was developing contrast-pair clustering techniques for CCS-style methods. This research was presented at the ICML MechInterp workshop. In addition to this project, Grégoire has been investigating how structures emerge in the computational graphs of neural networks. Currently, Grégoire is pursuing a Master's degree in Mathematics and Computer Science at ENS Paris-Saclay. His academic interests are centered on mechanistic interpretability and the analysis of neural networks.
We thank our advisors for their regular guidance on our research direction and other topics: