Team

Walter Laurito

At Cadenza Labs, Walter splits his time between being a team lead and doing research. Walter was in the MATS Winter 2023 Cohort under the mentorship of John Wentworth. He was also part of ASET, where he implemented benchmarks for the UK AI Security Institute and supervised others in their implementation. Before starting his PhD at KIT, Walter worked as a software engineer for a couple of years after graduating in CS.

Kieron Kretschmar

Kieron joined Cadenza Labs while completing his M.Sc. at the University of Amsterdam, where he graduated cum laude. His thesis, written in collaboration with Walter Laurito, explores representations of truthfulness in language models. It analyzes how supervised and unsupervised probes can fail to predict truthfulness under distributional shifts, highlighting failure modes and potential mitigation strategies. During his studies he has set up and organized an AI Safety reading group, and has participated in the ML4Good and Talos Fellowships. Before pivoting his career towards technical AI Safety research, he has co-founded two startups.

Sharan Maiya

Sharan first joined Cadenza Labs to work on Cluster-Normalization for Unsupervised Probing. He is a first-year PhD student at the Language Technology Lab at the University of Cambridge, where he works on interpretability and evals. Additionally, he is a MATS scholar under Evan Hubinger. Sharan has a background in statistics after studying at Imperial College and Edinburgh.

Advisors

We thank our advisors for their regular guidance on our research direction and other topics:

Alex Mallen, AI Safety Researcher at Redwood Research
Chris Cundy, AI Safety Researcher at FAR.AI
Erik Jenner, AI Safety Researcher at Deepmind

Other Collaborators

Clément Dumas
Kaarel Hänni
Grégoire Dhimoïla