Kureha Yamaguchi

Hi! I’m Kureha (pronounced cray.ha)- an AI Researcher at UK’s National Institute for Data Science and AI, also known as The Alan Turing Institute. I investigate AI system vulnerabilities and develop secure ML solutions within the Turing’s Defence and National Security programme, working with UK Government stakeholders and academic collaborators.

Prior to research, I was at the University of Cambridge completing my undergraduate and master’s degrees in Information and Computer Engineering, supported by a scholarship from the Institution of Engineering and Technology.

I love coding, and I really care about problem-solving in a socially responsible way that actually benefits humanity in the long-term. Still figuring it all out and enjoying the process- thanks for visiting my page!

selected publications

Adversarial Manipulation of Reasoning Models using Internal Representations

Kureha Yamaguchi, Benjamin Etheridge, and Andy Arditi

2025

DOI
2DSig-Detect: a semi-supervised framework for anomaly detection on image data using 2D-signatures

Xinheng Xie, Kureha Yamaguchi, Margaux Leblanc, Simon Malzard, Varun Chhabra, Victoria Nockles, and Yue Wu

2025

DOI
An AI red team playbook

Anna Raney, Shiri Bendelac, Keith Manville, Mike Tan, and Kureha Yamaguchi

https://doi.org/10.1117/12.3021906, Jun 2024

DOI
An AI blue team playbook

Mike Tan, Kureha Yamaguchi, Anna Raney, Victoria Nockles, Margaux Leblanc, and Shiri Bendelac

https://doi.org/10.1117/12.3021908, Jun 2024

DOI