Education
PhD in Computer Science, Durham University
2019-2024
- Thesis topic: “Reformulation and Decomposition: Multitask learning approaches to Long Document Problems”.
 - Developed methods to apply LLMs to very long documents in both multitask and multimodal settings.
 
MEng in Computer Science, Durham University
2014-2018
- Published my work on Native Language Identification.
 - BCS Chartered Institute for IT Prize - Awarded for being the top performing Masters student in Computer Science
 
Experience
Senior Research Associate, Durham University
2025-
- Working in a multidisciplinary team to apply NLP methods to monitor the quality of hospital discharge notes.
 - Created tools and methods which enable clinicians to more effectively communicate, improving patient outcomes.
 - Independently led a team developing fundamental ML methods - a novel multimodal approach which has been accepted at ICCV 2025
 
Senior Data Scientist, Evergreen Life
2025-
- Working at the intersection of research and industry, building new models to analyse electronic health records.
 - Developed methods to automatically triage patient requests in GP practices.
 
Post Doctoral Research Associate, Durham University
2023-2025
- Worked closely with veterinarians from industry to use LLMs to understand discussions on veterinary message-boards.
 - Developed methods to extract diseases from unstructured case discussions and match them to custom veterinary vocabularies (similar to SNOMED/UMLS).
 - Built tools which enable new trends in diseases and treatments to be identified, and to facilitate more productive discussions between veterinary professionals with problematic cases.
 
Data Scientist, Caspian
2018-2023
- Researching how machine learning techniques can help solve NLP problems in the anti-money laundering (AML) sector, including information extraction from large unstructured documents.
 - Role involved applying research techniques to industry, including developing methods for managing and monitoring models in production.
 
Recent Articles
- Terminal-izing my workflow - My 2019 resolution is to use more terminal tools, with the aim of making my work more efficient and productive....
 - pyWebcamSteg: An annoymising proxy through your webcam - ExploitDB’s Google Hacking Database (GHDB) is a great resource for finding sneaky search queries for Google which lets you find...
 
