Education
PhD in Computer Science, Durham University
2019-2024
- Thesis topic: “Reformulation and Decomposition: Multitask learning approaches to Long Document Problems”.
- Developed methods to apply LLMs to very long documents in both multitask and multimodal settings.
MEng in Computer Science, Durham University
2014-2018
- Published my work on Native Language Identification.
- BCS Chartered Institute for IT Prize - Awarded for being the top performing Masters student in Computer Science
Experience
Senior Research Associate, Durham University
2025-
- Working in a multidisciplinary team to apply NLP methods to monitor the quality of hospital discharge notes.
- Created tools and methods which enable clinicians to more effectively communicate, improving patient outcomes.
- Independently led a team developing fundamental ML methods - a novel multimodal approach which has been accepted at ICCV 2025
Senior Data Scientist, Evergreen Life
2025-
- Working at the intersection of research and industry, building new models to analyse electronic health records.
- Developed methods to automatically triage patient requests in GP practices.
Post Doctoral Research Associate, Durham University
2023-2025
- Worked closely with veterinarians from industry to use LLMs to understand discussions on veterinary message-boards.
- Developed methods to extract diseases from unstructured case discussions and match them to custom veterinary vocabularies (similar to SNOMED/UMLS).
- Built tools which enable new trends in diseases and treatments to be identified, and to facilitate more productive discussions between veterinary professionals with problematic cases.
Data Scientist, Caspian
2018-2023
- Researching how machine learning techniques can help solve NLP problems in the anti-money laundering (AML) sector, including information extraction from large unstructured documents.
- Role involved applying research techniques to industry, including developing methods for managing and monitoring models in production.
Recent Articles
- Terminal-izing my workflow - My 2019 resolution is to use more terminal tools, with the aim of making my work more efficient and productive....
- pyWebcamSteg: An annoymising proxy through your webcam - ExploitDB’s Google Hacking Database (GHDB) is a great resource for finding sneaky search queries for Google which lets you find...
