2024
08/05/2024 | Joined Tom Hosking at the ICLR ‘24 poster session for our Human Feedback is Not Gold Standard work. Thanks for all the interest! |
29/01/2024 | Presented Human Feedback is Not Gold Standard at the Amazon Themis Science Meeting. |
2023
23/11/2023 | Presented Human Feedback is Not Gold Standard at the nPlan Machine Learning Paper Club. |
13/11/2023 | Gave an invited talk on the application of LLMs for Enterprise at the Oracle AI@Molitor event. |
10/05/2023 | Gave a talk on NLP Applications and Large Language Models to the Capital Enterprise startup network. |
13/04/2023 | Honoured to have been nominated by my students for the UCL Inspiring Teaching Delivery award 🙏 |
13/03/2023 | Gave an invited talk on Dynamic Advsersarial Data Collection for Large Language Models at the UCL AI Centre seminar on The Present and Future of Large Language Models in Theory and Practice. |
13/03/2023 | That’s a wrap! Another year of the MSIN0221 Natural Language Processing lectures comes to an end. Exciting to see the growing interest in NLP and its application! |
2022
24/11/2022 | Presented recent work on DADC and GAAs at the King’s College London Distributed Artificial Intelligence group. Thanks for the insightful discussions! |
17/10/2022 | Super excited to announce that I have joined Cohere and will be working on making large language models more useful and robust. |
10/07/2022 | I’m in Seattle for NAACL 2022! I’ll be presenting Models in the Loop: Aiding Crowdworkers with Generative Annotation Assistants on Wednesday, 13th July at 10:45 PST. And don’t forget to join us at the DADC workshop on Thursday, 14th July for same amazing keynote talks, a diverse panel, presentations from our Shared Task participants and best paper winners, posters, prizes & much more! |
18/05/2022 | Our work Fantastically Ordered Prompts and Where to Find Them: Overcoming Few-Shot Prompt Order Sensitivity has been selected as an outstanding paper at ACL 2022! |
13/05/2022 | Our work Models in the Loop: Aiding Crowdworkers with Generative Annotation Assistants has been accepted as an oral presentation at NAACL 2022! |
09/05/2022 | Excited to announce that I have joined DeepMind as a Research Scientist Intern. |
06/04/2022 | Gave an invited talk on Dynamic Adversarial Data Collection for Question Answering at the Oracle Labs ML Seminar Series. This was a particularly fun and interactive one, thanks for the invite! |
26/03/2022 | The call for participation for the Shared Task at the DADC Workshop co-located with NAACL ‘22 in Seattle is now live! We have three fantastic tracks for you to participate in. Sign up here! |
25/03/2022 | Presented our work on Dynamic Adversarial Data Collection for QA at the University of Oxford. |
19/03/2022 | Additional resources from our work on Improving Question Answering Model Robustness with Synthetic Adversarial Data Generation at EMNLP 2021 are now available! We are releasing a collection of synthetically-generated adversarial QA pairs and related resources as well as the models used to generate the questions. |
14/03/2022 | Just gave the last lecture of the MSIN0221 Natural Language Processing module for this year. Fantastic cohort as always and it was great to be back to in-person teaching! |
20/01/2022 | AdversarialQA is currently the 3rd most downloaded QA dataset on Huggingface 🤗 Datasets right after the benchmark SQuADv1.1 and SQuADv2! |
04/01/2022 | Our proposal for the First Workshop on Dynamic Adversarial Data Collection has been accepted! See you at NAACL ‘22 in Seattle! |
2021
09/11/2021 | Presented our work on Improving Question Answering Model Robustness with Synthetic Adversarial Data Generation at EMNLP 2021. The recording is available here. |
24/09/2021 | Dynabench is 1 year old! To celebrate, we’ve released Dynatask to help researchers host their own tasks. |
10/09/2021 | Presented a live demonstration of Dynamic Benchmarking at the UCL AI Centre 2nd Anniversary Showcase. |
27/08/2021 | Our work Improving Question Answering Model Robustness with Synthetic Adversarial Data Generation has been accepted to the EMNLP 2021 Main Conference! |
26/08/2021 | Our work Contrasting Human-and Machine-Generated Word-Level Adversarial Examples for Text Classification has been accepted to the EMNLP 2021 Main Conference! |
24/08/2021 | ldbd.ly helps you make sense of ever-changing dynamic leaderboards. |
18/04/2021 | Our work Improving Question Answering Model Robustness with Synthetic Adversarial Data Generation is now available on arXiv! |
17/04/2021 | Our work Fantastically Ordered Prompts and Where to Find Them: Overcoming Few-Shot Prompt Order Sensitivity is now available on arXiv! |
07/04/2021 | The Dynabench paper introducing our unified research platform for dynamic benchmarking has been accepted to NAACL 2021! |
06/04/2021 | Excited to announce that I have joined Facebook AI Research as an external research collaborator working on generation-assisted human adversarial annotation. |
13/01/2021 | The AdversarialQA dataset is now available in Huggingface 🤗 Datasets! Usage is as simple as |
2020
12/12/2020 | The HAMLETS NeurIPS 2020 workshop kicks off today. Join us to learn more about Human And Model in the Loop Evaluation and Training Strategies. |
20/11/2020 | Presented Humans-and-Machines in the Loopfor Dynamic Benchmarking and Evaluation at the Annual MURI Review Meeting. |
11/11/2020 | Presented Adversarial Human Annotation for Dynamic Benchmarking and Evaluation at the UCL AI Centre session on AI in science, industry and society at TheAlgo2020. |
25/10/2020 | Our work Beat the AI: Investigating Adversarial Human Annotation for Reading Comprehension will be presented at EMNLP 2020. |
24/09/2020 | Dynabench, in collaboration with Stanford University, the University of North Carolina at Chapel Hill, and Facebook AI, is now live! Can you fool the QA model? |
22/09/2020 | Call for papers for our NeurIPS2020 workshop HAMLETS: Human And Model in the Loop Evaluation & Training Strategies is now live! |
15/09/2020 | Our work Undersensitivity in Neural Reading Comprehension has been accepted in Findings of EMNLP 2020! |
01/09/2020 | Excited to announce that I have joined Facebook AI Research as a research intern working on adversarial benchmarking and robustness. |
17/06/2020 | Our work Beat the AI: Investigating Adversarial Human Annotation for Reading Comprehension has been accepted to Transactions of the Association for Computational Linguistics (TACL)! |
26/04/2020 | Delivered the MSIN0221: Natural Language Processing module to this year’s UCL MSc Business Analytics cohort. |
14/02/2020 | Presented Adversarial Human Annotation for Reading Comprehension at the University of Cambridge NLIP Seminar Series. |
06/02/2020 | Accepted onto Cohort III of the Conception X programme! |
2019
17/06/2019 | Presented Asking Harder Questions at the UCL NLP Inaugural Event followed by a poster session on ShARC. |
15/05/2019 | Delivered a two-part workshop titled Overview of NLP to this year’s UCL MSc Business Analytics cohort. |
15/04/2019 | Co-presented Interpretation of Natural Language Rules in Conversational Machine Reading at the South England Natural Language Processing (SENLP) meetup. |
12/04/2019 | Led a workshop titled Introduction to Python and Machine Learning at the Peking University HSBC Business School (PHBS) in Oxford. |
14/01/2019 | I have started a PhD at UCL under the guidance of Pontus Stenetorp and Sebastian Riedel. |
2018
03/11/2018 | Presented Interpretation of Natural Language Rules in Conversational Machine Reading at EMNLP together with Patrick Lewis and other co-authors. |
25/10/2018 | The ShARC dataset from our EMNLP ‘18 paper is now live! |
31/08/2018 | Bloomsbury AI has joined Facebook to strengthen its efforts in natural language processing research. |
22/08/2018 | Cape (open source) is the new state-of-the-art for open-domain question answering on TriviaQA. |
17/08/2018 | Our large-scale question answering system, Cape, is now available open source! |
10/08/2018 | Interpretation of Natural Language Rules in Conversational Machine Reading has been accepted at EMNLP. |
25/04/2018 | We’ve been accepted into the Allen & Overy Fuse accelerator programme. |
2017
22/11/2017 | Invited presentation of the work we’re doing at Bloomsbury AI at the A Common Language for Intelligence meet-up hosted by Grakn AI. |
12/05/2017 | I have joined NLP-focused startup Bloomsbury AI, working on open-domain question answering. |