I am a Machine Learning Architect at LynxCare, where I build models for clinical natural language processing. Previously, I was a research scientist at Nuance Automotive (now Cerence) and a postdoctoral researcher at IDLab's Text-to-Knowledge Group at Ghent University. My research focused on training and deploying neural NLP models in low-resource settings, with work spanning sequence-to-sequence models, keyphrase extraction, and knowledge base population.
Feel free to reach out by mail or connect on social media.
Publications
Learning to Reuse Distractors to Support Multiple Choice Question Generation in Education
IEEE Transactions on Learning Technologies
Clinical Information Extraction for Preterm Birth Risk Prediction
Journal of Biomedical Informatics
Overly Optimistic Prediction Results on Imbalanced Data: Flaws and Benefits of Applying Over-sampling
Artificial Intelligence in Medicine, 2021
Predicting Psychological Health from Childhood Essays. The UGent-IDLab CLPsych 2018 Shared Task System
CLPsych at NAACL-HLT 2018
Creation and Evaluation of Large Keyphrase Extraction Collections with Multiple Opinions
Language Resources and Evaluation
An Automated End-To-End Pipeline for Fine-Grained Video Annotation using Deep Neural Networks
ICMR 2016 (Demo)
Ghent University-IBCN Participation in TAC-KBP 2015 Cold Start Slot Filling Task
TAC 2015
When Topic Models Disagree: Keyphrase Extraction with Multiple Topic Models
WWW 2015 (Poster Session)
Using Active Learning and Semantic Clustering for Noise Reduction in Distant Supervision
AKBC Workshop at NIPS 2014
Thesis
Methods for Efficient Supervision in Natural Language Processing
Reports
Talks
Information Extraction from Medical Notes for Early Birth Risk Prediction
12th Belgium NLP Meetup — December 2019
Knowledge Base Population from Text and Graphs
Cambridge Language Technology Lab, Seminar — February 2017
Distributed Representations of Relation Paths to Bootstrap Relation Extractors
UCL Machine Reading Lab, Internal Workshop — July 2016