Project dates: 01/11/2021 - 01/02/2023
Title: A tool for the removal of identifying information from free-text fields in questionnaire data
The project addresses the challenge of identifying and obfuscating personal health information (PHI) in open-ended survey responses. It proposes a solution using Named Entity Recognition (NER) methods from Natural Language Processing (NLP). Challenges include the lack of annotated data and context in survey responses. The proposed model demonstrates improved performance compared to the baseline method. Future directions involve obtaining more labelled data, incorporating survey questions for context, and utilizing advanced NLP models. The document emphasizes the ongoing academic inquiry into obfuscating PHI in survey data.
Chief Investigators
Partners
Other Partners
SAX Institute: The SAX Institute is an Australian research organization specializing in public health and healthcare research. Founded in 2006, it collaborates with government, academic, and healthcare partners to conduct innovative research, generate evidence-based insights, and drive improvements in health policy and practice.