This project combines all the tools and techniques covered in the course, applying it to the problem of identifying patients with diabetic complications from clinical text notes.
Specifically, a corpus of anonymized clinical notes from patients who have diabetes is used to identify which notes indicate diabetic complications of neuropathy, nephropathy, and/or retinopathy, and flag these complications accordingly.
We assess the text processing algorithm performance by identifying how many notes were correctly and incorrectly classified based on a reference key.