The Architecture Decision Record (ADR) is to capture every key decision for each part of relevant coding process. It does not describe what, and how, coding executes but rather the abstract of it—the reason of it. For this project, the ADR is to document the whys. These documents are to understand better why the image classification is architected this way.
Pranav Raikote publishes the images released by the University of Montreal in Kaggle platform. This platform is an open-source data. The Kaggle API in IBM Watson Studio Jupyter Notebook is in use to retrieve this dataset. This dataset is 153mb in total and it has two different files, which are training and testing. Both have three different subsets: Covid-19, viral pneumonia and normal lungs.
The IBM Watson Studio Jupyter Notebook is in use to connect Kaggle platform and this dataset also persists in IBM Cloud Pak for Data as an asset. This process is entirely cloud-based.
…/data/train
…/Covid
…/001.png
…/002.jpg
…/Pneumonia
…/001.jpeg
…/002.jpg
…/Normal
…/001.jpg
…/002.png
There are several different explorations that have done, as following: