Medical Image Diagnostics — Ethiraj Srinivasan

Summary

With the deep learning techniques, medical diagnostics could be totally revolutionized. The project aims to develop a deep learning model for reporting mammograms. The project aimes to

To develop a diagnostic capability with medical images on mammogram screening
Identification of computer vision techniques that can preprocess Radiological images (X Ray, Ultrasound, MRI, CT)
Development of deep learning techniques that can identify lesions suspicious of malignancy
Development of mammogram screening as a proof of concept

Data Source

Mammogram images are obtained from the following sources

Pre Processing

In order to remove the noise from the image the biggest contour(largest blob) is found from the image. Mask of the biggest contour is used to clean the images.

Original Image

Contours

Largest Contour

Proprocessed Image

Experimental Setup

From the two publicly available datasets three more datasets are formed by combining the images in both the datasets (ALL), images with only CC views (CC) from DDSM datasets and images with only MLO views from the DDSM and MIAS datasets (MLO). These three newly formed datasets along with the two publicly available datasets form the five basic raw image datasets. These five raw image datasets are then semi-processed and completely processed to generate five semi-processed datasets and five completely processed datasets respectively. These 15 datasets from raw, semi-processed and completely processed datasets are then used in two class and three class classification models on VGG16 and ResNet34. Categorial Cross Entropy is chosen as the loss function and Adam optimizer is picked. Keras is used with Theano as the backend in an Ubuntu 16.04 Operating System with 32 GB RAM, 2 GTX 1080 Ti. Only one GPU was used for training the models. The last layer of the CNN networks were finetuned by replacing the thousand class output with a two class output for two class classification of normal vs abnormal and with a three class outputs for three class classification of normal vs benign vs malignant. The data is split into training(60%), test(20%) and validation(20%) set and classification is done for each image.

Results

Two class Classification

Three class Classification