- Contributors
- Introduction
- Dataset & Preprocessing
- Model Architecture
- Loss Function & Optimizer
- Learning Rate
- Result
- Conclusion
This project is created by the joint efforts of
Malaria caused by the Plasmodium parasites, is a blood disorder, which is transmitted through the bite of a woman Anopheles mosquito. With almost 240 million cases mentioned each year, the sickness puts nearly forty percentage of the global populace at danger. Thus,treatment of this disorder at the inital stages of occurance in the host's body is very neccessary,but for the treatment to take place the first step is the proper diagnosis of the host to detect malaria. Microscopic examinations of thick and thin blood smears for infected RBCs is commonly used for method of malaria diagnosis. Depending on the local protocol, the examination includes:
- classifying and counting the normal and infected erythrocytes in the thin smear images; and/or
- ountingparasites in thick smear images as specified in the WHOguidelines. Thus, the diagnostic accuracy is heavilydependent on manual expertise and can be adversely impacted by the burden posed by large scale analyses that are common in malaria endemic regionsAlternative techniques such as polymerase chain reaction (PCR) and rapid diagnostic tests (RDT) are also widely used. However, PCR tests are limited in their performance while RDTs are less cost-effective in zones with high disease prevalence
We have used the NIH malaria dataset,which contains 27,588 cell images with equal instances of parasitized and uninfected cells. Positive samples contained Plasmodium and negative samples contained no Plasmodium but other types of objects including staining artifacts/impurities. There are total 13,779 paratisized and 13,779 uninfected image samples.
We augmented our data on the fly using the Albumentations library. We applied random flips and rotations with random changes in lighting by increasing/decreasing contrast, gamma & brightness.
We have created three model architecture using different API namely
* Sequential API : We have used keras sequential API to build this architecture along with keras image generators.
* Functional API : We have used funtional API along with custom functional generator for this model architecture.
We have used the same model structure just in funtional paradigm along with custom generator and augmentations.
* Transfer Learning : We have used transfer learning with DenseNet201 along with functional API for this model.
The loss-function that we have used here is binary cross entropy. And, the optimiser that we have used is Nadam.
In case of sequential API we have used a fixed learing rate. But in case of the other two models the learning rate we have used here is not constant throughout the training of the data, instead we have used a learning rate schedular, which increases/decreases the learning rate gradually after every fixed set of epochs such that we can attain the optimum convergence by the end of our training of the data.
We have compared the results mainly on the basis of validation accuracy and validation AUC of three model architectures.
Hence we can clearly see from the three graphs that the model architecture using functional API has performed better w.r.t to the other two models.
Hence we can conlcude that Using transfer learning we extract many unwanted features in our model which stands out to be irrelevant while classifying the real data.Here, Our functional model with a custom generator has succesfully performed the task better than the model that we have created usng transfer learning
keep Coding:-)