ishans24 commited on
Commit
a8cb1f1
·
verified ·
1 Parent(s): 5f89d47

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +73 -1
README.md CHANGED
@@ -10,4 +10,76 @@ tags:
10
  - covid
11
  - covid19
12
  - xray
13
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
10
  - covid
11
  - covid19
12
  - xray
13
+ ---
14
+
15
+
16
+ # COVID-19 Detection using VGG19 and X-ray Images
17
+
18
+ ## Overview
19
+
20
+ This model is able detect COVID-19 from X-ray images using the VGG19 architecture for transfer learning. The dataset used for this project is the COVID-19 Radiography Database available on Kaggle.
21
+
22
+ ## Dataset
23
+
24
+ The dataset used in this project is the [COVID-19 Radiography Database](https://www.kaggle.com/datasets/tawsifurrahman/covid19-radiography-database). It contains X-ray images categorized into three classes: COVID, Normal, and other pneumonia. The dataset is split into training, validation, and test sets to ensure robust evaluation of the model.
25
+
26
+ ## Methodology
27
+
28
+ ### 1. Import Libraries
29
+
30
+ We start by importing the necessary libraries required for data processing, model building, and evaluation. These include TensorFlow for deep learning, matplotlib for visualization, and other essential packages.
31
+
32
+ ### 2. Load Dataset
33
+
34
+ The dataset is loaded from the specified directory. This dataset contains X-ray images categorized into COVID, Normal, and other pneumonia classes. The images are stored in respective folders, which are read and preprocessed.
35
+
36
+ ### 3. Data Preprocessing
37
+
38
+ - **Data Augmentation:** To increase the diversity of our training data, various transformations such as rotation, zoom, and horizontal flip are applied. This helps in making the model robust and prevents overfitting.
39
+ - **Rescaling:** The pixel values are rescaled to the range [0, 1] to standardize the input data, which improves model performance.
40
+
41
+ ### 4. Split Dataset
42
+
43
+ The dataset is split into training, validation, and test sets. This is crucial for evaluating the model's performance on unseen data.
44
+ - **Training Set:** Used to train the model.
45
+ - **Validation Set:** Used to tune hyperparameters and prevent overfitting.
46
+ - **Test Set:** Used to assess the final model's performance.
47
+
48
+ ### 5. Build the Model using VGG19
49
+
50
+ - **Transfer Learning:** The pre-trained VGG19 model, which has been trained on a large dataset (ImageNet), is used to leverage the learned features from a different domain to our specific task of COVID-19 detection.
51
+ - **Model Architecture:** Custom layers are added on top of VGG19 to adapt it to our classification problem. This includes flattening the output, adding dense layers, and a final softmax layer for classification.
52
+
53
+ ### 6. Compile the Model
54
+
55
+ - **Loss Function:** 'binary_crossentropy' is used as the loss function because we have more than two classes.
56
+ - **Optimizer:** The Adam optimizer is used to adjust the learning rate dynamically.
57
+ - **Metrics:** Accuracy is tracked to monitor the performance of the model.
58
+
59
+ ### 7. Train the Model
60
+
61
+ - **Epochs:** The number of times the entire training dataset is passed forward and backward through the neural network.
62
+ - **Batch Size:** The number of training examples utilized in one iteration.
63
+ - **Validation Data:** Helps in monitoring the model's performance on unseen data during training to tune hyperparameters and avoid overfitting.
64
+
65
+ ### 8. Evaluate the Model
66
+
67
+ The model is evaluated on the test set to determine its accuracy, precision, recall, and F1 score. This helps in understanding the model's performance comprehensively.
68
+
69
+ ### 9. Visualize Training Results
70
+
71
+ - **Loss and Accuracy Plots:** Visualize the training and validation loss and accuracy to understand how well the model is learning and if it's overfitting or underfitting.
72
+ - **Confusion Matrix:** Provides a detailed breakdown of true positives, false positives, true negatives, and false negatives, giving insights into where the model is making errors.
73
+
74
+ ### 10. Conclusion
75
+
76
+ The findings and the performance of the model are summarized. Potential improvements or future work such as experimenting with different architectures, more data, or advanced preprocessing techniques are discussed.
77
+
78
+ ## Results
79
+
80
+ The model achieves an accuracy of 98.1% on the test set, indicating its effectiveness in detecting COVID-19 from X-ray images. The high accuracy demonstrates the successful application of data preprocessing, augmentation, and model training techniques.
81
+
82
+ ## Acknowledgements
83
+
84
+ - [COVID-19 Radiography Database](https://www.kaggle.com/datasets/tawsifurrahman/covid19-radiography-database)
85
+ - [VGG19 Model](https://arxiv.org/abs/1409.1556)