Global Land Cover Mapping using Image Processing #503

abhisheks008 · 2024-05-05T04:08:13Z

Deep Learning Simplified Repository (Proposing new issue)

🔴 Project Title : Global Land Cover Mapping using Image Processing

🔴 Aim : The aim is to analyze the dataset using image processing and deep learning methods and find out the best fitted model for this dataset.

🔴 Dataset : https://www.kaggle.com/datasets/aletbm/global-land-cover-mapping-openearthmap

🔴 Approach : Try to use 3-4 algorithms to implement the models and compare all the algorithms to find out the best fitted algorithm for the model by checking the accuracy scores. Also do not forget to do a exploratory data analysis before creating any model.

📍 Follow the Guidelines to Contribute in the Project :

You need to create a separate folder named as the Project Title.
Inside that folder, there will be four main components.
- Images - To store the required images.
- Dataset - To store the dataset or, information/source about the dataset.
- Model - To store the machine learning model you've created using the dataset.
- requirements.txt - This file will contain the required packages/libraries to run the project in other machines.
Inside the Model folder, the README.md file must be filled up properly, with proper visualizations and conclusions.

🔴🟡 Points to Note :

The issues will be assigned on a first come first serve basis, 1 Issue == 1 PR.
"Issue Title" and "PR Title should be the same. Include issue number along with it.
Follow Contributing Guidelines & Code of Conduct before start Contributing.

✅ To be Mentioned while taking the issue :

Full name :
GitHub Profile Link :
Email ID :
Participant ID (if applicable):
Approach for this Project :
What is your participant role? (Mention the Open Source program)

Happy Contributing 🚀

All the best. Enjoy your open source journey ahead. 😎

The text was updated successfully, but these errors were encountered:

tushtithakur · 2024-05-10T06:37:21Z

Hi , I'm excited to contribute to this project. Could you please assign me? Looking forward to getting started! @abhisheks008

Full name : Tushti Thakur
GitHub Profile Link : https://github.com/tushtithakur
Email ID : [email protected]
Approach for this Project : Implement different deep learning algorithms using the dataset, evaluate it and compare performance.
What is your participant role? GSSoC 2024

abhisheks008 · 2024-05-10T07:50:09Z

Hi @tushtithakur wait for the induction session to complete by today evening, after that issues will be assigned to the contributors.

tushtithakur · 2024-05-10T12:29:41Z

@abhisheks008 Sure sir, I'll wait for the induction session to be completed. Thank you for the update!

ArismitaM · 2024-05-14T07:26:53Z

Hi, I would like to contribute to this project. I have recently been part of a hackathon where I have used image processing.

Full name: Arismita Mukherjee
GitHub Profile Link: https://github.com/ArismitaM
Email ID: [email protected]
Approach for this Project: I will implement different models and datasets to arrive at a better-performing model.
What is your participant role? GSSoC '24

abhisheks008 · 2024-05-14T07:28:09Z

Hi @ArismitaM can you clarify more on the algorithms/models you are planning to use here?

ArismitaM · 2024-05-14T07:32:18Z

I am planning to use YOLOv5 to train for land cover and identify the same

abhisheks008 · 2024-05-14T07:35:33Z

I am planning to use YOLOv5 to train for land cover and identify the same

I know YOLO is a go to option for this kind of datasets. Can you share some other approaches along with the YOLO one?

Basically here in this repo, we used to ask our contributors to implement at least 2-3 models for the same dataset, check their accuracy scores and then conclude that some x is the best fitted model for this project. I hope you understand my point.

ArismitaM · 2024-05-14T07:41:02Z

I am planning to use YOLOv5 to train for land cover and identify the same

I know YOLO is a go to option for this kind of datasets. Can you share some other approaches along with the YOLO one?

Basically here in this repo, we used to ask our contributors to implement at least 2-3 models for the same dataset, check their accuracy scores and then conclude that some x is the best fitted model for this project. I hope you understand my point.

I can use VGG16 or ResNet50 too as different models compared to YOLO

ArismitaM · 2024-05-14T09:28:19Z

I am planning to use YOLOv5 to train for land cover and identify the same

I know YOLO is a go to option for this kind of datasets. Can you share some other approaches along with the YOLO one?
Basically here in this repo, we used to ask our contributors to implement at least 2-3 models for the same dataset, check their accuracy scores and then conclude that some x is the best fitted model for this project. I hope you understand my point.

I can use VGG16 or ResNet50 too as different models compared to YOLO

I will use vgg16, resnet50, yolo, and inceptionv3 and then do a comparative study to analyze which model works the best and yields a better result for this issue. Please assign this task to me

abhisheks008 · 2024-05-14T13:45:48Z

Cool! Issue assigned to you @ArismitaM

ArismitaM · 2024-05-16T11:56:45Z

Hi @abhisheks008, I have downloaded the Kaggle dataset and have gone through its contents. The images as well as the labels are .tiff files. The readme file provided with the Kaggle dataset does not explain the structure of the information in the dataset except for mentioning the hexadecimal number associated with each label. From this, I have the following inference:

The image .tiff file is just a single layer of image
The .tiff file for the corresponding label contains a layer of image that contains the areas associated with the label of this layer.

I will have to use this information structure to create bounding boxes for each of the land cover types to train YOLO with. It would help me if you could confirm whether my understanding of the information content in the .tiff files is correct or not.
If I have misunderstood, kindly provide the structure of the information in the tiff files.

abhisheks008 · 2024-05-16T15:31:51Z

Hi @abhisheks008, I have downloaded the Kaggle dataset and have gone through its contents. The images as well as the labels are .tiff files. The readme file provided with the Kaggle dataset does not explain the structure of the information in the dataset except for mentioning the hexadecimal number associated with each label. From this, I have the following inference:

The image .tiff file is just a single layer of image

The .tiff file for the corresponding label contains a layer of image that contains the areas associated with the label of this layer.

I will have to use this information structure to create bounding boxes for each of the land cover types to train YOLO with. It would help me if you could confirm whether my understanding of the information content in the .tiff files is correct or not. If I have misunderstood, kindly provide the structure of the information in the tiff files.

You are going in the right direction. 🚀

ArismitaM · 2024-05-18T14:03:47Z

Hi @abhisheks008, I have analyzed the data set.

This is an image present in the dataset under the directory images/train

Each image has 3 layers, so below are the layers of the above image:

There is a corresponding label/train which has 1 layer with coloured label

abhisheks008 · 2024-05-18T14:08:15Z

@ArismitaM looks good to me.

ArismitaM · 2024-05-18T14:12:50Z

With reference to my comment above, I have a few queries,

In the images directory as well as the label directory, there is a folder called 'val' which also contains images, but I have not understood where to use them.
There are 3 layers for each image and a corresponding label for it. I want to understand the purpose of the 3 layers for each image.
How is the information organized for each image and corresponding label?

abhisheks008 · 2024-05-18T14:25:41Z

Query 1:

The val directory typically stands for 'validation'. In machine learning and deep learning, datasets are often split into three sets:

Training set: Used to train the model.
Validation set: Used to evaluate the model during training and tune hyperparameters.
Test set: Used to evaluate the model's performance after training is complete

Query 2:

The three layers in each image likely represent the three color channels in the image: Red, Green, and Blue (RGB). Each layer corresponds to the intensity of that color in the image:

The first layer shows the intensity of the red color.
The second layer shows the intensity of the green color.
The third layer shows the intensity of the blue color.

When these layers are combined, they form a full-color image.

Query 3:

Each image in the dataset is organized with its corresponding label, which likely represents the segmentation or classification information. Here's a typical structure:

Images Directory: Contains subdirectories for training and validation images.
- images/train: Contains the training images, each having three layers (RGB channels).
- images/val: Contains the validation images, each having three layers (RGB channels).
Labels Directory: Contains subdirectories for training and validation labels.
- labels/train: Contains the training labels, usually a single-layer image where each pixel value represents a class label.
- labels/val: Contains the validation labels, similarly structured to the training labels.

Three layers for each image are the RGB channels that combine to form the full-color image. The label image uses colors to represent different classes or segments corresponding to regions in the input image. The 'val' directory contains data for validation purposes, used to tune and evaluate the model during training.

Example:

Image Layers (RGB):
- Red Channel (Layer 1):
- Green Channel (Layer 2):
- Blue Channel (Layer 3):
Corresponding Label:

@ArismitaM

ArismitaM · 2024-05-18T14:48:59Z

Thank you for the clarification @abhisheks008.
I am able to understand the structure of information in the dataset.

ArismitaM · 2024-05-22T15:42:29Z

Hi @abhisheks008, I am writing a code that will draw bounding boxes around the objects with the same colour in the labels image (different colours are used for different classes) and generate a .txt file for it.
Do you think that this is an optimal method to do it?

abhisheks008 · 2024-05-22T17:05:43Z

Hi @abhisheks008, I am writing a code that will draw bounding boxes around the objects with the same colour in the labels image (different colours are used for different classes) and generate a .txt file for it. Do you think that this is an optimal method to do it?

If it is working with the development, then go for it.

ArismitaM · 2024-05-31T05:21:47Z

Hello @abhisheks008,
I have clustered the points of each of the classes using DBScan. PFA is the plot of a label and its corresponding cluster. Similar clusters have been created for all the labels in a raster file.

label

cluster

abhisheks008 · 2024-06-01T02:32:06Z

Hello @abhisheks008, I have clustered the points of each of the classes using DBScan. PFA is the plot of a label and its corresponding cluster. Similar clusters have been created for all the labels in a raster file.

label

cluster

Looks good to me. Any issues with this?

ArismitaM · 2024-06-01T06:02:32Z

Hello @abhisheks008, I have clustered the points of each of the classes using DBScan. PFA is the plot of a label and its corresponding cluster. Similar clusters have been created for all the labels in a raster file.
label
cluster

Looks good to me. Any issues with this?

no issues with this, I was giving an update

ArismitaM · 2024-06-01T06:04:29Z

I have drawn bounding boxes around the clusters and this is how it looks:

ArismitaM · 2024-06-20T10:21:24Z

Hello @abhisheks008 ,
Here is an update:
I have trained this model in YOLOv5 using GPU successfully.
I tried using VGG but I realized that it was not ideal for object detection.
Then I moved on to RetinaNet. The first round of training worked well with CPU but it did not work with my GPU due to lack of memory. So, I have switched to training with CPU which is why it is taking time for the training process.

abhisheks008 · 2024-06-21T02:44:30Z

Hello @abhisheks008 , Here is an update: I have trained this model in YOLOv5 using GPU successfully. I tried using VGG but I realized that it was not ideal for object detection. Then I moved on to RetinaNet. The first round of training worked well with CPU but it did not work with my GPU due to lack of memory. So, I have switched to training with CPU which is why it is taking time for the training process.

Looks pretty good. Carry on!

github-actions · 2024-06-28T16:33:06Z

Hello @ArismitaM! Your issue #503 has been closed. Thank you for your contribution!

abhisheks008 added the Status: Up for Grabs Up for grabs issue. label May 5, 2024

abhisheks008 assigned ArismitaM May 14, 2024

abhisheks008 added Status: Assigned Assigned issue. level2 Level 2 for GSSOC gssoc Girlscript Summer of Code 2024 and removed Status: Up for Grabs Up for grabs issue. labels May 14, 2024

ArismitaM mentioned this issue Jun 22, 2024

Global Land Cover Mapping using Image Processing #813

Merged

12 tasks

abhisheks008 closed this as completed in #813 Jun 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Global Land Cover Mapping using Image Processing #503

Global Land Cover Mapping using Image Processing #503

abhisheks008 commented May 5, 2024

tushtithakur commented May 10, 2024

abhisheks008 commented May 10, 2024

tushtithakur commented May 10, 2024

ArismitaM commented May 14, 2024

abhisheks008 commented May 14, 2024

ArismitaM commented May 14, 2024

abhisheks008 commented May 14, 2024

ArismitaM commented May 14, 2024

ArismitaM commented May 14, 2024

abhisheks008 commented May 14, 2024

ArismitaM commented May 16, 2024 •

edited

Loading

abhisheks008 commented May 16, 2024

ArismitaM commented May 18, 2024

abhisheks008 commented May 18, 2024

ArismitaM commented May 18, 2024

abhisheks008 commented May 18, 2024

ArismitaM commented May 18, 2024

ArismitaM commented May 22, 2024

abhisheks008 commented May 22, 2024

ArismitaM commented May 31, 2024

abhisheks008 commented Jun 1, 2024

ArismitaM commented Jun 1, 2024

ArismitaM commented Jun 1, 2024 •

edited

Loading

ArismitaM commented Jun 20, 2024

abhisheks008 commented Jun 21, 2024

github-actions bot commented Jun 28, 2024

Global Land Cover Mapping using Image Processing #503

Global Land Cover Mapping using Image Processing #503

Comments

abhisheks008 commented May 5, 2024

Deep Learning Simplified Repository (Proposing new issue)

📍 Follow the Guidelines to Contribute in the Project :

tushtithakur commented May 10, 2024

abhisheks008 commented May 10, 2024

tushtithakur commented May 10, 2024

ArismitaM commented May 14, 2024

abhisheks008 commented May 14, 2024

ArismitaM commented May 14, 2024

abhisheks008 commented May 14, 2024

ArismitaM commented May 14, 2024

ArismitaM commented May 14, 2024

abhisheks008 commented May 14, 2024

ArismitaM commented May 16, 2024 • edited Loading

abhisheks008 commented May 16, 2024

ArismitaM commented May 18, 2024

abhisheks008 commented May 18, 2024

ArismitaM commented May 18, 2024

abhisheks008 commented May 18, 2024

Query 1:

Query 2:

Query 3:

Example:

ArismitaM commented May 18, 2024

ArismitaM commented May 22, 2024

abhisheks008 commented May 22, 2024

ArismitaM commented May 31, 2024

abhisheks008 commented Jun 1, 2024

ArismitaM commented Jun 1, 2024

ArismitaM commented Jun 1, 2024 • edited Loading

ArismitaM commented Jun 20, 2024

abhisheks008 commented Jun 21, 2024

github-actions bot commented Jun 28, 2024

ArismitaM commented May 16, 2024 •

edited

Loading

ArismitaM commented Jun 1, 2024 •

edited

Loading