Conditional Variational Autoencoder Data Augmentation for Robustness to Instance-Dependent Noise

Research Paper

For detailed information about our study and findings, please refer to our research paper. You can access it here: Report Paper

Abstract

This paper addresses the critical challenge in deep learning of obtaining high-quality, accurately labeled data. In real-world scenarios, label noise is often unavoidable and can significantly hinder model performance. Traditional artificially generated label noise does not adequately represent real-world conditions. We propose a novel approach using instance-dependent noise (IDN) as a benchmark for modeling real-world label noise, coupled with a deep generative method for effective handling of this issue.

Introduction

The effectiveness of deep learning systems heavily relies on the quality of the labeled data. However, accurate and noise-free labels are challenging and time-consuming to obtain. Our research focuses on addressing the gap between artificially generated label noise and real-world label noise, using IDN as a more representative benchmark.

Methodology

We introduce the Conditional Variational Autoencoder (CVAE) as a novel method for data augmentation on desired class labels. This approach aims to enhance the robustness of deep learning models against the detrimental effects of label noise.

Conditional Variational Autoencoder (CVAE)

The CVAE is designed to encode generalized class-conditional features.
It helps in mitigating the influence of noisy labels by enhancing data quality through augmentation.

Dataset and Evaluation

Dataset Used: MNIST dataset.
Evaluation Method: Comparative analysis with several state-of-the-art techniques under less noisy conditions.

Results

The experimental results demonstrate that our CVAE-based method outperforms existing techniques in scenarios with lower levels of noise.
The CVAE-based data augmentation shows robustness to small percentages of noise, underscoring its potential in real-world image classification scenarios.

Conclusion

The study confirms the potential of the CVAE in improving the performance of image classification models, particularly in real-world settings where label noise is a common issue. Our findings highlight the importance of developing techniques that are tailored to the nuances of real-world data.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
.idea		.idea
.ipynb_checkpoints		.ipynb_checkpoints
MNIST		MNIST
__pycache__		__pycache__
noisyData		noisyData
README.md		README.md
cvae_data_augmentation.ipynb		cvae_data_augmentation.ipynb
cvae_generator_demo.py		cvae_generator_demo.py
dataVisualization.ipynb		dataVisualization.ipynb
dataset.py		dataset.py
model.py		model.py
modelTraining.ipynb		modelTraining.ipynb
noise_generator.ipynb		noise_generator.ipynb
noise_generator_demo.py		noise_generator_demo.py
report_paper.pdf		report_paper.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Conditional Variational Autoencoder Data Augmentation for Robustness to Instance-Dependent Noise

Research Paper

Abstract

Introduction

Methodology

Conditional Variational Autoencoder (CVAE)

Dataset and Evaluation

Results

Conclusion

About

Releases

Packages

Languages

eugenekim3107/conditional-vae-for-instance-dependent-noise

Folders and files

Latest commit

History

Repository files navigation

Conditional Variational Autoencoder Data Augmentation for Robustness to Instance-Dependent Noise

Research Paper

Abstract

Introduction

Methodology

Conditional Variational Autoencoder (CVAE)

Dataset and Evaluation

Results

Conclusion

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages