首页|Semantic Augmentation in Images using Language

Semantic Augmentation in Images using Language

来源：

英文摘要

Deep Learning models are incredibly data-hungry and require very large labeled datasets for supervised learning. As a consequence, these models often suffer from overfitting, limiting their ability to generalize to real-world examples. Recent advancements in diffusion models have enabled the generation of photorealistic images based on textual inputs. Leveraging the substantial datasets used to train these diffusion models, we propose a technique to utilize generated images to augment existing datasets. This paper explores various strategies for effective data augmentation to improve the out-of-domain generalization capabilities of deep learning models.

作者：Sahiti Yerramilli、Jayant Sravan Tamarapalli、Tanmay Girish Kulkarni、Jonathan Francis、Eric Nyberg

作者单位：

学科分类：计算技术、计算机技术

推荐引用：Sahiti Yerramilli,Jayant Sravan Tamarapalli,Tanmay Girish Kulkarni,Jonathan Francis,Eric Nyberg.Semantic Augmentation in Images using Language[EB/OL].(2025-07-09)[2025-07-25].https://arxiv.org/abs/2404.02353.点此复制

Semantic Augmentation in Images using Language

Semantic Augmentation in Images using Language

评论