Abstract
Deep learning-based food image classification enables precise identification of food categories, further facilitating accurate nutritional analysis. However, real-world food images often show a skewed distribution, with some food types being more prevalent than others. This class imbalance can be problematic, causing models to favor the majority (head) classes with overall performance degradation for the less common (tail) classes. Recently, synthetic data augmentation using diffusion-based generative models has emerged as a promising solution to address this issue. By generating high-quality synthetic images, these models can help uniformize the data distribution, potentially improving classification performance. However, existing approaches face challenges: fine-tuning-based methods need a uniformly distributed dataset, while pre-trained model-based approaches often overlook inter-class separation in synthetic data. In this paper, we propose a two-stage synthetic data augmentation framework, leveraging pre-trained diffusion models for long-tailed food classification. We generate a reference set conditioned by a positive prompt on the generation target and then select a class that shares similar features with the generation target as a negative prompt. Subsequently, we generate a synthetic augmentation set using positive and negative prompt conditions by a combined sampling strategy that promotes intra-class diversity and inter-class separation. We demonstrate the efficacy of the proposed method on two long-tailed food benchmark datasets, achieving superior performance compared to previous works in terms of top-1 accuracy.
| Original language | English |
|---|---|
| Title of host publication | Proceedings - 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2025 |
| Publisher | IEEE Computer Society |
| Pages | 391-400 |
| Number of pages | 10 |
| ISBN (Electronic) | 9798331599942 |
| DOIs | |
| Publication status | Published - 2025 |
| Event | 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2025 - Nashville, United States Duration: 2025 Jun 11 → 2025 Jun 12 |
Publication series
| Name | IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops |
|---|---|
| ISSN (Print) | 2160-7508 |
| ISSN (Electronic) | 2160-7516 |
Conference
| Conference | 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2025 |
|---|---|
| Country/Territory | United States |
| City | Nashville |
| Period | 25/6/11 → 25/6/12 |
Bibliographical note
Publisher Copyright:© 2025 IEEE.
Keywords
- diffusion-based data augmentation
- long-tailed food classification
ASJC Scopus subject areas
- Computer Vision and Pattern Recognition
- Electrical and Electronic Engineering
Fingerprint
Dive into the research topics of 'Synthetic Data Augmentation using Pre-trained Diffusion Models for Long-tailed Food Image Classification'. Together they form a unique fingerprint.Cite this
- APA
- Standard
- Harvard
- Vancouver
- Author
- BIBTEX
- RIS