LAION Art EN Improved Captions

LAION Art EN Improved Captions is a dataset of artistic images combined with improved English descriptions via a state-of-the-art model, designed to improve the semantic image-text relationship in image generation tasks.

Download dataset

Size

2.68 million image-caption pairs, 442 MB, Parquet format

Licence

CC-BY 4.0

Description

‍

LAION Art EN Improved Captions contains over 2.6 million image-caption pairs in English, with descriptions generated and refined by an advanced model (Salesforce/blip2-flan-t5-XXL). This dataset makes it easy to fine-tune text-based image-generating models and create powerful prompt databases.

‍

What is this dataset for?

‍

Fine-tuning text-to-image generators (ex: Stable Diffusion)
Creation of searchable prompt databases for image generation
Improving the semantic quality between images and descriptions

‍

Can it be enriched or improved?

‍

The dataset can be enriched by adding captions in other languages, or by manually correcting descriptions for specific cases. Advanced indexing (e.g. Faiss) allows a better search in the prompt database.

‍

🔎 In summary

Criterion	Evaluation
🧩 Ease of use	⭐⭐⭐⭐✩ (Structured dataset, accessible via Hugging Face)
🧼 Need for cleaning	⭐⭐⭐⭐⭐ (Low – captions generated with good quality)
🏷️ Annotation richness	⭐⭐⭐⭐✩ (Good – improved and contextual captions)
📜 Commercial license	✅ Yes (CC-BY 4.0)
👨‍💻 Beginner friendly	⚠️ Moderate – requires knowledge in vision and NLP
🔁 Fine-tuning ready	✅ Perfect for text-to-image and prompt bases
🌍 Cultural diversity	🎨 Wide artistic diversity in English