By clicking "Accept", you agree to the storing of cookies on your device to enhance site navigation, analyze site usage, and assist in our marketing efforts. See our Privacy Policy for more information
Open Datasets
MidJourney Detailed Prompts
Text

MidJourney Detailed Prompts

More than 3,000 complex prompts structured on several levels, intended for training or evaluating text-to-image models.

Download dataset
Size

3,053 entries in Parquet format, rich text

Licence

Apache 2.0

Description

This dataset contains over 3,000 detailed text prompts designed to generate images using models like MidJourney or DALL·E. The prompts span multiple levels of complexity, ranging from short descriptions to very rich formulations, useful for exploring the effect of linguistic variations on image generation.

What is this dataset for?

  • Train or refine models for generating images from text
  • Evaluate the sensitivity of a model to the levels of detail of the prompts
  • Design tools to help the automatic writing of visual prompts

Can it be enriched or improved?

Yes, this dataset can be expanded by adding corresponding generated images, or by ranking prompts according to their artistic style, target object, or composition. It can also be translated into other languages or enriched with metadata.

🔎 In summary

Criterion Evaluation
🧩Ease of Use ⭐⭐⭐⭐⭐ (Very accessible, readable Parquet format)
🧼Cleaning Required ⭐⭐⭐☆☆ (Low – some text normalization may be needed)
🏷️Annotation Richness ⭐⭐⭐☆☆ (Rich text structure but no extra annotations)
📜Commercial License ✅ Yes (Apache 2.0)
👨‍💻Beginner Friendly 👍 Yes – no advanced technical skills required
🔁Reusable for Fine-Tuning 🎯 Useful for adapting a text-to-image model
🌍Cultural Diversity 🌍 Limited – mainly English, Western style

🧠 Recommended for

  • Digital artists
  • AI researchers
  • Visual creation tool developers

🔧 Compatible tools

  • Stable Diffusion
  • MidJourney
  • SLAB
  • ComfyUI

💡 Tip

Combine these prompts with metadata or style classifiers to generate more targeted images.

Frequently Asked Questions

Does this dataset contain images generated in addition to prompts?

Yes, it contains images generated in addition to prompts (each row has 1 prompt + 1 image).

Can it be used to train a custom text-to-image model?

Yes, it's a great base for fine-tuning or evaluating a text-to-image model with a wide range of prompt styles.

Is this dataset suitable for use in a school or academic environment?

Yes, as long as the use respects the Apache 2.0 license, it can be used for educational projects on image generation or NLP.

Similar datasets

See more
Category

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Suspendisse varius enim in eros elementum tristique.

Category

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Suspendisse varius enim in eros elementum tristique.

Category

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Suspendisse varius enim in eros elementum tristique.