Reddit Memes Dataset
Dataset composed of over 3,300 Reddit meme images, including image URLs, the number of upvotes and downvotes, and other metadata. Collected for computer vision projects and popularity analysis.
3,327 image files (image URLs + associated JSON metadata)
CC0: Public Domain
Description
The dataset Reddit Memes Dataset contains 3,327 meme images from Reddit, along with metadata such as post ID, number of upvotes and downvotes, and other relevant information. This corpus is a good starting point for computer vision projects related to the analysis of humorous and viral content.
What is this dataset for?
- Training computer vision models for the classification of humorous images
- Analyzing the popularity and engagement score of social media memes
- Develop systems for recommending or moderating visual content
Can it be enriched or improved?
Yes, you can add manual annotations to the content of the memes, such as humorous categories, the type of meme, or the cultural context. It is also possible to integrate textual data extracted from images via OCR for multimodal analyses.
🔎 In summary
🧠 Recommended for
- Computer vision researchers
- Social application developers
- Data scientists
🔧 Compatible tools
- PyTorch
- TensorFlow
- OpenCV
- FastAI
💡 Tip
Use OCR tools to exploit the texts in the images.
Frequently Asked Questions
Does this dataset contain images directly or only their URLs?
The dataset provides image URLs, you have to download them separately.
Can this dataset be used to train humorous image recognition models?
Yes, it is suitable for the classification and analysis of computer vision memes.
Does the dataset include manual annotations on the content of the memes?
No, annotations are limited to engagement metadata, but adding annotations is possible and recommended.




