By clicking "Accept", you agree to the storing of cookies on your device to enhance site navigation, analyze site usage, and assist in our marketing efforts. See our Privacy Policy for more information
Open Datasets
Labeled Faces in the Wild (LFW)
Image

Labeled Faces in the Wild (LFW)

Dataset composed of face photos collected on the web, tagged by name of the person. Used for facial recognition under non-binding conditions.

Download dataset
Size

13,000+ JPEG images, original resolution 250x250 px (often resized to 62x47 px), RGB

Licence

GNU Lesser General Public License 3.0 (LGPL-3.0)

Description

The dataset Labeled Faces in the Wild (LFW) includes over 13,000 images of the faces of public figures, with annotations indicating the name of each individual. The images were collected from the Internet and captured under a variety of conditions, making the facial recognition task more realistic and challenging.

What is this dataset for?

  • Training and evaluating facial recognition models
  • Studying biometric identification in uncontrolled conditions
  • Testing facial detection and classification algorithms

Can it be enriched or improved?

The dataset can be completed with more accurate annotations, additional metadata (age, expression, pose), or higher resolution images. The addition of multimodal data (video, audio) could extend uses.

🔎 In summary

Criterion Evaluation
🧩Ease of use ⭐⭐⭐⭐☆ (Ready-to-use images, clear annotations)
🧼Need for cleaning ⭐⭐⭐⭐⭐ (Low – well-organized dataset)
🏷️Richness of annotations ⭐⭐⭐☆☆ (Names of people but few other metadata)
📜Commercial license ✅ Allowed under LGPL-3.0
👨‍💻Ideal for beginners 👍 Very suitable for getting started with facial recognition
🔁Reusable for fine-tuning 🔥 Perfect for CNN fine-tuning and face models
🌍Cultural diversity 🌍 Mainly American and international public figures

🧠 Recommended for

  • Biometric researchers
  • ML developers
  • IT security projects

🔧 Compatible tools

  • OpenCV
  • Dlib
  • TensorFlow
  • PyTorch
  • FaceNet

💡 Tip

Pre-processing images to standardize lighting and pose improves model accuracy.

Frequently Asked Questions

What is the main use of this dataset?

Unconstrained facial recognition, to identify people on various images.

Does this dataset contain sensitive or private data?

No, the images concern public figures and are available on the Internet.

Can I use this dataset for a commercial project?

Yes, subject to compliance with the terms of the LGPL-3.0 license, in particular on redistribution.

Similar datasets

See more
Category

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Suspendisse varius enim in eros elementum tristique.

Category

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Suspendisse varius enim in eros elementum tristique.

Category

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Suspendisse varius enim in eros elementum tristique.