By clicking "Accept", you agree to the storing of cookies on your device to enhance site navigation, analyze site usage, and assist in our marketing efforts. See our Privacy Policy for more information
Open Datasets
UrbanSound8k
Audio

UrbanSound8k

The UrbanSound8K Dataset is a collection of sound recordings captured in various urban environments. This corpus is particularly useful for training models capable of identifying and classifying the sounds typical of cities, such as sirens, horns, construction noises or human voices.

Download dataset
Size

8732 audio clips, WAV format

Licence

Free under a Creative Commons Attribution 4.0 International license (CC BY 4.0)

Description


The dataset offers a complete variety of urban sounds:

  • 8732 individually annotated audio clips
  • Standardized duration of 4 seconds per extract
  • 10 distinct categories: horns, sirens, jobs, children playing, barking, engines, etc.
  • Recordings captured in real and varied contexts, with realistic sound levels

It is frequently used in research on soundscapes and acoustic identification systems in urban environments.

What is this dataset for?


UrbanSound8k is mainly used for:

  • Training audio recognition and classification models in urban environments
  • The development of solutions for noise pollution monitoring
  • The improvement of embedded systems capable of identifying ambient sounds (smart cities, public safety)
  • Research in environmental acoustics and urban sound ecology

Can it be enriched or improved?


Yes, several possibilities exist:

  • Add urban sounds from other geographic or climatic regions
  • Enrich annotations with additional contextual metadata (time, weather, traffic)
  • Mix or layer sounds to simulate complex acoustic environments
  • Test the robustness of models with degraded sounds (noise, distortions, etc.)

🔗 Source: UrbanSound8k Dataset

Frequently Asked Questions

Can UrbanSound8k be used for commercial applications?

Yes, subject to compliance with the terms of the CC BY 4.0 license, including appropriate attribution to authors.

Is there a larger version of the dataset?

Yes, UrbanSound also offers UrbanSound8K and UrbanSoundClassification (an expanded version with other excerpts and annotations).

Are the recordings representative of all major cities?

The sounds are mostly from New York, but are fairly typical of global urban environments. For increased geographic diversity, a combination with other corpora is recommended.

Similar datasets

See more
Category

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Suspendisse varius enim in eros elementum tristique.

Category

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Suspendisse varius enim in eros elementum tristique.

Category

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Suspendisse varius enim in eros elementum tristique.