DOTA (Dataset for Object Detection in Aerial Images)

DOTA (Dataset for Object Detection in Aerial Images) is a reference in the field of satellite imagery analysis. It makes it possible to automatically detect and identify complex objects visible from the sky, such as buildings, vehicles or infrastructures. This dataset is designed to meet the particular challenges of aerial imagery: variation in scale, orientation, density and shape of objects. It is widely used to train and evaluate object detection models in geospatial contexts.

Download dataset

Size

Several thousand satellite images in TIFF format, with approximately 280,000 annotated objects

Licence

Usable for academic research purposes, subject to specific DOTA license conditions

Description

‍
The DOTA dataset includes:

Thousands of high-resolution satellite images in TIFF format
Approximately 280,000 manually annotated objects
15 categories of common objects in urban or port environments (building, plane, ship, vehicle, bridge, etc.)
Annotations in quadrilateral format (not rectangle), better suited to oriented objects

‍

The annotations are accurate and geometrically adapted to the diversity of orientation and scale of objects, making them a realistic benchmark for Computer Vision algorithms applied to remote sensing.

‍

What is this dataset for?

‍
DOTA is used for:

Training object detection models in aerial or satellite images
Recognition of urban structures, infrastructures and vehicles
Environmental monitoring, disaster management, or automated mapping
Evaluating the performance of models on complex scenes with a high density of objects

‍

Can it be enriched or improved?

‍
Yes, DOTA can be enriched in a number of ways:

By adding annotations for specific classes according to the field of use (wind turbines, solar panels, military equipment, etc.)
By combining with multispectral sources to integrate thermal or infrared analysis
By georeferencing annotations for finer integration into GIS (Geographic Information Systems)
By increasing geographic diversity (beyond China and a few test regions)

‍

🔗 Source: DOTA Dataset

‍

Frequently Asked Questions

Why are objects annotated with angled polygons and not rectangles?

In satellite imagery, objects can be seen from a variety of angles and orientations. Quadrilateral annotations better match the real shape and perspective of objects, offering greater precision for models.

Can DOTA be used for tasks other than detection?

Yes, although primarily focused on detection, DOTA can be used for oriented segmentation, object classification, or spatial analysis tasks when combined with other geographic data.

Are there derivative or enhanced versions of DOTA?

Yes, several versions have been released, including DOTA-v1.5 and DOTA-v2.0, with more images, categories, and more accurate annotations. Some versions also incorporate more varied contexts such as night scenes or extreme shooting angles.

Similar datasets

Text

MultiNli (Multi-Genre Natural Language Inference Corpus)

Text

Google Trends Dataset

Text

sQuad (Stanford Question Answering Dataset)