A synthetic texture-invariant dataset for object detection of UAVs

Last update: Aug 13, 2022

Overview

A synthetic dataset for object detection of UAVs

This repository contains a synthetic datasets accompanying the paper Sim2Air - Synthetic aerial dataset for UAV monitoring by Antonella Barisic, Frano Petric and Stjepan Bogdan.

In this paper, we propose to use a texture-invariant representation of objects for aerial object detection. Our approach improves the generalisation and robustness of the object detector. A dataset is created with randomly assigned atypical textures and sufficient diversity and photorealism in all other components such as shape, pose, lighting, scale, background, etc. The results also show improved accuracy in case of distant objects and difficult lighting conditions.

All datasets from the paper are available for download. If you use these datasets for your research, please cite:

@misc{barisic2021sim2air,
      title={Sim2Air - Synthetic aerial dataset for UAV monitoring}, 
      author={Antonella Barisic and Frano Petric and Stjepan Bogdan},
      year={2021},
      eprint={2110.05145},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Datasets

Name	Description
Synthetic Eagle Baseline (SEB)	The SEB dataset is a synthetic dataset with a single UAV model, the custom aerial platform Eagle. Since this dataset serves as the basis for proving our hypothesis, it was created with only one texture, identical to the texture of real-life Eagle. SEB consists of 32 000 images of size 604 x 604 with annotations in YOLO format.
Synthetic Eagle with Textures (SET)	The SET dataset is the main star ⭐ of our work. It is a synthetic dataset of a single model, the custom aerial platform Eagle, with randomly selected atypical textures. The mixture of 32 different textures is applied during the procedural generation of the dataset. SET also consists of 32 000 images of size 604 x 604 with annotations in YOLO format.
Synthetic UAVs with Textures (S-UAV-T)	The S-UAV-T dataset is similar to SET but with many more models of UAVs. The data was created with 10 different multicopter models, 32 atypical textures, and with a variety of poses, backgrounds, viewpoints, etc. S-UAV-T consists of 52 500 images of size 604 x 604 with annotations in YOLO format.

If you want to test your detection results against real data, check out our UAV-Eagle dataset at larics/UAV-Eagle.

Contact

For more information, please contact Antonella Barisic.

A synthetic texture-invariant dataset for object detection of UAVs

Related tags

Overview

A synthetic dataset for object detection of UAVs

Datasets

Contact

Owner

LARICS Lab

Conjugated Discrete Distributions for Distributional Reinforcement Learning (C2D)

Build a medical knowledge graph based on Unified Language Medical System (UMLS)

Self Driving RC Car Code

code for paper -- "Seamless Satellite-image Synthesis"

Multi-scale discriminator feature-wise loss function

Adjust Decision Boundary for Class Imbalanced Learning

Weakly-supervised semantic image segmentation with CNNs using point supervision

Pytorch code for "Text-Independent Speaker Verification Using 3D Convolutional Neural Networks".

my graduation project is about live human face augmentation by projection mapping by using CNN

Proximal Backpropagation - a neural network training algorithm that takes implicit instead of explicit gradient steps

Official PyTorch implementation for FastDPM, a fast sampling algorithm for diffusion probabilistic models

Multi-Glimpse Network With Python

Repository of the paper Compressing Sensor Data for Remote Assistance of Autonomous Vehicles using Deep Generative Models at ML4AD @ NeurIPS 2021.

An open source Python package for plasma science that is under development

GPOEO is a micro-intrusive GPU online energy optimization framework for iterative applications

Synthetic Humans for Action Recognition, IJCV 2021

Pytorch implementation of “Recursive Non-Autoregressive Graph-to-Graph Transformer for Dependency Parsing with Iterative Refinement”

A simple, fully convolutional model for real-time instance segmentation.

TGRNet: A Table Graph Reconstruction Network for Table Structure Recognition

Neural Reprojection Error: Merging Feature Learning and Camera Pose Estimation