The first dataset of composite images with rationality score indicating whether the object placement in a composite image is reasonable.

Last update: Nov 15, 2022

Overview

Object-Placement-Assessment-Dataset-OPA

Object-Placement-Assessment (OPA) is to verify whether a composite image is plausible in terms of the object placement. The foreground object should be placed at a reasonable location on the background considering location, size, occlusion, semantics, and etc.

Our dataset OPA is a synthesized dataset for Object Placement Assessment based on COCO dataset. We select unoccluded objects from multiple categories as our candidate foreground objects. The foreground objects are pasted on their compatible background images with random sizes and locations to form composite images, which are sent to human annotators for rationality labeling. Finally, we split the collected dataset into training set and test set, in which the background images and foreground objects have no overlap between training set and test set. We show some example positive and negative images in our dataset in the figure below.

Illustration of OPA dataset samples: Some positive and negative samples in our OPA dataset and the inserted foreground objects are marked with red outlines. Top row: positive samples; Bottom rows: negative samples, including objects with inappropriate size (e.g., f, g, h), without supporting force (e.g., i, j, k), appearing in the semantically unreasonable place (e.g., l, m, n), with unreasonable occlusion (e.g., o, p, q), and with inconsistent perspectives (e.g., r, s, t).

Our OPA dataset contains 62,074 training images and 11,396 test images, in which the foregrounds/backgrounds in training set and test set have no overlap. The training (resp., test) set contains 21,351 (resp.,3,566) positive samples and 40,724 (resp., 7,830) negative samples. Besides, the training (resp., test) set contains 2,701 (resp., 1,436) unrepeated foreground objects and1,236 (resp., 153) unrepeated background images. The OPA dataset is provided in Baidu Cloud (access code: qb1r) or Google Drive.

Prerequisites

Python
Pytorch
PIL

Getting Started

Installation

Clone this repo:

git clone https://github.com/bcmi/Object-Placement-Assessment-Dataset-OPA.git
cd Object-Placement-Assessment-Dataset-OPA

Download the OPA dataset. We show the file structure below:
```
├── background: 
     ├── category: 
              ├── imgID.jpg
              ├── ……
     ├── ……
├── foreground: 
     ├── category: 
              ├── imgID.jpg
              ├── mask_imgID.jpg
              ├── ……
     ├── ……
├── composite: 
     ├── train_set: 
              ├── fgimgID_bgimgID_x_y_w_h_scale_label.jpg
              ├── mask_fgimgID_bgimgID_x_y_w_h_scale_label.jpg
              ├── ……
     └── test_set: 
├── train_set.csv
└── test_set.csv
```
All backgrounds and foregrounds have their own IDs for identification. Each category of foregrounds and their compatible backgrounds are placed in one folder. The corresponding masks are placed in the same folder with a mask prefix.

Four values are used to identify the location of a foreground in the background, including x y indicating the upper left corner of the foreground and w h indicating width and height. Scale is the maximum of fg_w/bg_w and fg_h/bg_h. The label (0 or 1) means whether the composite is reasonable in terms of the object placement.

The training set and the test set each has a CSV file to record their information.
We also provide a script in /data_processing/ to generate composite images:
```
python generate_composite.py
```
After running the script, input the foreground ID, background ID, position, label, and storage path to generate your composite image.

Bibtex

If you find this work useful for your research, please cite our paper using the following BibTeX [arxiv]:

@article{liu2021OPA,
  title={OPA: Object Placement Assessment Dataset},
  author={Liu,Liu and Zhang,Bo and Li,Jiangtong and Niu,Li and Liu,Qingyang and Zhang,Liqing},
  journal={arXiv preprint arXiv:2107.01889},
  year={2021}
}

The first dataset of composite images with rationality score indicating whether the object placement in a composite image is reasonable.

Related tags

Overview

Object-Placement-Assessment-Dataset-OPA

Prerequisites

Getting Started

Installation

Bibtex

Owner

BCMI

Supplementary code for SIGGRAPH 2021 paper: Discovering Diverse Athletic Jumping Strategies

Code release for "BoxeR: Box-Attention for 2D and 3D Transformers"

Learning Continuous Image Representation with Local Implicit Image Function

A Real-ESRGAN equipped Colab notebook for CLIP Guided Diffusion

Pca-on-genotypes - Mini bioinformatics project - PCA on genotypes

[CVPR 2021] Monocular depth estimation using wavelets for efficiency

git《Investigating Loss Functions for Extreme Super-Resolution》(CVPR 2020) GitHub:

Rethinking Nearest Neighbors for Visual Classification

The goal of the exercises below is to evaluate the candidate knowledge and problem solving expertise regarding the main development focuses for the iFood ML Platform team: MLOps and Feature Store development.

CoSMA: Convolutional Semi-Regular Mesh Autoencoder. From Paper "Mesh Convolutional Autoencoder for Semi-Regular Meshes of Different Sizes"

Mixup for Supervision, Semi- and Self-Supervision Learning Toolbox and Benchmark

OpenDILab RL Kubernetes Custom Resource and Operator Lib

Rotated Box Is Back : Accurate Box Proposal Network for Scene Text Detection

DROPO: Sim-to-Real Transfer with Offline Domain Randomization

Simple command line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neural representation network)

Wanli Li and Tieyun Qian: Exploit a Multi-head Reference Graph for Semi-supervised Relation Extraction, IJCNN 2021

PFFDTD is an open-source FDTD simulator for 3D room acoustics

Source code for "UniRE: A Unified Label Space for Entity Relation Extraction.", ACL2021.

Code of our paper "Contrastive Object-level Pre-training with Spatial Noise Curriculum Learning"

Official PyTorch Implementation for InfoSwap: Information Bottleneck Disentanglement for Identity Swapping