This is the code repository for the paper A hierarchical semantic segmentation framework for computer-vision-based bridge column damage detection

Last update: Dec 07, 2022

Overview

Bridge-damage-segmentation

This is the code repository for the paper A hierarchical semantic segmentation framework for computer-vision-based bridge column damage detection submitted to the IC-SHM Challenge 2021. The semantic segmentation framework used in this paper leverages importance sampling, semantic mask, and multi-scale test time augmentation to achieve a 0.836 IoU for scene component segmentation and a 0.467 IoU for concrete damage segmentation on the Tokaido Dataset. The framework was implemented on MMSegmentation using Python.

Highlights

Models used in the framework

Backbones

HRNet
Swin
ResNest

Decoder Heads

PSPNet
UperNet
OCRNet

Performance

The following table reports IoUs for structural component segmentation.

Architecture	Slab	Beam	Column	Non-structural	Rail	Sleeper	Average
Ensemble	0.891	0.880	0.859	0.660	0.623	0.701	0.785
Ensemble + Importance sampling	0.915	0.912	0.958	0.669	0.618	0.892	0.827
Ensemble + Importance sampling + Multi-scale TTA	0.924	0.929	0.965	0.681	0.621	0.894	0.836

The following table reports IoUs for damage segmentation of pure texture images.

Architecture	Concrete damage	Exposed rebar	Average
Ensemble	0.356	0.536	0.446
Ensemble + Importance sampling	0.708	0.714	0.711
Ensemble + Importance sampling + Multi-scale TTA	0.698	0.727	0.712

The following table reports IoUs for damage segmentation of real scene images.

Architecture	Concrete damage	Exposed rebar	Average
Ensemble	0.235	0.365	0.300
Ensemble + Importance sampling	0.340	0.557	0.448
Ensemble + Importance sampling + Multi-scale TTA	0.350	0.583	0.467
Ensemble + Importance sampling + Multi-scale TTA + Mask	0.379	0.587	0.483

Environment

The code is developed under the following configurations.

Hardware: >= 2 GPUs for training, >= 1 GPU for testing. The script supports sbatch training and testing on computer clusters.
Software:
- System: Ubuntu 16.04.3 LTS
- CUDA >= 10.1
Dependencies:
- Conda: This is optional, but we suggest using conda to configure the environment.
- Pytorch >= 1.6.0
- MMCV
- MMSeg
- OpenCV >= 4.5.0
- tqdm

Usage

Environment

Install conda and create a conda environment

$ conda create -n open-mmlab
$ source activate open-mmlab
$ conda install pytorch=1.6.0 torchvision cudatoolkit=10.1 -c pytorch

Install mmcv-full

$ pip install mmcv-full -f https://download.openmmlab.com/mmcv/dist/cu101/torch1.6.0/index.html

Install mmsegmentation

$ pip install git+https://github.com/open-mmlab/mmsegmentation.git

Install other dependencies

$ pip install opencv, tqdm, numpy, scipy

Download the Tokaido dataset from IC-SHM Challenge 2021.

Training

Example single model training using multiple GPUs

$ python3 -m torch.distributed.launch --nproc_per_node=2 --nnodes=2 --master_port=$RANDOM ./apis/train_damage_real.py \
  --nw hrnet \
  --cp $CHECKPOINT_DIR \
  --dr $DATA_ROOT \
  --conf $MODEL_CONFIG \
  --bs 16 \
  --train_split $TRAIN_SPLIT_PATH \
  --val_split $VAL_SPLIT_PATH \
  --width 1920 \
  --height 1080 \
  --distributed \
  --iter 100000 \
  --log_iter 10000 \
  --eval_iter 10000 \
  --checkpoint_iter 10000 \
  --multi_loss \
  --ohem \
  --job_name dmg

Example shell script for preparing the whole dataset and train all models for the whole pipeline.
```
$ ./scripts/main_training_script.sh
```

Evlauation

Eval one model

$ python3 ./test/test.py \
  --nw hrnet \
  --task single \
  --cp $CONFIG_PATH \
  --dr $DATA_ROOT \
  --split_csv $RAW_CSV_PATH \
  --save_path $OUTPOUT_DIR \
  --img_dir $INPUT_IMG_DIR \
  --ann_dir $INPUT_GT_DIR \
  --split $TEST_SPLIT_PATH \
  --type cmp \
  --width 640 \
  --height 360

Example shell script for testing the whole pipeline and generate the output using the IC-SHM Challenge format.
```
$ ./scripts/main_testing_script.sh
```

Visualization (Add the --cmp flag when visualizing components.)

$ ./modules/viz_label.py \
  --input $SEG_DIR
  --output $OUTPUT_DIR
  --raw_input $IMG_DIR
  --cmp

Reference

If you find the code useful, please cite the following paper.

This is the code repository for the paper A hierarchical semantic segmentation framework for computer-vision-based bridge column damage detection

Related tags

Overview

Bridge-damage-segmentation

Highlights

Models used in the framework

Backbones

Decoder Heads

Performance

Environment

Usage

Environment

Training

Evlauation

Reference

Owner

Jingxiao Liu

keyframes-CNN-RNN(action recognition)

[Arxiv preprint] Causality-inspired Single-source Domain Generalization for Medical Image Segmentation (code&data-processing pipeline)

공공장소에서 눈만 돌리면 CCTV가 보인다는 말이 과언이 아닐 정도로 CCTV가 우리 생활에 깊숙이 자리 잡았습니다.

PyTorch Implementations for DeeplabV3 and PSPNet

A vision library for performing sliced inference on large images/small objects

Implementation of CaiT models in TensorFlow and ImageNet-1k checkpoints. Includes code for inference and fine-tuning.

Python implementation of "Single Image Haze Removal Using Dark Channel Prior"

An Object Oriented Programming (OOP) interface for Ontology Web language (OWL) ontologies.

Implementation for "Manga Filling Style Conversion with Screentone Variational Autoencoder" (SIGGRAPH ASIA 2020 issue)

Music Classification: Beyond Supervised Learning, Towards Real-world Applications

Pixray is an image generation system

Data, notebooks, and articles associated with the RSNA AI Deep Learning Lab at RSNA 2021

Repo for our ICML21 paper Unsupervised Learning of Visual 3D Keypoints for Control

Transfer SemanticKITTI labeles into other dataset/sensor formats.

Pytorch implementation of SimSiam Architecture

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Code for DeepXML: A Deep Extreme Multi-Label Learning Framework Applied to Short Text Documents

Spatial Sparse Convolution Library

PyTorch reimplementation of the Smooth ReLU activation function proposed in the paper "Real World Large Scale Recommendation Systems Reproducibility and Smooth Activations" [arXiv 2022].

BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis