YOLOX Win10 Project

Last update: Jun 08, 2022

Related tags

Deep Learning YOLOX_Win10

Overview

Introduction

这是一个用于Windows训练YOLOX的项目，相比于官方项目，做了一些适配和修改：

1、解决了Windows下import yolox失败，No such file or directory: 'xxx.xml'等路径问题

2、CUDA out of memory等显存不够问题

3、增加eval.txt，可以输出IoU=0.5-0.95的AP值，以及Map50和Map50:95

Benchmark

Model	size	mAP^val 0.5:0.95	mAP^{test 0.5:0.95}	Speed V100 (ms)	Params (M)	FLOPs (G)	weights
YOLOX-s	640	40.5	40.5	9.8	9.0	26.8	github
YOLOX-m	640	46.9	47.2	12.3	25.3	73.8	github
YOLOX-l	640	49.7	50.1	14.5	54.2	155.6	github
YOLOX-x	640	51.1	51.5	17.3	99.1	281.9	github
YOLOX-Darknet53	640	47.7	48.0	11.1	63.7	185.3	github

Training on custom data

1、准备数据集

以VOC数据集为例，数据目录如下图所示，datasets/VOCdevkit/VOC2021/(不建议修改年份，如需要修改，则对应修改yolox_voc_s.py中的年份)，该文件夹下有三个文件夹，分别为Annotations、JPEGImages、ImageSets，特别注意ImageSets文件夹下须新建Main文件夹，运行dataset_cls.py（注意切换到datasets路径下，可以修改训练集和测试集比例）会自动生成训练文件trainval.txt和test.txt

2、修改配置文件

修改exps/example/yolox_voc/yolox_voc_s.py文件 self.num_classes和其他配置变量（自选）

class Exp(MyExp):
    def __init__(self):
        super(Exp, self).__init__()
        self.num_classes = 42         #修改成自己的类别
        self.depth = 0.33
        self.width = 0.50
        self.warmup_epochs = 1

此Exp类体继承MyExp类体，且可以对MyExp的变量重写（因此有更高的优先级），对按住ctrl点击MyExp跳转

class Exp(BaseExp):
    def __init__(self):
        super().__init__()

        # ---------------- model config ---------------- #
        self.num_classes = 80  #因为在yolox_voc_s.py中已经重新赋值，此处不用修改
        self.depth = 1.00
        self.width = 1.00
        self.act = 'silu'

        # ---------------- dataloader config ---------------- #
        # set worker to 4 for shorter dataloader init time
        self.data_num_workers = 1
        self.input_size = (640, 640)  # (height, width)
        # Actual multiscale ranges: [640-5*32, 640+5*32].
        # To disable multiscale training, set the
        # self.multiscale_range to 0.
        self.multiscale_range = 5 #五种输入大小随机调整
        # You can uncomment this line to specify a multiscale range
        # self.random_size = (14, 26)
        self.data_dir = None
        self.train_ann = "instances_train2017.json"
        self.val_ann = "instances_val2017.json"

        # --------------- transform config ----------------- #
        self.mosaic_prob = 1.0   #数据增强概率，可以根据需要调整
        self.mixup_prob = 1.0
        self.hsv_prob = 1.0
        self.flip_prob = 0.5
        self.degrees = 10.0
        self.translate = 0.1
        self.mosaic_scale = (0.1, 2)
        self.mixup_scale = (0.5, 1.5)
        self.shear = 2.0
        self.enable_mixup = True

        # --------------  training config --------------------- #
        self.warmup_epochs = 5
        self.max_epoch = 100  #设置训练轮数
        self.warmup_lr = 0
        self.basic_lr_per_img = 0.01 / 64.0
        self.scheduler = "yoloxwarmcos"
        self.no_aug_epochs = 15 #不适用数据增强轮数
        self.min_lr_ratio = 0.05
        self.ema = True

        self.weight_decay = 5e-4
        self.momentum = 0.9
        self.print_interval = 10 #每隔十步打印输出一次训练信息
        self.eval_interval = 1 #每隔1轮保存一次
        self.exp_name = os.path.split(os.path.realpath(__file__))[1].split(".")[0]

        # -----------------  testing config ------------------ #
        self.test_size = (640, 640)
        self.test_conf = 0.01
        self.nmsthre = 0.65

可以对上述类体变量进行调整，其中关键变量有input_size、max_epoch、eval_interval等

3、开始训练

输入以下命令开始训练，-c 表示加载预训练权重

python tools/train.py  -c /path/to/yolox_s.pth

你也可以对其他参数进行调整，例如：

python tools/train.py  -d 1 -b 8 --fp16 -c /path/to/yolox_s.pth

-d 表示用几块显卡，-b 表示设置batch_size，--fp16 表示半精度训练，-c 表示加载预训练权重，如果在显存不足的情况下，谨慎输入 -o 参数，会占用较多显存

如果训练一半终止后，想继续断点训练，可以输入

python tools/train.py --resume

Evaluation

输入以下代码默认对精度最高模型评估，评估后，可以在YOLOX_outputs/yolox_voc_s/eval.txt中看到IoU=0.5-0.95的AP值，文件最后可以看到Map50和Map50:95

python tools/eval.py

如需对设定其他参数，可以输入以下代码，参数意义同训练

python tools/eval.py -n  yolox-s -c yolox_s.pth -b 8 -d 1 --conf 0.001 
                         yolox-m
                         yolox-l
                         yolox-x

Reference

https://github.com/Megvii-BaseDetection/YOLOX

YOLOX Win10 Project

Related tags

Overview

Introduction

Benchmark

Training on custom data

1、准备数据集

2、修改配置文件

3、开始训练

Evaluation

Reference

Owner

A PyTorch-centric hybrid classical-quantum machine learning framework

Code and data accompanying our SVRHM'21 paper.

This repository contains the database and code used in the paper Embedding Arithmetic for Text-driven Image Transformation

Image reconstruction done with untrained neural networks.

🥈78th place in Riiid Answer Correctness Prediction competition

PSTR: End-to-End One-Step Person Search With Transformers (CVPR2022)

This code is for our paper "VTGAN: Semi-supervised Retinal Image Synthesis and Disease Prediction using Vision Transformers"

PyTorch implementation of saliency map-aided GAN for Auto-demosaic+denosing

GPOEO is a micro-intrusive GPU online energy optimization framework for iterative applications

MusicYOLO framework uses the object detection model, YOLOx, to locate notes in the spectrogram.

ZeroGen: Efficient Zero-shot Learning via Dataset Generation

This program generates a random 12 digit/character password (upper and lowercase) and stores it in a file along with your username and app/website.

Deep Illuminator is a data augmentation tool designed for image relighting. It can be used to easily and efficiently generate a wide range of illumination variants of a single image.

text_recognition_toolbox: The reimplementation of a series of classical scene text recognition papers with Pytorch in a uniform way.

Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task of Visual Document Understanding (VDU)

Unofficial pytorch implementation of paper "One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing"

Code for the USENIX 2017 paper: kAFL: Hardware-Assisted Feedback Fuzzing for OS Kernels

Face Recognition and Emotion Detector Device

K-PLUG: Knowledge-injected Pre-trained Language Model for Natural Language Understanding and Generation in E-Commerce (EMNLP Founding 2021)

Drone detection using YOLOv5