Computer Vision Paper Reviews with Key Summary of paper, End to End Code Practice and Jupyter Notebook converted papers

Overview

Computer-Vision-Paper-Reviews

Computer Vision Paper Reviews with Key Summary along Papers & Codes.

Jonathan Choi 2021

The repository provides 100+ Papers across Computer Vision fields converted as Jupyter Notebook, with the Key Summary and End to End Code Practice.


Contents

The goal of the repository is providing an end to end study scripts of most read and important papers.

The prefered readers are not limited for researchers, but also for students and engieeners from rookies to the professions in computer vision fields .

To provide the perfect and rich understanding, each paper contains following three main contents.

Key Summary

Providing key summaries and terminologies of the paper so that even rookies can study as perfectly and easily as possible.

Code Practice

Providing an end to end study script of codes for the paper so that even rookies can study as easily and perfectly as possible.

Jupyter Notebook edited Original Paper

Providing the Original Paper converted into Jupyter notbook for easy and fast modification and understanding.


Category/Paper/

Paper_Review_Practice.ipynb includes

Key Summary according to the flow of Original Paper (Jupyter Notebook Edited) with the End to End Code Practice

Paper.ipynb includes

Original Paper (Jupyter Notebook Edited)

Review.ipynb includes

Key Summary

Practice.ipynb includes

End to End Code Practice


Index


Working Papers

If you want to see Road Map and the process, please visit here.


Implicit Neural Representation

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Instance Normalization: The Missing Ingredient for Fast Stylization

Arbitrary Style Transfer in Real-time with Adaptive Instance Normalization

Semantic Image Synthesis with Spatially-Adaptive Normalization

Universal Style Transfer via Feature Transforms

A Neural Algorithm of Artistic Style

Convolutional neural network architecture for geometric matching

Perceptual Losses for Real-Time Style Transfer and Super-Resolution

Geometric Style Transfer


Image to Image Translation

Image-to-Image Translation with Conditional Adversarial Networks (CVPR 2017)

Bi-level Feature Alignment for Versatile Image Translation and Manipulation


Transformer

[DETR] End-to-End Object Detection with Transformers

[Vision Transformer] An Image Is Worth 16x16 Words: Transformers For Image Recognition at Scale

Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

[Transformer] Attention Is All You Need

Vision Transformers for Dense Prediction


Object Detection

Feature Pyramid Networks for Object Detection

Selective Search for Object Recognition

R-CNN

Fast R-CNN

Faster R-CNN

Sparse R-CNN

YOLOv4: Optimal Speed and Accuracy of Object Detection**


Segmentation

Panoptic Feature Pyramid Networks

Mask R-CNN

PointRend: Image Segmentation as Rendering

Cost Aggregation Is All You Need for Few-Shot Segmentation


Convolutional Neural Network

Deep Residual Learning for Image Recognition

EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks


Representation Learning

Unsupervised Feature Learning via Non-Parametric Instance Discrimination

Momentum Contrast for Unsupervised Visual Representation Learning.

A Simple Framework for Contrastive Learning of Visual Representations

Bootstrap Your Own Latent- A New Approach to Self-supervised Learning

Exploring Simple Siamese Representation Learning


Image Generation

Generative Adversarial Networks

A Style-Based Generator Architecture for Generative Adversarial Networks

GAN Dissection: Visualizing and Understanding Generative Adversarial Networks

Semantic Image Synthesis with Spatially-Adaptive Normalization

Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks


Vision and Language


Depth Estimation


Correspondence


Implicit Field

Owner
Jonathan Choi
Researching Computer Vision @ Korea University. To The Infinity, And Beyond!
Jonathan Choi
Rational Activation Functions - Replacing Padé Activation Units

Rational Activations - Learnable Rational Activation Functions First introduce as PAU in Padé Activation Units: End-to-end Learning of Activation Func

<a href=[email protected]"> 38 Nov 22, 2022
A curated list of awesome projects and resources related fastai

A curated list of awesome projects and resources related fastai

Tanishq Abraham 138 Dec 22, 2022
Artificial Intelligence search algorithm base on Pacman

Pacman Search Artificial Intelligence search algorithm base on Pacman Source The Pacman Projects by the University of California, Berkeley. Layouts Di

Day Fundora 6 Nov 17, 2022
MAUS: A Dataset for Mental Workload Assessment Using Wearable Sensor - Baseline system

MAUS: A Dataset for Mental Workload Assessment Using Wearable Sensor - Baseline system Getting started To start working on this assignment, you should

2 Aug 06, 2022
[ICCV'21] PlaneTR: Structure-Guided Transformers for 3D Plane Recovery

PlaneTR: Structure-Guided Transformers for 3D Plane Recovery This is the official implementation of our ICCV 2021 paper News There maybe some bugs in

73 Nov 30, 2022
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research

MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research

Facebook Research 338 Dec 29, 2022
《Dual-Resolution Correspondence Network》(NeurIPS 2020)

Dual-Resolution Correspondence Network Dual-Resolution Correspondence Network, NeurIPS 2020 Dependency All dependencies are included in asset/dualrcne

Active Vision Laboratory 45 Nov 21, 2022
SHIFT15M: multiobjective large-scale fashion dataset with distributional shifts

[arXiv] The main motivation of the SHIFT15M project is to provide a dataset that contains natural dataset shifts collected from a web service IQON, wh

ZOZO, Inc. 138 Nov 24, 2022
DETReg: Unsupervised Pretraining with Region Priors for Object Detection

DETReg: Unsupervised Pretraining with Region Priors for Object Detection Amir Bar, Xin Wang, Vadim Kantorov, Colorado J Reed, Roei Herzig, Gal Chechik

Amir Bar 283 Dec 27, 2022
This repository contains pre-trained models and some evaluation code for our paper Towards Unsupervised Dense Information Retrieval with Contrastive Learning

Contriever: Towards Unsupervised Dense Information Retrieval with Contrastive Learning This repository contains pre-trained models and some evaluation

Meta Research 207 Jan 08, 2023
Final project for machine learning (CSC 590). Detection of hepatitis C and progression through blood samples.

Hepatitis C Blood Based Detection Final project for machine learning (CSC 590). Dataset from Kaggle. Using data from previous hepatitis C blood panels

Jennefer Maldonado 1 Dec 28, 2021
This repository is all about spending some time the with the original problem posed by Minsky and Papert

This repository is all about spending some time the with the original problem posed by Minsky and Papert. Working through this problem is a great way to begin learning computer vision.

Jaissruti Nanthakumar 1 Jan 23, 2022
[NeurIPS 2021] "Delayed Propagation Transformer: A Universal Computation Engine towards Practical Control in Cyber-Physical Systems"

Delayed Propagation Transformer: A Universal Computation Engine towards Practical Control in Cyber-Physical Systems Introduction Multi-agent control i

VITA 6 May 05, 2022
The repository contains source code and models to use PixelNet architecture used for various pixel-level tasks. More details can be accessed at .

PixelNet: Representation of the pixels, by the pixels, and for the pixels. We explore design principles for general pixel-level prediction problems, f

Aayush Bansal 196 Aug 10, 2022
PantheonRL is a package for training and testing multi-agent reinforcement learning environments.

PantheonRL is a package for training and testing multi-agent reinforcement learning environments. PantheonRL supports cross-play, fine-tuning, ad-hoc coordination, and more.

Stanford Intelligent and Interactive Autonomous Systems Group 57 Dec 28, 2022
A library that can print Python objects in human readable format

objprint A library that can print Python objects in human readable format Install pip install objprint Usage op Use op() (or objprint()) to print obj

319 Dec 25, 2022
This is a simple backtesting framework to help you test your crypto currency trading. It includes a way to download and store historical crypto data and to execute a trading strategy.

You can use this simple crypto backtesting script to ensure your trading strategy is successful Minimal setup required and works well with static TP a

Andrei 154 Sep 12, 2022
Customer Segmentation using RFM

Customer-Segmentation-using-RFM İş Problemi Bir e-ticaret şirketi müşterilerini segmentlere ayırıp bu segmentlere göre pazarlama stratejileri belirlem

Nazli Sener 7 Dec 26, 2021
Subpopulation detection in high-dimensional single-cell data

PhenoGraph for Python3 PhenoGraph is a clustering method designed for high-dimensional single-cell data. It works by creating a graph ("network") repr

Dana Pe'er Lab 42 Sep 05, 2022
Implementation of ICCV2021(Oral) paper - VMNet: Voxel-Mesh Network for Geodesic-aware 3D Semantic Segmentation

VMNet: Voxel-Mesh Network for Geodesic-Aware 3D Semantic Segmentation Created by Zeyu HU Introduction This work is based on our paper VMNet: Voxel-Mes

HU Zeyu 82 Dec 27, 2022