A CNN implementation using only numpy. Supports multidimensional images, stride, etc.

Last update: Nov 30, 2021

Related tags

Overview

CNN from scratch

The most interesting part is in the folder neural_networks/layers.py: Code for a convolutional neural network, based on only numpy (no PyTorch or TensorFlow). It is therefore very foundational and illustrates how CNNs work mathematically.

The CNNs is compatible with colour images (3-channel rgb), includes pooling layers (class Pool2D) and works with any given (valid) stride.

neural_networks/activations.py contains basic activation functions, like ReLu or SoftMax with the appropriate forward / backward implementations calculating the jacobian, etc., needed for backpropagation.

Many functions make heavy use of slicing, to speed up the training process significantly. See e.g. Conv2D.forward:

for x in range(out_rows):
    for y in range(out_cols):
        out[:,x,y,:] = np.apply_over_axes(np.sum, W[None]*X_pad[:,x*s:x*s+kernel_height,y*s:y*s+kernel_width,:][...,None], [1,2,3])[:,0,0,0,:]

which is the sliced version of a depth-6 nested for loop -- and thus allows for significant speedup (on my computer, more than 20x speedup for the given training data).

In losses.py, CrossEntropy is the most important function. To allow for speed-up, we simplified mathematically as much as possible, yielding

loss = -1.0/m *np.trace(np.matmul(Y,np.log(Y_hat.T)))

for the forward pass and

-1/m*(np.divide(Y,Y_hat))

for the backward pass.

This is based on a project for CS289 at UC Berkeley.

A CNN implementation using only numpy. Supports multidimensional images, stride, etc.

Related tags

Overview

CNN from scratch

Owner

My solution for the 7th place / 245 in the Umoja Hack 2022 challenge

GluonMM is a library of transformer models for computer vision and multi-modality research

Pipeline code for Sequential-GAM(Genome Architecture Mapping).

Memory-Augmented Model Predictive Control

Exploiting a Zoo of Checkpoints for Unseen Tasks

JAXMAPP: JAX-based Library for Multi-Agent Path Planning in Continuous Spaces

Galileo library for large scale graph training by JD

This is a classifier which basically predicts whether there is a gun law in a state or not, depending on various things like murder rates etc.

SwinTrack: A Simple and Strong Baseline for Transformer Tracking

Practical Single-Image Super-Resolution Using Look-Up Table

PyTorch implementation of Algorithm 1 of "On the Anatomy of MCMC-Based Maximum Likelihood Learning of Energy-Based Models"

Code release for NeurIPS 2020 paper "Co-Tuning for Transfer Learning"

Tools for computational pathology

DCSAU-Net: A Deeper and More Compact Split-Attention U-Net for Medical Image Segmentation

On Generating Extended Summaries of Long Documents

Repo público onde postarei meus estudos de Python, buscando aprender por meio do compartilhamento do aprendizado!

Predictive Modeling on Electronic Health Records(EHR) using Pytorch

The code uses SegFormer for Semantic Segmentation on Drone Dataset.

General Virtual Sketching Framework for Vector Line Art (SIGGRAPH 2021)

Official code release for ICCV 2021 paper SNARF: Differentiable Forward Skinning for Animating Non-rigid Neural Implicit Shapes.