Machine Learning University: Accelerated Computer Vision Class

Last update: Dec 28, 2022

Overview

Machine Learning University: Accelerated Computer Vision Class

This repository contains slides, notebooks, and datasets for the Machine Learning University (MLU) Computer Vision class. Our mission is to make Machine Learning accessible to everyone. We have courses available across many topics of machine learning and believe knowledge of ML can be a key enabler for success. This class is designed to help you get started with Computer Vision, learn about widely used Machine Learning techniques, and apply them to real-world problems.

YouTube

Watch all Computer Vision class video recordings in this YouTube playlist from our YouTube channel.

Course Overview

There are three lectures and one final project for this class.

Lecture 1	Lecture 2	Lecture 3
Intro to ML	Image Datasets	Advanced CNNs: VGGNet and ResNet
Intro to Computer Vision	Training Neural Networks	Object Detection
Neural Networks	Modern CNNs: LeNet and AlexNet	Semantic Segmentation
Convolutional Neural Networks	Model fine-tuning

Final Project: Practice working with a "real-world" computer vision dataset for the final project. Final project dataset is in the data/final_project_dataset folder. For more details on the final project, check out this notebook.

Contribute

If you would like to contribute to the project, see CONTRIBUTING for more information.

License

The license for this repository depends on the section. Data set for the course is being provided to you by permission of Amazon and is subject to the terms of the Amazon License and Access. You are expressly prohibited from copying, modifying, selling, exporting or using this data set in any way other than for the purpose of completing this course. The lecture slides are released under the CC-BY-SA-4.0 License. The code examples are released under the MIT-0 License. See each section's LICENSE file for details.

Machine Learning University: Accelerated Computer Vision Class

Related tags

Overview

Machine Learning University: Accelerated Computer Vision Class

YouTube

Course Overview

Contribute

License

Owner

AWS Samples

MISSFormer: An Effective Medical Image Segmentation Transformer

Face Depixelizer based on "PULSE: Self-Supervised Photo Upsampling via Latent Space Exploration of Generative Models" repository.

Code of the paper "Deep Human Dynamics Prior" in ACM MM 2021.

A collection of pre-trained StyleGAN2 models trained on different datasets at different resolution.

Core ML tools contain supporting tools for Core ML model conversion, editing, and validation.

Pca-on-genotypes - Mini bioinformatics project - PCA on genotypes

Automatic learning-rate scheduler

Implementation of the state-of-the-art vision transformers with tensorflow

implement of SwiftNet:Real-time Video Object Segmentation

SweiNet is an uncertainty-quantifying shear wave speed (SWS) estimator for ultrasound shear wave elasticity (SWE) imaging.

[peer review] An Arbitrary Scale Super-Resolution Approach for 3D MR Images using Implicit Neural Representation

PyTorch implementation of PP-LCNet: A Lightweight CPU Convolutional Neural Network

Generative Query Network (GQN) in PyTorch as described in "Neural Scene Representation and Rendering"

Self-Supervised Collision Handling via Generative 3D Garment Models for Virtual Try-On

improvement of CLIP features over the traditional resnet features on the visual question answering, image captioning, navigation and visual entailment tasks.

A PyTorch implementation of NeRF (Neural Radiance Fields) that reproduces the results.

Code for Environment Inference for Invariant Learning (ICML 2020 UDL Workshop Paper)

Builds a LoRa radio frequency fingerprint identification (RFFI) system based on deep learning techiniques

Everything you need to know about NumPy( Creating Arrays, Indexing, Math,Statistics,Reshaping).

This repository provides data for the VAW dataset as described in the CVPR 2021 paper titled "Learning to Predict Visual Attributes in the Wild"