TensorFlow (v2.7.0) benchmark results on an M1 Macbook Air 2020 laptop (macOS Monterey v12.1).

Last update: Jan 05, 2022

Related tags

Overview

M1-tensorflow-benchmark

TensorFlow (v2.7.0) benchmark results on an M1 Macbook Air 2020 laptop (macOS Monterey v12.1).

I was initially testing if TensorFlow was installed correctly so that code outside any context manager automatically runs on the GPU by using the with tf.device('/GPU:0') context manager. It would be interesting to compare this with free GPU services, so I also included Kaggle and Colab in the tests. Also tested M1's CPU.

This plot shows training time (y-axis) of an MLP with 5, 10, 15, 20 (x-axis) hidden layers of size 1024, and ReLU activation, trained on 50,000 CIFAR-10 images for 3 epochs.

The M1 looks comparable to a K80 which is nice if you always get locked out of Colab (like I do). But temps were worrying (~65 °C) this laptop is fanless after all. 🥲 Kaggle's P100 is 4x faster which is expected as the P100 provides 1.6x more GFLOPs and stacks 3x the memory bandwidth of the K80. The graph also confirms that the TF installation works and that TF code automatically runs on the GPU!

Extending the results

The code for running the benchmarks and consolidating the results in a plot is written so that it can easily incorporate results for new tests.

Run the following script in your environment:

import tensorflow as tf
import time
import pandas as pd
print(tf.__version__)

# Get CIFAR10 data; do basic preprocessing
(X_train, y_train), (X_test, y_test) = tf.keras.datasets.cifar10.load_data()
X_train_scaled = X_train / 255.0
y_train_encoded = tf.keras.utils.to_categorical(y_train, num_classes=10, dtype='float32')

# Define model constructor
def get_model(depth):
    model = tf.keras.Sequential()
    model.add(tf.keras.layers.Flatten(input_shape=(32, 32, 3)))
    for _ in range(depth):
        model.add(tf.keras.layers.Dense(1024, activation='relu'))
    model.add(tf.keras.layers.Dense(10, activation='sigmoid'))
    model.compile(optimizer='SGD', loss='categorical_crossentropy', metrics=['accuracy'])
    return model
    
YOUR_ENV_NAME = # Your environment's name here.
network_depth = [5, 10, 15, 20]
results = { depth: {} for depth in network_depth }
for depth in network_depth:
    default_start_time = time.time()
    model = get_model(depth)
    model.fit(X_train_scaled, y_train_encoded, epochs=3)
    results[depth][YOUR_ENV_NAME] = time.time() - default_start_time

# Save results
pd.DataFrame(results).to_csv(f'results_{YOUR_ENV_NAME}.csv', index=True)

Download the resulting CSV file and save it in the root directory alongside the other results_*.csv files.
Run plot_results.py. Open results.png. A line graph of your results should be added to the above plot. 🥳

Devices used

Kaggle's P100
Google Colab's Tesla K80
Macbook Air 2020 M1 GPU (macOS Monterey v12.1)
Macbook Air 2020 M1 CPU (macOS Monterey v12.1)

Contribute

Please contribute by adding more tests with different architectures and dataset, or by running the benchmarks on different environments, e.g. GTX or RTX cards, M1 Max and M1 Pro are very much welcome.

TensorFlow (v2.7.0) benchmark results on an M1 Macbook Air 2020 laptop (macOS Monterey v12.1).

Related tags

Overview

M1-tensorflow-benchmark

Extending the results

Devices used

Contribute

Owner

particle

Yolo algorithm for detection + centroid tracker to track vehicles

This PyTorch package implements MoEBERT: from BERT to Mixture-of-Experts via Importance-Guided Adaptation (NAACL 2022).

official Pytorch implementation of ICCV 2021 paper FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting.

Code for "Hierarchical Skills for Efficient Exploration" HSD-3 Algorithm and Baselines

Toolbox to analyze temporal context invariance of deep neural networks

PyTorch code for EMNLP 2021 paper: Don't be Contradicted with Anything! CI-ToD: Towards Benchmarking Consistency for Task-oriented Dialogue System

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

(ICCV 2021 Oral) Re-distributing Biased Pseudo Labels for Semi-supervised Semantic Segmentation: A Baseline Investigation.

CoTr: Efficiently Bridging CNN and Transformer for 3D Medical Image Segmentation

🥈78th place in Riiid Answer Correctness Prediction competition

Official PyTorch implementation of the paper "Graph-based Generative Face Anonymisation with Pose Preservation" in ICIAP 2021

Learning to See by Looking at Noise

Jittor Medical Segmentation Lib -- The assignment of Pattern Recognition course (2021 Spring) in Tsinghua University

(SIGIR2020) “Asymmetric Tri-training for Debiasing Missing-Not-At-Random Explicit Feedback’’

TensorFlow implementation of original paper : https://github.com/hszhao/PSPNet

Differentiable Neural Computers, Sparse Access Memory and Sparse Differentiable Neural Computers, for Pytorch

3DIAS: 3D Shape Reconstruction with Implicit Algebraic Surfaces (ICCV 2021)

Official repository for ABC-GAN

Aerial Single-View Depth Completion with Image-Guided Uncertainty Estimation (RA-L/ICRA 2020)

Dynamic hair modeling from monocular videos using deep neural networks