MoveNetを用いたPythonでの姿勢推定のデモ

Last update: Dec 17, 2022

Overview

MoveNet-Python-Example

MoveNetのPythonでの動作サンプルです。
ONNXに変換したモデルも同梱しています。変換自体を試したい方はMoveNet_tf2onnx.ipynbを使用ください。

2021/08/24時点でTensorFlow Hubで提供されている以下モデルを使用しています。

Requirement

TensorFlow 2.3.0 or later
tensorflow-hub 0.12.0 or later
OpenCV 3.4.2 or later
onnxruntime 1.5.2 or later ※ONNX推論を使用する場合のみ

Demo

デモの実行方法は以下です。

SignlePose

python demo_singlepose.py

--device
カメラデバイス番号の指定
デフォルト：0
--file
動画ファイルの指定 ※指定時はカメラデバイスより優先
デフォルト：指定なし
--width
カメラキャプチャ時の横幅
デフォルト：960
--height
カメラキャプチャ時の縦幅
デフォルト：540
--mirror
VideoCapture()取り込みデータを左右反転するか否か
デフォルト：指定なし
--model_select
使用モデルの選択
Saved Model, ONNX：0→Lightning　1→Thunder
TFLite：0→Lightning(float16)　1→Thunder(float16)　2→Lightning(int8)　3→Thunder(int8)
デフォルト：0
--keypoint_score
キーポイント表示の閾値
デフォルト：0.4

MultiPose

python demo_multipose.py

--device
カメラデバイス番号の指定
デフォルト：0
--file
動画ファイルの指定 ※指定時はカメラデバイスより優先
デフォルト：指定なし
--width
カメラキャプチャ時の横幅
デフォルト：960
--height
カメラキャプチャ時の縦幅
デフォルト：540
--mirror
VideoCapture()取り込みデータを左右反転するか否か
デフォルト：指定なし
--keypoint_score
キーポイント表示の閾値
デフォルト：0.4
--bbox_score
バウンディングボックス表示の閾値
デフォルト：0.2

Reference

TensorFlow Hub：MoveNet

Author

高橋かずひと(https://twitter.com/KzhtTkhs)

License

MoveNet-Python-Example is under Apache-2.0 License.

License(Movie)

サンプル動画はNHKクリエイティブ・ライブラリーのストリートバスケットを使用しています。

MoveNetを用いたPythonでの姿勢推定のデモ

Related tags

Overview

MoveNet-Python-Example

Requirement

Demo

SignlePose

MultiPose

Reference

Author

License

License(Movie)

Owner

KazuhitoTakahashi

Open source simulator for autonomous vehicles built on Unreal Engine / Unity, from Microsoft AI & Research

Deep and online learning with spiking neural networks in Python

Omnidirectional camera calibration in python

mbrl-lib is a toolbox for facilitating development of Model-Based Reinforcement Learning algorithms.

Face detection using deep learning.

FID calculation with proper image resizing and quantization steps

Code for testing various M1 Chip benchmarks with TensorFlow.

Official PyTorch implementation of GDWCT (CVPR 2019, oral)

Cross-modal Deep Face Normals with Deactivable Skip Connections

Concept drift monitoring for HA model servers.

Lightweight stereo matching network based on MobileNetV1 and MobileNetV2

Language Models Can See: Plugging Visual Controls in Text Generation

MTCNN face detection implementation for TensorFlow, as a PIP package.

Official code for "End-to-End Optimization of Scene Layout" -- including VAE, Diff Render, SPADE for colorization (CVPR 2020 Oral)

Sparse Progressive Distillation: Resolving Overfitting under Pretrain-and-Finetune Paradigm

Neural Ensemble Search for Performant and Calibrated Predictions

ReferFormer - Official Implementation of ReferFormer

Code for Phase diagram of Stochastic Gradient Descent in high-dimensional two-layer neural networks

Source code for Fathony, Sahu, Willmott, & Kolter, "Multiplicative Filter Networks", ICLR 2021.

Vision-Language Pre-training for Image Captioning and Question Answering