Lightweight 3D human pose estimation network training using teacher-student learning

Published by Claudia Heyer at 2 March 2020

Dong-Hyun Hwang, Suntae Kim, Nicolas Monet, Hideki Koike, Soonmin Bae

Winter Conference on Applications of Computer Vision (WACV), Aspen, Colorado, USA, 2 March, 2020

Abstract

We present MoVNect, a lightweight deep neural network to capture 3D human pose using a single RGB camera.
To improve the overall performance of the model, we apply the teacher-student learning method based knowledge distillation to 3D human pose estimation. A realtime post-
processing makes the CNN output to yield temporally stable 3D skeletal information, which can apply to applications directly. We implement a 3D avatar application running on mobile in realtime to demonstrate that our network
achieves both high accuracy and fast inference time. Extensive evaluations show the advantages of our lightweight model with the proposed training method over previous 3D pose estimation methods on Human3.6M dataset and mobile devices.

Related Content

NAVER FRANCE Gender Equality 2024

All

Publications

Blog

News

Code & Data

Careers

People

ACTION

Providing embodied agents with sequential decision-making capabilities to safely execute complex tasks in dynamic environments.

INTERACTION

Equip robots to interact safely with humans, other robots and systems.

VISION

Perception to help robots understand and interact with the environment.

NAVER FRANCE Gender Equality 2023

Action

Lightweight 3D human pose estimation network training using teacher-student learning

Related Content

All

Publications

Blog

News

Code & Data

Careers

People

Cookie settings