AI-Based 3D Pose Estimation: Almost Real Time!

Показать описание

📝 The paper "3D Human Pose Machines with Self-supervised Learning" and its source code is available here:

🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible:
313V, Alex Haro, Andrew Melnychuk, Angelos Evripiotis, Anthony Vdovitchenko, Brian Gilman, Christian Ahlin, Christoph Jadanowski, Claudio Fernandes, Dennis Abts, Eric Haddad, Eric Martel, Evan Breznyik, Geronimo Moralez, Jason Rollins, Javier Bustamante, John De Witt, Kaiesh Vohra, Kasia Hayden, Kjartan Olason, Lorin Atzberger, Marcin Dukaczewski, Marten Rauschenberg, Maurits van Mastrigt, Michael Albrecht, Michael Jensen, Morten Punnerud Engelstad, Nader Shakerin, Owen Campbell-Moore, Owen Skarpness, Raul Araújo da Silva, Richard Reis, Rob Rowe, Robin Graham, Ryan Monsurate, Shawn Azman, Steef, Steve Messina, Sunil Kim, Thomas Krcmar, Torsten Reil, Zach Boldyga, Zach Doty.

Károly Zsolnai-Fehér's links:

Рекомендации по теме

Комментарии

Gait recognition is certainly becoming much easier.
I just want a good body tracker for VR.

kebakent

I think there are many more mass-market applications of this than are mentioned in the video. For example, broadcasting sports events. Instead of broadcasting a plain video of the event, the poses of the athletes are estimated and broadcast. They are then reskinned on the viewers' screens using "skins" resembling the athletes. The viewers are then able to watch the game from any angle and in any resolution their computer can handle. Sure, it wouldn't look photorealistic, but a generation used to Fortnite and Minecraft might not care.

DontfallasleeZZZZ

You mention 51ms for a single frame, but I assume if you were doing this on a real time feed, the algorithm could be significantly optimized using temporal smoothness since the pose changes very little between two frames.

ehsan_kia

Currently for AAA quality mocap you need 16 to 32 (or more) cameras in a perfectly well lit room, cable managed to a server which connects to your workstation
if it were just 1 camera the cost would be so much lower, and less room for error with fewer moving parts. perhaps you could train the algorithm with a second camera angle for added speed and reliability.

Curt-

This can help indie 3D game developers to create good mocap animations under budget.. Great invention!

alialtaf

Great, no need for expensive off-site motion capture studios and their limiting schedule constraints.

nononono

This is huge! Always good stuff from the channel.

ryanbrown

Could this allow google translate for sign language?

briandiehl

Very powerful stuff here. It would be interesting to see this technology applied to a game of soccer / football. The general goal of every player is known. Analyzing postures in a dynamic system could provide an agent with the information needed to best avoid the most likey threats to its current goal.

brigfiche

Why no videos examples? Is there a problem with temporal consistency?

Dragonblood

This is really cool! But please note that latency and fps (frames per second) are not related. Even though the latency is 51 ms, a fast computer can produce output at any desired frame rate. It is just a question of parallelism.

henriksundt

Why do you mention the miliseconds it takes but don't mention the resolution or hardware used to achieve that time?

timgo

This almost seems better than the kinect and it's not even using IR lights for depth. It just needs to be a bit faster.

john_hunter_

Can we use this for GAIT Recognition? I mean estimating the pose and then could we construct a GAIT Energy Image based on that?

rushirajparmar

It's obvious application is going to be military

SiddharthKulkarniN

Isn't there a video out there of these poses being fed in and used to generate CGI output? Anyone know what I'm talking about and have a link?

MatthiasTTV

Hahahaha please someone build this into a slouch detection algorithm for any of us spending hours on the computer

MobyMotion

51ms per scene while batch predicting 1000 scenes, versus 51ms for each independent scene, are drastically different performance numbers. You can't speed up real world inputs in real-realtime processing, unless you create a "buffer" and wait for the data to accumulate to a batch. But then it wouldn't really be realtime.
Reason I'm saying this: Recently I've coded an AI that can make thousands of predictions in tens of microseconds. Then when I pulled it to production, and have user requests come in one by one, it took literally *seconds* per sample. I'm so fired...

deep.space.

where can i get the algortihm
and code

RaselAhmed-ixee

Hi, is there any software that uses camera tracking or motion capture to create variety of animsets for games? I want to make a game myself with AI's help in mind and I need references for how to make games with AI. You could say it's somewhat my goal to do it. Thanks...

MILADISGONE

AI-Based 3D Pose Estimation: Almost Real Time!

AI-Based 3D Pose Estimation: Almost Real Time!

3D Pose Estimation Demo

Human Dancer vs AI Dancer (real-time 3D pose estimation demo)

3D Pose Estimation With AI For Heavily Occluded Images | Game Futurology #37

Multi-person pose estimation: first try

3D Human Pose Estimation Explained: Watch How to Use It in AI Fitness Coach Apps

Using AI for 3d pose estimation, data processed and exported to FBX, improved accuracy :)

3D Mobile Pose Estimation (ONNX)

Human Pose Estimation in Machine Learning Explained (2D & 3D)

Real-time 3D pose estimation for iOS with Unity.

Human Upper-Body Pose Estimation using Fully Convolutional Network and Joint Heatmap

Pose estimation Measurement using angles

Human Pose Estimation Machine Learning Demo

Pose Estimation and Video Analytics with Machine Learning

Yolov7 pose estimation #yolov7 #yolo #pose #estimation offical yolo

AI Powered 3D Human Pose Tracking and Analysis

Dense Pose AI for Real Time 3D Human Pose Estimation through Wifi.

Pose Estimation #artificialintelligence #deeplearning #objectdetection

Master Object Detection: Pose Estimation & YOLO Explained

AI Learns Human Pose Estimation From Videos | Two Minute Papers #237

3D Object Detection and Pose Estimation with Deep Learning in OpenCV Python

Deep Learning based on Human Pose Estimation

AI-Powered 3D Human Pose Tracking & Analysis

Demo of 3D pose estimation to classify all 82 yoga positions in real time