Fixing the root node: Efficient tracking and detection of 3D human pose through local solutions

Ben Daubney, Xianghua Xie, Jingjing Deng, Neil Mac Parthaláin , Reyer Zwiggelaar

Research output: Contribution to journalArticlepeer-review

233 Downloads (Pure)


3D human pose estimation is a very difficult task. In this paper we propose that this problem can be more easily solved by first finding the solutions to a set of easier sub-problems. These are to locally estimate pose conditioned on a fixed root node state, which defines the global position and orientation of the person. The global solution can then be found using information extracted during this procedure. This approach has two key benefits: The first is that each local solution can be found by modeling the articulated object as a kinematic chain, which has far less degrees of freedom than alternative models. The second is that by using this approach we can represent, or support, a much larger area of the posterior than is currently possible. This allows far more robust algorithms to be implemented since there is far less pressure to prune the search space to free up computational resources. We apply this approach to two problems: The first is single frame monocular 3D pose estimation, where we propose a method to directly extract 3D pose without first extracting any intermediate 2D representation or being dependent on strong spatial prior models. The second is multi-view 3D tracking where we show that using the above technique results in an approach that is far more robust than current approaches, without relying on strong temporal prior models. In both domains we demonstrate the strength and versatility of the proposed method.
Original languageEnglish
Pages (from-to)73-87
Number of pages15
JournalImage and Vision Computing
Early online date31 May 2016
Publication statusPublished - 1 Aug 2016


  • 3D pose estimation
  • Local solutions
  • Root node
  • Tracking


Dive into the research topics of 'Fixing the root node: Efficient tracking and detection of 3D human pose through local solutions'. Together they form a unique fingerprint.

Cite this