FMPose3D: monocular 3D pose estimation via flow matching

This is the official implementation of the approach described in the preprint:

FMPose3D: monocular 3D pose estimation via flow matching
Ti Wang, Xiaohang Yu, Mackenzie Weygandt Mathis

🚀 TL;DR

FMPose3D creates a 3D pose from a single 2D image. It leverages fast Flow Matching, generating multiple plausible 3D poses via an ODE in just a few steps, then aggregates them using a reprojection-based Bayesian module (RPEA) for accurate predictions, achieving state-of-the-art results on human and animal 3D pose benchmarks.

News!

Feb 2026: the FMPose3D code and our arXiv paper is released - check out the demos here or on our project page
Planned: This method will be integrated into DeepLabCut

Installation

Set up an environment

Make sure you have Python 3.10+. You can set this up with:

conda create -n fmpose_3d python=3.10
conda activate fmpose_3d

pip install fmpose3d

Demos

Testing on in-the-wild images (humans)

This visualization script is designed for single-frame based model, allowing you to easily run 3D human pose estimation on any single image.

Before testing, make sure you have the pre-trained model ready. You may either use the model trained by your own or download ours from here and place it in the ./pre_trained_models directory.

Next, put your test images into folder demo/images. Then run the visualization script:

sh vis_in_the_wild.sh

The predictions will be saved to folder demo/predictions.

Training and Inference

Dataset Setup

Setup from original source

You can obtain the Human3.6M dataset from the Human3.6M website, and then set it up using the instructions provided in VideoPose3D.

Setup from preprocessed dataset (Recommended)

You also can access the processed data by downloading it from here.

Place the downloaded files in the dataset/ folder of this project:

<project_root>/
├── dataset/
│   ├── data_3d_h36m.npz
│   ├── data_2d_h36m_gt.npz
│   └── data_2d_h36m_cpn_ft_h36m_dbb.npz

Training

The training logs, checkpoints, and related files of each training time will be saved in the './checkpoint' folder.

For training on Human3.6M:

sh /scripts/FMPose3D_train.sh

Inference

First, download the folder with pre-trained model from here and place it in the './pre_trained_models' directory.

To run inference on Human3.6M:

sh ./scripts/FMPose3D_test.sh

Experiments on non-human animals

For animal training/testing and demo scripts, see animals/README.md.

Citation

@misc{wang2026fmpose3dmonocular3dpose,
      title={FMPose3D: monocular 3D pose estimation via flow matching}, 
      author={Ti Wang and Xiaohang Yu and Mackenzie Weygandt Mathis},
      year={2026},
      eprint={2602.05755},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2602.05755}, 
}

Acknowledgements

We thank the Swiss National Science Foundation (SNSF Project # 320030-227871) and the Kavli Foundation for providing financial support for this project.

Our code is extended from the following repositories. We thank the authors for releasing the code.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FMPose3D: monocular 3D pose estimation via flow matching

🚀 TL;DR

News!

Installation

Set up an environment

Demos

Testing on in-the-wild images (humans)

Training and Inference

Dataset Setup

Setup from original source

Setup from preprocessed dataset (Recommended)

Training

Inference

Experiments on non-human animals

Citation

Acknowledgements

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 1,106 Commits
.github/workflows		.github/workflows
animals		animals
dataset		dataset
demo		demo
fmpose3d		fmpose3d
images		images
pre_trained_models		pre_trained_models
scripts		scripts
tests		tests
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
pyproject.toml		pyproject.toml

License

AdaptiveMotorControlLab/FMPose3D

Folders and files

Latest commit

History

Repository files navigation

FMPose3D: monocular 3D pose estimation via flow matching

🚀 TL;DR

News!

Installation

Set up an environment

Demos

Testing on in-the-wild images (humans)

Training and Inference

Dataset Setup

Setup from original source

Setup from preprocessed dataset (Recommended)

Training

Inference

Experiments on non-human animals

Citation

Acknowledgements

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages