Make human motion capture easier.

Go to file

shuaiqing 64e0e48d29 📝 add results		2021-04-14 16:03:38 +08:00
apps	🚀 update to v0.2	2021-04-14 15:22:51 +08:00
code	update smpl config	2021-03-20 13:56:20 +08:00
data/smplx	🔥 remove unused files and images	2021-04-13 17:57:39 +08:00
doc	📝 add results	2021-04-14 16:03:38 +08:00
easymocap	🚀 update to v0.2	2021-04-14 15:22:51 +08:00
scripts	🚀 update to v0.2	2021-04-14 15:22:51 +08:00
.gitignore	🚀 update to v0.2	2021-04-14 15:22:51 +08:00
LICENSE	Create LICENSE	2021-01-14 22:13:55 +08:00
Readme.md	📝 add results	2021-04-14 16:03:38 +08:00
requirements.txt	update requirements	2021-01-14 21:56:17 +08:00
setup.py	🚀 update to v0.2	2021-04-14 15:22:51 +08:00

Readme.md

EasyMocap

EasyMocap is an open-source toolbox for markerless human motion capture from RGB videos. In this project, we provide a lot of motion capture demos in different settings.

Core features

Multiple views of a single person

This is the basic code for fitting SMPL[1]/SMPL+H[2]/SMPL-X[3] model to capture body+hand+face poses from multiple views.

^{Videos are from ZJU-MoCap, with 23 calibrated and synchronized cameras.}

Internet video with a mirror

^{The raw video is from Youtube.}

^{Captured with 6 cameras and a mirror}

Multiple Internet videos with a specific action (Coming soon)

^{Internet videos of Roger Federer's serving}

Multiple views of multiple people (Coming soon)

^{Captured with 4 consumer cameras}

Others

This project is used by many other projects:

[CVPR21] Dense Reconstruction and View Synthesis from Sparse Views

Other features

Camera calibration: a simple calibration tool based on OpenCV
Pose guided synchronization (comming soon)
Annotator: a simple GUI annotator based on OpenCV
Exporting of multiple data formats(bvh, asf/amc, ...)

Updates

04/12/2021: Mirrored-Human part is released. We also release the calibration tool and the annotator.

Installation

See doc/install for more instructions.

Evaluation

The weight parameters can be set according to your data.

More quantitative reports will be added in doc/evaluation.md

Acknowledgements

Here are the great works this project is built upon:

SMPL models and layer are from MPII SMPL-X model.
Some functions are borrowed from SPIN, VIBE, SMPLify-X
The method for fitting 3D skeleton and SMPL model is similar to TotalCapture, without using point clouds.
We integrate some easy-to-use functions for previous great work:
- easymocap/estimator/SPIN : an SMPL estimator[5]
- easymocap/estimator/YOLOv4: an object detector[6](Coming soon)
- easymocap/estimator/HRNet : a 2D human pose estimator[7](Coming soon)

We also would like to thank Wenduo Feng who is the performer in the sample data.

Contact

Please open an issue if you have any questions. We appreciate all contributions to improve our project.

Citation

This project is a part of our work iMocap, Mirrored-Human and Neural Body

Please consider citing these works if you find this repo is useful for your projects.

@inproceedings{dong2020motion,
  title={Motion capture from internet videos},
  author={Dong, Junting and Shuai, Qing and Zhang, Yuanqing and Liu, Xian and Zhou, Xiaowei and Bao, Hujun},
  booktitle={European Conference on Computer Vision},
  pages={210--227},
  year={2020},
  organization={Springer}
}

@inproceedings{peng2021neural,
  title={Neural Body: Implicit Neural Representations with Structured Latent Codes for Novel View Synthesis of Dynamic Humans},
  author={Peng, Sida and Zhang, Yuanqing and Xu, Yinghao and Wang, Qianqian and Shuai, Qing and Bao, Hujun and Zhou, Xiaowei},
  booktitle={CVPR},
  year={2021}
}

@inproceedings{fang2021mirrored,
  title={Reconstructing 3D Human Pose by Watching Humans in the Mirror},
  author={Fang, Qi and Shuai, Qing and Dong, Junting and Bao, Hujun and Zhou, Xiaowei},
  booktitle={CVPR},
  year={2021}
}

Reference

[1] Loper, Matthew, et al. "SMPL: A skinned multi-person linear model." ACM transactions on graphics (TOG) 34.6 (2015): 1-16.
[2] Romero, Javier, Dimitrios Tzionas, and Michael J. Black. "Embodied hands: Modeling and capturing hands and bodies together." ACM Transactions on Graphics (ToG) 36.6 (2017): 1-17.
[3] Pavlakos, Georgios, et al. "Expressive body capture: 3d hands, face, and body from a single image." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2019.
Bogo, Federica, et al. "Keep it SMPL: Automatic estimation of 3D human pose and shape from a single image." European conference on computer vision. Springer, Cham, 2016.
[4] Cao, Z., Hidalgo, G., Simon, T., Wei, S.E., Sheikh, Y.: Openpose: real-time multi-person 2d pose estimation using part affinity fields. arXiv preprint arXiv:1812.08008 (2018)
[5] Kolotouros, Nikos, et al. "Learning to reconstruct 3D human pose and shape via model-fitting in the loop." Proceedings of the IEEE/CVF International Conference on Computer Vision. 2019
[6] Bochkovskiy, Alexey, Chien-Yao Wang, and Hong-Yuan Mark Liao. "Yolov4: Optimal speed and accuracy of object detection." arXiv preprint arXiv:2004.10934 (2020).
[7] Sun, Ke, et al. "Deep high-resolution representation learning for human pose estimation." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2019.