EasyMocap/Readme.md

<!--
 * @Date: 2021-01-13 20:32:12
 * @Author: Qing Shuai
 * @LastEditors: Qing Shuai
 * @LastEditTime: 2021-04-14 16:00:04
 * @FilePath: /EasyMocapRelease/Readme.md
-->

# EasyMocap

**EasyMocap** is an open-source toolbox for **markerless human motion capture** from RGB videos. In this project, we provide a lot of motion capture demos in different settings.

![python](https://img.shields.io/github/languages/top/zju3dv/EasyMocap)
![star](https://img.shields.io/github/stars/zju3dv/EasyMocap?style=social)

---

## Core features

### Multiple views of a single person

[![report](https://img.shields.io/badge/quickstart-green)](./doc/quickstart.md)

This is the basic code for fitting SMPL[1]/SMPL+H[2]/SMPL-X[3] model to capture body+hand+face poses from multiple views.

<div align="center">
    <img src="doc/feng/mv1pmf-smplx.gif" width="80%">
    <br>
    <sup>Videos are from ZJU-MoCap, with 23 calibrated and synchronized cameras.<sup/>
</div>

### Internet video with a mirror

[![report](https://img.shields.io/badge/CVPR21-mirror-red)](https://arxiv.org/pdf/2104.00340.pdf) [![quickstart](https://img.shields.io/badge/quickstart-green)](https://github.com/zju3dv/Mirrored-Human)

<div align="center">
    <img src="https://raw.githubusercontent.com/zju3dv/Mirrored-Human/main/doc/assets/smpl-avatar.gif" width="80%">
    <br>
    <sup>The raw video is from <a href="https://www.youtube.com/watch?v=KOCJJ27hhIE">Youtube<a/>.<sup/>
</div>

<div align="center">
    <img src="doc/imocap/mv1p-mirror.gif" width="80%"><br/>
    <sup>Captured with 6 cameras and a mirror<sup/>
</div>

### Multiple Internet videos with a specific action (Coming soon)

[![report](https://img.shields.io/badge/ECCV20-imocap-red)](https://arxiv.org/pdf/2008.07931.pdf) [![quickstart](https://img.shields.io/badge/quickstart-green)](./doc/todo.md)

<div align="center">
    <img src="doc/imocap/imocap.gif" width="80%"><br/>
    <sup>Internet videos of Roger Federer's serving<sup/>
</div>

### Multiple views of multiple people (Coming soon)

[![report](https://img.shields.io/badge/CVPR20-mvpose-red)](https://arxiv.org/pdf/1901.04111.pdf) [![quickstart](https://img.shields.io/badge/quickstart-green)](./doc/todo.md)

<div align="center">
    <img src="doc/imocap/mvpose.gif" width="80%"><br/>
    <sup>Captured with 4 consumer cameras<sup/>
</div>

### Others

This project is used by many other projects:

- [[CVPR21] Dense Reconstruction and View Synthesis from **Sparse Views**](https://zju3dv.github.io/neuralbody/)

## Other features

- [Camera calibration](apps/calibration/Readme.md): a simple calibration tool based on OpenCV
- [Pose guided synchronization](./doc/todo.md) (comming soon)
- [Annotator](apps/calibration/Readme.md): a simple GUI annotator based on OpenCV
- [Exporting of multiple data formats(bvh, asf/amc, ...)](./doc/02_output.md)

## Updates

- 04/12/2021: Mirrored-Human part is released. We also release the calibration tool and the annotator.

## Installation

See [doc/install](./doc/installation.md) for more instructions.

## Evaluation

The weight parameters can be set according to your data.

More quantitative reports will be added in [doc/evaluation.md](doc/evaluation.md)

## Acknowledgements

Here are the great works this project is built upon:

- SMPL models and layer are from MPII [SMPL-X model](https://github.com/vchoutas/smplx).
- Some functions are borrowed from [SPIN](https://github.com/nkolot/SPIN), [VIBE](https://github.com/mkocabas/VIBE), [SMPLify-X](https://github.com/vchoutas/smplify-x)
- The method for fitting 3D skeleton and SMPL model is similar to [TotalCapture](http://www.cs.cmu.edu/~hanbyulj/totalcapture/), without using point clouds.
- We integrate some easy-to-use functions for previous great work:
  - `easymocap/estimator/SPIN`  : an SMPL estimator[5]
  - `easymocap/estimator/YOLOv4`: an object detector[6](Coming soon)
  - `easymocap/estimator/HRNet` : a 2D human pose estimator[7](Coming soon)

We also would like to thank Wenduo Feng who is the performer in the sample data.

## Contact

Please open an issue if you have any questions. We appreciate all contributions to improve our project.

## Citation

This project is a part of our work [iMocap](https://zju3dv.github.io/iMoCap/), [Mirrored-Human](https://zju3dv.github.io/Mirrored-Human/) and [Neural Body](https://zju3dv.github.io/neuralbody/)

Please consider citing these works if you find this repo is useful for your projects.

```bibtex
@inproceedings{dong2020motion,
  title={Motion capture from internet videos},
  author={Dong, Junting and Shuai, Qing and Zhang, Yuanqing and Liu, Xian and Zhou, Xiaowei and Bao, Hujun},
  booktitle={European Conference on Computer Vision},
  pages={210--227},
  year={2020},
  organization={Springer}
}

@inproceedings{peng2021neural,
  title={Neural Body: Implicit Neural Representations with Structured Latent Codes for Novel View Synthesis of Dynamic Humans},
  author={Peng, Sida and Zhang, Yuanqing and Xu, Yinghao and Wang, Qianqian and Shuai, Qing and Bao, Hujun and Zhou, Xiaowei},
  booktitle={CVPR},
  year={2021}
}

@inproceedings{fang2021mirrored,
  title={Reconstructing 3D Human Pose by Watching Humans in the Mirror},
  author={Fang, Qi and Shuai, Qing and Dong, Junting and Bao, Hujun and Zhou, Xiaowei},
  booktitle={CVPR},
  year={2021}
}
```

## Reference

```bash
[1] Loper, Matthew, et al. "SMPL: A skinned multi-person linear model." ACM transactions on graphics (TOG) 34.6 (2015): 1-16.
[2] Romero, Javier, Dimitrios Tzionas, and Michael J. Black. "Embodied hands: Modeling and capturing hands and bodies together." ACM Transactions on Graphics (ToG) 36.6 (2017): 1-17.
[3] Pavlakos, Georgios, et al. "Expressive body capture: 3d hands, face, and body from a single image." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2019.
Bogo, Federica, et al. "Keep it SMPL: Automatic estimation of 3D human pose and shape from a single image." European conference on computer vision. Springer, Cham, 2016.
[4] Cao, Z., Hidalgo, G., Simon, T., Wei, S.E., Sheikh, Y.: Openpose: real-time multi-person 2d pose estimation using part affinity fields. arXiv preprint arXiv:1812.08008 (2018)
[5] Kolotouros, Nikos, et al. "Learning to reconstruct 3D human pose and shape via model-fitting in the loop." Proceedings of the IEEE/CVF International Conference on Computer Vision. 2019
[6] Bochkovskiy, Alexey, Chien-Yao Wang, and Hong-Yuan Mark Liao. "Yolov4: Optimal speed and accuracy of object detection." arXiv preprint arXiv:2004.10934 (2020).
[7] Sun, Ke, et al. "Deep high-resolution representation learning for human pose estimation." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2019.
```
init 2021-01-14 21:17:40 +08:00			`<!--`
			`* @Date: 2021-01-13 20:32:12`
			`* @Author: Qing Shuai`
			`* @LastEditors: Qing Shuai`
:memo: add results 2021-04-14 16:03:38 +08:00			`* @LastEditTime: 2021-04-14 16:00:04`
update doc 2021-01-14 21:22:44 +08:00			`* @FilePath: /EasyMocapRelease/Readme.md`
init 2021-01-14 21:17:40 +08:00			`-->`
update output and support bvh 2021-03-13 21:58:16 +08:00
init 2021-01-14 21:17:40 +08:00			`# EasyMocap`
update output and support bvh 2021-03-13 21:58:16 +08:00
update Readme 2021-04-02 12:28:46 +08:00			`EasyMocap is an open-source toolbox for markerless human motion capture from RGB videos. In this project, we provide a lot of motion capture demos in different settings.`
update readme 2021-01-17 21:08:07 +08:00
update Readme 2021-04-02 12:28:46 +08:00			`![python](https://img.shields.io/github/languages/top/zju3dv/EasyMocap)`
			`![star](https://img.shields.io/github/stars/zju3dv/EasyMocap?style=social)`
init 2021-01-14 21:17:40 +08:00
:rocket: update to v0.2 2021-04-14 15:22:51 +08:00			`---`
init 2021-01-14 21:17:40 +08:00
update Readme 2021-04-02 12:28:46 +08:00			`## Core features`
update readme 2021-01-17 21:08:07 +08:00
:rocket: update to v0.2 2021-04-14 15:22:51 +08:00			`### Multiple views of a single person`
init 2021-01-14 21:17:40 +08:00
:rocket: update to v0.2 2021-04-14 15:22:51 +08:00			`[![report](https://img.shields.io/badge/quickstart-green)](./doc/quickstart.md)`

			`This is the basic code for fitting SMPL[1]/SMPL+H[2]/SMPL-X[3] model to capture body+hand+face poses from multiple views.`
update Readme 2021-01-14 21:41:31 +08:00
update Readme 2021-04-02 12:28:46 +08:00			`<div align="center">`
			`<img src="doc/feng/mv1pmf-smplx.gif" width="80%">`
			`<br>`
			`<sup>Videos are from ZJU-MoCap, with 23 calibrated and synchronized cameras.<sup/>`
			`</div>`
update output and support bvh 2021-03-13 21:58:16 +08:00
update Readme 2021-04-02 12:28:46 +08:00			`### Internet video with a mirror`
update Readme 2021-01-14 21:41:31 +08:00
:rocket: update to v0.2 2021-04-14 15:22:51 +08:00			`[![report](https://img.shields.io/badge/CVPR21-mirror-red)](https://arxiv.org/pdf/2104.00340.pdf) [![quickstart](https://img.shields.io/badge/quickstart-green)](https://github.com/zju3dv/Mirrored-Human)`
update Readme 2021-01-14 21:41:31 +08:00
update Readme 2021-04-02 12:28:46 +08:00			`<div align="center">`
			`<img src="https://raw.githubusercontent.com/zju3dv/Mirrored-Human/main/doc/assets/smpl-avatar.gif" width="80%">`
			`<br>`
:memo: add results 2021-04-14 16:03:38 +08:00			`<sup>The raw video is from <a href="https://www.youtube.com/watch?v=KOCJJ27hhIE">Youtube<a/>.<sup/>`
			`</div>`

			`<div align="center">`
			`<img src="doc/imocap/mv1p-mirror.gif" width="80%"><br/>`
			`<sup>Captured with 6 cameras and a mirror<sup/>`
update Readme 2021-04-02 12:28:46 +08:00			`</div>`
update output and support bvh 2021-03-13 21:58:16 +08:00
update Readme 2021-04-02 12:28:46 +08:00			`### Multiple Internet videos with a specific action (Coming soon)`
init 2021-01-14 21:17:40 +08:00
:rocket: update to v0.2 2021-04-14 15:22:51 +08:00			`[![report](https://img.shields.io/badge/ECCV20-imocap-red)](https://arxiv.org/pdf/2008.07931.pdf) [![quickstart](https://img.shields.io/badge/quickstart-green)](./doc/todo.md)`
update output and support bvh 2021-03-13 21:58:16 +08:00
:memo: add results 2021-04-14 16:03:38 +08:00			`<div align="center">`
			`<img src="doc/imocap/imocap.gif" width="80%"><br/>`
			`<sup>Internet videos of Roger Federer's serving<sup/>`
			`</div>`

:rocket: update to v0.2 2021-04-14 15:22:51 +08:00			`### Multiple views of multiple people (Coming soon)`
init 2021-01-14 21:17:40 +08:00
:rocket: update to v0.2 2021-04-14 15:22:51 +08:00			`[![report](https://img.shields.io/badge/CVPR20-mvpose-red)](https://arxiv.org/pdf/1901.04111.pdf) [![quickstart](https://img.shields.io/badge/quickstart-green)](./doc/todo.md)`
update output and support bvh 2021-03-13 21:58:16 +08:00
:memo: add results 2021-04-14 16:03:38 +08:00			`<div align="center">`
			`<img src="doc/imocap/mvpose.gif" width="80%"><br/>`
			`<sup>Captured with 4 consumer cameras<sup/>`
			`</div>`

update Readme 2021-04-02 12:28:46 +08:00			`### Others`
:rocket: update to v0.2 2021-04-14 15:22:51 +08:00
update Readme 2021-04-02 12:28:46 +08:00			`This project is used by many other projects:`
:rocket: update to v0.2 2021-04-14 15:22:51 +08:00
update Readme 2021-04-02 12:28:46 +08:00			`- [[CVPR21] Dense Reconstruction and View Synthesis from Sparse Views](https://zju3dv.github.io/neuralbody/)`
update output and support bvh 2021-03-13 21:58:16 +08:00
update Readme 2021-04-02 12:28:46 +08:00			`## Other features`
init 2021-01-14 21:17:40 +08:00
:rocket: update to v0.2 2021-04-14 15:22:51 +08:00			`- [Camera calibration](apps/calibration/Readme.md): a simple calibration tool based on OpenCV`
			`- [Pose guided synchronization](./doc/todo.md) (comming soon)`
			`- [Annotator](apps/calibration/Readme.md): a simple GUI annotator based on OpenCV`
			`- [Exporting of multiple data formats(bvh, asf/amc, ...)](./doc/02_output.md)`
update output and support bvh 2021-03-13 21:58:16 +08:00
update Readme 2021-04-02 12:28:46 +08:00			`## Updates`
update output and support bvh 2021-03-13 21:58:16 +08:00
:rocket: update to v0.2 2021-04-14 15:22:51 +08:00			`- 04/12/2021: Mirrored-Human part is released. We also release the calibration tool and the annotator.`
update output and support bvh 2021-03-13 21:58:16 +08:00
:rocket: update to v0.2 2021-04-14 15:22:51 +08:00			`## Installation`
init 2021-01-14 21:17:40 +08:00
:rocket: update to v0.2 2021-04-14 15:22:51 +08:00			`See [doc/install](./doc/installation.md) for more instructions.`
:rocket: support SMPL+H/SMPL-X 2021-01-24 22:33:08 +08:00
update readme 2021-01-17 21:08:07 +08:00			`## Evaluation`

:rocket: update to v0.2 2021-04-14 15:22:51 +08:00			`The weight parameters can be set according to your data.`
:rocket: support SMPL+H/SMPL-X 2021-01-24 22:33:08 +08:00
:rocket: update to v0.2 2021-04-14 15:22:51 +08:00			`More quantitative reports will be added in [doc/evaluation.md](doc/evaluation.md)`
update readme 2021-01-17 21:08:07 +08:00
init 2021-01-14 21:17:40 +08:00			`## Acknowledgements`
update Readme 2021-04-02 12:28:46 +08:00
Update Readme.md 2021-01-14 23:13:49 +08:00			`Here are the great works this project is built upon:`
init 2021-01-14 21:17:40 +08:00
Update Readme.md 2021-01-14 23:13:49 +08:00			`- SMPL models and layer are from MPII [SMPL-X model](https://github.com/vchoutas/smplx).`
init 2021-01-14 21:17:40 +08:00			`- Some functions are borrowed from [SPIN](https://github.com/nkolot/SPIN), [VIBE](https://github.com/mkocabas/VIBE), [SMPLify-X](https://github.com/vchoutas/smplify-x)`
:rocket: update to v0.2 2021-04-14 15:22:51 +08:00			`- The method for fitting 3D skeleton and SMPL model is similar to [TotalCapture](http://www.cs.cmu.edu/~hanbyulj/totalcapture/), without using point clouds.`
			`- We integrate some easy-to-use functions for previous great work:`
			- `easymocap/estimator/SPIN` : an SMPL estimator[5]
			- `easymocap/estimator/YOLOv4`: an object detector[6](Coming soon)
			- `easymocap/estimator/HRNet` : a 2D human pose estimator[7](Coming soon)
init 2021-01-14 21:17:40 +08:00
Update Readme.md 2021-01-14 23:13:49 +08:00			`We also would like to thank Wenduo Feng who is the performer in the sample data.`
init 2021-01-14 21:17:40 +08:00
			`## Contact`
update Readme 2021-04-02 12:28:46 +08:00
			`Please open an issue if you have any questions. We appreciate all contributions to improve our project.`
init 2021-01-14 21:17:40 +08:00
			`## Citation`
update Readme 2021-04-02 12:28:46 +08:00
update output and support bvh 2021-03-13 21:58:16 +08:00			`This project is a part of our work [iMocap](https://zju3dv.github.io/iMoCap/), [Mirrored-Human](https://zju3dv.github.io/Mirrored-Human/) and [Neural Body](https://zju3dv.github.io/neuralbody/)`
Update Readme.md 2021-01-14 23:13:49 +08:00
update Readme 2021-04-02 12:28:46 +08:00			`Please consider citing these works if you find this repo is useful for your projects.`
init 2021-01-14 21:17:40 +08:00
			```bibtex
			`@inproceedings{dong2020motion,`
			`title={Motion capture from internet videos},`
			`author={Dong, Junting and Shuai, Qing and Zhang, Yuanqing and Liu, Xian and Zhou, Xiaowei and Bao, Hujun},`
			`booktitle={European Conference on Computer Vision},`
			`pages={210--227},`
			`year={2020},`
			`organization={Springer}`
			`}`
Update Readme.md 2021-01-14 22:48:55 +08:00
Update Readme.md 2021-03-24 17:03:03 +08:00			`@inproceedings{peng2021neural,`
Update Readme.md 2021-01-14 22:48:55 +08:00			`title={Neural Body: Implicit Neural Representations with Structured Latent Codes for Novel View Synthesis of Dynamic Humans},`
Update Readme.md 2021-01-16 20:40:50 +08:00			`author={Peng, Sida and Zhang, Yuanqing and Xu, Yinghao and Wang, Qianqian and Shuai, Qing and Bao, Hujun and Zhou, Xiaowei},`
Update Readme.md 2021-03-24 17:04:20 +08:00			`booktitle={CVPR},`
Update Readme.md 2021-03-04 10:15:38 +08:00			`year={2021}`
Update Readme.md 2021-01-14 22:48:55 +08:00			`}`
update output and support bvh 2021-03-13 21:58:16 +08:00
fix reference 2021-03-23 09:33:47 +08:00			`@inproceedings{fang2021mirrored,`
			`title={Reconstructing 3D Human Pose by Watching Humans in the Mirror},`
update output and support bvh 2021-03-13 21:58:16 +08:00			`author={Fang, Qi and Shuai, Qing and Dong, Junting and Bao, Hujun and Zhou, Xiaowei},`
fix reference 2021-03-23 09:33:47 +08:00			`booktitle={CVPR},`
update output and support bvh 2021-03-13 21:58:16 +08:00			`year={2021}`
			`}`
:rocket: support SMPL+H/SMPL-X 2021-01-24 22:33:08 +08:00			```

			`## Reference`
:rocket: update to v0.2 2021-04-14 15:22:51 +08:00
:rocket: support SMPL+H/SMPL-X 2021-01-24 22:33:08 +08:00			```bash
			`[1] Loper, Matthew, et al. "SMPL: A skinned multi-person linear model." ACM transactions on graphics (TOG) 34.6 (2015): 1-16.`
			`[2] Romero, Javier, Dimitrios Tzionas, and Michael J. Black. "Embodied hands: Modeling and capturing hands and bodies together." ACM Transactions on Graphics (ToG) 36.6 (2017): 1-17.`
			`[3] Pavlakos, Georgios, et al. "Expressive body capture: 3d hands, face, and body from a single image." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2019.`
			`Bogo, Federica, et al. "Keep it SMPL: Automatic estimation of 3D human pose and shape from a single image." European conference on computer vision. Springer, Cham, 2016.`
			`[4] Cao, Z., Hidalgo, G., Simon, T., Wei, S.E., Sheikh, Y.: Openpose: real-time multi-person 2d pose estimation using part affinity fields. arXiv preprint arXiv:1812.08008 (2018)`
:rocket: update to v0.2 2021-04-14 15:22:51 +08:00			`[5] Kolotouros, Nikos, et al. "Learning to reconstruct 3D human pose and shape via model-fitting in the loop." Proceedings of the IEEE/CVF International Conference on Computer Vision. 2019`
			`[6] Bochkovskiy, Alexey, Chien-Yao Wang, and Hong-Yuan Mark Liao. "Yolov4: Optimal speed and accuracy of object detection." arXiv preprint arXiv:2004.10934 (2020).`
			`[7] Sun, Ke, et al. "Deep high-resolution representation learning for human pose estimation." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2019.`
Update Readme.md Seems like SMPLH_male.pkl and SMPLH_female.pkl will not be used. 2021-01-27 15:26:55 +08:00			```