EasyMocap/Readme.md

<!--
 * @Date: 2021-01-13 20:32:12
 * @Author: Qing Shuai
 * @LastEditors: Qing Shuai
 * @LastEditTime: 2022-11-03 13:09:58
 * @FilePath: /EasyMocapRelease/Readme.md
-->

<div align="center">
    <img src="logo.png" width="40%">
</div>

**EasyMocap** is an open-source toolbox for **markerless human motion capture** and **novel view synthesis** from RGB videos. In this project, we provide a lot of motion capture demos in different settings.

![python](https://img.shields.io/github/languages/top/zju3dv/EasyMocap)
![star](https://img.shields.io/github/stars/zju3dv/EasyMocap?style=social)

## News

- :tada: Our SIGGRAPH 2022 [**Novel View Synthesis of Human Interactions From Sparse Multi-view Videos**](https://chingswy.github.io/easymocap-public-doc/works/multinb.html) is released! Check the [documentation](https://chingswy.github.io/easymocap-public-doc/works/multinb.html).
- :tada: EasyMocap v0.2 is released! We support motion capture from Internet videos. Please check the [Quick Start](https://chingswy.github.io/easymocap-public-doc/quickstart/quickstart.html) for more details.


---

## Core features

### Multiple views of a single person

[![report](https://img.shields.io/badge/quickstart-green)](./doc/quickstart.md) [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/drive/1Cyvu_lPFUajr2RKt6yJIfS3HQIIYl6QU?usp=sharing)

This is the basic code for fitting SMPL[^loper2015]/SMPL+H[^romero2017]/SMPL-X[^pavlakos2019]/MANO[^romero2017] model to capture body+hand+face poses from multiple views.

<div align="center">
    <img src="doc/feng/mv1pmf-smplx.gif" width="80%">
    <br>
    <img src="https://raw.githubusercontent.com/chingswy/Dataset-Demo/main/EasyMocap/mv1p-dance-smpl.gif" width="80%">
    <br>
    <sup>Videos are from ZJU-MoCap, with 23 calibrated and synchronized cameras.</sup>
</div>

<div align="center">
    <img src="doc/feng/mano.gif" width="80%">
    <br>
    <sup>Captured with 8 cameras.</sup>
</div>

### Internet video

This part is the basic code for fitting SMPL[^loper2015] with 2D keypoints estimation[^cao2018][^hrnet] and CNN initialization[^kolotouros2019].

<div align="center">
    <img src="https://raw.githubusercontent.com/chingswy/Dataset-Demo/main/EasyMocap/23EfsN7vEOA%2B003170%2B003670.gif" width="80%">
    <br>
    <sup>The raw video is from <a href="https://www.youtube.com/watch?v=23EfsN7vEOA">Youtube</a>.</sup>
</div>

### Internet video with a mirror

[![report](https://img.shields.io/badge/CVPR21-mirror-red)](https://arxiv.org/pdf/2104.00340.pdf) [![quickstart](https://img.shields.io/badge/quickstart-green)](https://github.com/zju3dv/Mirrored-Human)

<div align="center">
    <img src="https://raw.githubusercontent.com/zju3dv/Mirrored-Human/main/doc/assets/smpl-avatar.gif" width="80%">
    <br>
    <sup>The raw video is from <a href="https://www.youtube.com/watch?v=KOCJJ27hhIE">Youtube</a>.</sup>
</div>


### Multiple Internet videos with a specific action (Coming soon)

[![report](https://img.shields.io/badge/ECCV20-imocap-red)](https://arxiv.org/pdf/2008.07931.pdf) [![quickstart](https://img.shields.io/badge/quickstart-green)](./doc/todo.md)

<div align="center">
    <img src="doc/imocap/imocap.gif" width="80%"><br/>
    <sup>Internet videos of Roger Federer's serving</sup>
</div>

### Multiple views of multiple people

[![report](https://img.shields.io/badge/CVPR19-mvpose-red)](https://arxiv.org/pdf/1901.04111.pdf) [![quickstart](https://img.shields.io/badge/quickstart-green)](./doc/mvmp.md)

<div align="center">
    <img src="doc/assets/mvmp1f.gif" width="80%"><br/>
    <sup>Captured with 8 consumer cameras</sup>
</div>

### Novel view synthesis from sparse views
[![report](https://img.shields.io/badge/CVPR21-neuralbody-red)](https://arxiv.org/pdf/2012.15838.pdf) [![quickstart](https://img.shields.io/badge/quickstart-green)](https://github.com/zju3dv/neuralbody)

<div align="center">
    <img src="https://raw.githubusercontent.com/chingswy/Dataset-Demo/main/EasyMocap/female-ballet.gif" width="80%"><br/>
    <sup>Novel view synthesis for chanllenge motion(coming soon)</sup>
</div>

<div align="center">
    <img src="https://raw.githubusercontent.com/chingswy/Dataset-Demo/main/EasyMocap/nvs_mp_soccer1_6_rgb.gif" width="80%"><br/>
    <sup>Novel view synthesis for human interaction(coming soon)</sup>
</div>

    
## ZJU-MoCap

With our proposed method, we release two large dataset of human motion: LightStage and Mirrored-Human. See the [website](https://chingswy.github.io/Dataset-Demo/) for more details.

If you would like to download the ZJU-Mocap dataset, please sign the [agreement](https://pengsida.net/project_page_assets/files/ZJU-MoCap_Agreement.pdf), and email it to Qing Shuai (s_q@zju.edu.cn) and cc Xiaowei Zhou (xwzhou@zju.edu.cn) to request the download link.

<div align="center">
    <img src="doc/assets/ZJU-MoCap-lightstage.jpg" width="80%"><br/>
    <sup>LightStage: captured with LightStage system</sup>
</div>

<div align="center">
    <img src="https://raw.githubusercontent.com/chingswy/Dataset-Demo/main/EasyMocap/mirrored-human.jpg" width="80%"><br/>
    <sup>Mirrored-Human: collected from the Internet</sup>
</div>

## Other features

### 3D Realtime visualization
[![quickstart](https://img.shields.io/badge/quickstart-green)](./doc/realtime_visualization.md)
<div align="center">
    <img src="https://raw.githubusercontent.com/chingswy/Dataset-Demo/main/assets/vis3d/skel-body25.gif" width="26%">
    <img src="https://raw.githubusercontent.com/chingswy/Dataset-Demo/main/assets/vis3d/skel-total.gif" width="26%">
    <img src="https://raw.githubusercontent.com/chingswy/Dataset-Demo/main/assets/vis3d/skel-multi.gif" width="26%">
</div>

<div align="center">
    <img src="https://raw.githubusercontent.com/chingswy/Dataset-Demo/main/assets/vis3d/mesh-smpl.gif" width="26%">
    <img src="https://raw.githubusercontent.com/chingswy/Dataset-Demo/main/assets/vis3d/mesh-smplx.gif" width="26%">
    <img src="https://raw.githubusercontent.com/chingswy/Dataset-Demo/main/assets/vis3d/mesh-manol.gif" width="26%">
</div>

### [Camera calibration](apps/calibration/Readme.md)

<div align="center">
    <img src="https://raw.githubusercontent.com/chingswy/Dataset-Demo/main/EasyMocap/calib_intri.jpg" width="40%">
    <img src="https://raw.githubusercontent.com/chingswy/Dataset-Demo/main/EasyMocap/calib_extri.jpg" width="40%">
    <br>
    <sup>Calibration for intrinsic and extrinsic parameters</sup>
</div>

### [Annotator](apps/annotation/Readme.md)

<div align="center">
    <img src="https://raw.githubusercontent.com/chingswy/Dataset-Demo/main/EasyMocap/annot_keypoints.jpg" width="40%">
    <img src="https://raw.githubusercontent.com/chingswy/Dataset-Demo/main/EasyMocap/annot_mask.jpg" width="40%">
    <br>
    <sup>Annotator for bounding box, keypoints and mask</sup>
</div>


## Updates
- 11/03/2022: Support MultiNeuralBody.
- 12/25/2021: Support mediapipe keypoints detector.
- 08/09/2021: Add a colab demo [here](https://colab.research.google.com/drive/1Cyvu_lPFUajr2RKt6yJIfS3HQIIYl6QU?usp=sharing).
- 06/28/2021: The **Multi-view Multi-person** part is released!
- 06/10/2021: The **real-time 3D visualization** part is released!
- 04/11/2021: The calibration tool and the annotator are released.
- 04/11/2021: **Mirrored-Human** part is released.

## Installation

See [documentation](https://chingswy.github.io/easymocap-public-doc/install/install.html) for more instructions.

## Acknowledgements

Here are the great works this project is built upon:

- SMPL models and layer are from MPII [SMPL-X model](https://github.com/vchoutas/smplx).
- Some functions are borrowed from [SPIN](https://github.com/nkolot/SPIN), [VIBE](https://github.com/mkocabas/VIBE), [SMPLify-X](https://github.com/vchoutas/smplify-x)
- The method for fitting 3D skeleton and SMPL model is similar to [SMPLify-X](https://github.com/vchoutas/smplify-x)(with 3D keypoints loss), [TotalCapture](http://www.cs.cmu.edu/~hanbyulj/totalcapture/)(without using point clouds).
- We integrate some easy-to-use functions for previous great work:
  - `easymocap/estimator/mediapipe_wrapper.py`: [MediaPipe](https://github.com/google/mediapipe)
  - `easymocap/estimator/SPIN`  : an SMPL estimator[^cao2018]
  - `easymocap/estimator/YOLOv4`: an object detector[^kolotouros2019]
  - `easymocap/estimator/HRNet` : a 2D human pose estimator[^bochkovskiy2020]

## Contact

Please open an issue if you have any questions. We appreciate all contributions to improve our project.
    

## Contributor

EasyMocap is **built by** researchers from the 3D vision group of Zhejiang University: [**Qing Shuai**](https://chingswy.github.io/), [**Qi Fang**](https://raypine.github.io/), [**Junting Dong**](https://jtdong.com/), [**Sida Peng**](https://pengsida.net/), **Di Huang**, [**Hujun Bao**](http://www.cad.zju.edu.cn/home/bao/), **and** [**Xiaowei Zhou**](https://xzhou.me/). 

We would like to thank Wenduo Feng, Di Huang, Yuji Chen, Hao Xu, Qing Shuai, Qi Fang, Ting Xie, Junting Dong, Sida Peng and Xiaopeng Ji who are the performers in the sample data. We would also like to thank all the people who has helped EasyMocap [in any way](https://github.com/zju3dv/EasyMocap/graphs/contributors).

## Citation

This project is a part of our work [iMocap](https://zju3dv.github.io/iMoCap/), [Mirrored-Human](https://zju3dv.github.io/Mirrored-Human/), [mvpose](https://zju3dv.github.io/mvpose/), [Neural Body](https://zju3dv.github.io/neuralbody/), [MultiNeuralBody](https://chingswy.github.io/easymocap-public-doc/works/multinb.html), [enerf]().

Please consider citing these works if you find this repo is useful for your projects.

```bibtex
@Misc{easymocap,  
    title = {EasyMoCap - Make human motion capture easier.},
    howpublished = {Github},  
    year = {2021},
    url = {https://github.com/zju3dv/EasyMocap}
}

@inproceedings{shuai2022multinb,
  title={Novel View Synthesis of Human Interactions from Sparse
Multi-view Videos},
  author={Shuai, Qing and Geng, Chen and Fang, Qi and Peng, Sida and Shen, Wenhao and Zhou, Xiaowei and Bao, Hujun},
  booktitle={SIGGRAPH Conference Proceedings},
  year={2022}
}

@inproceedings{lin2022efficient,
  title={Efficient Neural Radiance Fields for Interactive Free-viewpoint Video},
  author={Lin, Haotong and Peng, Sida and Xu, Zhen and Yan, Yunzhi and Shuai, Qing and Bao, Hujun and Zhou, Xiaowei},
  booktitle={SIGGRAPH Asia Conference Proceedings},
  year={2022}
}

@inproceedings{dong2021fast,
  title={Fast and Robust Multi-Person 3D Pose Estimation and Tracking from Multiple Views},
  author={Dong, Junting and Fang, Qi and Jiang, Wen and Yang, Yurou and Bao, Hujun and Zhou, Xiaowei},
  booktitle={T-PAMI},
  year={2021}
}
    
@inproceedings{dong2020motion,
  title={Motion capture from internet videos},
  author={Dong, Junting and Shuai, Qing and Zhang, Yuanqing and Liu, Xian and Zhou, Xiaowei and Bao, Hujun},
  booktitle={European Conference on Computer Vision},
  pages={210--227},
  year={2020},
  organization={Springer}
}

@inproceedings{peng2021neural,
  title={Neural Body: Implicit Neural Representations with Structured Latent Codes for Novel View Synthesis of Dynamic Humans},
  author={Peng, Sida and Zhang, Yuanqing and Xu, Yinghao and Wang, Qianqian and Shuai, Qing and Bao, Hujun and Zhou, Xiaowei},
  booktitle={CVPR},
  year={2021}
}

@inproceedings{fang2021mirrored,
  title={Reconstructing 3D Human Pose by Watching Humans in the Mirror},
  author={Fang, Qi and Shuai, Qing and Dong, Junting and Bao, Hujun and Zhou, Xiaowei},
  booktitle={CVPR},
  year={2021}
}

```

[^loper2015]: Loper, Matthew, et al. "SMPL: A skinned multi-person linear model." ACM transactions on graphics (TOG) 34.6 (2015): 1-16.

[^romero2017]: Romero, Javier, Dimitrios Tzionas, and Michael J. Black. "Embodied hands: Modeling and capturing hands and bodies together." ACM Transactions on Graphics (ToG) 36.6 (2017): 1-17.

[^pavlakos2019]: Pavlakos, Georgios, et al. "Expressive body capture: 3d hands, face, and body from a single image." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2019.

<!-- [4] Bogo, Federica, et al. "Keep it SMPL: Automatic estimation of 3D human pose and shape from a single image." European conference on computer vision. Springer, Cham, 2016. -->

[^cao2018]: Cao, Z., Hidalgo, G., Simon, T., Wei, S.E., Sheikh, Y.: Openpose: real-time multi-person 2d pose estimation using part affinity fields. arXiv preprint arXiv:1812.08008 (2018)

[^kolotouros2019]: Kolotouros, Nikos, et al. "Learning to reconstruct 3D human pose and shape via model-fitting in the loop." Proceedings of the IEEE/CVF International Conference on Computer Vision. 2019

[^bochkovskiy2020]: Bochkovskiy, Alexey, Chien-Yao Wang, and Hong-Yuan Mark Liao. "Yolov4: Optimal speed and accuracy of object detection." arXiv preprint arXiv:2004.10934 (2020).

[^hrnet]: Sun, Ke, et al. "Deep high-resolution representation learning for human pose estimation." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2019.
init 2021-01-14 21:17:40 +08:00			`<!--`
			`* @Date: 2021-01-13 20:32:12`
			`* @Author: Qing Shuai`
			`* @LastEditors: Qing Shuai`
:memo: update readme 2022-11-03 13:11:37 +08:00			`* @LastEditTime: 2022-11-03 13:09:58`
update doc 2021-01-14 21:22:44 +08:00			`* @FilePath: /EasyMocapRelease/Readme.md`
init 2021-01-14 21:17:40 +08:00			`-->`
update output and support bvh 2021-03-13 21:58:16 +08:00
update readme 2022-08-08 13:43:40 +08:00			`<div align="center">`
			`<img src="logo.png" width="40%">`
Update Readme.md 2021-08-21 10:42:11 +08:00			`</div>`

update readme 2022-08-08 13:43:40 +08:00			`EasyMocap is an open-source toolbox for markerless human motion capture and novel view synthesis from RGB videos. In this project, we provide a lot of motion capture demos in different settings.`
update readme 2021-01-17 21:08:07 +08:00
update Readme 2021-04-02 12:28:46 +08:00			`![python](https://img.shields.io/github/languages/top/zju3dv/EasyMocap)`
			`![star](https://img.shields.io/github/stars/zju3dv/EasyMocap?style=social)`
init 2021-01-14 21:17:40 +08:00
update readme 2022-08-08 13:43:40 +08:00			`## News`

:memo: update readme 2022-11-03 13:11:37 +08:00			`- :tada: Our SIGGRAPH 2022 [Novel View Synthesis of Human Interactions From Sparse Multi-view Videos](https://chingswy.github.io/easymocap-public-doc/works/multinb.html) is released! Check the [documentation](https://chingswy.github.io/easymocap-public-doc/works/multinb.html).`
:tada: update readme 2022-08-21 16:18:33 +08:00			`- :tada: EasyMocap v0.2 is released! We support motion capture from Internet videos. Please check the [Quick Start](https://chingswy.github.io/easymocap-public-doc/quickstart/quickstart.html) for more details.`
:memo: update readme 2022-11-03 13:11:37 +08:00
update readme 2022-08-08 13:43:40 +08:00
:rocket: update to v0.2 2021-04-14 15:22:51 +08:00			`---`
init 2021-01-14 21:17:40 +08:00
update Readme 2021-04-02 12:28:46 +08:00			`## Core features`
update readme 2021-01-17 21:08:07 +08:00
:rocket: update to v0.2 2021-04-14 15:22:51 +08:00			`### Multiple views of a single person`
init 2021-01-14 21:17:40 +08:00
Add colab demo 2021-08-09 19:17:14 +08:00			`[![report](https://img.shields.io/badge/quickstart-green)](./doc/quickstart.md) [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/drive/1Cyvu_lPFUajr2RKt6yJIfS3HQIIYl6QU?usp=sharing)`
:rocket: update to v0.2 2021-04-14 15:22:51 +08:00
Markdown footnote syntax 2022-01-15 20:47:47 +08:00			`This is the basic code for fitting SMPL[^loper2015]/SMPL+H[^romero2017]/SMPL-X[^pavlakos2019]/MANO[^romero2017] model to capture body+hand+face poses from multiple views.`
update Readme 2021-01-14 21:41:31 +08:00
update Readme 2021-04-02 12:28:46 +08:00			`<div align="center">`
			`<img src="doc/feng/mv1pmf-smplx.gif" width="80%">`
			`<br>`
:memo: update figures 2021-07-21 15:07:37 +08:00			`<img src="https://raw.githubusercontent.com/chingswy/Dataset-Demo/main/EasyMocap/mv1p-dance-smpl.gif" width="80%">`
			`<br>`
:memo: add demo gif 2022-05-08 19:55:56 +08:00			`<sup>Videos are from ZJU-MoCap, with 23 calibrated and synchronized cameras.</sup>`
update Readme 2021-04-02 12:28:46 +08:00			`</div>`
update output and support bvh 2021-03-13 21:58:16 +08:00
:memo: update Readme 2021-06-14 16:47:16 +08:00			`<div align="center">`
			`<img src="doc/feng/mano.gif" width="80%">`
			`<br>`
:memo: add demo gif 2022-05-08 19:55:56 +08:00			`<sup>Captured with 8 cameras.</sup>`
			`</div>`

:tada: update readme 2022-08-21 16:18:33 +08:00			`### Internet video`
:memo: add demo gif 2022-05-08 19:55:56 +08:00
:memo: update Readme 2022-05-08 20:05:25 +08:00			`This part is the basic code for fitting SMPL[^loper2015] with 2D keypoints estimation[^cao2018][^hrnet] and CNN initialization[^kolotouros2019].`
:memo: add demo gif 2022-05-08 19:55:56 +08:00
			`<div align="center">`
			`<img src="https://raw.githubusercontent.com/chingswy/Dataset-Demo/main/EasyMocap/23EfsN7vEOA%2B003170%2B003670.gif" width="80%">`
			`<br>`
			`<sup>The raw video is from <a href="https://www.youtube.com/watch?v=23EfsN7vEOA">Youtube</a>.</sup>`
:memo: update Readme 2021-06-14 16:47:16 +08:00			`</div>`

update Readme 2021-04-02 12:28:46 +08:00			`### Internet video with a mirror`
update Readme 2021-01-14 21:41:31 +08:00
:rocket: update to v0.2 2021-04-14 15:22:51 +08:00			`[![report](https://img.shields.io/badge/CVPR21-mirror-red)](https://arxiv.org/pdf/2104.00340.pdf) [![quickstart](https://img.shields.io/badge/quickstart-green)](https://github.com/zju3dv/Mirrored-Human)`
update Readme 2021-01-14 21:41:31 +08:00
update Readme 2021-04-02 12:28:46 +08:00			`<div align="center">`
			`<img src="https://raw.githubusercontent.com/zju3dv/Mirrored-Human/main/doc/assets/smpl-avatar.gif" width="80%">`
			`<br>`
:memo: add demo gif 2022-05-08 19:55:56 +08:00			`<sup>The raw video is from <a href="https://www.youtube.com/watch?v=KOCJJ27hhIE">Youtube</a>.</sup>`
:memo: add results 2021-04-14 16:03:38 +08:00			`</div>`

update output and support bvh 2021-03-13 21:58:16 +08:00
update Readme 2021-04-02 12:28:46 +08:00			`### Multiple Internet videos with a specific action (Coming soon)`
init 2021-01-14 21:17:40 +08:00
:rocket: update to v0.2 2021-04-14 15:22:51 +08:00			`[![report](https://img.shields.io/badge/ECCV20-imocap-red)](https://arxiv.org/pdf/2008.07931.pdf) [![quickstart](https://img.shields.io/badge/quickstart-green)](./doc/todo.md)`
update output and support bvh 2021-03-13 21:58:16 +08:00
:memo: add results 2021-04-14 16:03:38 +08:00			`<div align="center">`
			`<img src="doc/imocap/imocap.gif" width="80%"><br/>`
:memo: add demo gif 2022-05-08 19:55:56 +08:00			`<sup>Internet videos of Roger Federer's serving</sup>`
:memo: add results 2021-04-14 16:03:38 +08:00			`</div>`

[vis] update realtime visualization 2021-06-28 12:14:56 +08:00			`### Multiple views of multiple people`
init 2021-01-14 21:17:40 +08:00
:memo: add link to dataset 2021-07-07 12:03:18 +08:00			`[![report](https://img.shields.io/badge/CVPR19-mvpose-red)](https://arxiv.org/pdf/1901.04111.pdf) [![quickstart](https://img.shields.io/badge/quickstart-green)](./doc/mvmp.md)`
update output and support bvh 2021-03-13 21:58:16 +08:00
:memo: add results 2021-04-14 16:03:38 +08:00			`<div align="center">`
[vis] update realtime visualization 2021-06-28 12:14:56 +08:00			`<img src="doc/assets/mvmp1f.gif" width="80%"><br/>`
:memo: add demo gif 2022-05-08 19:55:56 +08:00			`<sup>Captured with 8 consumer cameras</sup>`
:memo: add results 2021-04-14 16:03:38 +08:00			`</div>`

:memo: update Readme 2021-06-14 16:47:16 +08:00			`### Novel view synthesis from sparse views`
			`[![report](https://img.shields.io/badge/CVPR21-neuralbody-red)](https://arxiv.org/pdf/2012.15838.pdf) [![quickstart](https://img.shields.io/badge/quickstart-green)](https://github.com/zju3dv/neuralbody)`
:rocket: update to v0.2 2021-04-14 15:22:51 +08:00
:memo: update Readme 2021-06-14 16:47:16 +08:00			`<div align="center">`
:memo: add demo gif 2022-05-08 19:55:56 +08:00			`<img src="https://raw.githubusercontent.com/chingswy/Dataset-Demo/main/EasyMocap/female-ballet.gif" width="80%"><br/>`
			`<sup>Novel view synthesis for chanllenge motion(coming soon)</sup>`
:memo: update Readme 2021-06-14 16:47:16 +08:00			`</div>`
:rocket: update to v0.2 2021-04-14 15:22:51 +08:00
Update Readme.md 2021-11-21 20:23:22 +08:00			`<div align="center">`
:memo: add demo gif 2022-05-08 19:55:56 +08:00			`<img src="https://raw.githubusercontent.com/chingswy/Dataset-Demo/main/EasyMocap/nvs_mp_soccer1_6_rgb.gif" width="80%"><br/>`
			`<sup>Novel view synthesis for human interaction(coming soon)</sup>`
Update Readme.md 2021-11-21 20:23:22 +08:00			`</div>`


:memo: add link to dataset 2021-07-07 12:03:18 +08:00			`## ZJU-MoCap`

:memo: update figures 2021-07-21 15:07:37 +08:00			`With our proposed method, we release two large dataset of human motion: LightStage and Mirrored-Human. See the [website](https://chingswy.github.io/Dataset-Demo/) for more details.`

Update Readme.md 2023-03-27 19:40:36 +08:00			`If you would like to download the ZJU-Mocap dataset, please sign the [agreement](https://pengsida.net/project_page_assets/files/ZJU-MoCap_Agreement.pdf), and email it to Qing Shuai (s_q@zju.edu.cn) and cc Xiaowei Zhou (xwzhou@zju.edu.cn) to request the download link.`
Update Readme.md 2022-01-24 21:23:48 +08:00
:memo: update figures 2021-07-21 15:07:37 +08:00			`<div align="center">`
			`<img src="doc/assets/ZJU-MoCap-lightstage.jpg" width="80%"><br/>`
:memo: add demo gif 2022-05-08 19:55:56 +08:00			`<sup>LightStage: captured with LightStage system</sup>`
:memo: update figures 2021-07-21 15:07:37 +08:00			`</div>`

			`<div align="center">`
			`<img src="https://raw.githubusercontent.com/chingswy/Dataset-Demo/main/EasyMocap/mirrored-human.jpg" width="80%"><br/>`
:memo: add demo gif 2022-05-08 19:55:56 +08:00			`<sup>Mirrored-Human: collected from the Internet</sup>`
:memo: update figures 2021-07-21 15:07:37 +08:00			`</div>`
:memo: add link to dataset 2021-07-07 12:03:18 +08:00
update Readme 2021-04-02 12:28:46 +08:00			`## Other features`
init 2021-01-14 21:17:40 +08:00
:rocket: update mvmp 2021-06-28 19:37:15 +08:00			`### 3D Realtime visualization`
			`[![quickstart](https://img.shields.io/badge/quickstart-green)](./doc/realtime_visualization.md)`
			`<div align="center">`
			`<img src="https://raw.githubusercontent.com/chingswy/Dataset-Demo/main/assets/vis3d/skel-body25.gif" width="26%">`
			`<img src="https://raw.githubusercontent.com/chingswy/Dataset-Demo/main/assets/vis3d/skel-total.gif" width="26%">`
			`<img src="https://raw.githubusercontent.com/chingswy/Dataset-Demo/main/assets/vis3d/skel-multi.gif" width="26%">`
			`</div>`

			`<div align="center">`
			`<img src="https://raw.githubusercontent.com/chingswy/Dataset-Demo/main/assets/vis3d/mesh-smpl.gif" width="26%">`
			`<img src="https://raw.githubusercontent.com/chingswy/Dataset-Demo/main/assets/vis3d/mesh-smplx.gif" width="26%">`
			`<img src="https://raw.githubusercontent.com/chingswy/Dataset-Demo/main/assets/vis3d/mesh-manol.gif" width="26%">`
			`</div>`

:memo: update figures 2021-07-21 21:49:28 +08:00			`### [Camera calibration](apps/calibration/Readme.md)`

			`<div align="center">`
			`<img src="https://raw.githubusercontent.com/chingswy/Dataset-Demo/main/EasyMocap/calib_intri.jpg" width="40%">`
			`<img src="https://raw.githubusercontent.com/chingswy/Dataset-Demo/main/EasyMocap/calib_extri.jpg" width="40%">`
			`<br>`
:memo: add demo gif 2022-05-08 19:55:56 +08:00			`<sup>Calibration for intrinsic and extrinsic parameters</sup>`
:memo: update figures 2021-07-21 21:49:28 +08:00			`</div>`

			`### [Annotator](apps/annotation/Readme.md)`

			`<div align="center">`
			`<img src="https://raw.githubusercontent.com/chingswy/Dataset-Demo/main/EasyMocap/annot_keypoints.jpg" width="40%">`
			`<img src="https://raw.githubusercontent.com/chingswy/Dataset-Demo/main/EasyMocap/annot_mask.jpg" width="40%">`
			`<br>`
:memo: add demo gif 2022-05-08 19:55:56 +08:00			`<sup>Annotator for bounding box, keypoints and mask</sup>`
:memo: update figures 2021-07-21 21:49:28 +08:00			`</div>`

update output and support bvh 2021-03-13 21:58:16 +08:00
update Readme 2021-04-02 12:28:46 +08:00			`## Updates`
:memo: update readme 2022-11-03 13:11:37 +08:00			`- 11/03/2022: Support MultiNeuralBody.`
support mediapipe 2021-12-25 15:26:56 +08:00			`- 12/25/2021: Support mediapipe keypoints detector.`
Add colab demo 2021-08-09 19:17:14 +08:00			`- 08/09/2021: Add a colab demo [here](https://colab.research.google.com/drive/1Cyvu_lPFUajr2RKt6yJIfS3HQIIYl6QU?usp=sharing).`
[vis] update realtime visualization 2021-06-28 12:14:56 +08:00			`- 06/28/2021: The Multi-view Multi-person part is released!`
:memo: update Readme 2021-06-14 16:47:16 +08:00			`- 06/10/2021: The real-time 3D visualization part is released!`
			`- 04/11/2021: The calibration tool and the annotator are released.`
			`- 04/11/2021: Mirrored-Human part is released.`
update output and support bvh 2021-03-13 21:58:16 +08:00
:rocket: update to v0.2 2021-04-14 15:22:51 +08:00			`## Installation`
init 2021-01-14 21:17:40 +08:00
:memo: update readme 2022-11-03 13:11:37 +08:00			`See [documentation](https://chingswy.github.io/easymocap-public-doc/install/install.html) for more instructions.`
:rocket: support SMPL+H/SMPL-X 2021-01-24 22:33:08 +08:00
init 2021-01-14 21:17:40 +08:00			`## Acknowledgements`
update Readme 2021-04-02 12:28:46 +08:00
Update Readme.md 2021-01-14 23:13:49 +08:00			`Here are the great works this project is built upon:`
init 2021-01-14 21:17:40 +08:00
Update Readme.md 2021-01-14 23:13:49 +08:00			`- SMPL models and layer are from MPII [SMPL-X model](https://github.com/vchoutas/smplx).`
init 2021-01-14 21:17:40 +08:00			`- Some functions are borrowed from [SPIN](https://github.com/nkolot/SPIN), [VIBE](https://github.com/mkocabas/VIBE), [SMPLify-X](https://github.com/vchoutas/smplify-x)`
:memo: update readme 2022-11-03 13:11:37 +08:00			`- The method for fitting 3D skeleton and SMPL model is similar to [SMPLify-X](https://github.com/vchoutas/smplify-x)(with 3D keypoints loss), [TotalCapture](http://www.cs.cmu.edu/~hanbyulj/totalcapture/)(without using point clouds).`
:rocket: update to v0.2 2021-04-14 15:22:51 +08:00			`- We integrate some easy-to-use functions for previous great work:`
support mediapipe 2021-12-25 15:26:56 +08:00			- `easymocap/estimator/mediapipe_wrapper.py`: [MediaPipe](https://github.com/google/mediapipe)
Markdown footnote syntax 2022-01-15 20:47:47 +08:00			- `easymocap/estimator/SPIN` : an SMPL estimator[^cao2018]
:memo: update readme 2022-11-03 13:11:37 +08:00			- `easymocap/estimator/YOLOv4`: an object detector[^kolotouros2019]
			- `easymocap/estimator/HRNet` : a 2D human pose estimator[^bochkovskiy2020]
init 2021-01-14 21:17:40 +08:00
			`## Contact`
update Readme 2021-04-02 12:28:46 +08:00
			`Please open an issue if you have any questions. We appreciate all contributions to improve our project.`
Update Readme.md 2021-11-21 20:23:22 +08:00
init 2021-01-14 21:17:40 +08:00
:memo: update figures 2021-07-21 15:07:37 +08:00			`## Contributor`

Update Readme.md 2021-07-26 11:49:57 +08:00			`EasyMocap is built by researchers from the 3D vision group of Zhejiang University: [Qing Shuai](https://chingswy.github.io/), [Qi Fang](https://raypine.github.io/), [Junting Dong](https://jtdong.com/), [Sida Peng](https://pengsida.net/), Di Huang, [Hujun Bao](http://www.cad.zju.edu.cn/home/bao/), and [Xiaowei Zhou](https://xzhou.me/).`
:memo: update figures 2021-07-21 15:07:37 +08:00
			`We would like to thank Wenduo Feng, Di Huang, Yuji Chen, Hao Xu, Qing Shuai, Qi Fang, Ting Xie, Junting Dong, Sida Peng and Xiaopeng Ji who are the performers in the sample data. We would also like to thank all the people who has helped EasyMocap [in any way](https://github.com/zju3dv/EasyMocap/graphs/contributors).`

init 2021-01-14 21:17:40 +08:00			`## Citation`
update Readme 2021-04-02 12:28:46 +08:00
:memo: update readme 2022-11-03 13:11:37 +08:00			`This project is a part of our work [iMocap](https://zju3dv.github.io/iMoCap/), [Mirrored-Human](https://zju3dv.github.io/Mirrored-Human/), [mvpose](https://zju3dv.github.io/mvpose/), [Neural Body](https://zju3dv.github.io/neuralbody/), [MultiNeuralBody](https://chingswy.github.io/easymocap-public-doc/works/multinb.html), [enerf]().`
Update Readme.md 2021-01-14 23:13:49 +08:00
update Readme 2021-04-02 12:28:46 +08:00			`Please consider citing these works if you find this repo is useful for your projects.`
init 2021-01-14 21:17:40 +08:00
			```bibtex
:memo: update Readme 2022-05-08 20:05:25 +08:00			`@Misc{easymocap,`
			`title = {EasyMoCap - Make human motion capture easier.},`
			`howpublished = {Github},`
			`year = {2021},`
			`url = {https://github.com/zju3dv/EasyMocap}`
			`}`

:memo: update readme 2022-11-03 13:11:37 +08:00			`@inproceedings{shuai2022multinb,`
			`title={Novel View Synthesis of Human Interactions from Sparse`
			`Multi-view Videos},`
			`author={Shuai, Qing and Geng, Chen and Fang, Qi and Peng, Sida and Shen, Wenhao and Zhou, Xiaowei and Bao, Hujun},`
			`booktitle={SIGGRAPH Conference Proceedings},`
			`year={2022}`
			`}`

			`@inproceedings{lin2022efficient,`
			`title={Efficient Neural Radiance Fields for Interactive Free-viewpoint Video},`
			`author={Lin, Haotong and Peng, Sida and Xu, Zhen and Yan, Yunzhi and Shuai, Qing and Bao, Hujun and Zhou, Xiaowei},`
			`booktitle={SIGGRAPH Asia Conference Proceedings},`
			`year={2022}`
			`}`

Update Readme.md 2021-12-04 22:24:34 +08:00			`@inproceedings{dong2021fast,`
			`title={Fast and Robust Multi-Person 3D Pose Estimation and Tracking from Multiple Views},`
			`author={Dong, Junting and Fang, Qi and Jiang, Wen and Yang, Yurou and Bao, Hujun and Zhou, Xiaowei},`
			`booktitle={T-PAMI},`
			`year={2021}`
			`}`

init 2021-01-14 21:17:40 +08:00			`@inproceedings{dong2020motion,`
			`title={Motion capture from internet videos},`
			`author={Dong, Junting and Shuai, Qing and Zhang, Yuanqing and Liu, Xian and Zhou, Xiaowei and Bao, Hujun},`
			`booktitle={European Conference on Computer Vision},`
			`pages={210--227},`
			`year={2020},`
			`organization={Springer}`
			`}`
Update Readme.md 2021-01-14 22:48:55 +08:00
Update Readme.md 2021-03-24 17:03:03 +08:00			`@inproceedings{peng2021neural,`
Update Readme.md 2021-01-14 22:48:55 +08:00			`title={Neural Body: Implicit Neural Representations with Structured Latent Codes for Novel View Synthesis of Dynamic Humans},`
Update Readme.md 2021-01-16 20:40:50 +08:00			`author={Peng, Sida and Zhang, Yuanqing and Xu, Yinghao and Wang, Qianqian and Shuai, Qing and Bao, Hujun and Zhou, Xiaowei},`
Update Readme.md 2021-03-24 17:04:20 +08:00			`booktitle={CVPR},`
Update Readme.md 2021-03-04 10:15:38 +08:00			`year={2021}`
Update Readme.md 2021-01-14 22:48:55 +08:00			`}`
update output and support bvh 2021-03-13 21:58:16 +08:00
fix reference 2021-03-23 09:33:47 +08:00			`@inproceedings{fang2021mirrored,`
			`title={Reconstructing 3D Human Pose by Watching Humans in the Mirror},`
update output and support bvh 2021-03-13 21:58:16 +08:00			`author={Fang, Qi and Shuai, Qing and Dong, Junting and Bao, Hujun and Zhou, Xiaowei},`
fix reference 2021-03-23 09:33:47 +08:00			`booktitle={CVPR},`
update output and support bvh 2021-03-13 21:58:16 +08:00			`year={2021}`
			`}`
:memo: update mvpose 2021-07-12 20:22:09 +08:00
:rocket: support SMPL+H/SMPL-X 2021-01-24 22:33:08 +08:00			```

Markdown footnote syntax 2022-01-15 20:47:47 +08:00			`[^loper2015]: Loper, Matthew, et al. "SMPL: A skinned multi-person linear model." ACM transactions on graphics (TOG) 34.6 (2015): 1-16.`

			`[^romero2017]: Romero, Javier, Dimitrios Tzionas, and Michael J. Black. "Embodied hands: Modeling and capturing hands and bodies together." ACM Transactions on Graphics (ToG) 36.6 (2017): 1-17.`

			`[^pavlakos2019]: Pavlakos, Georgios, et al. "Expressive body capture: 3d hands, face, and body from a single image." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2019.`

			`<!-- [4] Bogo, Federica, et al. "Keep it SMPL: Automatic estimation of 3D human pose and shape from a single image." European conference on computer vision. Springer, Cham, 2016. -->`

			`[^cao2018]: Cao, Z., Hidalgo, G., Simon, T., Wei, S.E., Sheikh, Y.: Openpose: real-time multi-person 2d pose estimation using part affinity fields. arXiv preprint arXiv:1812.08008 (2018)`

			`[^kolotouros2019]: Kolotouros, Nikos, et al. "Learning to reconstruct 3D human pose and shape via model-fitting in the loop." Proceedings of the IEEE/CVF International Conference on Computer Vision. 2019`

			`[^bochkovskiy2020]: Bochkovskiy, Alexey, Chien-Yao Wang, and Hong-Yuan Mark Liao. "Yolov4: Optimal speed and accuracy of object detection." arXiv preprint arXiv:2004.10934 (2020).`

:memo: update Readme 2022-05-08 20:05:25 +08:00			`[^hrnet]: Sun, Ke, et al. "Deep high-resolution representation learning for human pose estimation." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2019.`