GitHub - pq-yang/MatAnyone: [CVPR 2025] MatAnyone: Stable Video Matting with Consistent Memory Propagation

Stable Video Matting with Consistent Memory Propagation

Peiqing Yang¹ Shangchen Zhou¹ Jixin Zhao¹ Qingyi Tao² Chen Change Loy¹

¹S-Lab, Nanyang Technological University ²SenseTime Research, Singapore

MatAnyone is a practical human video matting framework supporting target assignment, with stable performance in both semantics of core regions and fine-grained boundary details.

🎥 For more visual results, go checkout our project page

📮 Update

[2025.03] Release our evaluation benchmark - YouTubeMatte.
[2025.03] Integrate MatAnyone with Hugging Face 🤗
[2025.02] Release inference codes and gradio demo.
[2025.02] This repo is created.

🔎 Overview

🔧 Installation

Clone Repo

git clone https://github.com/pq-yang/MatAnyone
cd MatAnyone

Create Conda Environment and Install Dependencies

# create new conda env
conda create -n matanyone python=3.8 -y
conda activate matanyone

# install python dependencies
pip install -e .
# [optional] install python dependencies for gradio demo
pip3 install -r hugging_face/requirements.txt

🤗 Load from Hugging Face

Alternatively, models can be directly loaded from Hugging Face to make inference.

pip install -q git+https://github.com/pq-yang/MatAnyone

To extract the foreground and the alpha video you can directly run the following lines. Please refer to inference_hf.py for more arguments.

from matanyone import InferenceCore
processor = InferenceCore("PeiqingYang/MatAnyone")

foreground_path, alpha_path = processor.process_video(
    input_path = "inputs/video/test-sample1.mp4",
    mask_path = "inputs/mask/test-sample1.png",
    output_path = "outputs"
)

🔥 Inference

Download Model

Download our pretrained model from MatAnyone v1.0.0 to the pretrained_models folder (pretrained model can also be automatically downloaded during the first inference).

The directory structure will be arranged as:

pretrained_models
   |- matanyone.pth

Quick Test

We provide some examples in the inputs folder. For each run, we take a video and its first-frame segmenatation mask as input. The segmenation mask could be obtained from interactive segmentation models such as SAM2 demo. For example, the directory structure can be arranged as:

inputs
   |- video
      |- test-sample0          # folder containing all frames
      |- test-sample1.mp4      # .mp4, .mov, .avi
   |- mask
      |- test-sample0_1.png    # mask for person 1
      |- test-sample0_2.png    # mask for person 2
      |- test-sample1.png

Run the following command to try it out:

## single target
# short video; 720p
python inference_matanyone.py -i inputs/video/test-sample1.mp4 -m inputs/mask/test-sample1.png
# short video; 1080p
python inference_matanyone.py -i inputs/video/test-sample2.mp4 -m inputs/mask/test-sample2.png
# long video; 1080p
python inference_matanyone.py -i inputs/video/test-sample3.mp4 -m inputs/mask/test-sample3.png

## multiple targets (control by mask)
# obtain matte for target 1
python inference_matanyone.py -i inputs/video/test-sample0 -m inputs/mask/test-sample0_1.png --suffix target1
# obtain matte for target 2
python inference_matanyone.py -i inputs/video/test-sample0 -m inputs/mask/test-sample0_2.png --suffix target2

The results will be saved in the results folder, including the foreground output video and the alpha output video.

If you want to save the results as per-frame images, you can set --save_image.
If you want to set a limit for the maximum input resolution, you can set --max_size, and the video will be downsampled if min(w, h) exceeds. By default, we don't set the limit.

🎪 Interactive Demo

To get rid of the preparation for first-frame segmentation mask, we prepare a gradio demo on hugging face and could also launch locally. Just drop your video/image, assign the target masks with a few clicks, and get the the matting results!

cd hugging_face

# install python dependencies
pip3 install -r requirements.txt # FFmpeg required

# launch the demo
python app.py

By launching, an interactive interface will appear as follow:

📊 Evaluation Benchmark

We provide a synthetic benchmark YouTubeMatte to enlarge the commonly-used VideoMatte240K-Test. A comparison between them is summarized in the table below.

Dataset	#Foregrounds	Source	Harmonized
VideoMatte240K-Test	5	Purchased Footage	❌
YouTubeMatte	32	YouTube Videos	✅

It is noteworthy that we applied harmonization (using Harmonizer) when compositing the foreground on a background. Such an operation effectively makes YouTubeMatte a more challenging benchmark that is closer to the real distribution. As shown in the figure below, while RVM is confused by the harmonized frame, our method still yields robust performance.

📑 Citation

If you find our repo useful for your research, please consider citing our paper:

 @inProceedings{yang2025matanyone,
     title     = {{MatAnyone}: Stable Video Matting with Consistent Memory Propagation},
     author    = {Yang, Peiqing and Zhou, Shangchen and Zhao, Jixin and Tao, Qingyi and Loy, Chen Change},
     booktitle = {CVPR},
     year      = {2025}
     }

📝 License

This project is licensed under NTU S-Lab License 1.0. Redistribution and use should follow this license.

👏 Acknowledgement

This project is built upon Cutie, with the interactive demo adapted from ProPainter, leveraging segmentation capabilities from Segment Anything Model and Segment Anything Model 2. Thanks for their awesome works!

This study is supported under the RIE2020 Industry Alignment Fund – Industry Collaboration Projects (IAF-ICP) Funding Initiative, as well as cash and in-kind contribution from the industry partner(s).

📧 Contact

If you have any questions, please feel free to reach us at peiqingyang99@outlook.com.

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
assets		assets
hugging_face		hugging_face
inputs		inputs
matanyone		matanyone
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
inference_hf.py		inference_hf.py
inference_matanyone.py		inference_matanyone.py
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Stable Video Matting with Consistent Memory Propagation

📮 Update

🔎 Overview

🔧 Installation

🤗 Load from Hugging Face

🔥 Inference

Download Model

Quick Test

🎪 Interactive Demo

📊 Evaluation Benchmark

📑 Citation

📝 License

👏 Acknowledgement

📧 Contact

About

Releases 2

Packages

Contributors 4

Languages

License

pq-yang/MatAnyone

Folders and files

Latest commit

History

Repository files navigation

Stable Video Matting with Consistent Memory Propagation

📮 Update

🔎 Overview

🔧 Installation

🤗 Load from Hugging Face

🔥 Inference

Download Model

Quick Test

🎪 Interactive Demo

📊 Evaluation Benchmark

📑 Citation

📝 License

👏 Acknowledgement

📧 Contact

About

Resources

License

Stars

Watchers

Forks

Releases 2

Packages 0

Contributors 4

Languages

Packages