New Human Video Matting Framework
MatAnyone 2 introduces a state-of-the-art human video matting framework with a Matting Quality Evaluator (MQE) and a new VMReal dataset (28k clips, 2.4M frames).
The framework includes a Matting Quality Evaluator (MQE) that could standardize how matting results are judged, pushing the field towards more reliable benchmarks. This is particularly useful as the field currently lacks a consistent, objective way to assess matting quality. The VMReal dataset, with its 28k clips and 2.4M frames, offers a substantial resource for training and evaluating human video matting models. Its scale could enable models to generalize better across diverse real-world scenarios. MatAnyone's work addresses a key challenge in video editing and augmented reality: accurately separating foreground elements (people) from the background. Improved matting quality directly translates to more seamless and realistic visual effects.