Application Scenario: For example, when I upload an image to generate a talking-head video, if it were possible to upload a reference video, the final result would be that the character remains the person from the uploaded image, but their movements...