I strongly recommend adding the ability to generate voice-driven digital human videos using start and end frame images, as this would make the generated results better align with user expectations. I previously paid for a membership to test the image...