When a person starts speaking in a video their lips are already forming the word. Example is the pursing of the lips to form the word "Welcome". The person speaking has a contorted face before speaking. Is there a way to use an amount of time before...