Currently you have video translation, I am suggesting you have audio only transcription and translation. A user will only have to upload the audio file (NOT video).