Hi Berni!
The first and the main question, did you try manually Google Speech and does it provide you satisfactory results? if yes, that's great, everything other described I can do.
After we will have working prototype for audio, I can do for video, it's a bit more programming, but still nothing too complicated.
Feel free to ask your questions
Thanks, Alex