Now the Google Speech Technology team is speaking about the challenges they faced to get auto-captioning operational. Their vision was to create accurate captions for all videos in all languages, but had to deal with huge vocabularies, background noise, poor recordings, accent variability, and distinguishing between song and speech.- Google’s approach is to deliver captions from the cloud, given them the ability to rapidly iterate and model at a large scale.