This looks remarkably like
something I saw demonstrated at IBC, maybe ten years ago: a self-indexing content aggregation system that indexed metadata and audio from bulk video stores. I don't recall how good its speech-to-text was, but I believe it read subtitles too if they were present in the video stream, which would have probably been a lot more accurate.