Will it ever be possible?
Subject context is another item that must make speech recognition very difficult.
A while back, I was reviewing the official police transcript of an interview with a man who was accused of killing his wife during a scuba accident.
Luckily the video of the same interview was available, as the number of Critical mis-transcriptions was amazing. Here was something that a human had transcribed from a very clear AV feed, and had seriously gotten wrong on several occaisions. Why? Because the transcriber had no knowledge of Scuba terms, and had transcribed what he/she thought they had heard.
But what they thought they had heard was not shaped by any knowledge of the subject, so was wrongly transcribed. I
supposed a compute may one day be able to work out the subject and then apply a specific set of industry / subject terms to it (which is why I guess they make medical specific transcription software) but as a human I can never follow along when my wife switches topic 13 times in one conversation, so how will a computer?