There was a reviewer many years ago (Dragon type era?) who defined the problem with voice input perfectly.

At some point there will be an amusing misinterpretation of what you have been saying and your voice will change slightly because of the incipient smile which means there are more misinterpretations until it reaches the point where it is laughter in and garbage out.

A secondary problem is that in order for the computer to have a chance at working out what you are really asking (in type you can see the difference between two, to and too but they have the same sound) you require the person speaking to roboticize themselves.

This is a project much easier to describe in words than to implement.

