Yep, I said that a long way up there. IA64, VLIW, do the optimisation at compile-time, not run-time. Problem solved.

Not so fast, Mr. Bond. Getting the compiler working and producing efficient code proved hard. And you still need to drain the pipeline on occasion.

See Multiflow for the first attempt at commercialising the arch,

