Speech, itself, is nothing but sequences of different stimuli (phonemes), and perceiving their order is half of the decoding game. If the phonemes are far enough apart in time, you can perceive their order.
------C-----------------A------------------ T--------
(time)