Watch how a small draft model generates predictions that are validated by a larger target model, enabling faster text generation through parallel processing.
Token generation and validation flow
Step 1 of 25: Initial prompt loaded. Ready to begin generation.
Sequential token generation without speculation
Step 1 of 17: Initial prompt loaded. Ready to begin generation.