@individualkex: 2-3x inference speedup | specdec explained #machinelearning

2026-02-05 01:08:49

2026-02-15 09:56:04

2026-02-05 13:01:24

2026-03-13 01:58:53

2026-04-30 12:51:01

2026-04-05 23:37:36

2026-02-17 23:49:52

2026-02-05 07:58:02

2026-02-07 17:22:07

2026-02-25 23:45:21

2026-02-05 02:36:52

2026-02-05 22:18:56

2026-02-05 05:51:48

To see more videos from user @individualkex, please go to the Tikwm homepage.

@individualkex: 2-3x inference speedup | specdec explained #machinelearning

@individualkex: 2-3x inference speedup | specdec explained #machinelearning

Individualkex

Open In TikTok:

Region: US

Wednesday 04 February 2026 16:28:03 GMT

Music

Download

Comments

Noble :

You’re goated man

Jaden Stock :

I’ve seen the math for this but it still blows my mind. It actually doesn’t even matter how good the small model is, the sampling still works. The better the small model is, the further out you can go before resampling. (correct me if wrong)

reckless_dane :

swopping costly output tokens for cheaper input tokens -nice. unless your large model rejects every output of your smaller model 😂 but I’m just negatively speculating

Girth Brooks :

Jeremy Fisher :

I wish you had explained what everything meant better because idk what’s going on

Silly :

That’s pretty cool dann

Peter :

Spec decoding is goated and people should use it more often

AV :

link please

Mxrk :

awesome!

SonOfMan :

Wes :

I hope when we get more concrete agents, one of them communicates just like you. 🥰 Besides your amazing research, your style of communication is exactly the same as my inner monolog if I could bring it to my speech

Farhan Beats :

🤩👏👏

Avg redditor :

This made me feel stupidly

Other Videos

About

Legal