@larpu_acc: lisa su (ceo of amd) was holding a tiny mini-pc (about the size of a router or lunchbox) during the presentation inside it there’s the ryzen ai max+ 395 (strix halo) with 128 gb of unified memory (shared for cpu + powerful integrated gpu + npu). on linux around 110 gb of that is available as vram (* ˘ ω ˘ *) this little guy (like the gmktec evo-x2) can fully run the 235-billion-parameter model (qwen3-235b, mixture-of-experts) and feels super comfy with deepseek v3 and other big models. no cloud, no separate graphics card needed! (* > ω < *) why this is exciting: • it’s the first x86 chip that can handle ~200+ billion parameters on a single die • according to amd, in some inference tasks it beats the rtx 5080 by several times (especially with huge models that are memory-bound) • such a mini-pc costs roughly $1400–2500 depending on config, and it pays for itself in just a few months instead of those $200–400+ monthly cloud bills (claude, chatgpt pro, cursor, etc.) in short: amd really leveled up mobile and mini platforms for local ai, and this is actually threatening part of nvidia’s cloud business and all those model subscriptions(* ^ ω ^) ♡ #amd #llm #opensource #software #larpochka
☣︎ larpochka ☣︎
Region: DE
Sunday 14 June 2026 23:03:31 GMT
Music
Download
Comments
︎︎matts :
"you can now run one of the world's Ilm at home"
2026-06-15 12:06:07
421
japo_tech_genius_pro_the_best :
holy larp
2026-06-15 05:08:30
465
Lou.nfo :
boi would you rather pay a 20 dollar subscription to get 1-2T models or 1000 for a 200b shit model thats normally free on the cloud
2026-06-15 08:36:27
78
fbi.gov :
larp doesn't what he's talking about
2026-06-15 08:52:26
140
MurimPath :
What larp is this
2026-06-15 02:45:30
86
Stevefox123 :
how about telling ppl it rundt so slow youll have to wait 30 min for a response if the task us large enough?
2026-06-15 05:13:01
23
BadRally :
kimi k2.6 is a 1t model and is ~5 usd per million tokens
2026-06-15 12:48:21
5
mikgazer :
It’s unfortunate to see the big technology companies slowly prioritizing AI and the business opportunities it could produce, but on a more positive note, we get to see cool innovations such as the 395 strix halo, beating a 5080 with that form factor is incredibly impressive.
2026-06-17 02:05:12
1
wetware :
Or u can just ollama gpt-oss on ur spare GPU
2026-06-16 02:15:14
7
Hackless :
Can it run GLM 5.2?
2026-06-18 10:42:51
0
Fallor😴 :
AMD will always own nvidia when it comes to AI and LL
2026-06-15 06:35:07
3
🇹🇷🇨🇾 :
Local ai is the future because it’s what people want
2026-06-15 07:47:58
12
leafwayzz :
Larp overload
2026-06-15 07:04:42
14
killua🎀 :
2026-06-15 04:54:18
9
foxtheleague :
Why evb saying larp? Ion get it
2026-06-16 00:16:52
0
Sndrdrx024 :
128gb is not a little guy
2026-06-16 08:17:45
6
jebediah kerman :
larp
2026-06-15 07:09:55
7
Dfrcyt :
I’m not broke so I’ll stick with Claude
2026-06-17 02:00:58
1
Sing (Antarctic Singularity) :
tokens per second prob horrendous
2026-06-17 20:52:46
0
️ :
correction, they made a mini pc that can run ai models for 4000 dollars
2026-06-17 08:21:21
1
Purple :
I’d rather just pay for a claude/openai subscription
2026-06-17 05:15:00
0
Mith :
So what's new? I did this last year in September—ran a DeepSeek-V2 235B model on an AMD Epyc 7282 (24 cores, 48 threads), 128 GB DDR4. TTFT was 3 min, 1.5–2.3 tk/s
2026-06-16 08:10:09
1
darek :
fable 5 has a few trillion, 5 i think. and that tiny mini pc is worth like five thousand dollars.
2026-06-15 12:54:24
1
To see more videos from user @larpu_acc, please go to the Tikwm
homepage.