@individualkex: reinforcement learning (for llms) explained

Individualkex
Individualkex
Open In TikTok:
Region: US
Saturday 02 May 2026 14:00:00 GMT
14982
1341
10
53

Music

Download

Comments

the._.legend._.27
TheLegend27 :
I don't understand on what is it being reinforced, like, what is the "test" the "judge" is getting? how can you tell if something is more "unhinged" for the judge to be reinforced to mimic?
2026-06-11 18:58:17
0
through.my.eyes25
Will 🧡 :
Need this for myself
2026-06-08 12:31:23
0
dariton4000
Dariton :
2026-05-02 14:45:01
4
cyrilzgheib
cyrilzgheib :
I recently made a cnn program that identifies different garments and accessories the library I installed was perfect about 93% but when I want insert an image it confuses bag with shoes and vice versa
2026-05-09 05:33:37
0
ai.kyarisiimaishm
AI kyarisiimaishmael :
amazing 👏
2026-05-08 17:53:27
0
charlie6316
Charlie :
Niceeeee
2026-05-02 21:20:46
0
To see more videos from user @individualkex, please go to the Tikwm homepage.

Other Videos


About