just_another_person@lemmy.world to Linux@lemmy.worldEnglish · 15 days agoAMD Announces "Instella" Fully Open-Source 3B Language Modelswww.phoronix.comexternal-linkmessage-square26fedilinkarrow-up188arrow-down13cross-posted to: linux@lemmy.ml
arrow-up185arrow-down1external-linkAMD Announces "Instella" Fully Open-Source 3B Language Modelswww.phoronix.comjust_another_person@lemmy.world to Linux@lemmy.worldEnglish · 15 days agomessage-square26fedilinkcross-posted to: linux@lemmy.ml
minus-squareHappyFrog@lemmy.blahaj.zonelinkfedilinkEnglisharrow-up7·15 days agoI see all these graphs about how much better this LLM is than another, but do those graphs actually translate to real world usefulness?
minus-squarejust_another_person@lemmy.worldOPlinkfedilinkEnglisharrow-up10arrow-down3·15 days agoI think more of the issue is what constitutes actual open source. This is actually open source, and it performs well. If you’re familiar with the space, then it’s a big deal.
minus-squareHappyFrog@lemmy.blahaj.zonelinkfedilinkEnglisharrow-up4·edit-214 days agoI see, thank you. Damn, they even chose a dataset with a open license.
minus-squarePossibly linux@lemmy.ziplinkfedilinkEnglisharrow-up2arrow-down1·edit-214 days agoIs it really or is it just a binary release like everything else? Edit: It is actually Foss
minus-squarejust_another_person@lemmy.worldOPlinkfedilinkEnglisharrow-up4·14 days agoEverything is explained and linked in the project, so…
minus-squarePossibly linux@lemmy.ziplinkfedilinkEnglisharrow-up2·14 days agoYeah I noticed that after writing this. Really cool stuff
minus-squareoldfart@lemm.eelinkfedilinkEnglisharrow-up1·12 days agoI have yet to see a 3B model that’s not dumb.
I see all these graphs about how much better this LLM is than another, but do those graphs actually translate to real world usefulness?
I think more of the issue is what constitutes actual open source. This is actually open source, and it performs well. If you’re familiar with the space, then it’s a big deal.
I see, thank you.
Damn, they even chose a dataset with a open license.
Is it really or is it just a binary release like everything else?
Edit: It is actually Foss
Everything is explained and linked in the project, so…
Yeah I noticed that after writing this. Really cool stuff
I have yet to see a 3B model that’s not dumb.