DeepSeek R1 Is Reportedly Running Inference On Huawei's Ascend 910C AI Chips, Showing China's Growing AI Capabilities

filister@lemmy.world · 2 days ago

spaduf@slrpnk.net · 2 days ago

Honestly good for them. US tech CEOs deserve to have their lunch eaten for ducking the industry into stagnation with their short sighted greed.

wewbull@feddit.uk · edit-2 2 days ago

In one story they’re using PTX on Nvidia H800s. In another they’re on Huawei chips.

Which is it? Are we all just hypothesising?

taytay@lemmings.world · 2 days ago

An unknown quantization of R1 is running on the 3rd iteration of outdated 7nm hardware taken from Sophgo’s work with TSMC last year?

Is this meant to be impressive or alarming? Because I’m neither.

aaron@lemm.ee · 2 days ago

I’m not going to parse this shit article. What does interference mean here? Please and thank you.

filister@lemmy.world · 2 days ago

That’s a very toxic attitude.

Inference is in principle the process of generation of the AI response. So when you run locally and LLM you are using your GPU only for inference.

aaron@lemm.ee · 2 days ago

Yeah, I misread because I’m stupid. Thanks for replying, non-toxic man.

xodoh74984@lemmy.world · 2 days ago

Training: Creating the model
Inference: Using the model

RobotToaster@mander.xyz · 2 days ago

Inference? It’s the actual running of the ai when you use it, as opposed to training.

aaron@lemm.ee · 2 days ago

Sorry. I forgot to mention that I’m dumb.