DeepSeek’s distilled new R1 AI model can run on a single GPU
DeepSeek’s updated R1 reasoning AI model is getting the bulk of the AI community’s attention this week.
Chinese AI startup DeepSeek’s newest AI model, an updated version of the company’s R1 reasoning model, achieves impressive scores on benchmarks for coding, math, and general knowledge, nearly surpassing OpenAI’s flagship o3. But the upgraded R1, also known as “R1-0528,” might also be less willing to answer contentious questions, in particular questions about topics the […]
DeepSeek has gone viral. Chinese AI lab DeepSeek broke into the mainstream consciousness this week after its chatbot app rose to the top of the Apple App Store charts (and Google Play, as well). DeepSeek’s AI models, which were trained using compute-efficient techniques, have led Wall Street analysts — and technologists — to question whether the U.S. can maintain its […]
Chinese startup DeepSeek has released an updated version of its R1 reasoning AI model on the developer platform Hugging Face after announcing it in a WeChat message Wednesday morning. The updated R1, which is under a permissive MIT license, meaning it can be used commercially, is a “minor” upgrade, according to DeepSeek’s WeChat announcement. The […]