Latest Tech News from Ars Technica
AI firms follow DeepSeek’s lead, create cheaper models with “distillation” 3 March 2025 at 06:36

AI firms follow DeepSeek’s lead, create cheaper models with “distillation”

By: Cristina Criddle and Melissa Heikkilä， Financial Times

3 March 2025 at 06:36

Leading artificial intelligence firms including OpenAI, Microsoft, and Meta are turning to a process called “distillation” in the global race to create AI models that are cheaper for consumers and businesses to adopt.

The technique caught widespread attention after China’s DeepSeek used it to build powerful and efficient AI models based on open source systems released by competitors Meta and Alibaba. The breakthrough rocked confidence in Silicon Valley’s AI leadership, leading Wall Street investors to wipe billions of dollars of value from US Big Tech stocks.

Through distillation, companies take a large language model—dubbed a “teacher” model—which generates the next likely word in a sentence. The teacher model generates data which then trains a smaller “student” model, helping to quickly transfer knowledge and predictions of the bigger model to the smaller one.

Read full article

Comments

TechCrunch News
OpenAI spoke to government officials about its DeepSeek probe 10 February 2025 at 12:03

OpenAI spoke to government officials about its DeepSeek probe

TechCrunch News

By: Maxwell Zeff

10 February 2025 at 12:03

OpenAI says it has spoken to government officials about its ongoing investigation into DeepSeek. The ChatGPT-maker previously claimed to have evidence that DeepSeek trained its AI models using improperly obtained data from OpenAI’s API. During a Bloomberg TV interview on Monday, OpenAI’s chief global affairs officer, Chris Lehane, said the company has talked with government […]

TechCrunch News
Researchers created an open rival to OpenAI’s o1 ‘reasoning’ model for under $50 5 February 2025 at 15:38

Researchers created an open rival to OpenAI’s o1 ‘reasoning’ model for under $50

TechCrunch News

By: Maxwell Zeff

5 February 2025 at 15:38

AI researchers at Stanford and the University of Washington were able to train an AI “reasoning” model for under $50 in cloud compute credits, according to a new research paper released last Friday. The model, known as s1, performs similarly to cutting-edge reasoning models, such as OpenAI’s o1 and DeepSeek’s R1, on tests measuring math […]

TechCrunch News
Microsoft probing whether DeepSeek improperly used OpenAI APIs 29 January 2025 at 03:31

Microsoft probing whether DeepSeek improperly used OpenAI APIs

TechCrunch News

By: Romain Dillet

29 January 2025 at 03:31

Just a few hours after David Sacks claimed DeepSeek used OpenAI’s models to train its own models, Bloomberg Law reports that Microsoft is investigating DeepSeek’s use of OpenAI’s application programming interface (API). According to security researchers working for Microsoft, the Chinese company behind the R1 reasoning model may have exfiltrated a large amount of data […]

Normal view