Normal view

There are new articles available, click to refresh the page.
Today — 3 March 2025Main stream

AI firms follow DeepSeek’s lead, create cheaper models with “distillation”

Leading artificial intelligence firms including OpenAI, Microsoft, and Meta are turning to a process called “distillation” in the global race to create AI models that are cheaper for consumers and businesses to adopt.

The technique caught widespread attention after China’s DeepSeek used it to build powerful and efficient AI models based on open source systems released by competitors Meta and Alibaba. The breakthrough rocked confidence in Silicon Valley’s AI leadership, leading Wall Street investors to wipe billions of dollars of value from US Big Tech stocks.

Through distillation, companies take a large language model—dubbed a “teacher” model—which generates the next likely word in a sentence. The teacher model generates data which then trains a smaller “student” model, helping to quickly transfer knowledge and predictions of the bigger model to the smaller one.

Read full article

Comments

© FT montage/Getty

Before yesterdayMain stream

OpenAI spoke to government officials about its DeepSeek probe

10 February 2025 at 12:03

OpenAI says it has spoken to government officials about its ongoing investigation into DeepSeek. The ChatGPT-maker previously claimed to have evidence that DeepSeek trained its AI models using improperly obtained data from OpenAI’s API. During a Bloomberg TV interview on Monday, OpenAI’s chief global affairs officer, Chris Lehane, said the company has talked with government […]

© 2024 TechCrunch. All rights reserved. For personal use only.

Researchers created an open rival to OpenAI’s o1 ‘reasoning’ model for under $50

5 February 2025 at 15:38

AI researchers at Stanford and the University of Washington were able to train an AI “reasoning” model for under $50 in cloud compute credits, according to a new research paper released last Friday. The model, known as s1, performs similarly to cutting-edge reasoning models, such as OpenAI’s o1 and DeepSeek’s R1, on tests measuring math […]

© 2024 TechCrunch. All rights reserved. For personal use only.

Microsoft probing whether DeepSeek improperly used OpenAI APIs

29 January 2025 at 03:31

Just a few hours after David Sacks claimed DeepSeek used OpenAI’s models to train its own models, Bloomberg Law reports that Microsoft is investigating DeepSeek’s use of OpenAI’s application programming interface (API). According to security researchers working for Microsoft, the Chinese company behind the R1 reasoning model may have exfiltrated a large amount of data […]

© 2024 TechCrunch. All rights reserved. For personal use only.

❌
❌