Latest Tech News from Ars Technica
CMU research shows compression alone may unlock AI puzzle-solving abilities 6 March 2025 at 15:22

CMU research shows compression alone may unlock AI puzzle-solving abilities

By: Benj Edwards

6 March 2025 at 15:22

A pair of Carnegie Mellon University researchers recently discovered hints that the process of compressing information can solve complex reasoning tasks without pre-training on a large number of examples. Their system tackles some types of abstract pattern-matching tasks using only the puzzles themselves, challenging conventional wisdom about how machine-learning systems acquire problem-solving abilities.

"Can lossless information compression by itself produce intelligent behavior?" ask Isaac Liao, a first-year PhD student, and his advisor, Professor Albert Gu, from CMU's Machine Learning Department. Their work suggests the answer might be yes. To demonstrate, they created CompressARC and published the results in a comprehensive post on Liao's website.

The pair tested their approach on the Abstraction and Reasoning Corpus (ARC-AGI), an unbeaten visual benchmark created in 2019 by machine-learning researcher François Chollet to test AI systems' abstract reasoning skills. ARC presents systems with grid-based image puzzles where each provides several examples demonstrating an underlying rule, and the system must infer that rule to apply it to a new example.

Read full article

Comments

TechCrunch News
Why IQ is a poor test for AI 5 February 2025 at 12:47

Why IQ is a poor test for AI

TechCrunch News

By: Kyle Wiggers

5 February 2025 at 12:47

During a recent press appearance, OpenAI CEO Sam Altman said that he’s observed the “IQ” of AI rapidly improve over the past several years. “Very roughly, it feels to me like — this is not scientifically accurate, this is just a vibe or spiritual answer — every year we move one standard deviation of IQ,” […]

TechCrunch News
People are benchmarking AI by having it make balls bounce in rotating shapes 24 January 2025 at 09:48

People are benchmarking AI by having it make balls bounce in rotating shapes

TechCrunch News

By: Kyle Wiggers

24 January 2025 at 09:48

The list of informal, weird AI benchmarks keeps growing. Over the past few days, some in the AI community on X have become obsessed with a test of how different AI models, particularly so-called reasoning models, handle prompts like this: “Write a Python script for a bouncing yellow ball within a shape. Make the shape […]

TechCrunch News
AI benchmarking organization criticized for waiting to disclose funding from OpenAI 19 January 2025 at 11:58

AI benchmarking organization criticized for waiting to disclose funding from OpenAI

TechCrunch News

By: Kyle Wiggers

19 January 2025 at 11:58

An organization developing math benchmarks for AI didn’t disclose that it had received funding from OpenAI until relatively recently, drawing allegations of impropriety from some in the AI community. Epoch AI, a nonprofit primarily funded by Open Philanthropy, a research and grantmaking foundation, revealed on December 20 that OpenAI had supported the creation of FrontierMath. […]

Normal view