Reasoning skills of large language models are often overesti...

Reasoning skills of large language models are often overestimated

RelatedInsightsHighlights

“It is impossible to find any domain in which humans clearly outperformed crude extrapolation algorithms, less still sophisticated statistical ones.”17

Michael J. Mauboussin • Think Twice: Harnessing the Power of Counterintuition

Thumbnail of www-x-com-cremieuxrecueil-status-1816532459393024052

A new paper in Nature found that you cannot, in fact, train AIs on AI-generated data and expect them to continue improving. What happens is actually that the model collapses and ends up producing nonsense.

Crémieux

x.com

What a modern LLM does during training is, essentially, very very quickly skim the textbook, the words just flying by , not spending much brain power on it.

Rather, when you or I read that math textbook, we read a couple pages slowly; then have an internal monologue about the material in our heads and talk about it with a few study-buddies; read an

SITUATIONAL AWARENESS - The Decade Ahead • I. From GPT-4 to AGI: Counting the OOMs

LLMs struggle when handling tasks which require extensive knowledge. This limitation highlights the need to supplement LLMs with non-parametric knowledge. This paper Prompting Large Language Models with Knowledge Graphs for Question Answering Involving Long-tail Facts analyze the effects of different types of non-parametric knowledge, such as textu... See more

Michael J. Mauboussin • Think Twice: Harnessing the Power of Counterintuition

SITUATIONAL AWARENESS - The Decade Ahead • I. From GPT-4 to AGI: Counting the OOMs

Shortwave — rajhesh.panchanadhan@gmail.com [Gmail alternative]