International Business Weekly
  • Home
  • News
  • Politics
  • Business
  • National
  • Culture
  • Lifestyle
  • Sports
No Result
View All Result
  • Home
  • News
  • Politics
  • Business
  • National
  • Culture
  • Lifestyle
  • Sports
No Result
View All Result
International Business Weekly
No Result
View All Result
Home National

AI’s reasoning ability in mathematics questionable, reveal Apple researchers

October 12, 2024
in National
0
AI’s reasoning ability in mathematics questionable, reveal Apple researchers
0
SHARES
8
VIEWS
Share on FacebookShare on Twitter


Apple researchers question AI’s reasoning ability in mathematicsIANS

A team of Apple researchers has questioned the formal reasoning capabilities of large language models (LLMs), particularly in mathematics.

They found that LLMs exhibit noticeable variance when responding to different instantiations of the same question.

Literature suggests that the reasoning process in LLMs is probabilistic pattern-matching rather than formal reasoning.

Although LLMs can match more abstract reasoning patterns, they fall short of true logical reasoning. Small changes in input tokens can drastically alter model outputs, indicating a strong token bias and suggesting that these models are highly sensitive and fragile.

“Additionally, in tasks requiring the correct selection of multiple tokens, the probability of arriving at an accurate answer decreases exponentially with the number of tokens or steps involved, underscoring their inherent unreliability in complex reasoning scenarios,” said Apple researchers in their paper titled “GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models.”

The ‘GSM8K’ benchmark is widely used to assess the mathematical reasoning of models on grade-school level questions.

artificial intelligence

GSM-Symbolic enables more controllable evaluationsIANS

While the performance of LLMs on GSM8K has significantly improved in recent years, it remains unclear whether their mathematical reasoning capabilities have genuinely advanced, raising questions about the reliability of the reported metrics.

To address these concerns, the researchers conducted a large-scale study on several state-of-the-art open and closed models.

“To overcome the limitations of existing evaluations, we introduce GSM-Symbolic, an improved benchmark created from symbolic templates that allow for the generation of a diverse set of questions,” the authors wrote.

GSM-Symbolic enables more controllable evaluations, providing key insights and more reliable metrics for measuring the reasoning capabilities of models.

“Our findings reveal that LLMs exhibit noticeable variance when responding to different instantiations of the same question,” said researchers, adding that overall, “our work provides a more nuanced understanding of LLMs’ capabilities and limitations in mathematical reasoning”.

(With inputs from IANS)



Source link

Tags: AbilityAIsAppleartificial intelligenceformal reasoninglarge language models (LLMs)Limitations of Mathematical ReasoningmathematicsQuestionableReasoningResearchersReveal
Brand Post

Brand Post

I am an editor for IBW, focusing on business and entrepreneurship. I love uncovering emerging trends and crafting stories that inspire and inform readers about innovative ventures and industry insights.

Related Posts

Republican Rep. Criticizes Mexico’s Delivery Of Aid To Cuba: ‘Nothing Sent Directly To The Regime Will Be Seen By Average Cubans’
National

Republican Rep. Criticizes Mexico’s Delivery Of Aid To Cuba: ‘Nothing Sent Directly To The Regime Will Be Seen By Average Cubans’

February 12, 2026
Class Action Claims Hundreds Of People, Including U.S. Citizens, Were Zip-Tied at Gunpoint in October Idaho Raid
National

Class Action Claims Hundreds Of People, Including U.S. Citizens, Were Zip-Tied at Gunpoint in October Idaho Raid

February 12, 2026
Trump Administration Sending Unaccompanied Pregnant Teen Migrants to Shelter Without Proper Care: Report
National

Trump Administration Sending Unaccompanied Pregnant Teen Migrants to Shelter Without Proper Care: Report

February 12, 2026
Next Post
Migrants Not Grabbin Black, Hispanic Jobs As Trump Claims, And They Boost Economy: Report

Migrants Not Grabbin Black, Hispanic Jobs As Trump Claims, And They Boost Economy: Report

Former Top General Brands Trump ‘Fascist To Core,’ ‘Most Dangerous Person’ in America In Woodward’s New Book

Former Top General Brands Trump 'Fascist To Core,' 'Most Dangerous Person' in America In Woodward's New Book

US Top General Drops Bombshell, Says Trump is ‘Fascist to the Core’ and a Dire Threat to America

US Top General Drops Bombshell, Says Trump is 'Fascist to the Core' and a Dire Threat to America

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

ABOUT US

International Business Weekly is an American entertainment magazine. We cover business News & feature exclusive interviews with many notable figures

Copyright © 2026 - International Business Weekly

  • About
  • Advertise
  • Careers
  • Contact
No Result
View All Result
  • Home
  • Politics
  • News
  • Business
  • Culture
  • National
  • Sports
  • Lifestyle
  • Travel

Copyright © 2024 - International Business Weekly

سایت کازینو,سایت کازینو انفجار,سایت انفجار هات بت,سایت حضرات ,بت خانه ,تاینی بت ,سیب بت ,ایس بت بدون فیلتر ,ماه بت ,دانلود اپلیکیشن دنس بت ,بازی انفجار دنس,ازا بت,ازا بت,اپلیکیشن هات بت,اپلیکیشن هات بت,عقاب بت,فیفا نود,شرط بندی سنگ کاغذ قیچی,bet90,bet90,سایت شرط بندی پاسور,بت لند,Bababet,Bababet,گلف بت,گلف بت,پوکر آنلاین,پاسور شرطی,پاسور شرطی,پاسور شرطی,پاسور شرطی,تهران بت,تهران بت,تهران بت,تخته نرد پولی,ناسا بت ,هزار بت,هزار بت,شهر بت,چهار برگ آنلاین,چهار برگ آنلاین,رد بت,رد بت,پنالتی بت,بازی انفجار حضرات,بازی انفجار حضرات,بازی انفجار حضرات,سبد ۷۲۴,بت 303,بت 303,شرط بندی پولی,بتکارت بدون فیلتر,بتکارت بدون فیلتر,بتکارت بدون فیلتر, بت تایم, سایت شرط بندی بدون نیاز به پول, یاس بت, بت خانه, Tatalbet, اپلیکیشن سیب بت, اپلیکیشن سیب بت, بت استار, پابلو بت, پیش بینی فوتبال, بت 45, سایت همسریابی پيوند, بت باز, بری بت, بازی انفجار رایگان, شير بت, رویال بت, بت فلاد, روما بت, پوکر ریور, تاس وگاس, بت ناب, بتکارت, سایت بت برو, سایت حضرات, سیب بت, پارس نود, ایس بت, سایت سیگاری بت, sigaribet, هات بت, سایت هات بت, سایت بت برو, بت برو, ماه بت, اوزابت | ozabet, تاینی بت | tinybet, بری بت | سایت بدون فیلتر بری بت, دنس بت بدون فیلتر, bet120 | سایت بت ۱۲۰, ace90bet | acebet90 | ac90bet, ثبت نام در سایت تک بت, سیب بت 90 بدون فیلتر, یاس بت | آدرس بدون فیلتر یاس بت, بازی انفجار دنس, بت خانه | سایت, بت تایم | bettime90, دانلود اپلیکیشن وان ایکس بت 1xbet بدون فیلتر و آدرس جدید, سایت همسریابی دائم و رایگان برای یافتن بهترین همسر و همدم, دانلود اپلیکیشن هات بت بدون فیلتر برای اندروید و لینک مستقیم, تتل بت - سایت شرط بندی بدون فیلتر, دانلود اپلیکیشن بت فوت - سایت شرط بندی فوت بت بدون فیلتر, سایت بت لند 90 و دانلود اپلیکیشن بت 90, سایت ناسا بت - nasabet, دانلود اپلیکیشن ABT90 - ثبت نام و ورود به سایت بدون فیلتر, https://planer4.com/, http://geduf.com/,, بازی انفجار, http://foreverliving-ar.com/, https://wediscusstech.com/, http://codesterlab.com/, https://www.9ja4u.com/, https://pimpurwhip.com/, http://nubti.com/, http://www.casinoherrald.com/, http://oigor.com/, http://coinjoin.art/, بازی مونتی