Abstract: Great efforts have been made to investigate AI’s ability in abstract reasoning, along with the proposal of various versions of RAVEN’s progressive matrices (RPM) as benchmarks. Previous ...
Researchers from Samsung Electronic Co. Ltd. have created a tiny artificial intelligence model that punches far above its weight on certain kinds of “reasoning” tasks, challenging the industry’s ...
Researchers from Samsung Electronic Co. Ltd. have created a tiny artificial intelligence model that punches far above its weight on certain kinds of “reasoning” tasks, challenging the industry’s ...
A prompt-level hack for deeper LLM thinking, which applies abstract reasoning principles to direct LLMs to look at paradoxes and edge cases from different angles.
A new study from Arizona State University researchers suggests that the celebrated "Chain-of-Thought" (CoT) reasoning in Large Language Models (LLMs) may be more of a "brittle mirage" than genuine ...
Whether this is the first time you present or you are a seasoned professional, here are some tips for presenting your research at ACS Spring 2026. Need Help? Contacts us for registration, hotel, ...
Singapore-based AI startup Sapient Intelligence has developed a new AI architecture that can match, and in some cases vastly outperform, large language models (LLMs) on complex reasoning tasks, all ...
RE-IMAGINE synthesizes new reasoning benchmarks by (1) symbolically mutating the solution processes from existing benchmarks, and (2) asking language models to imagine what would happen if the ...
Recent research indicates that LLMs, particularly smaller ones, frequently struggle with robust reasoning. They tend to perform well on familiar questions but falter when those same problems are ...
NOTE (*): This article has been edited to reflect that the paper, The Illusion of the Illusion of Thinking, was wrongfully attributed to Anthropic, the company, as the lead author. In fact, the lead ...
一些您可能无法访问的结果已被隐去。
显示无法访问的结果