Logical Thinking Performance Task

Unlock the Full Power of DeepSeek R1 by Fine-Tuning Its Reasoning Tasks

Fine-tuning a large language model (LLM) like DeepSeek R1 for reasoning tasks can significantly enhance its ability to address domain-specific challenges. DeepSeek R1, an open source alternative to ...

1 天

GLM 4.7 AI Brings Stronger Reasoning, Higher HLE Scores & Cleaner Web Output with Tools

GLM version 4.7 lifts software engineering accuracy from 68% to 73.8%, helping you ship cleaner code and UI faster. Terminal Bench rises from 24.5% to 41%, giving teams steadier ...

VentureBeat

LLMs excel at inductive reasoning but struggle with deductive tasks, new research shows

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Large language models (LLMs) have shown impressive performance on various ...

EurekAlert!

AI makes human-like reasoning mistakes

Manipulating content within fixed logical structures. In each of the author’s three datasets, they instantiate different versions of the logical problems. Different versions of a problem offer the ...

Computerworld

Microsoft introduces Phi-4, an AI model for advanced reasoning tasks

Microsoft has announced Phi-4 — a new AI model with 14 billion parameters — designed for complex reasoning tasks, including mathematics. Phi-4 excels in areas such as STEM question-answering and ...

Tech Xplore on MSN

Enabling small language models to solve complex reasoning tasks

As language models (LMs) improve at tasks like image generation, trivia questions, and simple math, you might think that ...

9 天

Google launches Gemini 3 Flash, promising faster AI reasoning at lower cost

Gemini 3 Flash combines frontier-level performance with faster response times, and is now rolling out across Google’s ...

来自MSN

Scientists just developed a new AI modeled on the human brain — it's outperforming LLMs ...

Scientists have developed a new type of artificial intelligence (AI) model that can reason differently from most large language models (LLMs) like ChatGPT, resulting in much better performance in key ...

一些您可能无法访问的结果已被隐去。

显示无法访问的结果