Alibaba's 1000 Questions visual model tops the spatial reasoning leaderboard, surpassing Gemini and GPT.

Alibaba's 1000 Questions visual model tops the spatial reasoning leaderboard, surpassing Gemini and GPT.


In the newly released SpatialBench benchmark, Alibaba's Qwen3-VL and Qwen2.5-VL visual models secured the top two spots with scores of 13.5 and 12.9 respectively, significantly outperforming Gemini 3.0 Pro Preview (9.6) and GPT-5.1 (7.5), bringing them closer to the human baseline of 80 points. SpatialBench, a leading benchmark focusing on 2D/3D spatial reasoning, covers complex tasks such as circuit analysis and CAD engineering, and is hailed as a "litmus test for embodied intelligence." Its evaluation results are considered a core indicator of AI's spatial understanding capabilities.

Technically, Qwen3-VL achieves upgraded 3D detection through rotating bounding box output and a depth estimation head, improving accuracy in occluded scenes by 18% and accurately determining object orientation and viewpoint changes. Its innovative visual programming function supports generating runnable Python code from input sketches or short videos, achieving a "what you see is what you get" experience. Furthermore, the model offers diverse scale options from 2B to 235B, outperforming Gemini 2.5-Pro ​​by an average of 6.4 points in 32 core tests.

The open-source plan shows that Qwen2.5-VL is fully open-source, while Qwen3-VL will release its weights and toolchain in the second quarter of 2025, simultaneously launching on the Qianwen App for free trial. Alibaba Cloud revealed that the model has been validated in scenarios such as logistics robots and AR assembly, with a spatial positioning error of less than 2cm, and plans to launch a "vision-action" end-to-end model in 2026, providing robots with real-time visual servoing capabilities.

This achievement marks a breakthrough for Chinese AI in the multimodal field. Industry evaluations indicate that the Qwen-VL series has surpassed GPT-4V in tasks such as document analysis and Chinese image understanding, forming a global top three alongside Gemini and GPT.

Singapore's AISG releases next-generation large language model Qwen-Sea-Lion-v4, achieving 8.4% superior performance in Southeast Asian languages.

Singapore's AISG today officially released its next-generation large-scale language model, Qwen-Sea-Lion-v4, whose underlying architecture has been fully u

Singapore's AISG releases next-generation large language model Qwen-Sea-Lion-v4, achieving 8.4% superior performance in Southeast Asian languages.

Apple sues India over new antitrust law, potentially avoiding a massive $38 billion fine.

Apple has filed a lawsuit in the Delhi High Court, challenging India's newly revised antitrust fines law to avoid a potential fine of up to $38 billion (ap

Apple sues India over new antitrust law, potentially avoiding a massive $38 billion fine.

TSMC sues former executive Luo Weiren for allegedly leaking confidential information; Intel vehemently denies the allegations.

On November 25, global semiconductor giant TSMC officially filed a lawsuit against its former senior vice president, Luo Weiren, accusing him of potentially le

TSMC sues former executive Luo Weiren for allegedly leaking confidential information; Intel vehemently denies the allegations.

Alibaba's 1000 Questions visual model tops the spatial reasoning leaderboard, surpassing Gemini and GPT.

In the newly released SpatialBench benchmark, Alibaba's Qwen3-VL and Qwen2.5-VL visual models secured the top two spots with scores of 13.5 and 12.9 respec

Alibaba's 1000 Questions visual model tops the spatial reasoning leaderboard, surpassing Gemini and GPT.

Google has announced its timeline: Google Assistant will officially cease operations on March 31, 2026, with Gemini taking over to lead the future.

Google recently announced the final transition timeline for its voice assistant service via its official blog, marking the countdown to the Google Assistant er

Google has announced its timeline: Google Assistant will officially cease operations on March 31, 2026, with Gemini taking over to lead the future.

OpenAI upgrades ChatGPT voice mode: The main interface integrates multimodal interaction and supports real-time visual content display.

Recently, OpenAI released an official blog post announcing the full integration of ChatGPT's "Voice Mode" into the main chat interface, marking another ste

OpenAI upgrades ChatGPT voice mode: The main interface integrates multimodal interaction and supports real-time visual content display.

Dell releases Q3 FY2026 financial results: AI-driven revenue hits record high of $27 billion.

Dell Technologies (DELL) reported its fiscal third-quarter 2026 results on Tuesday, showing revenue of $27 billion (approximately RMB 191.748 billion), a recor

Dell releases Q3 FY2026 financial results: AI-driven revenue hits record high of $27 billion.

Kunlun Yuan AI releases BaiZe-Omni-14b-a2b, a multimodal fusion model whose multimodal capabilities surpass GPT-4.

At the 2025 World Computing Conference, Kunlun Yuan AI officially launched BaiZe-Omni-14b-a2b, a multimodal fusion model based on the Ascend platform, marking

Kunlun Yuan AI releases BaiZe-Omni-14b-a2b, a multimodal fusion model whose multimodal capabilities surpass GPT-4.

Musk announced that Grok 5 will challenge top League of Legends teams: a fair battle between AI and human esports.

Elon Musk posted on the X platform yesterday that his xAI company's Grok 5 AI model will challenge top human teams in League of Legends in 2026, setting st

Musk announced that Grok 5 will challenge top League of Legends teams: a fair battle between AI and human esports.

Assassin's Creed: Shadow receives version 1.1.6 update and will feature a collaboration with Attack on Titan.

Ubisoft has announced that Assassin's Creed Shadows will receive a 1.1.6 update on November 25, 2025, at 10 PM Beijing time. This update will include a col

Assassin's Creed: Shadow receives version 1.1.6 update and will feature a collaboration with Attack on Titan.

Call of Duty: Black Ops 7 Season 1 Trailer - Coming December 5th, with tons of new content coming soon!

Activision Blizzard has announced that its most comprehensive Season 1 update to date will launch on December 5, 2025, for *Call of Duty: Black Ops 7* and *Cal

Call of Duty: Black Ops 7 Season 1 Trailer - Coming December 5th, with tons of new content coming soon!

ARC Raiders has sold nearly 7 million copies across all platforms, dominating recent game sales.

ARC Raiders, a sci-fi third-person PvPvE extraction shooter developed by Embark Studios of Sweden and published by NEXON, will be released on October 30, 2025,

ARC Raiders has sold nearly 7 million copies across all platforms, dominating recent game sales.

AOC launches two new GAMING G4 series gaming monitors: Q27G4SMN and Q27G4SP.

AOC has announced two new monitors in its GAMING G4 series: the Q27G4SMN and Q27G4SP. The Q27G4SMN features full-distribution direct-lit glass-based mini LED b

AOC launches two new GAMING G4 series gaming monitors: Q27G4SMN and Q27G4SP.

ASUS unveils the ASUS AcePhase 60 Limited Edition PA602, featuring a walnut wood color scheme that blends seamlessly into home décor.

This year, ASUS launched the new ProArt PA602 Wood Edition, featuring walnut wood on the front grille, front I/O panel, and top handles. Now, ASUS has released

ASUS unveils the ASUS AcePhase 60 Limited Edition PA602, featuring a walnut wood color scheme that blends seamlessly into home décor.

Client GPU shipments reached 76.6 million units in Q3 2025, representing year-on-year and quarter-on-quarter growth of 4% and 2.5%, respectively.

A recent market research report released by Jon Peddie Research shows that global PC client-based GPU shipments reached 76.6 million units in the third quarte

Client GPU shipments reached 76.6 million units in Q3 2025, representing year-on-year and quarter-on-quarter growth of 4% and 2.5%, respectively.