Zhipu Open Sources the World's Most Powerful Visual Inference Model GLM-4.5V, Entering a New St

Zhipu Open Sources the World's Most Powerful Visual Inference Model GLM-4.5V, Entering a New St


Zhipu AI recently announced the launch and open-source release of the world's first 100-byte visual reasoning model, GLM-4.5V. With a total of 106 billion parameters and 12 billion activation parameters, it is now available for download simultaneously on the MoDa community and Hugging Face. As a key step toward artificial general intelligence (AGI), this model has achieved state-of-the-art performance among open-source models across 41 multimodal rankings, covering a full range of tasks, including image, video, and document parsing, and GUI interaction.

Based on the next-generation text-based framework, GLM-4.5-Air, this model achieves breakthrough capabilities through efficient hybrid training. A new "thinking mode" switch allows for flexible switching between fast response and deep reasoning, supports 64KB of context input, and utilizes 3D convolution and 3D-RoPE encoding technologies to enhance video and spatial relationship understanding. In actual testing, it can accurately locate objects in images, replicate web page structures, and even extract key information from complex documents containing dozens of pages.

To lower the barrier to entry, Zhipu has also open-sourced a desktop assistant application that can take real-time screenshots for visual tasks such as coding assistance and game guides. The API service is now available on BigModel.cn, offering a free quota of 20 million tokens. Call costs are as low as 2 yuan per million tokens, with response speeds of 60-80 tokens per second. Enterprise users can use this service to quickly deploy cost-effective multimodal solutions for scenarios such as industrial quality inspection and intelligent customer service.

Technically, the model innovatively integrates a visual encoder, an MLP adapter, and a language decoder, enhancing its ability to process images at extreme scales through bicubic interpolation. Analysts believe that the open-source release of GLM-4.5V will accelerate the industrialization of visual reasoning technology and take a key step towards the widespread application of AI in general scenarios.

New battery anode can withstand 2,100 cycles without wear and tear

South Korean scientists have proposed a new battery solution that could significantly extend the lifespan of electric vehicles and smartphones. This novel ano

New battery anode can withstand 2,100 cycles without wear and tear

Toyota pledges to launch an electric car with solid-state batteries by 2027

A Japanese company has announced it will launch the world's first electric vehicle equipped with solid-state batteries in 2027. This technology promises f

Toyota pledges to launch an electric car with solid-state batteries by 2027

Batteries powered by B vitamins and sugar could power electronic devices

Scientists have developed the world's first battery powered by vitamin B2 and glucose. It's based on the same principles the human body uses to conver

Batteries powered by B vitamins and sugar could power electronic devices

Artificial leaves mimic real photosynthesis

Scientists at the University of Cambridge have invented a "semi-synthetic leaf" that mimics photosynthesis, converting sunlight, water, and carbon dioxide int

Artificial leaves mimic real photosynthesis

BMW unveils its first self-inflating electric stand-up paddle board

BMW has unveiled its first self-inflating electric stand-up paddle board. This new product was developed in collaboration with Slovenian manufacturer SipaBoar

BMW unveils its first self-inflating electric stand-up paddle board

New Captery AA batteries charge in 160 seconds

Italian startup Captery has unveiled a rechargeable battery that charges in less than three minutes and lasts for decades. The company claims its technology wi

New Captery AA batteries charge in 160 seconds

The Prima eye implant restores vision to people.

Blind patients in the UK may be able to regain their reading ability with a new implant placed under the eye. Surgeons at London's Moorfields Hospital have

The Prima eye implant restores vision to people.

NASA plans to build a glass city on the moon

NASA is supporting an ambitious project aimed at enabling future human landings on the Moon. Skyeports, an American company, proposes building giant, transpar

NASA plans to build a glass city on the moon

Kohler launches smart toilet camera for health monitoring

Kohler, an American company known for its plumbing and kitchen appliances, has unveiled an unusual new product: the Dekoda camera. It attaches directly to the

Kohler launches smart toilet camera for health monitoring

LEDs can kill up to 92% of cancer cells

Scientists have developed a new light therapy that can destroy cancer cells without harming healthy cells. The method, which utilizes LEDs and tin nanosheets,

LEDs can kill up to 92% of cancer cells

New microturbine can operate on light winds

German engineers have invented a compact wind turbine that generates 83% more electricity than existing turbines of similar size. This invention could become

New microturbine can operate on light winds

The first fully recyclable electronic product has been created

Duke University researchers have developed a technology that could revolutionize the way displays are produced, even making them more environmentally friendly

The first fully recyclable electronic product has been created

Study: Neural networks speed up thinking but hinder deep analysis

Researchers at the University of Oxford have discovered how the use of neural networks affects students' cognitive functions. The so-called AI generation

Study: Neural networks speed up thinking but hinder deep analysis

Jason Schreier: Microsoft demands unattainable profits from Xbox

Bloomberg reporter Jason Schreier has once again exposed the hidden handcuffs in the gaming industry—this time, the focus is on Microsoft. It seems fans

Jason Schreier: Microsoft demands unattainable profits from Xbox

Bang & Olufsen has released a commemorative audio collection to mark the brand's 100th anniversary.

To celebrate its centennial, Danish brand Bang & Olufsen released special editions of its Beoplay H100 headphones and A9 and A5 speakers, dubbed the "Centenni

Bang & Olufsen has released a commemorative audio collection to mark the brand's 100th anniversary.