OpenAI and Anthropic review AI models: GPT accused of flattery, Claude's robustness against hallucinations

OpenAI and Anthropic review AI models: GPT accused of flattery, Claude's robustness against hallucinations

According to Engadget, OpenAI and Anthropic recently announced they will mutually evaluate the safety alignment of each other's public AI systems and share the results of their analysis. This move has garnered industry attention, particularly given that Anthropic previously banned OpenAI from using its tools due to a technical collaboration dispute between the two companies. The results revealed that both companies' products have their own strengths and weaknesses, providing guidance for future improvements in AI safety testing.

Anthropic's testing of OpenAI models focused on risks such as flattery, whistleblowing, and abusive support. The results showed that the o3 and o4-mini models performed similarly to Anthropic's own models, but the GPT-4o and GPT-4.1 general models presented potential abuse risks and, with the exception of o3, exhibited varying degrees of flattery. Notably, the testing did not include the newly released GPT-5, which features a new Safe Completions feature to address dangerous queries. This feature may be a targeted improvement given OpenAI's recent pressure stemming from a teen suicide lawsuit.

Separately, OpenAI tested Anthropic's Claude model on command hierarchies and hallucinations. Claude excels at following instructions and is more inclined to refuse to answer in scenarios with high uncertainty. This "conservative strategy" significantly reduces the risk of hallucinations. However, the test also indicates room for improvement for both models. For example, GPT needs to reduce its tendency to flatter, while Claude may need to balance the rigor and practicality of its answers.

New battery anode can withstand 2,100 cycles without wear and tear

South Korean scientists have proposed a new battery solution that could significantly extend the lifespan of electric vehicles and smartphones. This novel ano

New battery anode can withstand 2,100 cycles without wear and tear

Toyota pledges to launch an electric car with solid-state batteries by 2027

A Japanese company has announced it will launch the world's first electric vehicle equipped with solid-state batteries in 2027. This technology promises f

Toyota pledges to launch an electric car with solid-state batteries by 2027

Batteries powered by B vitamins and sugar could power electronic devices

Scientists have developed the world's first battery powered by vitamin B2 and glucose. It's based on the same principles the human body uses to conver

Batteries powered by B vitamins and sugar could power electronic devices

Artificial leaves mimic real photosynthesis

Scientists at the University of Cambridge have invented a "semi-synthetic leaf" that mimics photosynthesis, converting sunlight, water, and carbon dioxide int

Artificial leaves mimic real photosynthesis

BMW unveils its first self-inflating electric stand-up paddle board

BMW has unveiled its first self-inflating electric stand-up paddle board. This new product was developed in collaboration with Slovenian manufacturer SipaBoar

BMW unveils its first self-inflating electric stand-up paddle board

New Captery AA batteries charge in 160 seconds

Italian startup Captery has unveiled a rechargeable battery that charges in less than three minutes and lasts for decades. The company claims its technology wi

New Captery AA batteries charge in 160 seconds

The Prima eye implant restores vision to people.

Blind patients in the UK may be able to regain their reading ability with a new implant placed under the eye. Surgeons at London's Moorfields Hospital have

The Prima eye implant restores vision to people.

NASA plans to build a glass city on the moon

NASA is supporting an ambitious project aimed at enabling future human landings on the Moon. Skyeports, an American company, proposes building giant, transpar

NASA plans to build a glass city on the moon

Kohler launches smart toilet camera for health monitoring

Kohler, an American company known for its plumbing and kitchen appliances, has unveiled an unusual new product: the Dekoda camera. It attaches directly to the

Kohler launches smart toilet camera for health monitoring

LEDs can kill up to 92% of cancer cells

Scientists have developed a new light therapy that can destroy cancer cells without harming healthy cells. The method, which utilizes LEDs and tin nanosheets,

LEDs can kill up to 92% of cancer cells

New microturbine can operate on light winds

German engineers have invented a compact wind turbine that generates 83% more electricity than existing turbines of similar size. This invention could become

New microturbine can operate on light winds

The first fully recyclable electronic product has been created

Duke University researchers have developed a technology that could revolutionize the way displays are produced, even making them more environmentally friendly

The first fully recyclable electronic product has been created

Study: Neural networks speed up thinking but hinder deep analysis

Researchers at the University of Oxford have discovered how the use of neural networks affects students' cognitive functions. The so-called AI generation

Study: Neural networks speed up thinking but hinder deep analysis

Jason Schreier: Microsoft demands unattainable profits from Xbox

Bloomberg reporter Jason Schreier has once again exposed the hidden handcuffs in the gaming industry—this time, the focus is on Microsoft. It seems fans

Jason Schreier: Microsoft demands unattainable profits from Xbox

Bang & Olufsen has released a commemorative audio collection to mark the brand's 100th anniversary.

To celebrate its centennial, Danish brand Bang & Olufsen released special editions of its Beoplay H100 headphones and A9 and A5 speakers, dubbed the "Centenni

Bang & Olufsen has released a commemorative audio collection to mark the brand's 100th anniversary.