Apple releases the Pico-Banana-400K dataset: containing 400,000 images to help train AI image editing models.

Ai 10.30.25

Apple has released Pico-Banana-400K, a research dataset containing 400,000 images. Interestingly, this dataset was built using Google's Gemini-2.5 model.

Apple's research, titled "Pico-Banana-400K: A Large-Scale Dataset for Text-Guided Image Editing," includes the complete dataset of 400,000 images. This dataset is released under a non-commercial research license, meaning researchers and academic institutions are free to use it, but not for commercial purposes.

Several months ago, Google launched the Gemini 2.5-Flash-Image model, also known as Nanon-Banana, which performs exceptionally well in image editing tasks and is widely considered one of the most advanced image editing models available. Despite significant advancements in image generation and editing models in recent years, Apple's research team points out that "despite continuous technological progress, open research remains hampered by a lack of large-scale, high-quality, and fully shareable image editing datasets. Existing datasets often rely on synthetic data generated by proprietary models or contain only limited, manually selected subsets. Furthermore, these datasets commonly suffer from domain shifts, uneven distribution of editing types, and inconsistent quality control, severely hindering the development of robust image editing models."

To address this bottleneck, the Apple team set out to build a more comprehensive and representative image editing dataset.

The research team first selected a large number of real-world photographs from the OpenImages dataset, ensuring diverse content including people, objects, and scenes containing text.

Apple released the Pico-Banana-400K dataset, containing 400,000 images, to help train AI image editing models.

The team then designed 35 different types of image modification instructions, categorizing them into eight main groups:

Pixel & Photometric Adjustments: such as adding film grain or retro filters;

Human-Centric Editing: such as transforming a person into a Funko-Pop style toy;

Scene Composition & Multi-Subject Editing: such as changing weather conditions (sunny/rainy/snowy);

Object-Level Semantic Editing: such as moving objects or adjusting spatial relationships;

Image Scaling: such as zooming in.

Next, researchers input an original image along with an editing instruction into the Nanon-Banana model for image editing. The generated result is then automatically evaluated by the Gemini 2.5-Pro model to determine if it accurately follows the instructions and possesses good visual quality. Only results that pass double validation will be included in the final dataset.

Apple releases the Pico-Banana-400K dataset: containing 400,000 images to help train AI image editing models.

Pico-Banana-400K includes not only single-turn edits (i.e., edits completed with a single prompt), but also multi-turn edit sequences, and "preference pairs"—comparative samples of successful and unsuccessful edits—to help the model learn to distinguish between ideal and poor output.

While the research team acknowledges that Nanon-Banana still has limitations in fine-grained spatial control, layout extrapolation, and text typography, they emphasize that Pico-Banana-400K aims to provide a solid and reproducible foundation for training and evaluation of next-generation text-guided image editing models.

Currently, the related research paper has been published on the preprint platform arXiv, and the complete Pico-Banana-400K dataset is also freely available to researchers worldwide on GitHub.

Affinity, an image editing tool, has released a brand new version that integrates three major software packages.

On October 31st, the popular image editing tool Affinity officially released a new version, but instead of using the conventional name Affinity 3, it's si

10.31.25 0

Apple releases the Pico-Banana-400K dataset: containing 400,000 images to help train AI image editing models.

Apple has released Pico-Banana-400K, a research dataset containing 400,000 images. Interestingly, this dataset was built using Google's Gemini-2.5 model.

10.30.25 0

Nvidia's market capitalization surpasses $5 trillion; Jensen Huang rises to 8th place on the Forbes rich list.

Shares of AI chip giant Nvidia (NASDAQ: NVDA) jumped 3.3%, pushing its market capitalization past the $5 trillion mark for the first time, making it the first

10.30.25 0

A strange bug has appeared in the Windows 11 update: a "ghost" process is generated after Task Manager is closed.

Some Windows 11 users are experiencing an issue where Task Manager cannot be completely closed after installing the October optional update KB5067036. When us

10.30.25 0

YouTube launches voluntary employee departure program and restructures product team.

YouTube, Google's video platform, confirmed to TechCrunch on Wednesday that it is implementing a "voluntary severance program" for its U.S. employees, off

10.30.25 0

OpenAI is reportedly preparing a trillion-dollar IPO, potentially becoming one of the largest listings in history.

According to Reuters, citing three sources familiar with the matter, artificial intelligence giant OpenAI is preparing for an initial public offering (IPO) wi

10.30.25 0

Alphabet, Google's parent company, saw its revenue surpass $100 billion for the first time in the third quarter, driven by growth from both cloud computing and advertising businesses.

Alphabet, Google's parent company, released its Q3 2025 financial results today, showing revenue exceeding $100 billion for the first time, reaching $102.

10.30.25 0

Samsung's first tri-fold Galaxy Z TriFold revealed: 12-15mm thick with a 10-inch screen

Samsung is about to launch the world's first inward-folding tri-fold phone, the Galaxy Z TriFold. When folded, it will be 12-15mm thick, and when unfolded

10.30.25 0

The aging of Windows 10 boosted Mac sales.

A Counterpoint Research analyst report states that new PC sales have surged as the Windows 10 operating system "dies." Microsoft's decision has even impac

10.29.25 2

Xiaomi releases a pocket printer that prints photos quickly

Xiaomi has announced the global launch of its portable photo printer, known as the "Portable Photo Printer Pro." Despite its compact size, this new device del

10.29.25 2

iPhone 18 Pro and Pro Max will get flagship photo features

Two reliable sources simultaneously reported that Apple plans to upgrade the cameras of future flagship products. The sources claim that the iPhone 18 Pro ver

10.29.25 2

XMG releases compact gaming laptop with RTX 5070 and fast display

XMG has released the Fusion 15 laptop, positioned as a gaming machine. Key features of the new model include high-performance hardware from Intel and NVIDIA,

10.29.25 2

Samsung shows off its three flip phones in person for the first time

Samsung's first "tripod" phone is expected to launch next week. Ahead of the event, the company finally showed journalists the upcoming device—albei

10.29.25 2

ADATA launches smartphone SSD with mobile power function

ADATA has unveiled an unusual accessory that combines the functions of three devices. It combines a solid-state drive, a MagSafe power bank, and a phone holde

10.29.25 2

MSI launches an ITX version of the GeForce RTX 5050 for compact gaming PCs

MSI has introduced two new versions of the NVIDIA GeForce RTX 5050 desktop graphics card. Designed for SFF and ITX cases, these cards are significantly more c

10.29.25 2