Apple Shares Massive Dataset to Help Researchers Build Nano Banana-Like AI Models

Apple wants to help researchers and developers build better AI models with its dataset, even when it struggles to do so itself.

Advertisement
Written by Akash Dutta, Edited by Ketan Pratap | Updated: 29 October 2025 15:20 IST
Highlights
  • Apple’s Pico-Banana-400K is a dataset for text-guided image editing
  • The company used Nano Banana’s output to create the dataset
  • Apple’s dataset comes with a non-commercial research license

Apple said the dataset was created due to the absence of large-scale and openly accessible images

Photo Credit: Reuters

Apple researchers have released a large-scale dataset to help others develop image editing artificial intelligence (AI) models. Dubbed Pico-Banana-400K, the dataset contains 4,00,000 real images and their AI-edited counterparts that can be used to train large language models how to handle text-based image editing requests. It is an open-source dataset available with a research-only license, meaning it cannot be used for commercial purposes. Interestingly, the Cupertino-based tech giant's new dataset release comes at a time when it is struggling with native AI models itself.

Apple's Pico-Banana-400K Will Help Others Build Image Editing Models

A research paper titled “Pico-Banana-400K: A Large-Scale Dataset for Text-Guided Image Editing” was published on arXiv, an online journal. The dataset contains roughly 4,00,00 real photo edit pairs, built from OpenImages, organised into a 35-type edit taxonomy and split into single-turn edits, multi-turn sequences and preference pairs.

These design choices matter because they shift the training signal from synthetic, narrowly curated examples to instruction-rich, real-world scenarios that resemble what users actually ask for.

Advertisement

Pico-Banana-400K was produced by chaining a powerful generative model (Nano Banana) to create edits and another large multimodal model to act as an automated judge, filtering and retrying failed attempts. The result is a dataset emphasising photographic diversity, human-centric scenes and text-heavy shots. The photos also focus on nuance, with long and short instruction pairs to support research work.

Advertisement

Additionally, it also includes negative examples and preference pairs, which are crucial for alignment research and for teaching models not just what to do but what “better” looks like. The paper explicitly documents which edit types are robust (style transfers, global photometric changes) and which remain brittle (precise spatial relocations, text replacement on signs), making it unusually candid about limitations.

The dataset is currently available on GitHub, and can be used for any non-commercial use cases.

Advertisement

Interestingly, Apple has seemingly stalled with the company's in-house AI progress. While it has integrated the Apple Intelligence in more apps and features with the iPhone 17 series launch, the company continues to delay the Siri overhaul which was first announced in 2024.

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Further reading: Apple, AI, Artificial Intelligence
Advertisement

Related Stories

Popular Mobile Brands
  1. Redmi Note 15 Pro Series Launch Today: Know Price in India, Specs and More
  2. Apple Watch Hypertension Notifications Are Now Available in These Countries
  3. BSNL Launches Bharat Connect Prepaid; Slashes BSNL Superstar Premium Price
  4. Xiaomi 17, Xiaomi 17 Ultra Global Variants' RAM, Storage and Colours Leaked
  5. Why the Redmi Note Remains Xiaomi's Easiest Recommendation
  6. Wobble Launches X and K Series TVs in India With These Features
  7. UIDAI's New Aadhaar App Lets You Easily Update Mobile Number, Address
  8. Clawdbot (Now Moltbot) Explained: What is It and Why is It Going Viral?
  9. NASA Tests Nuclear Rocket Engine Designed for Faster Deep-Space Missions
  1. Redmi Note 15 Pro 5G, Redmi Note 15 Pro+ 5G Launching Today: Know Price in India, Features, Specifications and More
  2. Amazon Axes 16,000 Jobs as It Pushes AI and Efficiency
  3. Google AI Plus Plan Expanded Globally as the Most Affordable Gemini Subscription
  4. Redmi Note 15 Pro Series Colourways and Memory Configurations Listed on Amazon
  5. New ALMA Images Reveal Complex Rings Left Behind by Planet Formation
  6. BSNL Bharat Connect Prepaid Plan With 365-Day Validity Launched; Telco's BSNL Superstar Premium Plan Gets Price Cut
  7. Samsung Galaxy S26 Series Listed on US FCC Database With Support for Satellite Connectivity
  8. NASA Tests Nuclear Rocket Engine Designed for Faster Deep-Space Missions
  9. Hidden in Plain Sight: New Report Reveals Dozens of Nudify Apps in Major App Stores
  10. New Aadhaar App Full Version Launched in India, Introduces Easy Mobile Number Updation, and More
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.