OpenAI Announces Realtime API, Prompt Coaching and Vision Fine-Tuning on GPT-4o for Developers

OpenAI made the announcements at its DevDay conference on Tuesday.

Advertisement
Written by Akash Dutta, Edited by Manas Mitul | Updated: 3 October 2024 14:03 IST
Highlights
  • Realtime API supports low-latency speech-to-speech conversations
  • Prompt coaching will allow developers to reuse recently seen input tokens
  • OpenAI is also making the process of model distillation easier

These features are coming to all the developers using the paid version of ChatGPT API

Photo Credit: Unsplash/Levart_Photographer

OpenAI hosted its annual DevDay conference in San Francisco on Tuesday and announced several new upgrades to the application programming interface (API) version of ChatGPT, which can be remodelled and fine-tuned to power other applications and software. Among them, the major introductions are the realtime API, prompt coaching, and vision fine-tuning with GPT-4o. The company is also making the process of model distillation easier for developers. OpenAI also announced the completion of its funding round and stated it raised $6.6 billion (roughly Rs. 55 thousand crore) during the event.

OpenAI Announces New Features for Developers

In several blog posts, the AI firm highlighted the new features and tools for developers. The first is realtime API which will be available to the paid subscribers of ChatGPT API. This new capability offers a low-latency multimodal experience, allowing speech-to-speech conversations similar to the ChatGPT Advanced Voice Mode. Developers can also make use of the six preset voices that were earlier added to the API.

Another new introduction is the prompt coaching capability in the API. OpenAI is introducing this feature as a way for developers to save costs on prompts which are frequently used. The company noticed that developers usually keep sending the same input prompts when editing a codebase or having a multi-turn conversation with the chatbot. With prompt coaching, they can now reuse recently used input prompts at a discounted rate. The processing for the same will also be faster. The new rates can be checked here.

Advertisement

The GPT-4o model can also be fine-tuned for vision-related tasks. Developers can customise the large language model (LLM) by training it on a fixed set of visual data and improving its output efficiency. As per the blog post, the performance of GPT-4o for vision tasks can be improved with as few as 100 images.

Advertisement

Finally, the company is also making the process of model distillation easier for developers. Model distillation is the process of building smaller, fine-tuned AI models from a larger language model. Earlier, the process was convoluted and required taking a multi-step approach. Now, OpenAI is offering new tools such as Stored Completions (to easily generate distillation datasets), Evals (to run custom evaluations and measure performance), and Fine-Tuning (fine-tuning the smaller models directly after running an Eval).

Notably, all of these features are currently available in beta and will be available to all developers using the paid version of the API at a later date. Further, the company said it will be taking steps to further reduce the costs of input and output tokens.

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Advertisement

Related Stories

Popular Mobile Brands
  1. iQOO Neo 11 With Snapdragon 8 Elite SoC Launched: Price, Specifications
  2. Canva Brings Revamped Video Editor, New AI Tools and a Marketing Platform
  3. Top OTT Releases of the Week: Kantara Chapter 1, Lokah Chapter 1, Idli Kadai, and More
  4. Upcoming Smartphones in November: From OnePlus 15 to iQOO 15, Check List
  5. Gemini 3 AI Model Will Be Released Soon, Says Google CEO Sundar Pichai
  6. Reliance Offers Free 18-Month Google AI Pro with Gemini, Veo to Jio Users
  7. Samsung Galaxy S26 Series Teased to Launch With These Notable Upgrades
  8. Lava Agni 4 With Metal Design and Flat Edges Teased Ahead of Debut
  9. Gemini vs Perplexity vs ChatGPT: Which Free AI Plan Is Best For You
  10. Samsung Wallet Updated With Support for These UPI Improvements in India
  1. Scientists May Have Finally Solved the Sun’s Mysteriously Hot Atmosphere Puzzle
  2. Vivo X300 Series Launched Globally With 200-Megapixel Zeiss Camera, Up to 6.78-Inch Display: Price, Features
  3. Canva Introduces Revamped Video Editor, New AI Tools and a Marketing Platform
  4. Thode Door Thode Paas OTT Release Date: Know When and Where to Watch it Online
  5. Blackmail Now Streaming Online: Know Where to Watch This Tamil Crime Thriller Movie
  6. Eva Husson’s Playdate OTT Release Date: When and Where to Watch it Online?
  7. Raj Tarun's Chiranjeeva OTT Release Date: When and Where to Watch it Online?
  8. Bitchat Becomes Jamaica’s Go-to App as Hurricane Melissa Cripples Communication
  9. Google Maps Is Reportedly Developing a New Power Saving Mode for Navigation
  10. Take-Two CEO Says AI Won't Be 'Very Good' at Making a Game Like Grand Theft Auto
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.