MY KOLKATA EDUGRAPH
ADVERTISEMENT
regular-article-logo Friday, 22 November 2024

Explainer: Gemini 1.5 Pro or Flash or Nano?

One has to remember that Gemini 1.5 Pro and Flash are multimodal, meaning you can use prompts with text, images and videos

Mathures Paul Published 16.05.24, 10:35 AM
Google I/O 2024 developer conference was all about artificial intelligence.  

Google I/O 2024 developer conference was all about artificial intelligence.   Google

At Google’s annual developers conference, there were plenty of discussions about Gemini AI models —1.5 Flash and Nano, along with some more information about the 1.5 Pro model, which was announced in February.

One has to remember that Gemini 1.5 Pro and Flash are multimodal, meaning you can use prompts with text, images and videos.

ADVERTISEMENT

Gemini 1.5 Pro is available widely for developers and has a baseline token window of one million, with an option to sign up for a two million token window.

Gemini 1.5 Flash is the newest addition to the Gemini model family and the fastest Gemini model served in the API. It’s optimised for high-volume, high-frequency tasks at scale, is more cost-efficient to serve and features our breakthrough long context window. It excels at summarisation, chat applications, image and video captioning, data extraction from long documents and tables, and more. It has been trained by 1.5 Pro through a process called “distillation”.

Gemini Nano is the lightweight large language model Google introduced to the Pixel 8 Pro last year and, later, the Pixel 8. Gemini Nano is expanding beyond text-only inputs to include images as well. Starting with Pixel, applications using Gemini Nano with multimodality will be able to understand the world the way people do — not just through text, but also through sight, sound and spoken language.

Follow us on:
ADVERTISEMENT
ADVERTISEMENT