yugatech x infinix

Google unveils Gemini, its largest multimodal AI yet: Gemini Nano to enable on-device AI to Android

Listen to article

Google has introduced ‘Gemini’ as its largest and most capable AI model. Built from the start as multimodal, Gemini can “seamlessly operate across and combine different types of information.” This includes text, image, audio, video, and code.

Google Gemini Ai Fi

The first model, Gemini 1.0 comes in three ‘sizes’: the Gemini Nano aimed to run natively on-device to Android; a ‘fine-tuned version’ of Gemini Pro now empowers Google Bard; and Gemini Ultra capable for highly complex tasks that can understand and code in Python, Java, C++, and Go.

Google’s subsidiary, DeepMind with its CEO Demis Hassabis claims that Gemini outperforms OpenAI’s GPT-4 model in 30 out of 32 benchmarks.

Those benchmarks show narrow and even larger differences, with Gemini’s advantage of being multimodal by design.

Gemini Final Multimodal Table Benchmarks Vs Gpt 4 (yuga Fi)

Furthermore, Google specifically mentioned that the Pixel 8 Pro is the first smartphone engineered for Gemini Nano. It apparently powers new features like Summarize in the phone’s Recorder app and will roll out in Smart Reply to Gboard starting with WhatsApp.

“Gemini in Search” is also in development to make Search Generative Experience (SGE) faster with 40% reduced latency in English in the US.

Take a look at what Gemini can do with a video demo below:

React to this article:
Written by
JM Chavaria

JM Chavaria

Senior Writer

JM's highest stat is probably his curious ardor to anything tech—electronics and gaming in particular. He certainly heeds utmost regard to specsheet, visuals, and rule of thirds. If creativity and wit sometimes leave JM's system, watching films, anime and a good stroll for memes are his approved therapeutic claims.

View all posts by JM Chavaria →

0 Comments

Leave a Reply

Loading next article...