Monday, December 16, 2024

Google’s new Gemini model will power AI agents

Must read

In This Story

Google (GOOGL+5.52%) unveiled its most capable model to date to power the “new agentic era” of artificial intelligence.

The tech giant introduced Gemini 2.0, which “will enable us to build new AI agents that bring us closer to our vision of a universal assistant,” Google chief executive Sundar Pichai said Wednesday, adding that the company is working to roll out the new model across its products. AI agents are software that can complete complex tasks autonomously for a user.

Gemini 2.0 has new multimodal capabilities, including native image and audio output, Pichai said. Google launched Gemini 1.0 last December, which the company said was the first “natively multimodal” model, meaning it could process and respond to text, video, image, audio, and code inquiries.

Developers and testers will be the first to get 2.0, while all Gemini users will have access to the Gemini 2.0 Flash experimental model. The Flash model builds off of Gemini 1.5 Flash, which Google launched in July as its fastest, most cost-efficient model.

Google will add Gemini 2.0’s reasoning capabilities to its AI Overviews feature, which Pichai said now reaches one billion people and is “quickly becoming one of our most popular Search features ever.” With Gemini 2.0, AI Overviews will be able to solve advanced multi-step queries, such as mathematical equations and multimodal questions.

Limited testing for Gemini 2.0 in AI Overviews started this week, Pichai said, but the reasoning feature will roll out to more users early next year. The reasoning model runs on Google’s custom 6th-generation AI chip, Trillium, which became available to Google Cloud customers on Wednesday. The new chip delivers 4x better performance and is 67% more energy efficient than its predecessor, according to the company.

Pichai also announced a new Gemini feature called Deep Research, which can “act as a research assistant” by using advanced reasoning and long context capabilities. Deep Research, which is available in Gemini Advanced, can compile research reports on behalf of a user.

Latest article