Google says its robots are getting better at undertaking complex tasks

Google is training robots with its Gemini AI so they can complete tasks more efficiently
A recently published paper shows the robots ‘learning’ in an office environment
Google is hoping to train its bot to respond to language and visual instructions at the same time

Published on Jul 15, 2024 at 7:25 AM (UTC+4)
by Claire Reid

Last updated on Jul 15, 2024 at 8:31 PM (UTC+4)
Edited by Tom Wood

Get our stories first Follow Supercar Blondie on Google News

Google is training robots with its Gemini AI so they can complete tasks and navigate spaces more effectively.

The tech giant’s DeepMind robotics team has paired its Gemini AI engine with RT-2 robots in an attempt to make communication between humans and the bots easier and more natural.

To get the process up and running, the team would film a specific area within the office and then ‘show’ this video to the robot so it could gain an understanding of the environment.

Google is using its Gemini AI to train robots

Google says its robots are getting better at undertaking complex tasks — Google DeepMind

A human can then ask a robot a question about that space while using a visual image and the robot will guide the person there.

For example, you could approach the robot and ask, ‘Where can I use this?’ while holding up a whiteboard marker, and it would take you to a whiteboard.

In a recently published paper, Google DeepMind said the AI-powered robots had around a 90 percent success rate when given more than 50 instructions in a 9,000-plus-square-foot area.

This tech would be a big leap forward for AI helpers – allowing humans to interact them with in a way that feels a bit more natural.

The tech could come in useful in real-life situations

“Object goal and Vision Language navigation (ObjNav and VLN) are a giant leap forward in robot usability as they allow the use of open-vocabulary language to define navigation goals, such as ‘Go to the couch’,” the paper’s authors wrote.

“To make robots truly useful and ubiquitous in our daily lives, we propose another leap forward by lifting ObjNav and VLN’s natural language space onto the multimodal space, meaning that the robot can accept natural language and/or image instructions simultaneously.

“For example, a person unfamiliar with the building can ask ‘Where should I return this?’ while holding a plastic bin, and the robot guides the user to the shelf for returning the box based on verbal and visual context.”

Smart stuff, right?

The researchers went on to say that ‘preliminary evidence’ suggested the robots were able to plan how to carry out certain tasks beyond simple navigation.

As an example, the team stacked cans of Coca-Cola on one worker’s desk and then had him ask the robot if his favorite drink was available.

The team said Gemini showed evidence that it ‘knows that the robot should navigate to the fridge, inspect if there are Cokes, and then return to the user to report the result’.

So a bit like a robot butler, then?

Better a robot butler than a robot overlord, right?

DISCOVER SBX CARS: The global premium car auction platform powered by Supercar Blondie

# Tags - Google, Tech Claire Reid

With a background in both local and national press in the UK, Claire has covered a range of topics, including technology, gaming, and cryptocurrency, since joining the editorial team at Supercar Blondie in May 2024. Her ability to be first to a story has been integral to making SB’s coverage of scientific discovery, AI, and global tech news a slick 24/7 operation.

Google says its robots are getting better at undertaking complex tasks

Google is using its Gemini AI to train robots

The tech could come in useful in real-life situations

TRENDING

California woman took her heavily modified Ferrari 458 to Ferrari and it made her a wild cash offer that left her with very tricky decision

Mystery packages are showing up on American doorsteps with QR codes on them and the FBI says it is a trap

Georgia man buys a $13,252 pallet of Amazon electronics returns to see if he can make money out of it

Canada man attempts to sleep in his Tesla Model 3 with his wife at -26°F to see what it's like and is left surprised

Virginia YouTuber waited four months for his $30,000 Alibaba Bugatti but what arrived was only suitable for a playground