Google adds eyes to AI mode with new visual search features

Google has added a whole new dimension to its gemini-driven AI search mode with adding images to the text and links provided by Conversational Search Platform.

AI mode now offers items of image search and Google lens to go with the Gemini AI engine so you can ask a question about a photo you upload, or view photos related to your queries.

For example, you can see someone on the street with an aesthetics you like, click on an image and ask AI mode to “show me this style in lighter shades.” Or you Oculd ask for “Retro 50’s living room design” and see what people were sitting on and about 75 years ago. Google sets this feature as a way of replacing filters and awkward key words with natural conversation.

The visual facet of AI mode uses a “Visual Search-Fan-Out” approach on top of its existing fan-out way of answering questions used by AI mode.

When uploading or pointing to an image, AI mode breaks it down into elements such as objects, background, color and texture and send more internal queries in parallel. In this way, it comes back with relevant images that are not limited to repeating what you have already shared. Then recombin the results that best match your intention.

Of course, it also means that Google’s search engine has to decide what results results drawn by AI to highlight and when to suppress noise. It can wrongly read your intention, lift sponsored products, or favor large brands whose images are better optimized for AI. As the search becomes more image -centered, places that lack purely visuals or visual metadata can disappear from the results, making the experience less useful than ever.

On the shopping site, all this is tapping into Google’s shopping graph, which indexes over 50 billion products and updates every hour. So a picture of a pair of jeans may be able to net your details of current prices, reviews and local accessibility at once.

AI mode that transforms your vague prompts and visuals into real opportunities for shopping, learning or even discovering art is at least a big deal, at least if it is doing well. Google’s size allows it to melt search, image processing and e-commerce to a current.

There will be plenty of concerned rivals that look closely even if they were in front of Google with similar products. For example, Pinterest -Linse already allows you to find similar appearance from images, and Microsoft’s Copilot and Bing Visual Search allows you to start from images in some way.

But few combine a global product database and live price data with conversation searched through images. Specialized apps or niche areas with focus can come ahead, but Google’s size and ubiquitous mean means it will have a massive lead with wider attempts to search for information with images.

For years we have written queries and paired results. Now the direction of online search is to feel, point and describe, allow AI and search engines to not only interpret our words, but also what we see and feel.

More aesthetic and design thinking means that systems that naturally understand that Visuals is becoming more and more critical, and AI mode evolution suggests that baseline may soon be that search tools should look as well as read.

Should erroneous errors crawl in, all bets are off. If AI mode visual results mistakenly interpret the intention, mislead users or show larger bias, users can return to brute-force filtration or more specialized settings. The success of this visual AI jump is related to whether it feels useful or unreliable.

Must Read

Leave a Comment Cancel Reply