SoundHound is giving its AI the power of sight
Soundhound AI, who is already a major player in voice assistants, now gives his technique a pair of eyes.
Imagine driving through a teacher, without withdrawing your phone, and asked your car, “What is this building there?” And obtain an immediate answer. This is what Soundhound Ai is built.
With the launch of Vision AI, the new Soundhound system combines sound with sound to create a more natural and more natural way to interact with technology. The idea is to imitate how we worked as human beings; We are not only listening to someone, and we also see his gestures and what they are looking for.
By bringing this same contextual understanding to artificial intelligence, Soundhound hopes to enjoy a fair and frustrating experience often with many smart devices today. The company targets the real world applications where this common meaning can make a big difference, whether it is in your next car, in Drive-thethru, or factory flooring.
“In SoundHound, we believe that the future of artificial intelligence is not only multimedia-it is very integrated, fast to respond and is built to influence in the real world,” said Kivan Muhajer, CEO of Soundhound Ai.
“Through Vision AI, we expand our leadership in AI’s vocal and conversation to redefine how humans interact with the products and services provided by companies and use them.”
So, how does it work? Vision AI takes a direct summary of the camera and releases it with the company’s voice technology, which already exceeds the understanding of natural speech. By treating what he sees and what he hears exactly at the same time, the system can realize the real user intention in a way that can never be for a simple audio assistant.
Think about a mechanic wearing smart glasses that can simply look at the engine part and order instructions, and receive immediate visual and audio guidance without throwing their tools. In one store, the employee can only wipe the shelves by looking at them to get the number of inventory in actual time. As for the rest of us, this may mean a driving booth that visually confirms our request on the screen at the moment we say.
One of the largest technical problems in creating such a system is to ensure that the vocal and visual elements are fully synchronized. Any delay will break a natural conversation.
“With Vision AI, we integrate visual recognition and the intelligence of the conversation into one simultaneous flow. Each frame, each seminar, each intention is interpreted within the ecosystem itself, where more natural users are experimented with dams from dams from cookies to guaranteed positions.
“This is innovation at the intersection of intelligence and implementation, and the presentation of artificial intelligence that sees what you see, hears what you say, and responds at the present time.”
For companies that adopt this technology, the promise is to provide faster service, lower errors, and happier customers. It comes to removal of friction and making technology look less as a tool you should run and more like a partner who helps you to accomplish things.
This new visual ability is not the only SoundHound upgrade to be offered. The company also recently improved its “brain” with a new update, Amelia 7.1. This improvement makes artificial intelligence agents faster and more accurate, and gives companies more control and transparency on how they work.
By combining sight and sound, SoundHound aims to push us to approach a world where interaction with artificial intelligence is easy and intuitive like speaking to another person.
(Photo by Christian Lowe)
See also: Alan Toring Institute: Humanities are the key to the future of artificial intelligence
Do you want to learn more about artificial intelligence and large data from industry leaders? Check AI and Big Data Expo, which is held in Amsterdam, California, and London. The comprehensive event was identified with other leading events including the smart automation conference, Blockx, the digital transformation week, and the Cyber Security & Cloud.
Explore the upcoming web events and seminars with which Techforge works here.
Don’t miss more hot News like this! Click here to discover the latest in AI news!
2025-08-12 10:06:00



