Google can now recognize objects in videos using machine learning | Public Relations & Social Marketing Insight | Scoop.it

Fei-Fei Li, chief scientist of artificial intelligence and machine learning at Google Cloud, came on stage at Google’s Next Cloud conference today to talk about the current and next-generation applications of AI that Google’s working on. These technologies will make a difference in self-driving cars and healthcare, sure, but also Snapchat’s filters and Google Photos’ search capabilities. But the big highlight came when she announced a new way to allow software to parse video.

This new “Video Intelligence API” was demoed onstage, and it offered the kind of “whoa” moment you expect from a Google keynote. By playing a short commercial, the API was able to identify the dachshund in the video, when it appeared in the video, and then understand that the whole thing was a commercial. In another demo, we saw a simple search for “beach” and was able to find videos which had scenes from beaches in them, complete with timestamps. That’s similar to how Google Photos lets you search for “sunset” and pull up your best late-day snapshots.

Before now, computers couldn’t really understand the content of a video directly without manual tagging. “We are beginning to shine light on the dark matter of the digital universe,” Li said. At least in Google’s demo, it was genuinely impressive. And Google is making the API available to developers, just as it has with its other machine learning APIs.