Log in to bookmark your favorites and sync them to your phone or calendar.

Deep Learning Stage [clear filter]
Friday, January 25


Learned Video Compression
We present an algorithm for video coding, learned end-to-end for the low-latency mode. In this setting, our approach outperforms all standard video codecs across nearly the entire bitrate range. To our knowledge, this is the first ML-based method to do so. We propose a novel architecture for video compression which generalizes motion estimation to perform any learned compensation beyond simpler translations. Our architecture allows for joint compression of motion and residual and can dynamically trade-off between them. It is also able to model multiple flow fields in the same frame. We propose an ML-based spatial rate control, which allows or model to adaptively change the bitrate across space for each frame. For the same quality traditional codecs achieve up to 60% larger code.

avatar for Lubomir Bourdev, WaveOne

Lubomir Bourdev, WaveOne

Co-Founder & CEO, WaveOne
Lubomir Bourdev is a co-founder and the CEO of WaveOne, Inc., a startup focusing on video compression with deep learning. He is also a founding member of Facebook AI Research and he founded and led the Facebook AML Computer Vision team responsible for the image and video content recognition... Read More →

Friday January 25, 2019 9:10am - 9:25am
Grand Ballroom Hyatt Regency San Francisco, 5 Embarcadero Center, San Francisco, CA 94111, USA


Using AI to Transform Informational Videos and Our Watching Behavior
Videos account for about 75% of the internet traffic and enterprises are increasingly using videos for various informational purposes, including training of customers, partners and employees, marketing and internal communication. However, most viewers do not have the patience to watch these videos end-to-end and our video watching experience has not evolved much in over a decade. We present an AI-based approach to automatically index videos in the form of a table-of-contents, a phrase cloud and a searchable transcript, which helps summarize the key topics in a video and lets viewers navigate directly to the topics of interest. We use a combination of visual classification, object detection, automated speech recognition, text summarization, and domain classification, and show the results achieved on a range of informational videos. We conclude with some thoughts on the promise of transforming how informational videos are consumed as well as open problems and future directions.

avatar for Manish Gupta, VideoKen

Manish Gupta, VideoKen

CEO & Co-Founder, VideoKen
Dr. Manish Gupta is the co-founder and CEO of VideoKen Inc., a video technology startup. He has served as the Vice President and Director of Xerox Research Centre India and has held various leadership positions with IBM, including that of Director, IBM Research - India and Chief Technologist... Read More →

Friday January 25, 2019 9:25am - 9:40am
Grand Ballroom Hyatt Regency San Francisco, 5 Embarcadero Center, San Francisco, CA 94111, USA


DNA of an AI Powered Robotic Workforce for Extreme Environments
As practical applications of AI emerge in industrial robotics, we are starting to realize the potential of highly autonomous robotic systems not only capable of performing AI driven tasks in simulation or structured environments, but out in the field and even in extreme environments. However, there are optimum solutions that do not require the use of deep reinforcement learning or other Machine Learning methodologies. What is the right balance between powerful implementations of AI and traditional automation control to achieve a highly autonomous robotic system that can operate in remote unforgiving locations? Is there a right DNA for an AI powered robotic workforce for extreme environments?

avatar for Alicia Kavelaars, OffWorld

Alicia Kavelaars, OffWorld

Co-Founder and CTO, OffWorld
Alicia is Co-Founder and Chief Technology Officer at OffWorld Inc. She brings over 15 years of experience in the aerospace industry developing and successfully launching systems for NASA, NOAA and the Telecommunications industry. In 2015, Alicia made the jump to New Space to work... Read More →

Friday January 25, 2019 9:40am - 9:55am
Grand Ballroom Hyatt Regency San Francisco, 5 Embarcadero Center, San Francisco, CA 94111, USA