Waymo is Sharing its Massive Self-Driving Dataset With Researchers
【Summary】Waymo, which spun out of Google’s self-driving car project in 2016, announced that its releasing the ‘Waymo Open Dataset’ for researchers and developers working on autonomous driving and other related mobility projects. Waymo says its dataset is the the largest, richest, and most diverse self-driving dataset ever released for research.
Before self-driving cars can safely navigate urban streets, the vehicles need large datasets to train the machine learning models that are used for navigation, recognizing street signs, pedestrians and other vehicles. However, training machine learning models requires an enormous amount of data and collecting it is long and painstaking process, especially for many of the budding startups working on autonomous driving.
Today Waymo, which spun out of Google's self-driving car project in 2016, announced that its releasing the ‘Waymo Open Dataset' for researchers and developers working on autonomous driving and other related mobility projects. Waymo says its dataset is the the largest, richest, and most diverse self-driving dataset ever released for research.
The data was collected by a fleet of Waymo self-driving vehicles that traveled over 10 million miles in 25 different cities.
The dataset includes high-resolution sensor data covering a wide variety of environments, include dense urban areas and suburban streets. That data was also collected in a wide variety of real-world conditions, including day and night, at dawn and dusk, in bright sunlight and rain.
This data is an invaluable tool for other parties working on autonomous driving. Waymo's own engineers use the dataset to develop self-driving technology and innovative machine learning models and algorithms. With the release of dataset, engineers outside of Waymo are getting access to the same data the Waymo's uses for the first time ever.
All of this data is then fed into Waymo's Open Dataset then crunched and processed by machine learning algorithms. Developers can also use datasets to improve upon existing algorithms by analyzing the data and using it to improve software to behave more like a human driver.
Waymo's Dataset Will Help Others to Improve Self-Driving Technology
Waymo believes that offering the dataset it will help speed up the developement of self-driving technology by sharing data and thereby promoting collaboration among developers, even if they are outside of the company.
"The more smart brains you can get working on the problem, whether inside or outside the company, the better," says Waymo principal scientist Drago Anguelov in a statement.
All of the data has been labeled and formatted to aid in research.
The dataset includes Camera-lidar synchronization. Waymo is currently working on 3D perception models that fuse data from multiple cameras and lidar.
Waymo's dataset contains data from 1,000 driving segments. Each segment captures 20 seconds of continuous driving, corresponding to 200,000 frames at 10 Hz per sensor, according to Waymo. This longer footage allows researchers to develop models for predicting the behavior of other road users.
Each segment contains sensor data gathered from five high-resolution Waymo lidars and five front-and-side-facing HD cameras. The dataset includes lidar frames and images with vehicles, pedestrians, cyclists, and signage that's been carefully labeled, capturing a total of 12 million 3D labels and 1.2 million 2D labels.
The Waymo Open Dataset has the potential to help researchers make advances in 2D and 3D perception, and improve behavior prediction, which is especially helpful for navigating dense urban areas where there are many pedestrians for a self-driving car to deal with.
The dataset includes camera footage from Waymo's high-definition cameras and 1.2 million 2D labels.
By releasing the dataset, Waymo hopes that the research community can use it to make self-driving vehicles more capable and safer. However, the comprehensive dataset can be used for applications outside of autonomous driving such as the related fields of computer vision and robotics.
Waymo's vehicle collected the data in Phoenix, AZ, Kirkland, WA, Mountain View, CA and San Francisco, CA capturing a wide spectrum of driving conditions (day and night, dawn and dusk, sun and rain).
Waymo is not the only company sharing its data for autonomous driving. Researchers at the University of California, Berkeley shared its DeepDrive dataset, which was once the largest dataset for self-driving AI. DeepDrive contains over 100,000 videos of over 1,100-hour driving events across different times of the day, and varying weather conditions.
China's Baidu released its ApolloScape dataset in March 2018 as part of its open Apollo autonomous driving platform. Baidu's ApolloScape dataset has 26 pre-defined semantic items, like cars, buildings, people walking on the sidewalk, traffic lights, street lights, etc. This has been done using a pixel-by-pixel semantic segmentation technique, according to Baidu.
Waymo said that the release of its self-driving dataset is just the first step. The Alphabet subsidiary is welcoming feedback from the developer community on how to make its dataset even more useful with future updates.
The dataset is available free of charge to researchers at waymo.com/open.
Originally from New Jersey, Eric is a automotive & technology reporter covering the high-tech industry in Silicon Valley. Eric has over 15 years of automotive experience and a bachelors degree in computer science. These skills, combined with technical writing and news reporting, allows him to fully understand and identify new and innovative technologies in the auto industry and beyond. He has worked at Uber on self-driving cars and as a technical writer, helping people to understand and work with technology. Outside of work, Eric likes to travel to new places, play guitar, and explore the outdoors.
Volkswagen is Investing $66 Billion in E-Mobility and Digital Technology
Ford Officially Confirms the 'Mustang Mach-E' Name for its First Fully-Electric SUV
Xilinx Announces New High-Performance Adaptive Devices for Autonomous Driving & ADAS Applications
Chinese Electric Vehicle Startup Xpeng Motors Raises $400 Million in Series C Funding
Hyundai Develops the World’s First Active Noise Cancelling Technology for Passenger Vehicles
California EV Startup Lucid Motors Begins Construction on its Arizona Factory
Toyota & BYD Establish Joint Company in China for Electric Vehicle Development
Lyft is Encouraging People to Sell Their Vehicles for Ride-hailing Credit
- MIT Teaching Autonomous Cars to See Around Corners
- Toyota Enters Partnership With Chinese Startup Pony.Ai for Autonomous Tech
- Hyundai Develops the World’s First Active Noise Cancelling Technology for Passenger Vehicles
- Audi’s Latest PPE Platform Is Part of a Four-Pronged EV Approach
- China’s Tencent Launches WeChat App for Drivers
- Tesla Model 3 Becomes Most Efficient Electric Vehicle on Sale
- AAA Study Finds Pedestrian Detection Systems Perform Inconsistently
- Designated Driver Deploys its ‘Remote Driver’ Technology for Autonomous Shuttles at Texas A&M
- Huawei to Develop Radar for Self-Driving Cars in Push into Auto Industry, Execs Say
- Waymo, AAA Partner to Educate Children on Autonomous Cars