Waymo is Sharing its Massive Self-Driving Dataset With Researchers
【Summary】Waymo, which spun out of Google’s self-driving car project in 2016, announced that its releasing the ‘Waymo Open Dataset’ for researchers and developers working on autonomous driving and other related mobility projects. Waymo says its dataset is the the largest, richest, and most diverse self-driving dataset ever released for research.
Before self-driving cars can safely navigate urban streets, the vehicles need large datasets to train the machine learning models that are used for navigation, recognizing street signs, pedestrians and other vehicles. However, training machine learning models requires an enormous amount of data and collecting it is long and painstaking process, especially for many of the budding startups working on autonomous driving.
Today Waymo, which spun out of Google's self-driving car project in 2016, announced that its releasing the ‘Waymo Open Dataset' for researchers and developers working on autonomous driving and other related mobility projects. Waymo says its dataset is the the largest, richest, and most diverse self-driving dataset ever released for research.
The data was collected by a fleet of Waymo self-driving vehicles that traveled over 10 million miles in 25 different cities.
The dataset includes high-resolution sensor data covering a wide variety of environments, include dense urban areas and suburban streets. That data was also collected in a wide variety of real-world conditions, including day and night, at dawn and dusk, in bright sunlight and rain.
This data is an invaluable tool for other parties working on autonomous driving. Waymo's own engineers use the dataset to develop self-driving technology and innovative machine learning models and algorithms. With the release of dataset, engineers outside of Waymo are getting access to the same data the Waymo's uses for the first time ever.
All of this data is then fed into Waymo's Open Dataset then crunched and processed by machine learning algorithms. Developers can also use datasets to improve upon existing algorithms by analyzing the data and using it to improve software to behave more like a human driver.
Waymo's Dataset Will Help Others to Improve Self-Driving Technology
Waymo believes that offering the dataset it will help speed up the developement of self-driving technology by sharing data and thereby promoting collaboration among developers, even if they are outside of the company.
"The more smart brains you can get working on the problem, whether inside or outside the company, the better," says Waymo principal scientist Drago Anguelov in a statement.
All of the data has been labeled and formatted to aid in research.
The dataset includes Camera-lidar synchronization. Waymo is currently working on 3D perception models that fuse data from multiple cameras and lidar.
Waymo's dataset contains data from 1,000 driving segments. Each segment captures 20 seconds of continuous driving, corresponding to 200,000 frames at 10 Hz per sensor, according to Waymo. This longer footage allows researchers to develop models for predicting the behavior of other road users.
Each segment contains sensor data gathered from five high-resolution Waymo lidars and five front-and-side-facing HD cameras. The dataset includes lidar frames and images with vehicles, pedestrians, cyclists, and signage that's been carefully labeled, capturing a total of 12 million 3D labels and 1.2 million 2D labels.
The Waymo Open Dataset has the potential to help researchers make advances in 2D and 3D perception, and improve behavior prediction, which is especially helpful for navigating dense urban areas where there are many pedestrians for a self-driving car to deal with.
The dataset includes camera footage from Waymo's high-definition cameras and 1.2 million 2D labels.
By releasing the dataset, Waymo hopes that the research community can use it to make self-driving vehicles more capable and safer. However, the comprehensive dataset can be used for applications outside of autonomous driving such as the related fields of computer vision and robotics.
Waymo's vehicle collected the data in Phoenix, AZ, Kirkland, WA, Mountain View, CA and San Francisco, CA capturing a wide spectrum of driving conditions (day and night, dawn and dusk, sun and rain).
Waymo is not the only company sharing its data for autonomous driving. Researchers at the University of California, Berkeley shared its DeepDrive dataset, which was once the largest dataset for self-driving AI. DeepDrive contains over 100,000 videos of over 1,100-hour driving events across different times of the day, and varying weather conditions.
China's Baidu released its ApolloScape dataset in March 2018 as part of its open Apollo autonomous driving platform. Baidu's ApolloScape dataset has 26 pre-defined semantic items, like cars, buildings, people walking on the sidewalk, traffic lights, street lights, etc. This has been done using a pixel-by-pixel semantic segmentation technique, according to Baidu.
Waymo said that the release of its self-driving dataset is just the first step. The Alphabet subsidiary is welcoming feedback from the developer community on how to make its dataset even more useful with future updates.
The dataset is available free of charge to researchers at waymo.com/open.
Originally hailing from New Jersey, Eric is a automotive & technology reporter covering the high-tech industry here in Silicon Valley. He has over 15 years of automotive experience and a bachelors degree in computer science. These skills, combined with technical writing and news reporting, allows him to fully understand and identify new and innovative technologies in the auto industry and beyond. He has worked at Uber on self-driving cars and as a technical writer, helping people to understand and work with technology.
BMW Invests $25 Million in Silicon Valley-based Software Motor Company
Cruise Debuts the 'Origin', an Autonomous Urban Robotaxi Built for Ride-Hailing
Tesla Announces it Will Use Baidu Maps in its China-Made Vehicles
Uber is Testing a New Feature in California Allowing Some Drivers to Set Their Own Fares
The NHTSA Will Review a Petition to Investigate Sudden Acceleration Complaints on Tesla Vehicles
Volvo to Build an Electric Vehicle Battery Plant in the U.S.
Toyota Leads $590 Million Funding Round in California VTOL Aircraft Startup Joby Aviation
EV Startup Lucid Motors to Reveal a Near Production Version of its Ultra-Luxurious Air Sedan at the NY Auto Show
- NVIDIA Announces DRIVE AXG Orin, One of the Most Advanced Platforms for Autonomous Vehicles
- Waymo Commits to Using Safety Drivers in Autonomous Vehicles Long Term
- China’s BYD Receives Order for 130 Electric Buses, its Biggest Order to Date in the U.S.
- Toyota Believes There’s No Market for EVs Yet
- Hyundai Launches ‘BotRide’ Autonomous SUVs in California with Pony.ai & Via
- General Motors May Bring Back Hummer as a New Electric Truck Brand
- Volvo to Build an Electric Vehicle Battery Plant in the U.S.
- Walmart is Launching an Autonomous Grocery Delivery Pilot with Startup Nuro
- Electric Vehicles Getting Hit With Higher Fees at the State Level
- Didi Chuxing to Trial Luxury Ride-Hailing Service with Tesla & Mercedes Benz Vehicles