Parsing Petabytes, SpaceML Taps Satellite Images to Help Model Wildfire Risks

Teams of experts and citizen scientists help build image classifiers from satellite imagery of Earth to spot signs of natural disasters.

by SCOTT MARTIN

When freak lightning ignited massive wildfires across Northern California last year, it also sparked efforts from data scientists to improve predictions for blazes.

One effort came from SpaceML, an initiative of the Frontier Development Lab, which is an AI research lab for NASA in partnership with the SETI Institute. Dedicated to open-source research, the SpaceML developer community is creating image recognition models to help advance the study of natural disaster risks, including wildfires.

SpaceML uses accelerated computing on petabytes of data for the study of Earth and space sciences, with the goal of advancing projects for NASA researchers. It brings together data scientists and volunteer citizen scientists on projects that tap into the NASA Earth Observing System Data and Information System data. The satellite information came from recorded images of Earth — 197 million square miles — daily over 20 years, providing 40 petabytes of unlabeled data.

“We are lucky to be living in an age where such an unprecedented amount of data is available. It’s like a gold mine, and all we need to build are the shovels to tap its full potential,” said Anirudh Koul, machine learning lead and mentor at SpaceML.

Stoked to Make Difference

Koul, whose day job is a data scientist at Pinterest, said the California wildfires damaged areas near his home last fall. The San Jose resident and avid hiker said they scorched some of his favorite hiking spots at nearby Mount Hamilton. His first impulse was to join as a volunteer firefighter, but instead he realized his biggest contribution could be through lending his data science chops.

Koul enjoys work that helps others. Before volunteering at SpaceML, he led AI and research efforts at startup Aira, which uses augmented reality glasses to dictate for the blind what’s in front of them with image identification paired to natural language processing.

Aira, a member of the NVIDIA Inception accelerator program for startups in AI and data science, was acquired last year.

Inclusive Interdisciplinary Research

The work at SpaceML combines volunteers without backgrounds in AI with tech industry professionals as mentors on projects. Their goal is to build image classifiers from satellite imagery of Earth to spot signs of natural disasters.

Groups take on three-week projects that can examine everything from wildfires and hurricanes to floods and oil spills. They meet monthly with scientists from NASA with domain expertise in sciences for evaluations.

Contributors to SpaceML range from high school students to graduate students and beyond. The work has included participants from Nigeria, Mexico, Korea and Germany and Singapore.

SpaceML’s team members for this project include Rudy Venguswamy, Tarun Narayanan, Ajay Krishnan and Jeanessa Patterson. The mentors are Koul, Meher Kasam and Siddha Ganju, a data scientist at NVIDIA.

Assembling a SpaceML Toolkit

SpaceML provides a collection of machine learning tools. Groups use it to work on such tasks as self-supervised learning using SimCLR, multi-resolution image search, and data labeling, among other tasks. Ease of use is key to the suite of tools.

Among their pipeline of model-building tools, SpaceML contributors rely on NVIDIA DALI for fast preprocessing of data. DALI helps with unstructured data unfit to feed directly into convolutional neural networks to develop classifiers.

“Using DALI we were able to do this relatively quickly,” said Venguswamy.

Findings from SpaceML were published at the Committee on Space Research (COSPAR) so that researchers can replicate their formula.

Classifiers for Big Data

The group developed Curator to train classifiers with a human in the loop, requiring fewer labeled examples because of its self-supervised learning. Curator’s interface is like Tinder, explains Koul, so that novices can swipe left on rejected examples of images for their classifiers or swipe right for those that will be used in the training pipeline.

The process allows them to quickly collect a small set of labeled images and use that against the GIBS Worldview set of the satellite images to find every image in the world that’s a match, creating a massive dataset for further scientific research.

“The idea of this entire pipeline was that we can train a self-supervised learning model against the entire Earth, which is a lot of data,” said Venguswamy.

The CNNs are run on instances of NVIDIA GPUs in the cloud.

To learn more about SpaceML, check out these speaker sessions at GTC 2021:

Space ML: Distributed Open-Source Research with Citizen-Scientists for Advancing Space Technology for NASA (GTC registration required to view)

Curator: A No-Code, Self-Supervised Learning and Active Labeling Tool to Create Labeled Image Datasets from Petabyte-Scale Imagery (GTC registration required to view)

The GTC keynote can be viewed on April 12 at 8:30 a.m. Pacific time and will be available for replay.

Photo credit: Emil Jarfelt, Unsplash

Teams of experts and citizen scientists help build image classifiers from satellite imagery of Earth to spot signs of natural disasters.

by SCOTT MARTIN

When freak lightning ignited massive wildfires across Northern California last year, it also sparked efforts from data scientists to improve predictions for blazes.

Stoked to Make Difference

Aira, a member of the NVIDIA Inception accelerator program for startups in AI and data science, was acquired last year.

Inclusive Interdisciplinary Research

Contributors to SpaceML range from high school students to graduate students and beyond. The work has included participants from Nigeria, Mexico, Korea and Germany and Singapore.

Assembling a SpaceML Toolkit

“Using DALI we were able to do this relatively quickly,” said Venguswamy.

Findings from SpaceML were published at the Committee on Space Research (COSPAR) so that researchers can replicate their formula.

Classifiers for Big Data

“The idea of this entire pipeline was that we can train a self-supervised learning model against the entire Earth, which is a lot of data,” said Venguswamy.

The CNNs are run on instances of NVIDIA GPUs in the cloud.

To learn more about SpaceML, check out these speaker sessions at GTC 2021:

Space ML: Distributed Open-Source Research with Citizen-Scientists for Advancing Space Technology for NASA (GTC registration required to view)

Curator: A No-Code, Self-Supervised Learning and Active Labeling Tool to Create Labeled Image Datasets from Petabyte-Scale Imagery (GTC registration required to view)

The GTC keynote can be viewed on April 12 at 8:30 a.m. Pacific time and will be available for replay.

Photo credit: Emil Jarfelt, Unsplash

Parsing Petabytes, SpaceML Taps Satellite Images to Help Model Wildfire Risks

etetewtgae

Top Rated

Mazda make global golf tournament ‘Mazda AJGA’ A Pathway to Pro Golf from U.S. to Thailand for the first time ever

Bridgestone Receives “The Best Supplier of Overall Performance in 2023 (Truck Business)” Award, As a Strong Partnership with Hino

FIRST BESPOKE LIMITED EDITION IN INDIA CURATED BY BENTLEY MULLINER

OUTRIGGER Koh Samui Beach Resort Introduces Exclusive Laser Tag Experience

BENTAYGA EWB - INFINITE CHOICE, CURATED BY MULLINER

Escape to Bliss: The Spa at The Standard, Hua Hin Launches a VIP Pass to Better Wellness

Be the first to test drive the Volvo EX30 at the 45th Bangkok International Motor Show

Nippon Express (South Asia & Oceania) to Exhibit at Future Mobility Asia

Kia Sales (Thailand) unveils the full line-up of The Kia EV5, Thailand’s first-ever all-electric versatile mid-size SUV, with special launch price starting from 1,249,000 baht, at the 45th Bangkok International Motor Show.

Continental Increases Earnings in 2023 and Targets Further Improvement This Year

Product Information: NEW MG CYBERSTER

MG gets closer to Young Consumers with #DareToBeYou, a Marketing Breakthrough on Self-Expression

OMODA & JAECOO Officially Launches in Thailand, Unveiling Four New Car Models to Provide Better Alternatives for Thai Drivers. Set to Hit the Market Mid-Year!

TSMC and Synopsys Bring Breakthrough NVIDIA Computational Lithography Platform to Production

Bridgestone Delivers Utmost High-Performance Driving Experience On-Road & Off-Road with Premium All-Terrain Tire, “BRIDGESTONE DUELER ALL-TERRAIN A/T002” in 10 Sizes

The Standard x Corona Sunsets: Songkran Edition is Here! Splash Into the Thai New Year at The Standard, Hua Hin

DLSS 3.5 and Full Ray Tracing Coming To Black Myth: Wukong, NARAKA: BLADEPOINT and Portal with RTX; Star Wars™ Outlaws Launching With DLSS 3 and Ray Tracing

Informa - Tarsus Group and the Rubber Authority of Thailand, are organizing "TyreXpo Asia 2024" with the goal of leading Thailand to become the hub of the rubber industry in ASEAN.

AI Decoded: Demystifying AI and the Hardware, Software and Tools That Power It

Shining Brighter Together: Google’s Gemma Optimized to Run on NVIDIA GPUs

Say What? Chat With RTX Brings Custom Chatbot to NVIDIA RTX AI PCs

Igniting the Future: TensorRT-LLM Release Accelerates AI Inference Performance, Adds Support for New Models Running on RTX-Powered Windows 11 PCs

Climate Solutions Prize: Continental Honors Winner of Tech Challenge on Pioneering Sustainable Materials

MG reaffirms MG4 ELECTRIC success with the launch of MG4 XPOWER with official price announcement at the Motor Show

Striking Performance: Large Language Models up to 4x Faster on RTX With TensorRT-LLM for Windows

MG launches three “100th Anniversary Special Edition” models in celebration of its significant milestone

“BRIDGESTONE ECOPIA EP150 with the Ultimate Customizationof Cutting-Edge ENLITEN® Technology” Selected as Original Equipment to Power “New Xpander HEV and New Xpander Cross HEV” from Mitsubishi Motors

NVIDIA DLSS & GeForce RTX: List Of All Games, Engines And Applications Featuring GeForce RTX-Powered Technology And Features

Mazda make global golf tournament ‘Mazda AJGA’ A Pathway to Pro Golf from U.S. to Thailand for the first time ever