Machine Learning

Publish your training data on Radiant MLHub for NeurIPS 2021

Submissions to the new Datasets and Benchmarks track require data documentation and availability on an open repository.

Organizers of the NeurIPS 2021 conference recently announced a new track for Datasets and Benchmarks. This is a significant development for a major machine learning (ML) conference to highlight the importance of data in developing algorithms for real-world problems. We at Radiant Earth Foundation welcome this initiative and applaud the organizers for establishing this new track.

In recent years, there have been many discussions and arguments to incentive ML researchers to work on real-world problems. One of those incentive mechanisms is the opportunity to publish a paper in a peer-reviewed conference, and getting recognition for working on these problems. The new track at NeurIPS is a necessary step to realize these incentives.

 

Machine Learning

Socially Responsible Data Labeling

Generating a global training dataset while supporting social initiatives and sustainable practices.

Labeling satellite imagery is the process of applying tags to scenes to provide context or confirm information. These labeled training datasets form the basis for machine learning (ML) algorithms. The labeling undertaking (in many cases) requires humans to meticulously and manually assign captions to the data, allowing the model to learn patterns and estimate them for other observations.

For a wide range of Earth observation applications, training data labels can be generated by annotating satellite imagery. Images can be classified to identify the entire image as a class (e.g., water body) or for specific objects within the satellite image. However, annotation tasks can only identify features observable in the imagery. For example, with Sentinel-2 imagery at the 10-meter spatial resolution, one cannot detect the more detailed features of interest, such as crop types but would be able to distinguish large croplands from other land cover classes.

Community Voices, Machine Learning

Igor Ivanov: Harnessing Machine Learning Skills to Reduce Damages from Tropical Storms

A conversation with the First Place winner of the Radiant Earth Tropical Cyclone Wind Estimation Data Competition

We recently announced the Radiant Earth Tropical Cyclone Wind Estimation Data Competition winners, a contest designed to build a machine learning (ML) model to improve NASA IMPACT’s Deep Learning-based Hurricane Intensity Estimator. Seven hundred thirty-three participants leveraged NOAA’s Geostationary Operational Environmental Satellites (GOES) imagery to estimate the wind speeds of storms at different points in time using satellite images captured throughout a storm’s life cycle. In this Q&A, we sat down with Igor Ivanov from Ukraine, winner of the first place Development Seed Award, to talk about his journey to become a data scientist and winning the contest.

Community Voices

Ashiraf Nsibambi Kyabainze: Technology and Social Entrepreneurship in Uganda

A conversation about using technology for a smart value chain to boldly impact food insecurity in Uganda

Meet Ashiraf Nsibambi Kyabainze, the founder of At HAUSE Limited, is a Ugandan entrepreneur working on African technology. The recipient of several awards, including Africa’s Top Young Entrepreneurs Award (RUFORUM) in 2018 and the Mandela Washington Fellowship for Young African Leaders Initiative Network agribusiness champion in 2019, Ashiraf founded At HAUSE to support agricultural and agribusiness workers by improving their packaging with insect-resistant solutions. At HAUSE’s business model reduces crop waste all while ensuring better financial returns for farmers.

Machine Learning

Radiant MLHub Python Client — Beta Release

Using the Python client to discover and download training datasets without managing API requests.

Community Voices, Machine Learning

Celebrating Women Leading the ML4EO Community

Meet the rising stars of women around the world at the forefront of machine learning for Earth observation.

Happy International Women’s Day!

Today, we celebrate the women who break barriers and expand the frontiers of machine learning for Earth observation. This essential field can help us understand the planet’s ecosystem, its different elements, interactions, and changes.

These 15 leading women were selected from 56 outstanding nominations from the ML4EO community. The Radiant Earth Foundation selection committee created a set of criteria to rank the nominees.

Standards

SpatioTemporal Asset Catalog (STAC) 1.0.0-rc.1 Released

Standards

The Path to STAC 1.0.0

It’s almost time! The SpatioTemporal Asset Catalog (STAC) specification has been maturing for over 3 years, and already has a rich ecosystem of tools with hundreds of millions of assets cataloged. The core community has agreed that it’s time to put a pin on it and lock in a super stable specification that can be a core building block for years to come. We believe STAC will be the foundation of something truly special: A transformation to a ‘Cloud Native Geospatial’ world that will open up unimaginable innovation.

Our goal has been to build simple, flexible building blocks to expose geospatial data in order to enable others to build incredible value on top of that. This can only happen if there’s a stable base to rely upon. In the next couple of months, we hope to release 1.0.0. Our goal goes beyond just releasing the specification: We aim to make sure there is a ‘complete’ core ecosystem of tools to make it easy for people to create, use, and get value from STAC. See below for our plan, details on our sprint, and also ways that organizations can directly support STAC.

Machine Learning

Archived Training Dataset Downloads now Available on Radiant MLHub

A little over a year ago, we launched the first iteration of Radiant MLHub in the form of a STAC-compliant API, which allows you to browse our training data collections and list and download individual assets from the items within those collections. Today, we’re announcing the ability to download an archived version of training datasets with just a single-click download. In this post, we’ll describe the process for downloading datasets, the structure of the archived datasets, and provide some tips for effectively traversing the downloaded datasets.

We are now offering three different methods of downloading our datasets. The easiest method, downloading on our registry, can be accessed by navigating to a dataset page and clicking on the “Download” link for each collection you would like to download. Clicking this link will direct you to our dashboard, which will ask you to login if you are not already authenticated and then begin the download process for that collection.

Machine Learning, News

Radiant MLHub in 2021: Realizing a Data Ecosystem

In December 2019, we publicly launched Radiant MLHub, the first open-access cloud-based repository for geospatial training datasets. Since then, we have continuously published new datasets and expanded the ecosystem around Radiant MLHub.

The idea of Radiant MLHub was born in Spring 2018 after several discussions and feedback from members of the community and funders. We had started a new project to develop a global and geographically diverse land cover training dataset using human verification called LandCoverNet. Soon after the launch of LandCoverNet in 2018, we identified a gap in the ecosystem to facilitate publication and uptake of training datasets in our community. That gap in the data value chain led us to the design and implementation of Radiant MLHub.