Satellite Images Computer Vision
Table of Contents
Expertise
We are part of a large consortium of universities and companies with advantage expertise in processing satellite images with computer vision and artificial intelligence. Our expertise spans many areas:
Available Dataset Knowhow
We know existing datasets, their strengths and deficiencies. As an example consider a newly announced dataset (worldstrat) that allow the development of machine learning algorithms of various kind, as superresolution. This allows us to bring together our expertise in advanced deep learning algorithms and apply it to satellite images. From the original paper (open review link), (PDF Downdload)
The largest and most varied such publicly available dataset, at Airbus SPOT 6/7 satellites’ high resolution of up to 1.5 m/pixel, empowered by European Space Agency’s Phi-Lab as part of the ESA-funded QueryPlanet project, we curate 10,000 sq km of unique locations to ensure stratified representation of all types of land-use across the world: from agriculture to ice caps, from forests to multiple urbanization densities. We also enrich those with locations typically under-represented in ML datasets: sites of humanitarian interest, illegal mining sites, and settlements of persons at risk. We temporally-match each high-resolution image with multiple low-resolution images from the freely accessible lower-resolution Sentinel-2 satellites at 10 m/pixel.
This is just one of multiple dataset that we use in our research with our research partners. In Figure 1 you can see a visual summary of the construction of the dataset.
As an example here some recent useful and interesting datasets that can be used for machine learning and generally any computer vision task (list partially adapted from github link). Images and demos are taken from the dataset websites to give an example of the possibilities.
Recent Satellite Image Datasets
-
FloodNet (University of Maryland, Jun 2021)
2343 image chips (drone imagery), 10 landcover categories (background, water, building flooded, building non-flooded, road-flooded, others). Paper: Rahnemoonfar et al., 2021 -
PASTIS : Panoptic Agricultural Satellite TIme Series (IGN, July 2021)
124,422 Agricultural parcels, 2,433 Sentinel-2 image chip timeseries, France, panoptic labels (instance index + semantic label for each pixel). Paper: Garnot & Landrieu 2021 -
xView3 Dark Vessel Detection 2021 (xView3 Team, Aug 2021)
Maritime object bounding boxes for 1k Sentinel-1 scenes (VH & VV polarizations), ancillary data (land/ice mask, bathymetry, wind speed, direction, quality).
Object Detection Datasets
-
Airbus Aircraft Detection (Airbus, Mar 2021)
Aircraft bounding boxes, 103 images of worlwide airports (Pleiades, 0.5m res., 2560px).
-
Airbus Oil Storage Detection (Airbus, Mar 2021)
Oil storage tank annotations, 98 worldwide images (SPOT, 1.2m res., 2560px). -
xView3 Dark Vessel Detection 2021 (xView3 Team, Aug 2021)
Maritime object bounding boxes for 1k Sentinel-1 scenes (VH & VV polarizations), ancillary data (land/ice mask, bathymetry, wind speed, direction, quality). -
AFO - Aerial dataset of floating objects (Ga̧sienica-Józkowy et al, Jun 2020)
3647 drone images from 50 scenes, 39991 objects with 6 categories (human, wind/sup-board, boat, bouy, sailboat, kayak), Darknet YOLO format, Paper: Authors: Ga̧sienica-Józkowy et al. 2021 -
NEON Tree Crowns Dataset (Weinstein et al., 2020)
Individual tree crown objects, height&area estimates, 100 million instances, 37 geographic sites across the US, DeepForest Python package, Paper: Weinstein et al. 2020 -
xView 2018 Detection Challenge (DIUx, Jul 2018)
60 categories from helicopter to stadium, 1 million instances, Worldview-3 imagery (0.3m res.), COCO data format, pre-trained Tensorflow and Pytorch baseline models, Paper: Lam et al. 2018 -
Open AI Challenge: Aerial Imagery of South Pacific Islands (WeRobotics & Worldbank, May 2018)
Tree position & 4 tree species, RGB UAV imagery (0.4m/0.8m res.), multiple AOIs in Tonga -
NIST DSE Plant Identification with NEON Remote Sensing Data (inria.fr, Oct 2017)
Tree position, tree species and crown parameters, hyperspectral (1m res.) & RGB imagery (0.25m res.), LiDAR point cloud and canopy height model -
NOAA Fisheries Steller Sea Lion Population Count (NOAA, Jun 2017)
5 sea lion categories, ~ 80k instances, ~ 1k aerial images, Kaggle kernels -
Stanford Drone Data (Stanford University, Oct 2016)
60 aerial UAV videos over Stanford campus and bounding boxes, 6 classes (Pedestrian, Biker, Skateboarder, Cart, Car, Bus), Paper: Robicquet et al. 2016 -
Cars Overhead With Context (COWC) (Lawrence Livermore National Laboratory, Sep 2016)
32k car bounding boxes, aerial imagery (0.15m res.), 6 cities, Paper: Mundhenk et al. 2016
Semantic Segmentation Datasets
3. Semantic Segmentation
-
FloodNet (University of Maryland, Jun 2021)
2343 image chips (drone imagery), 10 landcover categories (background, water, building flooded, building non-flooded, road-flooded, …). Paper: Rahnemoonfar et al., 2021 -
LoveDA (Wuhan University, Oct 2021)
5987 image chips (Google Earth), 7 landcover categories, 166768 labels, 3 cities in China. Paper: Wang et al., 2021 -
Sentinel-2 Cloud Mask Catalogue (Francis, A., et al., Nov 2020) 513 cropped subscenes (1022x1022 pixels) taken randomly from entire 2018 Sentinel-2 archive. All bands resampled to 20m, stored as numpy arrays. Includes clear, cloud and cloud-shadow classes. Also comes with binary classification tags for each subscene, describing what surface types, cloud types, etc. are present.
-
LandCover.ai: Dataset for Automatic Mapping of Buildings, Woodlands and Water from Aerial Imagery (Boguszewski, A., et al., May 2020) 41 orthophotos (9000x9000 px) over Poland, Aerial Imagery (25cm & 50cm res.), manual segmentations masks for Buildings, Woodland and Water, Paper: Boguszewski et al., 2020
-
95-Cloud: A Cloud Segmentation Dataset (S. Mohajerani et. all, Jan 2020)
34701 manually segmented 384x384 patches with cloud masks, Landsat 8 imagery (R,G,B,NIR; 30 m res.), Paper: Mohajerani et al. 2021
-
Open Cities AI Challenge (GFDRR, Mar 2020) .
790k building footprints from Openstreetmap (2 label quality categories), aerial imagery (0.03-0.2m resolution, RGB, 11k 1024x1024 chips, COG format), 10 cities in Africa. -
DroneDeploy Segmentation Dataset (DroneDeploy, Dec 2019)
Drone imagery (0.1m res., RGB), labels (7 land cover catageories: building, clutter, vegetation, water, ground, car) & elevation data, baseline model implementation. -
SkyScapes: Urban infrastructure & lane markings (DLR, Nov 2019)
Highly accurate street lane markings (12 categories e.g. dash line, long line, zebra zone) & urban infrastructure (19 categories e.g. buildings, roads, vegetation). Aerial imagery (0.13 m res.) for 5.7 km2 of Munich, Germany. Paper: Azimi et al. 2019 -
SpaceNet 5: Automated Road Network Extraction & Route Travel Time Estimation (CosmiQ Works, Maxar, Intel, AWS, Sep 2019)
2300 image chips, street geometries with location, shape and estimated travel time, 3/8band Worldview-3 imagery (0.3m res.), 4 global cities, 1 holdout city for leaderboard evaluation, APLS metric, baseline model
-
SEN12MS (TUM, Jun 2019)
180,748 corresponding image triplets containing Sentinel-1 (VV&VH), Sentinel-2 (all bands, cloud-free), and MODIS-derived land cover maps (IGBP, LCCS, 17 classes, 500m res.). All data upsampled to 10m res., georeferenced, covering all continents and meterological seasons, Paper: Schmitt et al. 2018 -
Slovenia Land Cover Classification (Sinergise, Feb 2019)
10 land cover classes, temporal stack of hyperspectral Sentinel-2 imagery (R,G,B,NIR,SWIR1,SWIR2; 10 m res.) for year 2017 with cloud masks, Official Slovenian land use land cover layer as ground truth.
-
ALCD Reference Cloud Masks (CNES, Oct 2018)
8 classes (inc. cloud and cloud shadow) for 38 Sentinel-2 scenes (10 m res.). Manual labeling & active learning, Paper: Baetens et al. 2019 -
RoadNet (Wuhan, Oct 2018)
Road network labels, high-res Google Earth imagery, 21 regions, Paper: Liu et al. 2018 -
SpaceNet 3: Road Network Detection (CosmiQ Works, Radiant Solutions, Feb 2018)
8000 km of roads in 5 city aois, 3/8band Worldview-3 imagery (0.3m res.), SpaceNet Challenge Asset Library, Paper: Van Etten et al. 2018 -
DSTL Satellite Imagery Feature Detection Challenge (Dstl, Feb 2017)
10 land cover categories from crops to vehicle small, 57 1x1km images, 3/16-band Worldview 3 imagery (0.3m-7.5m res.), Kaggle kernels -
SPARCS: S2 Cloud Validation data (USGS, 2016)
7 categories (cloud, cloud shadows, cloud shadows over water, water etc.), 80 1kx1k px. subset Landsat 8 scenes (30m res.), Paper: Hughes, J.M. & Hayes D.J. 2014 -
Biome: L8 Cloud Cover Validation data (USGS, 2016)
4 cloud categories (cloud, thin cloud, cloud shadows, clear), 96 Landsat 8 scenes (30m res.), 12 biomes with 8 scenes each, Paper: Foga et al. 2017 -
Inria Aerial Image Labeling (inria.fr)
Building footprint masks, RGB aerial imagery (0.3m res.), 5 cities
Research Partnership for Fast Technology Customisation
We partner with researchers and companies with a highly advanced expertise in satellite image analysis. In the figure below you can see an example of an application developed by Graniot that has uses satellite images to do precision agriculture and can extract information from images and compare them between different moments of time. We work with Graniot to adapt this technology to different use cases. We can customise it to your use cases, by starting from a sound and working base.
Their technology allow a comparison of the images and the extracted information at different moments in time (depending on the availablility of the images of course). In Figure 3 you can see for example comparison of ground humidity of a piece of land.