GeoAIR_icon

iLit4GEE-AI: An Interactive Web Map App for GEE-AI Literature

Abbr List

The following abbreviations are listed in alphabetical order:

Abbr Full Term
ADL Active Deep Learning
AI Artificial Intelligence
AL Active Learning
ALOS Advanced Land Observing Satellite
ANN Artificial Neural Network
API Application Program Interfaces
ASTER Advanced Spaceborne Thermal Emission and Reflection Radiometer
AUC Area Under The Curve
BCLL Biodiversity Characterization at Landscape Level
BGT Bagging Trees
BRT Boosted Regression Tree
BST Boosted Trees
BT Bagged Trees
CART Classification and Regression Tree
CBERS China–Brazil Earth Resources Satellite
CDL Cropland Data Layer
CGLS-LC100 Copernicus Global Land Cover Layer
CNB Continuous NaiveBayes
CNN Convolutional Neural Network
CV Computer Vision
CE Commission Error
CZMIL Coastal Zone Mapping and Imaging LiDAR
DEM Digital Elevation Model
DL Deep Learning
DMSP NTL Defense Meteorological Satellite Program Nighttime Lights
DnCNN Denoising Convolutional Neural Network
DSM Digital Surface Model
DT Decision Tree
DTM Digital Terrain Model
ELR Extreme Learning Machine Regression
ESA European Space Agency
FCN Fully Convolutional Network
FireCCI51 MODIS Fire v 5.1
FROM-GLC Finer Resolution Observation and Monitoring of Global Land Cover
GBRT Gradient Boosted Regression Trees
GDEM Global Digital Elevation Map
GEE Google Earth Engine
GeoAI Geospatial Artificial Intelligence
GFSAD Global Food Security-Support Analysis Data
GHSL Global Human Settlement Layers
GIScience Geographic Information Science
GLCM Gray-Level Co-occurrence Matrix
GLDAS Global Land Data Assimilation System
GREON Great Rivers Ecological Observation Network
GPR Gaussian Process Regression
IKPamir Intersection Kernel Passive Aggressive Method for Information Retrieval
INPE National Institute for Space Research (Brazil)
IoU Intersection over Union
IRS Indian Remote Sensing
KNN K-Nearest Neighbor
Landsat 8 OLI Operational Land Imager
LSTM Long Short-Term Memory
LULC Land Use Land Cover
MAE Mean Absolute Error
Markov-CA Markov-based Cellular Automata
MERIT Multi-Error Removed Improved-Terrain
MCD12Q1 MODIS Land Cover Type
MCD43A1 MODIS Bidirectional Reflectance Distribution Function (BRDF) Model Parameters
MCD43A4 MODIS Nadir BRDF-Adjusted Reflectance (NBAR)
MCD64A1 MODIS Burned Area Product
MIoU Mean Intersection over Union
ML Machine Learning
MLP Multi-Layer Perceptron
MLR Multiple Linear Regression
MNDWI Modified Normalized Difference Water Index
MODIS Moderate Resolution Imaging Spectrometer
MOD09A1 MODIS Terra Surface Reflectance
MOD11A2 MODIS Terra Land Surface Temperature and Emissivity
MOD13Q1 MODIS Vegetation
MOD15A3 MODIS Terra Leaf Area Index/FPARMODIS Vegetation
MSCNN Multiscale Convolutional Neural Network
MTBS Monitoring Trends in Burn Severity dataset
MuWI-R Multi-Spectral Water Index
MYD11A2 MODIS Aqua Land Surface Temperature and Emissivity
NASA National Aeronautics and Space Administration
NASS National Agricultural Statistics Service
NB Naive Bayes
NICFI Norway’s International Climate and Forest Initiative
NN Neural Network
NDVI Normalized Difference Vegetation Index
NDWI Normalized Difference Water Index
OA Overall Accuracy
OE Omission Error
PA Producer’s Accuracy
PB Petabyte
Pegasos Primal Estimated sub-GrAdient SOlver for SVM
PRODES Amazon Deforestation Monitoring Project
PSNR Peak Signal-to-Noise Ratio
QA60 Sentinel 2 Quality Assurance Bitmask Cloud Band
RF Random Forest
RHSeg Recursive Hierarchical Segmentation
RMSE Receiver Operating Characteristic
ROC Root Mean Square Error
RS Remote Sensing
RVM Relevance Vector Machine
SAE Stacked AutoEncoder
SNIC Simple Non-Iterative Clustering
SRTM Shuttle Radar Topography Mission
SSIM Structural Similarity Index
SVM Support Vector Machine
UA User’s Accuracy
UAS Unmanned Aircraft Systems
UAV Unmanned Aerial Vehicle
USDA United States Department of Agriculture
USGS United States Geological Survey
VIIRS NTL Visible Infrared Imaging Radiometer Suite Nighttime Lights

Data Explanation

See below for a brief explanation of some of the data fields used in our web app whose meaning may not be easily conveyed by their names.

*Note that in the data used in our iLit4GEE-AI web app, no GEE review paper is included. The reasons are: (1) review papers do not have much of the content specified in the data fields; (2) also at the time we wrote up the GEE integrated with AI review paper when this web app was designed and developed, there were no review papers focused solely on GEE + AI. Other GEE review papers talk about ML/DL models in GEE papers, but it is not their sole focus like it is for ours and we given much more details than those papers. Other GEE review papers have sections for ML models (e.g., "X papers use RF models and Y use SVMs..."). But again, ours is the most focused and comprehensive in this regard.

Data Field Brief Explanation
Google Scholar Page of Paper

This is the Google scholar page of the paper. On the map pop up (when clicking a paper point on the map, the popped-up info box) and on the data table, which is at the bottom of the web app, there is a clickable URL that will direct you to the paper’s Google Scholar page. This is where readers can find new papers citing each paper, so new paperas can be easily tracked. Note that if a paper does not have a Google Scholar page, we provide the ResearchGate link of the paper, if available.

Cited by URL

Not every paper seems to have a Google Scholar page.However, if a paper has at least one citation, it will have a Google Scholar Cited by URL to allow our users/readers to easily track new publications citing the paper.

Application Focus

This field is for what application a paper focused on (e.g., vegetation, wetland, water mapping).

Method/Application

Method: The authors used novel CV, ML, or DL methods. Note that we determined this based on the CV, ML, DL part of the analysis, not on pre-processing methods even if they were novel. So this value is used to represent a paper that moves forward the integration of GEE with AI and its branches, CV/ML/DL.
Application: This could look like the authors using ready-to-use models on GEE with their own data or data they found on GEE (“plug-and-play”).
(Note: this field is one of the three fields used to build the methods hierarchical sunburst chart)

Evaluation Metrics

The metrics the authors used to evaluate their model(s) and prediction(s).

Models Compared

The CV algorithms and/or ML/DL models the authors use in their paper.

Study Area

The location where the study focused on. If there is more than one place analyzed, we list the countries separated by a comma. If it is at global scale, we provide the value “global”.

Computed on Cloud

The cloud computing platform the authors use, if any. Note that not all authors use GEE for processing or analyzing data. Some authors simply used the platform for finding and downloading data, in which case this column should be empty (i.e., blank). Also, GEE is not the only possible value, though it is the most common. For example, one set of authors combined GEE and Google Cloud AI, where a few used Google Colab and Google Drive. Note that, if authors provide hardware specifications (e.g., GPU, RAM, CPU, and runtimes) about the cloud platform they used, we also report the information here.

Computed Offline

This column is for specifying what hardware (e.g., GPU, CPU, RAM, hard disk, runtimes) authors used if they completed part of their analysis offline. However, the majority of studies do not report hardware specifications, runtimes, etc. so this column will be listed as “NS”, which refers to “not specified”. For those only used cloud platforms and no local computers involved, the entry will be “NA” for “not applicable”.

Other Software

This column is useful for identifying which software, including libraries (if specified), authors use to complete their analysis. Software listed here are for both online and/or offline analyses. Examples are: ArcGIS, R, Python, MATLAB, eCognition, etc. Some authors do not specify which software they use to complete their analysis, in which case “NS” or “offline/NS” are listed, which mean “not specified” and “offline - not specified” respectively. Note: if a publication only> used Google Earth Engine or its APIs, this is not listed under this “Other Software” column, but instead given a value of “NA” for “not applicable”. Instead, it is included in the “Computed on Cloud” column because GEE is both a cloud computing and online software platform. Additionally, not all authors specify whether they use JavaScript, Python, or R when working specifically with GEE (and not other software packages within those programming languages), so it would be difficult to specify in any useful detail how authors were using GEE software for this column.

RS Data Type

The types of RS data or RS-derived data products the authors used in their analysis, either for direct analysis and processing or for validation purposes. (Note: This does not include whether or not the authors used field data or some other type of non-RS data.)

Method (macro)

Whether or not the authors used machine learning (ML), deep learning (DL), and/or computer vision (CV). If a paper uses more than one method, we list them and separate them by a comma.
(Note: this field is one of the three fields used to build the methods hierarchical sunburst chart)

Method (detailed)

Classification, segmentation, regression, or some combination of these choices, separated by a comma.
(Note: this field is one of the three fields used to build the methods hierarchical sunburst chart)

Data Structure

In order to facilitate the development and optimization of the web apps, we formulated web app design, aligning it with our desired deliverables for intended users. Then we collected and organized pertinent data in a CSV format encompassing relevant data columns, essential for generating diverse visual representations. Here we provide the data structure that we used to collect data. Data column names are listed under the information category such as basic information of the paper, and information of authors.

1 Basic information of the paper

Column Note
title

Paper title

openAccessPaper_pdf_link

Link of paper’s pdf file

journal

Journal of the paper

journal_link

Link of the journal

year

Year of publication

paper_citedBy_url

Not every paper seems to have a Google Scholar page. However, if a paper has at least one citation, it will have a Google Scholar Cited by URL to allow our users/readers to easily track new publications citing the paper.

paper_citedBy_AsOf_dataEntryDate

Google Scholar Cited by page. Not every paper seems to have a Google Scholar page. However, if a paper has at least one citation, it will have a Google Scholar Cited by URL to allow our users/readers to easily track new publications citing the paper.

paper_GS_url

This is the Google Scholar page of the paper. On the map pop-up (when clicking a paper point on the map, the popped-up info box) and on the data table, which is at the bottom of the web app, there is a clickable URL that will direct you to the paper’s Google Scholar page. This is where readers can find new papers citing each paper, so new papers can be easily tracked. Note that if a paper does not have a Google Scholar page, we provide the ResearchGate link of the paper, if available.

2 Information of authors

Column Note
authors

All author names of the corresponding paper

first_author

First author name

first_author_GS_profile

First author Google Scholar (GS) profile

first_author_institution

First author institution

firstAuthor_institution_country

First author institution country

first_author_institution_lat

First author institution country latitude

first_author_institution_lon

First author institution country longitude

3 Paper content related information

Column Note
method_or_application_oriented

Method: The authors used novel CV, ML, or DL methods. Note that we determined this based on the CV, ML, and DL part of the analysis, not on pre-processing methods (even if they themselves were novel). So this value is used to represent a paper that moves forward the integration of GEE with AI and its branches, CV/ML/DL. Application: This could look like the authors using ready-to-use models on GEE with their own data or data they found on GEE (i.e., “plug-and-play”).

application_focus

This field is for what application a paper focused on (e.g., vegetation, wetland, water mapping)

abstract

Paper abstract

keyword_1

Paper’s first keyword

keyword_2

Paper’s second keyword

keyword_3

Paper’s third keyword

​​keyword_4

Paper’s fourth keyword

keyword_5

Paper’s fifth keyword

RSDataType

The types of RS data or RS-derived data products the authors used in their analysis, either for direct analysis and processing or for validation purposes. (Note: This does not include whether or not the authors used field data or some other type of non-RS data.)

studyArea

The location where the study focused on. If there is more than one place analyzed, we list the countries separated by a comma. If it is at global scale, we provide the value “global”.

method_macro

Whether or not the authors used ML, DL, and/or CV. If a paper uses more than one method, we list them and separate them by a comma.

method_detailed

Classification, segmentation, regression, or some combination of these choices, separated by a comma.

modelsCompared

The CV algorithms and/or ML/DL models the authors use in their paper.

compute_cloud_platform

The cloud computing platform the authors use, if any. Note that not all authors use GEE for processing or analyzing data. Some authors simply used the platform for finding and downloading data, in which case this column should be empty (i.e., blank). Also, GEE is not the only possible value, though it is the most common. For example, one set of authors combined GEE and Google Cloud AI, where a few used Google Colab and Google Drive. Note that, if authors provide hardware specifications (e.g., GPU, RAM, CPU, and runtimes) about the cloud platform they used, we also report the information here.

compute_offline_hardware

This column is for specifying what hardware (e.g., GPU, CPU, RAM, hard disk, runtimes) authors used if they completed part of their analysis offline. However, the majority of studies do not report hardware specifications, runtimes, etc. so this column will be listed as “NS”, which refers to “not specified”. For those only used cloud platforms and no local computers involved, the entry will be “NA” for “not applicable”.

software

This column is useful for identifying which software, including libraries (if specified), authors use to complete their analysis (other than Google Earth Engine APIs, in which case we list “GEE” as the software entry). Note: software listed here are for both online and/or offline analyses. Examples are: ArcGIS, R, Python, MATLAB, eCognition, etc. Some authors do not specify which software they use to complete their analysis, in which case “NS” is listed, which means “not specified”.

evaluationMetrics

The metrics the authors used to evaluate their model(s) and prediction(s).

keywords

The column is merged from the following raw columns: keyword_1, keyword_2, keyword_3, keyword_4, keyword_5.