iLit4GEE-AI

Abbr List

The following abbreviations are listed in alphabetical order:

Abbr	Full Term
ADL	Active Deep Learning
AI	Artificial Intelligence
AL	Active Learning
ALOS	Advanced Land Observing Satellite
ANN	Artificial Neural Network
API	Application Program Interfaces
ASTER	Advanced Spaceborne Thermal Emission and Reflection Radiometer
AUC	Area Under The Curve
BCLL	Biodiversity Characterization at Landscape Level
BGT	Bagging Trees
BRT	Boosted Regression Tree
BST	Boosted Trees
BT	Bagged Trees
CART	Classification and Regression Tree
CBERS	China–Brazil Earth Resources Satellite
CDL	Cropland Data Layer
CGLS-LC100	Copernicus Global Land Cover Layer
CNB	Continuous NaiveBayes
CNN	Convolutional Neural Network
CV	Computer Vision
CE	Commission Error
CZMIL	Coastal Zone Mapping and Imaging LiDAR
DEM	Digital Elevation Model
DL	Deep Learning
DMSP NTL	Defense Meteorological Satellite Program Nighttime Lights
DnCNN	Denoising Convolutional Neural Network
DSM	Digital Surface Model
DT	Decision Tree
DTM	Digital Terrain Model
ELR	Extreme Learning Machine Regression
ESA	European Space Agency
FCN	Fully Convolutional Network
FireCCI51	MODIS Fire v 5.1
FROM-GLC	Finer Resolution Observation and Monitoring of Global Land Cover
GBRT	Gradient Boosted Regression Trees
GDEM	Global Digital Elevation Map
GEE	Google Earth Engine
GeoAI	Geospatial Artificial Intelligence
GFSAD	Global Food Security-Support Analysis Data
GHSL	Global Human Settlement Layers
GIScience	Geographic Information Science
GLCM	Gray-Level Co-occurrence Matrix
GLDAS	Global Land Data Assimilation System
GREON	Great Rivers Ecological Observation Network
GPR	Gaussian Process Regression
IKPamir	Intersection Kernel Passive Aggressive Method for Information Retrieval
INPE	National Institute for Space Research (Brazil)
IoU	Intersection over Union
IRS	Indian Remote Sensing
KNN	K-Nearest Neighbor
Landsat 8 OLI	Operational Land Imager
LSTM	Long Short-Term Memory
LULC	Land Use Land Cover
MAE	Mean Absolute Error
Markov-CA	Markov-based Cellular Automata
MERIT	Multi-Error Removed Improved-Terrain
MCD12Q1	MODIS Land Cover Type
MCD43A1	MODIS Bidirectional Reflectance Distribution Function (BRDF) Model Parameters
MCD43A4	MODIS Nadir BRDF-Adjusted Reflectance (NBAR)
MCD64A1	MODIS Burned Area Product
MIoU	Mean Intersection over Union
ML	Machine Learning
MLP	Multi-Layer Perceptron
MLR	Multiple Linear Regression
MNDWI	Modified Normalized Difference Water Index
MODIS	Moderate Resolution Imaging Spectrometer
MOD09A1	MODIS Terra Surface Reflectance
MOD11A2	MODIS Terra Land Surface Temperature and Emissivity
MOD13Q1	MODIS Vegetation
MOD15A3	MODIS Terra Leaf Area Index/FPARMODIS Vegetation
MSCNN	Multiscale Convolutional Neural Network
MTBS	Monitoring Trends in Burn Severity dataset
MuWI-R	Multi-Spectral Water Index
MYD11A2	MODIS Aqua Land Surface Temperature and Emissivity
NASA	National Aeronautics and Space Administration
NASS	National Agricultural Statistics Service
NB	Naive Bayes
NICFI	Norway’s International Climate and Forest Initiative
NN	Neural Network
NDVI	Normalized Difference Vegetation Index
NDWI	Normalized Difference Water Index
OA	Overall Accuracy
OE	Omission Error
PA	Producer’s Accuracy
PB	Petabyte
Pegasos	Primal Estimated sub-GrAdient SOlver for SVM
PRODES	Amazon Deforestation Monitoring Project
PSNR	Peak Signal-to-Noise Ratio
QA60	Sentinel 2 Quality Assurance Bitmask Cloud Band
RF	Random Forest
RHSeg	Recursive Hierarchical Segmentation
RMSE	Receiver Operating Characteristic
ROC	Root Mean Square Error
RS	Remote Sensing
RVM	Relevance Vector Machine
SAE	Stacked AutoEncoder
SNIC	Simple Non-Iterative Clustering
SRTM	Shuttle Radar Topography Mission
SSIM	Structural Similarity Index
SVM	Support Vector Machine
UA	User’s Accuracy
UAS	Unmanned Aircraft Systems
UAV	Unmanned Aerial Vehicle
USDA	United States Department of Agriculture
USGS	United States Geological Survey
VIIRS NTL	Visible Infrared Imaging Radiometer Suite Nighttime Lights

Data Explanation

See below for a brief explanation of some of the data fields used in our web app whose meaning may not be easily conveyed by their names.

*Note that in the data used in our iLit4GEE-AI web app, no GEE review paper is included. The reasons are: (1) review papers do not have much of the content specified in the data fields; (2) also at the time we wrote up the GEE integrated with AI review paper when this web app was designed and developed, there were no review papers focused solely on GEE + AI. Other GEE review papers talk about ML/DL models in GEE papers, but it is not their sole focus like it is for ours and we given much more details than those papers. Other GEE review papers have sections for ML models (e.g., "X papers use RF models and Y use SVMs..."). But again, ours is the most focused and comprehensive in this regard.

Data Field	Brief Explanation
Google Scholar Page of Paper	This is the Google scholar page of the paper. On the map pop up (when clicking a paper point on the map, the popped-up info box) and on the data table, which is at the bottom of the web app, there is a clickable URL that will direct you to the paper’s Google Scholar page. This is where readers can find new papers citing each paper, so new paperas can be easily tracked. Note that if a paper does not have a Google Scholar page, we provide the ResearchGate link of the paper, if available.
Cited by URL	Not every paper seems to have a Google Scholar page.However, if a paper has at least one citation, it will have a Google Scholar Cited by URL to allow our users/readers to easily track new publications citing the paper.
Application Focus	This field is for what application a paper focused on (e.g., vegetation, wetland, water mapping).
Method/Application	Method: The authors used novel CV, ML, or DL methods. Note that we determined this based on the CV, ML, DL part of the analysis, not on pre-processing methods even if they were novel. So this value is used to represent a paper that moves forward the integration of GEE with AI and its branches, CV/ML/DL. Application: This could look like the authors using ready-to-use models on GEE with their own data or data they found on GEE (“plug-and-play”). (Note: this field is one of the three fields used to build the methods hierarchical sunburst chart)
Evaluation Metrics	The metrics the authors used to evaluate their model(s) and prediction(s).
Models Compared	The CV algorithms and/or ML/DL models the authors use in their paper.
Study Area	The location where the study focused on. If there is more than one place analyzed, we list the countries separated by a comma. If it is at global scale, we provide the value “global”.
Computed on Cloud	The cloud computing platform the authors use, if any. Note that not all authors use GEE for processing or analyzing data. Some authors simply used the platform for finding and downloading data, in which case this column should be empty (i.e., blank). Also, GEE is not the only possible value, though it is the most common. For example, one set of authors combined GEE and Google Cloud AI, where a few used Google Colab and Google Drive. Note that, if authors provide hardware specifications (e.g., GPU, RAM, CPU, and runtimes) about the cloud platform they used, we also report the information here.
Computed Offline	This column is for specifying what hardware (e.g., GPU, CPU, RAM, hard disk, runtimes) authors used if they completed part of their analysis offline. However, the majority of studies do not report hardware specifications, runtimes, etc. so this column will be listed as “NS”, which refers to “not specified”. For those only used cloud platforms and no local computers involved, the entry will be “NA” for “not applicable”.
Other Software	This column is useful for identifying which software, including libraries (if specified), authors use to complete their analysis. Software listed here are for both online and/or offline analyses. Examples are: ArcGIS, R, Python, MATLAB, eCognition, etc. Some authors do not specify which software they use to complete their analysis, in which case “NS” or “offline/NS” are listed, which mean “not specified” and “offline - not specified” respectively. Note: if a publication *only*> used Google Earth Engine or its APIs, this is not listed under this “Other Software” column, but instead given a value of “NA” for “not applicable”. Instead, it is included in the “Computed on Cloud” column because GEE is both a cloud computing and online software platform. Additionally, not all authors specify whether they use JavaScript, Python, or R when working specifically with GEE (and not other software packages within those programming languages), so it would be difficult to specify in any useful detail how authors were using GEE software for this column.
RS Data Type	The types of RS data or RS-derived data products the authors used in their analysis, either for direct analysis and processing or for validation purposes. (Note: This does not include whether or not the authors used field data or some other type of non-RS data.)
Method (macro)	Whether or not the authors used machine learning (ML), deep learning (DL), and/or computer vision (CV). If a paper uses more than one method, we list them and separate them by a comma. (Note: this field is one of the three fields used to build the methods hierarchical sunburst chart)
Method (detailed)	Classification, segmentation, regression, or some combination of these choices, separated by a comma. (Note: this field is one of the three fields used to build the methods hierarchical sunburst chart)

Data Structure

In order to facilitate the development and optimization of the web apps, we formulated web app design, aligning it with our desired deliverables for intended users. Then we collected and organized pertinent data in a CSV format encompassing relevant data columns, essential for generating diverse visual representations. Here we provide the data structure that we used to collect data. Data column names are listed under the information category such as basic information of the paper, and information of authors.

1 Basic information of the paper

Column	Note
title	Paper title
openAccessPaper_pdf_link	Link of paper’s pdf file
journal	Journal of the paper
journal_link	Link of the journal
year	Year of publication
paper_citedBy_url	Not every paper seems to have a Google Scholar page. However, if a paper has at least one citation, it will have a Google Scholar Cited by URL to allow our users/readers to easily track new publications citing the paper.
paper_citedBy_AsOf_dataEntryDate	Google Scholar Cited by page. Not every paper seems to have a Google Scholar page. However, if a paper has at least one citation, it will have a Google Scholar Cited by URL to allow our users/readers to easily track new publications citing the paper.
paper_GS_url	This is the Google Scholar page of the paper. On the map pop-up (when clicking a paper point on the map, the popped-up info box) and on the data table, which is at the bottom of the web app, there is a clickable URL that will direct you to the paper’s Google Scholar page. This is where readers can find new papers citing each paper, so new papers can be easily tracked. Note that if a paper does not have a Google Scholar page, we provide the ResearchGate link of the paper, if available.

2 Information of authors

Column	Note
authors	All author names of the corresponding paper
first_author	First author name
first_author_GS_profile	First author Google Scholar (GS) profile
first_author_institution	First author institution
firstAuthor_institution_country	First author institution country
first_author_institution_lat	First author institution country latitude
first_author_institution_lon	First author institution country longitude

3 Paper content related information

Column	Note
method_or_application_oriented	Method: The authors used novel CV, ML, or DL methods. Note that we determined this based on the CV, ML, and DL part of the analysis, not on pre-processing methods (even if they themselves were novel). So this value is used to represent a paper that moves forward the integration of GEE with AI and its branches, CV/ML/DL. Application: This could look like the authors using ready-to-use models on GEE with their own data or data they found on GEE (i.e., “plug-and-play”).
application_focus	This field is for what application a paper focused on (e.g., vegetation, wetland, water mapping)
abstract	Paper abstract
keyword_1	Paper’s first keyword
keyword_2	Paper’s second keyword
keyword_3	Paper’s third keyword
keyword_4	Paper’s fourth keyword
keyword_5	Paper’s fifth keyword
RSDataType	The types of RS data or RS-derived data products the authors used in their analysis, either for direct analysis and processing or for validation purposes. (Note: This does not include whether or not the authors used field data or some other type of non-RS data.)
studyArea	The location where the study focused on. If there is more than one place analyzed, we list the countries separated by a comma. If it is at global scale, we provide the value “global”.
method_macro	Whether or not the authors used ML, DL, and/or CV. If a paper uses more than one method, we list them and separate them by a comma.
method_detailed	Classification, segmentation, regression, or some combination of these choices, separated by a comma.
modelsCompared	The CV algorithms and/or ML/DL models the authors use in their paper.
compute_cloud_platform	The cloud computing platform the authors use, if any. Note that not all authors use GEE for processing or analyzing data. Some authors simply used the platform for finding and downloading data, in which case this column should be empty (i.e., blank). Also, GEE is not the only possible value, though it is the most common. For example, one set of authors combined GEE and Google Cloud AI, where a few used Google Colab and Google Drive. Note that, if authors provide hardware specifications (e.g., GPU, RAM, CPU, and runtimes) about the cloud platform they used, we also report the information here.
compute_offline_hardware	This column is for specifying what hardware (e.g., GPU, CPU, RAM, hard disk, runtimes) authors used if they completed part of their analysis offline. However, the majority of studies do not report hardware specifications, runtimes, etc. so this column will be listed as “NS”, which refers to “not specified”. For those only used cloud platforms and no local computers involved, the entry will be “NA” for “not applicable”.
software	This column is useful for identifying which software, including libraries (if specified), authors use to complete their analysis (other than Google Earth Engine APIs, in which case we list “GEE” as the software entry). Note: software listed here are for both online and/or offline analyses. Examples are: ArcGIS, R, Python, MATLAB, eCognition, etc. Some authors do not specify which software they use to complete their analysis, in which case “NS” is listed, which means “not specified”.
evaluationMetrics	The metrics the authors used to evaluate their model(s) and prediction(s).
keywords	The column is merged from the following raw columns: keyword_1, keyword_2, keyword_3, keyword_4, keyword_5.

iLit4GEE-AI: An Interactive Web Map App for GEE-AI Literature