The Cropland Data Layer (CDL) is a data product produced by the National Agricultural Statistics Service of U.S. Department of Agriculture. It provides geo-referenced, high accuracy, 30 or 56 meter resolution, crop-specific cropland land cover information for up to 48 contiguous states in the U.S. from 1997 to the present. This data product has been extensively used in agricultural research [1].
CropScape is an interactive Web CDL exploring system, and it was developed to query, visualize, disseminate, and analyze CDL data geospatially through standard geospatial Web services in a publicly accessible online environment[2]. The development of the CropScapeR
package is to allow R users to easily utilize the geospatial processing services provided by CropScape, so that they can effectively and efficiently access and analyze the CDL data.
We implement four geospatial processing services provided by CropScape in R
:
GetCDLValue
/GetCDLFile
The GetCDLValue
service finds the pixel value at a given location (defined by a coordinate), and the GetCDLFile
service fetches irregularly shaped CDL data[2]. The GetCDLValue
and GetCDLFile
services are implemented in R
by one function: GetCDLData
. The GetCDLData
function takes an Area of Interest (AOI) and a year value as inputs and return the requested CDL raster data. This function does the data request in two steps. First, the function sends data requests to the CropScape online server using the GET
function from the httr
package. Second, the function reads the requested data into R
using the raster
function from the raster
package. By default, the data returned from the CropScape are in the raster-based GeoTIFF file format. Users can choose to save the raw data in TIF format into their local drives.
GetCDLImage
The GetCDLImage
service generates the preview images of the customized CDL data and the Keyhole Markup Language (KML) file with links to actual images that can be displayed in Google Earth[2]. This service is implemented by the GetCDLImage
function.
GetCDLStat
The GetCDLImage
service generates statistical information (for example, value, category name, and acreage) of the CDL data of an AOI[2]. This service is implemented by the GetCDLStat
function.
GetCDLComp
The GetCDLComp
service performs cropland change analysis by comparing the pixels of the cropland area defined by AOI between two given years[2]. This service is implemented by the GetCDLComp
function.
The four functions introduced above take three necessary inputs to work: aoi
, year
, type
. An API key is not required.
aoi
: Area of Interest. An AOI can take various shapes. Specifically, it can be:
sf
object (type = ‘b’);year
: a year value.type
: Type of the AOI. See above.Here I provide some examples to demonstrate how to use the package based on the GetCDLData
function. This is probably the most useful function since it acquires data from the CropScape
. The usage of the other three functions (GetCDLImage
, GetCDLStat
, GetCDLComp
) are similar to the usage of GetCDLData
.
# State of Connecticut, FIPS code: 09
data <- GetCDLData(aoi = '09', year = 2018, type = 'f')
# State of Connecticut, FIPS code: 10
data <- GetCDLData(aoi = 10, year = 2018, type = 'f')
Note: The input of aoi
shall be a 2-digit state FIPS code. Usually, users can provide a numeric value. In cases that the FIPS code starts with a zero, users can specify the aoi
as a character.
#### Get data for a county
# Champaign county in Illinois, FIPS code: 17019
data <- GetCDLData(aoi = 17019, year = 2018, type = 'f')
The AOI should be a numeric vector with 4 elements. The format to define the box is (min x, min y, max x, max y).
data <- GetCDLData(aoi = c(130783,2203171,153923,2217961), year = 2018, type = 'b')
Note that the default coordinate system is the Albers projection system. Users can use an alternative coordinate system by specifying it in the crs
argument. The GetCDLData
function would convert the points under the specified coordinate system back to the Albers projection system and then make data requests. For example, the following example uses the commonly used longitude/latitude to request data, and the corresponding crs
is ‘+init=epsg:4326’.
data <- GetCDLData(aoi = c(-88.2, 40.03, -88.1, 40.1), year = '2018', type = 'b', crs = '+init=epsg:4326')
If users have a shapefile, users can extract the coordinates of bounding box of the shapefile and then make data request:
# Read the shapefile into R
spdata <- sf::st_read("Your shapefile. File name ended with .shp")
# Extract data (Use sf::st_bbox function to extract bounding box points)
data <- GetCDLData(aoi = sf::st_bbox(spdata), year = '2018', type = 'b')
# Only use the data inside the shape
data_shape <- raster::mask(data, spdata)
The above example assumes that the shapefile has the Albers projection system. If not, make sure that you specify the correct system (same to the shapefile) in crs
.
The GetCDLData
function depends on the sf
package to process the spatial data, and it can take a sf
object as input. When a sf
object is used as the aoi
, the GetCDLData
function would extract bounding box points from theobject and then request for the data. Here is an example:
# Extract coordinates for the Champaign county using the us_map function.
counties_df <- usmap::us_map(regions = "counties", include = 17019)
# Specify projection system used in the us_map function.
crs <- '+proj=laea +lat_0=45 +lon_0=-100 +x_0=0 +y_0=0 +a=6370997 +b=6370997 +units=m +no_defs'
# Create a sf object
test_sf <- sf::st_as_sf(counties_df, coords = c('x', 'y'), crs = crs)
# Get CDL data
data <- GetCDLData(aoi = test_sf, year = 2018, type = 'b')
The AOI should be a numeric vector with at least 6 elements. The format to define the polygon is (x1, y2, x2, y2, …, xn, yn).
# A triangle area defined by 3 coordinates
data <- GetCDLData(aoi = c(175207,2219600,175207,2235525,213693,2219600), year = '2018', type = 'ps')
The AOI should be a numeric vector with 2 elements. The format to define the point is (x, y).
data <- GetCDLData(aoi = c(-94.6754,42.1197), year = 2018, type = 'p', crs = '+init=epsg:4326')
The CropScape server takes shapefile as an AOI to make data request. Yet, it requires users to provide a URL of a compressed ESRI shapefile. The .shp, .shx, .dbf, and .prj files must all be compressed with no subdirectories in a single ZIP file. In cases that the compressed shapefile is saved in the local disk, this shapefile needs to be published to a website URL (so CropScape can read the shapefile).
There are many ways to generate a URL for a file saved in the computer. Here is an example. Assume that you save the zipped shapefile in Dropbox. First, create a link to the file by right clicking on the file and then selecting ‘Copy Dropbox link’. In my case, I get this link: https://www.dropbox.com/s/cvcxjpyakxfyfpm/York_SF_watershed.zip?dl=0 Second, remove ‘?dl=0’ from the link content to make sure that the link ends with ‘zip’. Then copy to R
and make data request:
link <- 'https://www.dropbox.com/s/cvcxjpyakxfyfpm/York_SF_watershed.zip'
data <- GetCDLData(aoi = link, year = 2018, type = 's')
Because the zipped shapefile is directly sent to CropScape, the GetCDLData
function cannot convert the coordinate system before sending the request. Therefore, users must ensure that the zipping shapefile has the Albers projection system.
This kind of request takes relatively more steps. For convenience, users can request data for the custom area defined by a shapefile as a box (type = ‘b’) and then remove the data outside of the custom area (see examples above).
The CropScapeR
package was developed under the Windows system. Some unanticipated technical issues might occur when using the CropScapeR
package in a Mac operating system. A notable one is the SSL certificate problem. SSL refers to the Secure Sockets Layer, and SSL certificate displays important information for verifying the owner of a website and encrypting web traffic with SSL/TLS for securing connecttion. Several Mac users have reported errors called ‘SSL certificate problem: SSL certificate expired’. As the name suggests, this is because CropScape server has an expired certificate, which affects the Mac users. Windows users should not expect this issue.
With an invalid SSL certificate, the GetCDLData
function would fail because: (1) it cannot send httr GET request any more; and (2) it cannot read the requested raster TIF data via the raster
function any more. Here is a two-step workaround of the certificate issue. At step 1, specify in R
that you want to skip the certificate validation when making the httr GET request. At step 2, download the raster TIF data into your local drive using the download.file
function, and then read the downloaded raster file using the raster
function. The second step is automatically processed inside the GetCDLData
function. So you just have to do the first step manually. Below is an example to get the CDL data for the Champaign county in 2018 on a Mac computer.
# Skip the SSL check
httr::set_config(httr::config(ssl_verifypeer = 0L))
# Automatically generate a temporary path to save the data
tif_file <- tempfile(fileext = '.tif')
# Download the raster TIF file into specified path, also read into R
data <- GetCDLData(aoi = 17019, year = 2018, type = 'f', save_path = tif_file)
The CropScapeR
package is accepted by CRAN
, it can be directly installed in R
install.packages("CropScapeR")
To install development version of the package, run the following codes in R
:
install.packages("devtools") # Run this if the devtools package is not installed.
devtools::install_github("cbw1243/CropScapeR")
The development version provides the most recent updates of the package.
Please see more advanced uses of the CropScapeR
package from section 9.2 in the ‘R as GIS for Economists’ book.
Bowen Chen, PhD (bwchen0719@gmail.com)
The development of this package was supported by USDA-NRCS Agreement No. NR193A750016C001 through the Cooperative Ecosystem Studies Units network. Any opinions, findings, conclusions, or recommendations expressed are those of the author(s) and do not necessarily reflect the view of the U.S. Department of Agriculture.
Dr. Benjamin Gramig and Dr. Taro Mieno are contributors of this package.
[1] Boryan, Claire, Zhengwei Yang, Rick Mueller, and Mike Craig. 2011. Monitoring US Agriculture: The US Department of Agriculture, National Agricultural Statistics Service, Cropland Data Layer Program. Geocarto International 26 (5): 341–58. https://doi.org/10.1080/10106049.2011.562309.
[2] Han, Weiguo, Zhengwei Yang, Liping Di, and Richard Mueller. 2012. CropScape: A Web Service Based Application for Exploring and Disseminating US Conterminous Geospatial Cropland Data Products for Decision Support. Computers and Electronics in Agriculture 84 (June): 111–23. https://doi.org/10.1016/j.compag.2012.03.005.