diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/00/5b8c9612c07b8cc074888d10fbc1dd975cdf873c4a385add8b02d886b3ff06 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/00/5b8c9612c07b8cc074888d10fbc1dd975cdf873c4a385add8b02d886b3ff06 deleted file mode 100644 index 69cf429..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/00/5b8c9612c07b8cc074888d10fbc1dd975cdf873c4a385add8b02d886b3ff06 +++ /dev/null @@ -1,656 +0,0 @@ -I"ô

Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

- - - -

project outline

-
- -

I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

- -

I. cartography in r part one

- - -

II. cartography in r part two

- - -

III. cartography in r part three

- - -

IV. cartography in r part four

- - -

V. cartography in r part five

- - -

VI. cartography in r part six

- - -

1. load libraries

-
- -
1
-2
-3
-4
-5
-
    ## load libraries
-    library("tidyverse")        # data manipulation & map creation
-    library("sf")               # loads shapefile
-    library("leaflet")          # creates the map
-    library("operator.tools")   # not-in function
-
- -

I am not going to explain in detail what each of these packages do because I already covered it in part one.

- -

2. load data

-
- -
1
-2
-3
-
    ## load data
-    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
-    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
-
- -

Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

- -

If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

- -

3. download state park data

-
- -

Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

- -

When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

- -

The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

- -
- - - Image: PAD-US download - - -
PAD-US download
-
- -

On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

- -
- - - Image: National data download - PAD-US - - -
National data download - PAD-US
-
- -

When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

- -

4. find the right layer

-
- -

Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

- -
1
-
    layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
-
- -

I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

- -

st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

- -
- - - Image: PAD-US Layer List - - -
PAD-US Layer List
-
- -

The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

- - -

I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

- -

Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

- -
- - - Image: PAD-US Viewer - - -
PAD-US Viewer
-
- -

In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

- -
- - - Image: Search for Valley of Fire State Park - - -
Search for Valley of Fire State Park
-
- -

The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

- -

I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

- -
- - - Image: Valley of Fire State Park Polygon - - -
Valley of Fire State Park Polygon
-
- -

The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

- -

For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

- -
- - - Image: Federal Fee Managers Layer - - -
Federal Fee Managers Layer
-
- -

5. find the right data

-
- -

The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

- -

Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

- -

For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

- -
- - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
Valley of Fire (left), Crissey Field (center), Salton Sea (right)
-
- -
- -Note: -Clicking on an image will open it in a new tab. -
- -

First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

- -
- - - Image: Valley of Fire State Park Table - - -
Valley of Fire State Park Table
-
- -

In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

- -
    -
  1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
  2. -
  3. Mang_Type is the name of the agency that manages the land.
  4. -
  5. Mang_Name is the type of agency that owns the land, here state.
  6. -
  7. Des_Tp is the designation type.
  8. -
  9. Unit_Nm is the land’s name.
  10. -
- -

When we look at Crissey Field, many of these fields have different values.

- -
- - - Image: Crissey Field State Recreation Site Table - - -
Crissey Field State Recreation Site Table
-
- -

The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

- -

Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

- -
- - - Image: Salton Sea - - -
Salton Sea
-
- -

Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

- -

On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

- -
- - - Image: Salton Sea - - -
Salton Sea
-
- -

I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

- -

In the viewer, I’ll look for the blueish violet color (boxed in red above).

- -
- - - Image: Salton Sea State Recreation Area Table - - -
Salton Sea State Recreation Area Table
-
- -

After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

- -

Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

- -

6. process state park data

-
- -
- Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
- -
1
-2
-3
-4
-5
-6
-7
-8
-9
-10
-11
-12
-13
-14
-15
-16
-17
-18
-19
-20
-21
-22
-23
-24
-25
-26
-27
-28
-29
-30
-31
-32
-33
-34
-35
-36
-37
-38
-39
-40
-41
-42
-43
-
    ## create territories list
-    territories <- c("AS", "GU", "MP", "PR", "VI")
-
-    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
-    filter(State_Nm %!in% territories & 
-           Own_Type == "STAT")  %>% 
-    filter(Des_Tp == "ACC" |
-           Des_Tp == "HCA" |
-           Des_Tp == "REC" |
-           Des_Tp == "SCA" |
-           Des_Tp == "SHCA" |
-           Des_Tp == "SP" |
-           Des_Tp == "SREC" |
-           Des_Tp == "SRMA" |
-           Des_Tp == "SW") %>% 
-    filter(d_Pub_Acce != "Closed" & 
-           d_Pub_Acce != "Unknown") %>% 
-    filter(Loc_Ds != "ACC" &
-           Loc_Ds != "Hunter Access",
-           Loc_Ds != "Public Boat Ramp")  %>% 
-    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
-    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
-                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
-                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
-                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
-                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
-                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
-                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
-                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
-                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
-    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
-                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
-                               Unit_Nm == "Salton Sea" ~ "visited",
-                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
-                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
-                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
-                               TRUE ~ "not visited")  %>% 
-    shift_geometry(preserve_area = FALSE, 
-                    position = "below") %>% 
-    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
-
-    ## save shifted park data
-    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
-
- -

These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

- - - -
2    territories <- c("AS", "GU", "MP", "PR", "VI")
- -

Line 2 creates the list of US territories that I filter out in line 5.

- -

If you want to read more about this line visit part III of the series.

- -
- - - -
4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
- -

Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

- -

st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

- -

The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

- -
- - - -
5    filter(State_Nm %!in% territories  &
-6           Own_Type == "STAT")
- -

R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

- -

The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

- -
- -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
- -

The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

- -
- - - -
7    filter(Des_Tp == "ACC" |
-8           Des_Tp == "HCA" |
-9           Des_Tp == "REC" |
-10          Des_Tp == "SCA" |
-11          Des_Tp == "SHCA" |
-12          Des_Tp == "SP" |
-13          Des_Tp == "SREC" |
-14          Des_Tp == "SRMA" |
-15          Des_Tp == "SW")  
- -

Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

- -

Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

- -
- -Note: -You can view the land designation types by referencing the PAD-US documentation. -
- -

I am going to restrict my data to include the following designations:

- - -

This will leave me with 50,102 rows.

- -
- -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
- -
- - - -
16    filter(d_Pub_Acce != "Closed" & 
-17           d_Pub_Acce != "Unknown")  %>% 
- -

Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

- -

The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

- -

This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

- -
- -

*lines 18-20

- -
18    filter(Loc_Ds != "ACC" &
-19           Loc_Ds != "Hunter Access",
-20           Loc_Ds != "Public Boat Ramp")
- -

Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

- -
- - - -
21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
- -

Now that I’ve pared down the data a little bit, I want discard any column don’t need.

- -

select() lets me choose the columns I want to keep by name, rather than by index number.

- -

I decided to keep:

- - -
- - - -
22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
-23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
-24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
-25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
-26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
-27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
-28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
-29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
-30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
- -

mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

- -

I wanted the state park designations to match closely with the types I used in the National Park data.

- -

I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

- -

In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

- -
- - - -
31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
-32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
-33                             Unit_Nm == "Salton Sea" ~ "visited",
-34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
-35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
-36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
-37                             TRUE ~ "not visited")  %>% 
- -

Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

- -

I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

- -
- Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
- -

In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

- -

The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

- -
- - - -
38   shift_geometry(preserve_area = FALSE, 
-39                  position = "below") %>% 
-40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
-41
-42   ## save shifted park data
-43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
- -

I’ve covered these lines extensively in part II and part III of this series.

- -

Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

- -

Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

- -

Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

- -

7. divide by state

-
- -
- - -
- - - Image: chaos ensues - - -
chaos ensues
-
- - -
- -

I tried to map the base map, National Parks, and the state parks. It did not go well. R froze, my computer screamed, and chaos ensued. As a result, I had to rethink my map. I decided to separate the state parks by state, save them, and in part VI of this never-ending series* I’ll create individual state maps. When you click on a state it’ll take you to a map that includes the state parks.

- -

Unfortunately, this also means I need to separate the National Parks by state so they also appear on the individual maps. The logic will be the same so I am not going to update part III to reflect that change. If you want to see that code it’s available on the project repo].

- -

I don’t want to manually separate and save each state, so I’m going to use a loop! I hate loops. The logic is simple enough “as long as condition X is true, do something.” So simple, yet esvery time I’ve tried to learn a programming language I have struggled with loops. That’s pretty sad considering it’s like day 2 of any programming class. Day 1 is learning how to write “Hello World!”**

- -
1
-2
-3
-4
-5
-
    split_states <- split(state_parks, f = state_parks$State_Nm)
-    all_names <- names(split_states)   
-
-    for(name in all_names){            
-        st_write(split_states[[name]], paste0("shapefiles/shifted/states/individual/", name, '.shp'))}
-
- -

Look ma, new code!

- - - -
1    split_states <- split(state_parks, f = state_parks$State_Nm) 
- -

The split() is part of base R.

- -

8. conclusion

-
- -

* I have annoyed myself with how long this series is. Hopefully it is helpful. Drop me a line if it is.
-** print("Hello World!")

-:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/09/86aa331313387daff0cb78b8bccc4736640b02d7f76350b2d19b0edb19e634 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/09/86aa331313387daff0cb78b8bccc4736640b02d7f76350b2d19b0edb19e634 deleted file mode 100644 index 2b5cd6f..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/09/86aa331313387daff0cb78b8bccc4736640b02d7f76350b2d19b0edb19e634 +++ /dev/null @@ -1,330 +0,0 @@ -I"˙Y

Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

- - - -

project outline

-
- -

I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

- -

I. cartography in r part one

- - -

II. cartography in r part two

- - -

III. cartography in r part three

- - -

IV. cartography in r part four [this post]

- - -

V. cartography in r part five

- - -

1. load libraries

-
- -
1
-2
-3
-4
-5
-
    ## load libraries
-    library("tidyverse")        # data manipulation & map creation
-    library("sf")               # loads shapefile
-    library("leaflet")          # creates the map
-    library("operator.tools")   # not-in function
-
- -

I am not going to explain in detail what each of these packages do because I already covered it in part one.

- -

2. load data

-
- -
1
-2
-3
-
    ## load data
-    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
-    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
-
- -

Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

- -

If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

- -

3. download state park data

-
- -

Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

- -

When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

- -

The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

- -
- - - Image: PAD-US download - - -
PAD-US download
-
- -

On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

- -
- - - Image: National data download - PAD-US - - -
National data download - PAD-US
-
- -

When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

- -

4. find the right layer

-
- -

Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

- -
1
-
    layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
-
- -

I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

- -

st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

- -
- - - Image: PAD-US Layer List - - -
PAD-US Layer List
-
- -

The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

- - -

I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

- -

Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

- -
- - - Image: PAD-US Viewer - - -
PAD-US Viewer
-
- -

In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

- -
- - - Image: Search for Valley of Fire State Park - - -
Search for Valley of Fire State Park
-
- -

The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

- -

I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

- -
- - - Image: Valley of Fire State Park Polygon - - -
Valley of Fire State Park Polygon
-
- -

The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

- -

For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

- -
- - - Image: Federal Fee Managers Layer - - -
Federal Fee Managers Layer
-
- -

5. find the right data

-
- -

The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

- -

Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

- -

For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

- -
- - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
Valley of Fire (left), Crissey Field (center), Salton Sea (right)
-
- -
- -Note: -Clicking on an image will open it in a new tab. -
- -

First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

- -
- - - Image: Valley of Fire State Park Table - - -
Valley of Fire State Park Table
-
- -

In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

- -
    -
  1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
  2. -
  3. Mang_Type is the name of the agency that manages the land.
  4. -
  5. Mang_Name is the type of agency that owns the land, here state.
  6. -
  7. Des_Tp is the designation type.
  8. -
  9. Unit_Nm is the land’s name.
  10. -
- -

When we look at Crissey Field, many of these fields have different values.

- -
- - - Image: Crissey Field State Recreation Site Table - - -
Crissey Field State Recreation Site Table
-
- -

The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

- -

Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

- -
- - - Image: Salton Sea - - -
Salton Sea
-
- -

Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

- -

On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

- -
- - - Image: Salton Sea - - -
Salton Sea
-
- -

I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

- -

In the viewer, I’ll look for the blueish violet color (boxed in red above).

- -
- - - Image: Salton Sea State Recreation Area Table - - -
Salton Sea State Recreation Area Table
-
- -

After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

- -

Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

- -

6. pre-process data in R

-
- -
- Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
- -
1
-2
-3
-4
-5
-6
-
    ## create territories list
-    territories <- c("AS", "GU", "MP", "PR", "VI")
-
-    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
-        filter(State_Nm %!in% territories)  %>% 
-        filter(Own_Type == "State")
-
- -

These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

- - - -
- - - -

st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

- -

The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

- -
- -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/0a/e49974e904514b566f2f41baa8e5a450dfb82869f195b10c0f011875b83278 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/0a/e49974e904514b566f2f41baa8e5a450dfb82869f195b10c0f011875b83278 deleted file mode 100644 index 2fdf5e9..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/0a/e49974e904514b566f2f41baa8e5a450dfb82869f195b10c0f011875b83278 +++ /dev/null @@ -1,636 +0,0 @@ -I"·đ

Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

- - - -

project outline

-
- -

I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

- -

I. cartography in r part one

- - -

II. cartography in r part two

- - -

III. cartography in r part three

- - -

IV. cartography in r part four

- - -

V. cartography in r part five

- - -

VI. cartography in r part six

- - -

1. load libraries

-
- -
1
-2
-3
-4
-5
-
    ## load libraries
-    library("tidyverse")        # data manipulation & map creation
-    library("sf")               # loads shapefile
-    library("leaflet")          # creates the map
-    library("operator.tools")   # not-in function
-
- -

I am not going to explain in detail what each of these packages do because I already covered it in part one.

- -

2. load data

-
- -
1
-2
-3
-
    ## load data
-    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
-    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
-
- -

Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

- -

If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

- -

3. download state park data

-
- -

Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

- -

When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

- -

The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

- -
- - - Image: PAD-US download - - -
PAD-US download
-
- -

On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

- -
- - - Image: National data download - PAD-US - - -
National data download - PAD-US
-
- -

When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

- -

4. find the right layer

-
- -

Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

- -
1
-
    layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
-
- -

I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

- -

st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

- -
- - - Image: PAD-US Layer List - - -
PAD-US Layer List
-
- -

The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

- - -

I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

- -

Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

- -
- - - Image: PAD-US Viewer - - -
PAD-US Viewer
-
- -

In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

- -
- - - Image: Search for Valley of Fire State Park - - -
Search for Valley of Fire State Park
-
- -

The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

- -

I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

- -
- - - Image: Valley of Fire State Park Polygon - - -
Valley of Fire State Park Polygon
-
- -

The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

- -

For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

- -
- - - Image: Federal Fee Managers Layer - - -
Federal Fee Managers Layer
-
- -

5. find the right data

-
- -

The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

- -

Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

- -

For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

- -
- - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
Valley of Fire (left), Crissey Field (center), Salton Sea (right)
-
- -
- -Note: -Clicking on an image will open it in a new tab. -
- -

First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

- -
- - - Image: Valley of Fire State Park Table - - -
Valley of Fire State Park Table
-
- -

In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

- -
    -
  1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
  2. -
  3. Mang_Type is the name of the agency that manages the land.
  4. -
  5. Mang_Name is the type of agency that owns the land, here state.
  6. -
  7. Des_Tp is the designation type.
  8. -
  9. Unit_Nm is the land’s name.
  10. -
- -

When we look at Crissey Field, many of these fields have different values.

- -
- - - Image: Crissey Field State Recreation Site Table - - -
Crissey Field State Recreation Site Table
-
- -

The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

- -

Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

- -
- - - Image: Salton Sea - - -
Salton Sea
-
- -

Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

- -

On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

- -
- - - Image: Salton Sea - - -
Salton Sea
-
- -

I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

- -

In the viewer, I’ll look for the blueish violet color (boxed in red above).

- -
- - - Image: Salton Sea State Recreation Area Table - - -
Salton Sea State Recreation Area Table
-
- -

After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

- -

Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

- -

6. process state park data

-
- -
- Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
- -
1
-2
-3
-4
-5
-6
-7
-8
-9
-10
-11
-12
-13
-14
-15
-16
-17
-18
-19
-20
-21
-22
-23
-24
-25
-26
-27
-28
-29
-30
-31
-32
-33
-34
-35
-36
-37
-38
-39
-40
-41
-42
-43
-
    ## create territories list
-    territories <- c("AS", "GU", "MP", "PR", "VI")
-
-    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
-    filter(State_Nm %!in% territories & 
-           Own_Type == "STAT")  %>% 
-    filter(Des_Tp == "ACC" |
-           Des_Tp == "HCA" |
-           Des_Tp == "REC" |
-           Des_Tp == "SCA" |
-           Des_Tp == "SHCA" |
-           Des_Tp == "SP" |
-           Des_Tp == "SREC" |
-           Des_Tp == "SRMA" |
-           Des_Tp == "SW") %>% 
-    filter(d_Pub_Acce != "Closed" & 
-           d_Pub_Acce != "Unknown") %>% 
-    filter(Loc_Ds != "ACC" &
-           Loc_Ds != "Hunter Access",
-           Loc_Ds != "Public Boat Ramp")  %>% 
-    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
-    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
-                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
-                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
-                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
-                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
-                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
-                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
-                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
-                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
-    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
-                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
-                               Unit_Nm == "Salton Sea" ~ "visited",
-                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
-                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
-                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
-                               TRUE ~ "not visited")  %>% 
-    shift_geometry(preserve_area = FALSE, 
-                    position = "below") %>% 
-    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
-
-    ## save shifted park data
-    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
-
- -

These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

- - - -
2    territories <- c("AS", "GU", "MP", "PR", "VI")
- -

Line 2 creates the list of US territories that I filter out in line 5.

- -

If you want to read more about this line visit part III of the series.

- -
- - - -
4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
- -

Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

- -

st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

- -

The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

- -
- - - -
5    filter(State_Nm %!in% territories  &
-6           Own_Type == "STAT")
- -

R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

- -

The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

- -
- -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
- -

The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

- -
- - - -
7    filter(Des_Tp == "ACC" |
-8           Des_Tp == "HCA" |
-9           Des_Tp == "REC" |
-10          Des_Tp == "SCA" |
-11          Des_Tp == "SHCA" |
-12          Des_Tp == "SP" |
-13          Des_Tp == "SREC" |
-14          Des_Tp == "SRMA" |
-15          Des_Tp == "SW")  
- -

Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

- -

Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

- -
- -Note: -You can view the land designation types by referencing the PAD-US documentation. -
- -

I am going to restrict my data to include the following designations:

- - -

This will leave me with 50,102 rows.

- -
- -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
- -
- - - -
16    filter(d_Pub_Acce != "Closed" & 
-17           d_Pub_Acce != "Unknown")  %>% 
- -

Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

- -

The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

- -

This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

- -
- -

*lines 18-20

- -
18    filter(Loc_Ds != "ACC" &
-19           Loc_Ds != "Hunter Access",
-20           Loc_Ds != "Public Boat Ramp")
- -

Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

- -
- - - -
21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
- -

Now that I’ve pared down the data a little bit, I want discard any column don’t need.

- -

select() lets me choose the columns I want to keep by name, rather than by index number.

- -

I decided to keep:

- - -
- - - -
22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
-23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
-24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
-25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
-26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
-27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
-28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
-29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
-30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
- -

mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

- -

I wanted the state park designations to match closely with the types I used in the National Park data.

- -

I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

- -

In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

- -
- - - -
31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
-32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
-33                             Unit_Nm == "Salton Sea" ~ "visited",
-34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
-35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
-36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
-37                             TRUE ~ "not visited")  %>% 
- -

Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

- -

I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

- -
- Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
- -

In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

- -

The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

- -
- - - -
38   shift_geometry(preserve_area = FALSE, 
-39                  position = "below") %>% 
-40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
-41
-42   ## save shifted park data
-43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
- -

I’ve covered these lines extensively in part II and part III of this series.

- -

Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

- -

Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

- -

Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

- -

7. divide by state

-
- -
-
-
"chaos ensued"
- -
- -

I tried to map the base map, National Parks, and the state parks. It did not go well. R froze, my computer screamed, and chaos ensued. As a result, I had to rethink my map. I decided to separate the state parks by state, save them, and in part VI of this never-ending series* I’ll create individual state maps. When you click on a state it’ll take you to a map that includes the state parks.

- -

Unfortunately, this also means I need to separate the National Parks by state so they also appear on the individual maps. The logic will be the same so I am not going to update part III to reflect that change. If you want to see that code it’s available on the project repo].

- -

I don’t want to manually separate and save each state, so I’m going to use a loop! I hate loops. The logic is simple enough “as long as condition X is true, do something.” So simple, yet esvery time I’ve tried to learn a programming language I have struggled with loops. That’s pretty sad considering it’s like day 2 of any programming class. Day 1 is learning how to write “Hello World!”**

- -
1
-2
-3
-4
-5
-6
-
    split_states <- split(state_parks, f = state_parks$State_Nm) # split the data by state
-    all_names <- names(split_states)   
-
-    for(name in all_names){            
-        st_write(split_states[[name]], paste0("shapefiles/shifted/states/individual/", name, '.shp')) 
-    }
-
- -

* I have annoyed myself with how long this series is. Hopefully it is helpful. Drop me a line if it is.
-** print("Hello World!")

-:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/0b/955a42afe80794768b9378db95664cf8a0e673e5c5479a3fedc81ca80a96c4 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/0b/955a42afe80794768b9378db95664cf8a0e673e5c5479a3fedc81ca80a96c4 new file mode 100644 index 0000000..7f69409 --- /dev/null +++ b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/0b/955a42afe80794768b9378db95664cf8a0e673e5c5479a3fedc81ca80a96c4 @@ -0,0 +1,30 @@ +I"
+
+
+
+ + +
+ Here you'll find several non-partisan ballot guides. I try to include as much information as possible without directly recreating the official voter guide. The information is sourced from the ballot guide, calmatters, ballotpedia, LA Times, voter’s edge, and Mercury News. I have included links to the campaign website or wherever most of the information came from. + + I have done my best to keep my views out of it. + + I started this to help my two aunts because they would ask me to simplify their ballots for them. Democracy relies on an informed and participatory citizenry, but it’s not always easy. This is meant to alleviate some of the burden. + + If you notice any errors, you feel like I’ve missed something, or you found this guide helpful feel free to send me an email [link is below] + + Helpful Resources: +
  1. Register to vote (Deadline May 23): [https://registertovote.ca.gov/](https://registertovote.ca.gov/){:target="_blank" rel="noopener noreferrer"}
  2. +
  3. Check your registration status: [https://voterstatus.sos.ca.gov/](https://voterstatus.sos.ca.gov/){:target="_blank" rel="noopener noreferrer"}
  4. +
  5. Access the official voter guide: [https://voterguide.sos.ca.gov/](https://voterguide.sos.ca.gov/){:target="_blank" rel="noopener noreferrer"}
  6. +
  7. Early voting & ballot drop off locations: [https://caearlyvoting.sos.ca.gov/](https://caearlyvoting.sos.ca.gov/){:target="_blank" rel="noopener noreferrer"}
  8. +
  9. Track your ballot: [https://california.ballottrax.net/voter/](https://california.ballottrax.net/voter/){:target="_blank" rel="noopener noreferrer"}
  10. +
  11. If you are in the Los Angeles, San Bernardino, Orange County area and need help getting to your polling place I will either find you resources or help you get there. I also offer to go with you to vote (and I will bring my two large German Shepherds) if you feel unsafe going to vote alone.
+
+ +
+
+ +
+
+:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/0e/666c62197ff3edbe81f1ee1a243ed9bf45f84e78e7857c07dc991ad544d6bb b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/0e/666c62197ff3edbe81f1ee1a243ed9bf45f84e78e7857c07dc991ad544d6bb deleted file mode 100644 index e0b08c3..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/0e/666c62197ff3edbe81f1ee1a243ed9bf45f84e78e7857c07dc991ad544d6bb +++ /dev/null @@ -1,600 +0,0 @@ -I"iá

Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

- - - -

project outline

-
- -

I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

- -

I. cartography in r part one

- - -

II. cartography in r part two

- - -

III. cartography in r part three

- - -

IV. cartography in r part four [this post]

- - -

V. cartography in r part five

- - -

1. load libraries

-
- -
1
-2
-3
-4
-5
-
    ## load libraries
-    library("tidyverse")        # data manipulation & map creation
-    library("sf")               # loads shapefile
-    library("leaflet")          # creates the map
-    library("operator.tools")   # not-in function
-
- -

I am not going to explain in detail what each of these packages do because I already covered it in part one.

- -

2. load data

-
- -
1
-2
-3
-
    ## load data
-    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
-    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
-
- -

Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

- -

If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

- -

3. download state park data

-
- -

Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

- -

When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

- -

The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

- -
- - - Image: PAD-US download - - -
PAD-US download
-
- -

On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

- -
- - - Image: National data download - PAD-US - - -
National data download - PAD-US
-
- -

When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

- -

4. find the right layer

-
- -

Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

- -
1
-
    layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
-
- -

I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

- -

st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

- -
- - - Image: PAD-US Layer List - - -
PAD-US Layer List
-
- -

The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

- - -

I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

- -

Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

- -
- - - Image: PAD-US Viewer - - -
PAD-US Viewer
-
- -

In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

- -
- - - Image: Search for Valley of Fire State Park - - -
Search for Valley of Fire State Park
-
- -

The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

- -

I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

- -
- - - Image: Valley of Fire State Park Polygon - - -
Valley of Fire State Park Polygon
-
- -

The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

- -

For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

- -
- - - Image: Federal Fee Managers Layer - - -
Federal Fee Managers Layer
-
- -

5. find the right data

-
- -

The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

- -

Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

- -

For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

- -
- - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
Valley of Fire (left), Crissey Field (center), Salton Sea (right)
-
- -
- -Note: -Clicking on an image will open it in a new tab. -
- -

First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

- -
- - - Image: Valley of Fire State Park Table - - -
Valley of Fire State Park Table
-
- -

In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

- -
    -
  1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
  2. -
  3. Mang_Type is the name of the agency that manages the land.
  4. -
  5. Mang_Name is the type of agency that owns the land, here state.
  6. -
  7. Des_Tp is the designation type.
  8. -
  9. Unit_Nm is the land’s name.
  10. -
- -

When we look at Crissey Field, many of these fields have different values.

- -
- - - Image: Crissey Field State Recreation Site Table - - -
Crissey Field State Recreation Site Table
-
- -

The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

- -

Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

- -
- - - Image: Salton Sea - - -
Salton Sea
-
- -

Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

- -

On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

- -
- - - Image: Salton Sea - - -
Salton Sea
-
- -

I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

- -

In the viewer, I’ll look for the blueish violet color (boxed in red above).

- -
- - - Image: Salton Sea State Recreation Area Table - - -
Salton Sea State Recreation Area Table
-
- -

After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

- -

Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

- -

6. process state park data

-
- -
- Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
- -
1
-2
-3
-4
-5
-6
-7
-8
-9
-10
-11
-12
-13
-14
-15
-16
-17
-18
-19
-20
-21
-22
-23
-24
-25
-26
-27
-28
-29
-30
-31
-32
-33
-34
-35
-36
-37
-38
-39
-40
-41
-42
-43
-
    ## create territories list
-    territories <- c("AS", "GU", "MP", "PR", "VI")
-
-    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
-    filter(State_Nm %!in% territories & 
-           Own_Type == "STAT")  %>% 
-    filter(Des_Tp == "ACC" |
-           Des_Tp == "HCA" |
-           Des_Tp == "REC" |
-           Des_Tp == "SCA" |
-           Des_Tp == "SHCA" |
-           Des_Tp == "SP" |
-           Des_Tp == "SREC" |
-           Des_Tp == "SRMA" |
-           Des_Tp == "SW") %>% 
-    filter(d_Pub_Acce != "Closed" & 
-           d_Pub_Acce != "Unknown") %>% 
-    filter(Loc_Ds != "ACC" &
-           Loc_Ds != "Hunter Access",
-           Loc_Ds != "Public Boat Ramp")  %>% 
-    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
-    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
-                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
-                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
-                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
-                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
-                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
-                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
-                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
-                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
-    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
-                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
-                               Unit_Nm == "Salton Sea" ~ "visited",
-                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
-                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
-                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
-                               TRUE ~ "not visited")  %>% 
-    shift_geometry(preserve_area = FALSE, 
-                    position = "below") %>% 
-    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
-
-    ## save shifted park data
-    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
-
- -

These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

- - - -
2    territories <- c("AS", "GU", "MP", "PR", "VI")
- -

Line 2 creates the list of US territories that I filter out in line 5.

- -

If you want to read more about this line visit part III of the series.

- -
- - - -
4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
- -

Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

- -

st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

- -

The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

- -
- - - -
5    filter(State_Nm %!in% territories  &
-6           Own_Type == "STAT")
- -

R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

- -

The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

- -
- -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
- -

The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

- -
- - - -
7    filter(Des_Tp == "ACC" |
-8           Des_Tp == "HCA" |
-9           Des_Tp == "REC" |
-10          Des_Tp == "SCA" |
-11          Des_Tp == "SHCA" |
-12          Des_Tp == "SP" |
-13          Des_Tp == "SREC" |
-14          Des_Tp == "SRMA" |
-15          Des_Tp == "SW")  
- -

Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

- -

Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

- -
- -Note: -You can view the land designation types by referencing the PAD-US documentation. -
- -

I am going to restrict my data to include the following designations:

- - -

This will leave me with 50,102 rows.

- -
- -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
- -
- - - -
16    filter(d_Pub_Acce != "Closed" & 
-17           d_Pub_Acce != "Unknown")  %>% 
- -

Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

- -

The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

- -

This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

- -
- -

*lines 18-20

- -
18    filter(Loc_Ds != "ACC" &
-19           Loc_Ds != "Hunter Access",
-20           Loc_Ds != "Public Boat Ramp")
- -

Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

- -
- - - -
21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
- -

Now that I’ve pared down the data a little bit, I want discard any column don’t need.

- -

select() lets me choose the columns I want to keep by name, rather than by index number.

- -

I decided to keep:

- - -
- - - -
22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
-23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
-24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
-25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
-26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
-27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
-28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
-29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
-30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
- -

mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

- -

I wanted the state park designations to match closely with the types I used in the National Park data.

- -

I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

- -

In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

- -
- - - -
31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
-32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
-33                             Unit_Nm == "Salton Sea" ~ "visited",
-34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
-35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
-36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
-37                             TRUE ~ "not visited")  %>% 
- -

Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

- -

I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

- -
- Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
- -

In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

- -

The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

- -
- - - -
38   shift_geometry(preserve_area = FALSE, 
-39                  position = "below") %>% 
-40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
-41
-42   ## save shifted park data
-43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
- -

I’ve covered these lines extensively in part II and part III of this series.

- -

Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

- -

Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

- -

Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

- -

7. add state parks to the map

-
- -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/0e/76d780ba19960f2f682ac3a5adcb614aaed029dbfc59c88bd436133b64dff2 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/0e/76d780ba19960f2f682ac3a5adcb614aaed029dbfc59c88bd436133b64dff2 deleted file mode 100644 index b5b1834..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/0e/76d780ba19960f2f682ac3a5adcb614aaed029dbfc59c88bd436133b64dff2 +++ /dev/null @@ -1,402 +0,0 @@ -I" s

Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

- - - -

project outline

-
- -

I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

- -

I. cartography in r part one

- - -

II. cartography in r part two

- - -

III. cartography in r part three

- - -

IV. cartography in r part four [this post]

- - -

V. cartography in r part five

- - -

1. load libraries

-
- -
1
-2
-3
-4
-5
-
    ## load libraries
-    library("tidyverse")        # data manipulation & map creation
-    library("sf")               # loads shapefile
-    library("leaflet")          # creates the map
-    library("operator.tools")   # not-in function
-
- -

I am not going to explain in detail what each of these packages do because I already covered it in part one.

- -

2. load data

-
- -
1
-2
-3
-
    ## load data
-    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
-    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
-
- -

Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

- -

If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

- -

3. download state park data

-
- -

Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

- -

When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

- -

The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

- -
- - - Image: PAD-US download - - -
PAD-US download
-
- -

On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

- -
- - - Image: National data download - PAD-US - - -
National data download - PAD-US
-
- -

When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

- -

4. find the right layer

-
- -

Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

- -
1
-
    layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
-
- -

I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

- -

st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

- -
- - - Image: PAD-US Layer List - - -
PAD-US Layer List
-
- -

The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

- - -

I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

- -

Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

- -
- - - Image: PAD-US Viewer - - -
PAD-US Viewer
-
- -

In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

- -
- - - Image: Search for Valley of Fire State Park - - -
Search for Valley of Fire State Park
-
- -

The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

- -

I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

- -
- - - Image: Valley of Fire State Park Polygon - - -
Valley of Fire State Park Polygon
-
- -

The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

- -

For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

- -
- - - Image: Federal Fee Managers Layer - - -
Federal Fee Managers Layer
-
- -

5. find the right data

-
- -

The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

- -

Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

- -

For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

- -
- - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
Valley of Fire (left), Crissey Field (center), Salton Sea (right)
-
- -
- -Note: -Clicking on an image will open it in a new tab. -
- -

First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

- -
- - - Image: Valley of Fire State Park Table - - -
Valley of Fire State Park Table
-
- -

In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

- -
    -
  1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
  2. -
  3. Mang_Type is the name of the agency that manages the land.
  4. -
  5. Mang_Name is the type of agency that owns the land, here state.
  6. -
  7. Des_Tp is the designation type.
  8. -
  9. Unit_Nm is the land’s name.
  10. -
- -

When we look at Crissey Field, many of these fields have different values.

- -
- - - Image: Crissey Field State Recreation Site Table - - -
Crissey Field State Recreation Site Table
-
- -

The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

- -

Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

- -
- - - Image: Salton Sea - - -
Salton Sea
-
- -

Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

- -

On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

- -
- - - Image: Salton Sea - - -
Salton Sea
-
- -

I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

- -

In the viewer, I’ll look for the blueish violet color (boxed in red above).

- -
- - - Image: Salton Sea State Recreation Area Table - - -
Salton Sea State Recreation Area Table
-
- -

After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

- -

Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

- -

6. process state park data

-
- -
- Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
- -
1
-2
-3
-4
-5
-6
-7
-8
-9
-10
-11
-12
-13
-14
-15
-
    ## create territories list
-    territories <- c("AS", "GU", "MP", "PR", "VI")
-
-    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
-    filter(State_Nm %!in% territories & 
-           Own_Type == "STAT")  %>% 
-    filter(Des_Tp == "ACC" |
-           Des_Tp == "HCA" |
-           Des_Tp == "REC" |
-           Des_Tp == "SCA" |
-           Des_Tp == "SHCA" |
-           Des_Tp == "SP" |
-           Des_Tp == "SREC" |
-           Des_Tp == "SRMA" |
-           Des_Tp == "SW")  
-
- -

These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

- - - -
    territories <- c("AS", "GU", "MP", "PR", "VI")
- -

Line 2 creates the list of US territories that I filter out in line 5.

- -

If you want to read more about this line visit part III of the series.

- -
- - - -
    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
- -

Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

- -

st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

- -

The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

- -
- - - -
    filter(State_Nm %!in% territories & Own_Type == "STAT")
- -

R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

- -

The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

- -
- -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
- -

The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

- -
- - - -

Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

- -
- -Note: -You can view the land designation types by referencing the PAD-US documentation. -
- -

I am going to restrict my data to include the following designations:

- - -
-:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/13/5dfb2e86e8529faa0748d3779e369bee5b5203db7ecec416fe0bab1e1a43d2 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/13/5dfb2e86e8529faa0748d3779e369bee5b5203db7ecec416fe0bab1e1a43d2 deleted file mode 100644 index ba036e9..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/13/5dfb2e86e8529faa0748d3779e369bee5b5203db7ecec416fe0bab1e1a43d2 +++ /dev/null @@ -1,587 +0,0 @@ -I"„ä

This is part three of my cartography in R series. If you are just finding this, I suggest taking a look at part I and part II first.

- -

In this post, I will download and process the National Park data. Once that’s done, I’ll add it to the base map I created in part II.

- - - -
- Note: I know that my posts are verbose. I am sorry that I am not more concise but one of the biggest problems I have with code documentation is that it assumes you already know what you're doing. I'm writing these for absolute beginners that want to know what each part of the code does. If you want more concise explanations, I suggest looking at the annotated r files in the project's GitHub. -
- -

project outline

-
- -

I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

- -

I. cartography in r part one

- - -

II. cartography in r part two

- - -

III. cartography in r part three [this post]

- - -

IV. cartography in r part four

- - -

V. cartography in r part five

- - -

1. load libraries

-
- -
1
-2
-3
-4
-5
-
    ## load libraries
-    library("tidyverse")        # data manipulation & map creation
-    library("sf")               # loads shapefile
-    library("leaflet")          # creates the map
-    library("operator.tools")   # not-in function
-
- -

I am not going to explain in detail what each of these packages do because I already covered it in part one.

- -

2. load usa data

-
- -
1
-2
-
    ## load data
-    states <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
-
- -

Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp to reflect wherever you saved the shifted shapefile.

- -

If your data processing and base map creation are in the same file, you can skip this line, and when you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

- -

3. download national park data

-
- -

The National Park Service provides all the data we’ll need to make the map. The data is accessible on the ArcGIS’ Open Data website. Once you click on the link you’ll see a bunch of icons that lead to different data that’s available for download. Click on the one for boundaries.

- -
- - - Image: NPS GIS website - - -
NPS GIS website
-
- -

From here, you’ll be taken to a list of available National Park data. The second link should be nps boundary which contains the shape data for all the National Parks in the United States. The file contains all the data for the park outlines along with hiking trails, rest areas, and lots of other data.

- -
- - - Image: NPS download page - - -
NPS download page
-
- -

The nps boundary link will take you to a map showing the national parks. On the left, there will be a download link on the left.

- -
- - - Image: NPS map and download link - - -
NPS map and download link
-
- -

From here, you’ll have a few download options. The National Park Service provides the data in different formats including CSV and Shapefile. You’ll want to download the shapefile version.

- -
- - - Image: download nps shapefile - - -
download nps shapefile
-
- -

Be sure to save the file somewhere on your hard drive that is easy to find. When it finishes downloading, be sure to unzip the file. There will be four files inside the folder. All of them need to be kept in the same location. Even though we’ll only load the .shp file, R uses the three others to create the necessary shapes.

- -

4. process national park data

-
- -

The code below may look intimidating, but it’s fairly straight forward. I’ll go over each line below.

- -
- Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
- -
1
-2
-3
-4
-5
-6
-7
-8
-9
-10
-11
-12
-13
-14
-15
-16
-17
-18
-19
-20
-21
-22
-23
-24
-25
-26
-27
-28
-29
-30
-31
-32
-33
-34
-35
-36
-37
-38
-39
-40
-41
-42
-43
-
    ## create territories list
-    territories <- c("AS", "GU", "MP", "PR", "VI")
-
-    ## load and process nps data
-    nps <- read_sf("./shapefiles/original/nps/NPS_-_Land_Resources_Division_Boundary_and_Tract_Data_Service.shp")  %>% 
-    select(STATE, UNIT_TYPE, PARKNAME, Shape__Are, geometry) %>% 
-    filter(STATE %!in% territories) %>%  
-    mutate(type = case_when(UNIT_TYPE == "International Historic Site" ~ "International Historic Site", # there's 23 types of national park, I wanted to reduce this number.
-                UNIT_TYPE == "National Battlefield Site" ~ "National Military or Battlefield", # lines 56-77 reduce the number of park types
-                UNIT_TYPE == "National Military Park" ~ "National Military or Battlefield", 
-                UNIT_TYPE == "National Battlefield" ~ "National Military or Battlefield",
-                UNIT_TYPE == "National Historical Park" ~ "National Historical Park, Site, Monument, or Memorial",
-                UNIT_TYPE == "National Historic Site" ~ "National Historical Park, Site, Monument, or Memorial",
-                UNIT_TYPE == "National Historic Trail" ~ "National Historical Park, Site, Monument, or Memorial",
-                UNIT_TYPE == "National Memorial" ~ "National Historical Park, Site, Monument, or Memorial",
-                UNIT_TYPE == "National Monument" ~ "National Historical Park, Site, Monument, or Memorial",
-                UNIT_TYPE == "National Preserve" ~ "National Preserve, Reserve, or Recreation Area",
-                UNIT_TYPE == "National Reserve" ~ "National Preserve, Reserve, or Recreation Area",
-                UNIT_TYPE == "National Recreation Area" ~ "National Preserve, Reserve, or Recreation Area",
-                UNIT_TYPE == "National River" ~ "National River, Lakeshore, or Seashore",
-                UNIT_TYPE == "National Lakeshore" ~ "National River, Lakeshore, or Seashore",
-                UNIT_TYPE == "National Wild & Scenic River" ~ "National River, Lakeshore, or Seashore",
-                UNIT_TYPE == "National Seashore" ~ "National River, Lakeshore, or Seashore",
-                UNIT_TYPE == "National Trails Syste" ~ "National Trail",
-                UNIT_TYPE == "National Scenic Trail" ~ "National Trail",
-                UNIT_TYPE == "National Park" ~ "National Park or Parkway",
-                UNIT_TYPE == "Park" ~ "National Park or Parkway",
-                UNIT_TYPE == "Parkway" ~ "National Park or Parkway",
-                UNIT_TYPE == "Other Designation" ~ "Other National Land Area"))  %>%  
-    mutate(visited = case_when(PARKNAME == "Joshua Tree" ~ "visited", 
-                                PARKNAME == "Redwood" ~ "visited", 
-                                PARKNAME == "Santa Monica Mountains" ~ "visited", 
-                                PARKNAME == "Sequoia" ~ "visited", 
-                                PARKNAME == "Kings Canyon" ~ "visited",
-                                PARKNAME == "Lewis and Clark" ~ "visited",
-                                PARKNAME == "Mount Rainier" ~ "visited",
-                                TRUE ~ "not visited")) %>%  
-    shift_geometry(preserve_area = FALSE, 
-                    position = "below") %>% 
-    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
-
-    ## save the shifted national park data
-    st_write(nps, "~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
-
- - - -
2    territories <- c("AS", "GU", "MP", "PR", "VI")
- -

In part I of this series I talked about how R has an %in% function, but not a %!in% function. Here’s where the latter function shines.

- -

The United States is still an empire with its associated territories and islands. In this project I am interested in the 50 states - without these other areas. As a result, I need to filter them out. Using base R’s %in% function I would have to create a variable that contains the postal abbreviations for all 50 states. That is annoying. Instead, I want to use the shorter list that only includes the US’ associated islands and territories. To do so, however, I need to use the operator tools’ %!in% function.

- -

Line 2 creates the list of US territories that I filter out in line 7. The c() function in R means combine or concatenate. Inside the parenthesis are the five postal codes for the American Samoa, Guam, the Northern Mariana Islands, Puerto Rico, and the Virgin Islands.

- -
- - - -
5   nps <- read_sf("path/to/file/NPS_-_Land_Resources_Division_Boundary_and_Tract_Data_Service.shp")  %>% 
- -

nps <- read_sf("path/to/file.shp") loads the National Park data set to a variable called nps using the read_sf() function that is part of the sf package. You will need to change the file path so it reflects where you saved the data on your hard drive.

- -

The %>% operator is part of the tidyverse package. It tells R to go to the next line and process the next command. It has to go at the end of a line, rather than the beginning.

- -
- - - -
6    select(STATE, UNIT_TYPE, PARKNAME, Shape__Are, geometry) %>%
- -

select is part of the tidyverse package. With it, we can select columns by their name rather than their associated number. Large data sets take more computing power because the computer has to iterate over more rows. Unfortunately, rendering maps also takes a lot of computing power so I like to discard any unnecessary columns to reduce the amount of effort my computer has to exert.

- -

Deciding on which columns to keep will depend on the data you’re using and what you want to map (or analyze). I know for my project I want to include a few things:

- - -

There’s a couple ways to inspect the data to see what kind of information is available.

- -
    -
  1. You can view the entire data set by using view(nps) but as the number of data points increases, so does R’s struggle with opening it. I’ve found that VSCode doesn’t throw as big of a fit as R Studio when opening large data sets.
  2. -
  3. Another way is to just look at the column names using data.frame(colnames(nps)). This will return a list of the data set’s column names. This is my preferred method. I then go to the documentation to see what each column contains. This isn’t fool-proof because it really depends on if the data has good documentation.
  4. -
- -

The National Park data includes a lot of information about who created the data and maintains the property. I’m not interested in this, so in line 6 I select the following columns:

- - -
Note: The column name must be exact for select() to work. For example, in the NPS data, the park area is stored the Shape__Are column - two underscores and no a in area - and I have to use that exact format when selecting the column.
- -

The geometry column is specific to shapefiles and it includes the coordinates of the shape. It will be kept automatically - unless you use the st_drop_geometry() function. I like to specifically select so I remember it’s there.

- -
- - - -
7    filter(STATE %!in% territories) %>%  
- -

In line 7 I use the territories list I created in line 2 to filter out the United States’ associated areas. Since the nps data uses the two character state abbreviation, I have to use the two character abbreviation for the territories. Searching for “Guam,” for example, won’t work.

- -

filter() is part of the tidyverse and it uses conditional language. In the parentheses is a condition that must be true if the tidyverse is going to keep the row. Starting at the top of the data, R goes “alright, does the value in the STATE column match any of the values in the territories list?” If the condition is TRUE, R adds the row to the new data frame.

- -
Note: Since we're using the %!in% operator, any row that evaluates as TRUE will be kept because the value is NOT found in the territories list. If I wanted to keep only the territories, I would use the %in% operator and only the rows with STATE abbreviations found in the territories list would be kept.
- -

For example, if the STATE value in row 1 is CA, filter looks at it and goes “is CA NOT IN territories?” If that is TRUE, keep it because we want only the values that are NOT IN the territories list.

- -
- - - -
8    mutate(type = case_when(UNIT_TYPE == "International Historic Site" ~ "International Historic Site",
- -

mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

- -

The NPS data set has 23 different types of National Parks listed (you can view all of them by running levels(as.factor(nps$UNIT_TYPE))). I know that in later posts, I’m going to color code the land by type (blue for rivers, green for national parks, etc) so I wanted to reduce the number of colors I would have to use.

- -

mutate()’s first argument, type = creates a new column called type. R will populate the newly created column with whatever comes after the first (singular) equal = sign. For example, I can put type = NA and every row in the column will say NA.

- -

Here, I am using the case_when() function, which is also part of the tidyverse. The logic of case_when is fairly straight forward. The first value is the name of the column you want R to look in (here: UNIT_TYPE). Next, is a conditional. Here I am looking for an exact match (==) to the string (words) inside the first set of quotation marks (in line 8: "International Historic Site"). The last part of the argument is what I want R to put in the type column when it finds a row where the UNIT_TYPE is "International Historic Site".

- -

In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value")

- -

Lines 9-29 do the same thing for the other park types. You can reduce the parks however you want or use all 23 types. Just remember that the value before the tilde ~ has to match the values found in the data exactly. For example, in line 24 I change the NPS data’s National Trail Syste value to be National Trail. Whomever created the data set did not spell system correctly, so for R to match the value I also have to omit the last letter in system.

- -
- - - -
30    mutate(visited = case_when(PARKNAME == "Joshua Tree" ~ "visited",
- -

Lines 30-37 use the same mutate() and case_when logic as above. Instead of reducing the number of park types, I use it to mark the different parks I have visited.

- -

Line 30 creates the new column, visited and uses case_when to look for the names of the parks that I’ve been to. If I have visited them, it adds visited to the column of the same name.

- -

The last line, TRUE ~ "not_visited)), acts as an else statement. For any park not listed above, it will put not visited in the visited column I created.

- -

This feels like a very brute-force method of tracking which parks I’ve visited, but I haven’t spend much time trying to find another way.

- -
- - - -
38    shift_geometry(preserve_area = FALSE, 
-39                   position = "below") %>%
- -

In part I, when I made the base map, I moved Alaska and Hawaii so they were of similar size and closer to the continental USA. For the map to display the parks correctly, I have to shift them as well.

- -

I went over these two lines in part II, so I won’t go over them again here. If you want to read more about them, check out that post.

- -
- - - -
40  sf::st_transform('+proj=longlat +datum=WGS84')
- -

The last line uses the st_transform() function from the sf package to covert the data set from NAD83 to WGS84. Leaflet requires WGS84, so be sure to include this line at the end of your data manipulation.

- -

I covered the WGS84 ellipsoid in part I, if you want to read more about it.

- -
- - - -
43  st_write(nps, "~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
- -

Strictly speaking, this line isn’t necessary. You can do all your data processing in the same file where you make your map, but I prefer to separate the steps into different files.

- -

As a result, I save the shifted data to my hard drive so it’s easier to load later. I usually have this line commented out (by placing # at the start of the line) after I save it the first time. I don’t want it to save every time I run the rest of the code.

- -

5. add national parks to base map

-
- -
1
-2
-3
-4
-5
-6
-7
-8
-9
-10
-11
-12
-13
-14
-15
-16
-17
-18
-19
-20
-21
-22
-23
-24
-25
-26
-27
-28
-29
-30
-31
-32
-33
-34
-
## create usa Base Map using leaflet()
-    map <- leaflet() %>%
-    addPolygons(data = states,
-        smoothFactor = 0.2,
-        fillColor = "#808080",
-        fillOpacity = 0.5,
-        stroke = TRUE,
-        weight = 0.5,
-        opacity = 0.5,
-        color = "#808080",
-        highlight = highlightOptions(
-            weight = 0.5,
-            color = "#000000",
-            fillOpacity = 0.7,
-            bringToFront = FALSE),
-        group = "Base Map")  %>% 
-    addPolygons(data = nps,
-        smoothFactor = 0.2,                 
-        fillColor = "#354f52",
-        fillOpacity = 1,
-        stroke = TRUE,
-        weight = 1,     
-        opacity = 0.5,                       
-        color = "#354f52",             
-        highlight = highlightOptions(
-            weight = 3,
-            color = "#fff",
-            fillOpacity = 0.8,
-            bringToFront = TRUE),
-        group = "National Parks")  %>%
-    addLayersControl(
-        baseGroups = "Base Map",
-        overlayGroups = "National Parks",
-        options = layersControlOptions(collapsed = FALSE))
-
- - - -

Lines 2-16 are identical to those in part II where I created the base map. I am not going to cover these sections in detail, because I covered it previously.

- -
- - - -
17    addPolygons(data = nps,
- -

To add the National Park data to the base map, we call addPolygons() again. The arguments are the same as before - color, opacity, outline style - just with different values. By changing those values, we can differentiate the base map from the national park data.

- -

Since we’re mapping the National Parks and not the states, we have to tell R where the data is located using data = nps.

- -
- - - -
18   smoothFactor = 0.2,
- -

smoothFactor() determines how detailed the park boundaries should be. The lower the number, the more detailed the shape. The higher the number, the smoother the parks will render. I usually match this to whatever I set for the base map for consistency.

- -
- - - -
19   fillColor = "#354f52",
-20   fillOpacity = 1,
- -

Define the color and transparency of the National Parks. In a future post, I am going to change the color of each type of public land, but for now, I’ll make them all a nice sage green color #354f52. I also want to make the parks to be fully opaque.

- -
- - - -
21    stroke = TRUE,
-22    weight = 1,     
-23    opacity = 0.5,                       
-24    color = "#354f52",
- -

The next four lines (21-24) define what kind of outline the National Parks will have. I detail each of these arguments in part II of this series.

- -

Briefly, I want there to be an outline to each park (stroke = TRUE) that’s thicker weight = 1 than the outline used on the base map. I do not like the way it looks at full opacity, so I make it half-transparent (opacity = 0.5). Finally, I want the outline color = "#354f52 to be the same color as the fill. This will matter more when I change the fill color of the parks later on.

- -
- - - -
25  highlight = highlightOptions(
-26              weight = 3,
-27              color = "#fff",
-28              fillOpacity = 0.8,
- -

Lines 25-28 define the National Park’s behavior on mouseover. First we have to define and initialize the highlightOptions() function. The function take similar arguments as the addPolygons function - both of which I go over in detail in part II.

- -

I want to keep the mouseover behavior noticeable, but simple. To do so, I set the outline’s thickness to be weight = 3. This will give the shape a nice border that differentiates it from the rest of the map.

- -

color = "#fff sets the outline’s color on mouseover only. So, when inactive, the outline color will match the fill color, but on mouseover the outline color switches to white (#fff).

- -
- Note: When the hex code repeats (like, white - #ffffff, black - #000000, or grey #808080) you only need to include the first three characters.
- -
- - - -
29    bringToFront = TRUE),
- -

bringToFront can either be TRUE or FALSE. If TRUE, Leaflet will bring the park to the forefront on mouseover. This is useful later when we add in the state parks because national and state parks tend to be close together.

- -

When FALSE the shape will remain static.

- -
- - - -
30    group = "National Parks")  %>%
- -

Since Leaflet adds all new data to the top of the base map, I think it’s useful to group the layers together. In the next block of code, we add in some layer functionality. For now, though, I want to add the National Parks to their own group so I can hide the National Parks if I want.

- -
- - - -
31    addLayersControl(
-32        baseGroups = "Base Map",
-33        overlayGroups = "National Parks",
-34        options = layersControlOptions(collapsed = FALSE))
- -

addLayersControl defines how layers are displayed on the final map. The function takes three arguments.

- -

First, we have to tell Leaflet which layer should be used as the base map: baseGroups = "Base Map". The name in the quotations (here: "Base Map") has to match the name given to the layer you set in the addPolygons() call. In line 14, I put the 50 states into a group called "Base Map", but you can name it anything you like.

- -

There can be more than one base map, too. It’s not super helpful here since I shifted Alaska and Hawaii, but when using map tiles you can add multiple types of base maps that users can switch between.

- -

Next, we have to define the layers that are shown on top of the base group: overlayGroups = "National Parks". Just like the base map, this is defined in the corresponding addPolygons call. Here, I called the layer National Parks in line 30.

- -

Finally, on the map I don’t want the layers to be collapsed, so I set options = layersControlOptions(collapsed = FALSE). When TRUE the map will display an icon in the top right that, when clicked, will show the available layers.

- -
- - - Image: Layers collapsed - - -
Layers collapsed
-
- -

6. conclusion

-

Hey, look at that! You made a base map and you added some National Park data to it. You’re a certified cartographer now!

- -

In the next part IV post we’ll download and process the state park data before adding it to the map. Part V of this series we’ll add Shiny functionality and some additional markers.

- - -
Map showing US National Parks
-

</figure>

-:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/13/bac6e2f146265aecc927e21f50544292fd950718c1028eb350f5d38c3096d1 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/13/bac6e2f146265aecc927e21f50544292fd950718c1028eb350f5d38c3096d1 deleted file mode 100644 index 0223795..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/13/bac6e2f146265aecc927e21f50544292fd950718c1028eb350f5d38c3096d1 +++ /dev/null @@ -1,587 +0,0 @@ -I"ä

This is part three of my cartography in R series. If you are just finding this, I suggest taking a look at part I and part II first.

- -

In this post, I will download and process the National Park data. Once that’s done, I’ll add it to the base map I created in part II.

- - - -
- Note: I know that my posts are verbose. I am sorry that I am not more concise but one of the biggest problems I have with code documentation is that it assumes you already know what you're doing. I'm writing these for absolute beginners that want to know what each part of the code does. If you want more concise explanations, I suggest looking at the annotated r files in the project's GitHub. -
- -

project outline

-
- -

I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

- -

I. cartography in r part one

- - -

II. cartography in r part two

- - -

III. cartography in r part three

- - -

IV. cartography in r part four [this post]

- - -

V. cartography in r part five

- - -

1. load libraries

-
- -
1
-2
-3
-4
-5
-
    ## load libraries
-    library("tidyverse")        # data manipulation & map creation
-    library("sf")               # loads shapefile
-    library("leaflet")          # creates the map
-    library("operator.tools")   # not-in function
-
- -

I am not going to explain in detail what each of these packages do because I already covered it in part one.

- -

2. load usa data

-
- -
1
-2
-
    ## load data
-    states <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
-
- -

Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp to reflect wherever you saved the shifted shapefile.

- -

If your data processing and base map creation are in the same file, you can skip this line, and when you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

- -

3. download national park data

-
- -

The National Park Service provides all the data we’ll need to make the map. The data is accessible on the ArcGIS’ Open Data website. Once you click on the link you’ll see a bunch of icons that lead to different data that’s available for download. Click on the one for boundaries.

- -
- - - Image: NPS GIS website - - -
NPS GIS website
-
- -

From here, you’ll be taken to a list of available National Park data. The second link should be nps boundary which contains the shape data for all the National Parks in the United States. The file contains all the data for the park outlines along with hiking trails, rest areas, and lots of other data.

- -
- - - Image: NPS download page - - -
NPS download page
-
- -

The nps boundary link will take you to a map showing the national parks. On the left, there will be a download link on the left.

- -
- - - Image: NPS map and download link - - -
NPS map and download link
-
- -

From here, you’ll have a few download options. The National Park Service provides the data in different formats including CSV and Shapefile. You’ll want to download the shapefile version.

- -
- - - Image: download nps shapefile - - -
download nps shapefile
-
- -

Be sure to save the file somewhere on your hard drive that is easy to find. When it finishes downloading, be sure to unzip the file. There will be four files inside the folder. All of them need to be kept in the same location. Even though we’ll only load the .shp file, R uses the three others to create the necessary shapes.

- -

4. process national park data

-
- -

The code below may look intimidating, but it’s fairly straight forward. I’ll go over each line below.

- -
- Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
- -
1
-2
-3
-4
-5
-6
-7
-8
-9
-10
-11
-12
-13
-14
-15
-16
-17
-18
-19
-20
-21
-22
-23
-24
-25
-26
-27
-28
-29
-30
-31
-32
-33
-34
-35
-36
-37
-38
-39
-40
-41
-42
-43
-
    ## create territories list
-    territories <- c("AS", "GU", "MP", "PR", "VI")
-
-    ## load and process nps data
-    nps <- read_sf("./shapefiles/original/nps/NPS_-_Land_Resources_Division_Boundary_and_Tract_Data_Service.shp")  %>% 
-    select(STATE, UNIT_TYPE, PARKNAME, Shape__Are, geometry) %>% 
-    filter(STATE %!in% territories) %>%  
-    mutate(type = case_when(UNIT_TYPE == "International Historic Site" ~ "International Historic Site", # there's 23 types of national park, I wanted to reduce this number.
-                UNIT_TYPE == "National Battlefield Site" ~ "National Military or Battlefield", # lines 56-77 reduce the number of park types
-                UNIT_TYPE == "National Military Park" ~ "National Military or Battlefield", 
-                UNIT_TYPE == "National Battlefield" ~ "National Military or Battlefield",
-                UNIT_TYPE == "National Historical Park" ~ "National Historical Park, Site, Monument, or Memorial",
-                UNIT_TYPE == "National Historic Site" ~ "National Historical Park, Site, Monument, or Memorial",
-                UNIT_TYPE == "National Historic Trail" ~ "National Historical Park, Site, Monument, or Memorial",
-                UNIT_TYPE == "National Memorial" ~ "National Historical Park, Site, Monument, or Memorial",
-                UNIT_TYPE == "National Monument" ~ "National Historical Park, Site, Monument, or Memorial",
-                UNIT_TYPE == "National Preserve" ~ "National Preserve, Reserve, or Recreation Area",
-                UNIT_TYPE == "National Reserve" ~ "National Preserve, Reserve, or Recreation Area",
-                UNIT_TYPE == "National Recreation Area" ~ "National Preserve, Reserve, or Recreation Area",
-                UNIT_TYPE == "National River" ~ "National River, Lakeshore, or Seashore",
-                UNIT_TYPE == "National Lakeshore" ~ "National River, Lakeshore, or Seashore",
-                UNIT_TYPE == "National Wild & Scenic River" ~ "National River, Lakeshore, or Seashore",
-                UNIT_TYPE == "National Seashore" ~ "National River, Lakeshore, or Seashore",
-                UNIT_TYPE == "National Trails Syste" ~ "National Trail",
-                UNIT_TYPE == "National Scenic Trail" ~ "National Trail",
-                UNIT_TYPE == "National Park" ~ "National Park or Parkway",
-                UNIT_TYPE == "Park" ~ "National Park or Parkway",
-                UNIT_TYPE == "Parkway" ~ "National Park or Parkway",
-                UNIT_TYPE == "Other Designation" ~ "Other National Land Area"))  %>%  
-    mutate(visited = case_when(PARKNAME == "Joshua Tree" ~ "visited", 
-                                PARKNAME == "Redwood" ~ "visited", 
-                                PARKNAME == "Santa Monica Mountains" ~ "visited", 
-                                PARKNAME == "Sequoia" ~ "visited", 
-                                PARKNAME == "Kings Canyon" ~ "visited",
-                                PARKNAME == "Lewis and Clark" ~ "visited",
-                                PARKNAME == "Mount Rainier" ~ "visited",
-                                TRUE ~ "not visited")) %>%  
-    shift_geometry(preserve_area = FALSE, 
-                    position = "below") %>% 
-    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
-
-    ## save the shifted national park data
-    st_write(nps, "~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
-
- - - -
2    territories <- c("AS", "GU", "MP", "PR", "VI")
- -

In part I of this series I talked about how R has an %in% function, but not a %!in% function. Here’s where the latter function shines.

- -

The United States is still an empire with its associated territories and islands. In this project I am interested in the 50 states - without these other areas. As a result, I need to filter them out. Using base R’s %in% function I would have to create a variable that contains the postal abbreviations for all 50 states. That is annoying. Instead, I want to use the shorter list that only includes the US’ associated islands and territories. To do so, however, I need to use the operator tools’ %!in% function.

- -

Line 2 creates the list of US territories that I filter out in line 7. The c() function in R means combine or concatenate. Inside the parenthesis are the five postal codes for the American Samoa, Guam, the Northern Mariana Islands, Puerto Rico, and the Virgin Islands.

- -
- - - -
5   nps <- read_sf("path/to/file/NPS_-_Land_Resources_Division_Boundary_and_Tract_Data_Service.shp")  %>% 
- -

nps <- read_sf("path/to/file.shp") loads the National Park data set to a variable called nps using the read_sf() function that is part of the sf package. You will need to change the file path so it reflects where you saved the data on your hard drive.

- -

The %>% operator is part of the tidyverse package. It tells R to go to the next line and process the next command. It has to go at the end of a line, rather than the beginning.

- -
- - - -
6    select(STATE, UNIT_TYPE, PARKNAME, Shape__Are, geometry) %>%
- -

select is part of the tidyverse package. With it, we can select columns by their name rather than their associated number. Large data sets take more computing power because the computer has to iterate over more rows. Unfortunately, rendering maps also takes a lot of computing power so I like to discard any unnecessary columns to reduce the amount of effort my computer has to exert.

- -

Deciding on which columns to keep will depend on the data you’re using and what you want to map (or analyze). I know for my project I want to include a few things:

- - -

There’s a couple ways to inspect the data to see what kind of information is available.

- -
    -
  1. You can view the entire data set by using view(nps) but as the number of data points increases, so does R’s struggle with opening it. I’ve found that VSCode doesn’t throw as big of a fit as R Studio when opening large data sets.
  2. -
  3. Another way is to just look at the column names using data.frame(colnames(nps)). This will return a list of the data set’s column names. This is my preferred method. I then go to the documentation to see what each column contains. This isn’t fool-proof because it really depends on if the data has good documentation.
  4. -
- -

The National Park data includes a lot of information about who created the data and maintains the property. I’m not interested in this, so in line 6 I select the following columns:

- - -
Note: The column name must be exact for select() to work. For example, in the NPS data, the park area is stored the Shape__Are column - two underscores and no a in area - and I have to use that exact format when selecting the column.
- -

The geometry column is specific to shapefiles and it includes the coordinates of the shape. It will be kept automatically - unless you use the st_drop_geometry() function. I like to specifically select so I remember it’s there.

- -
- - - -
7    filter(STATE %!in% territories) %>%  
- -

In line 7 I use the territories list I created in line 2 to filter out the United States’ associated areas. Since the nps data uses the two character state abbreviation, I have to use the two character abbreviation for the territories. Searching for “Guam,” for example, won’t work.

- -

filter() is part of the tidyverse and it uses conditional language. In the parentheses is a condition that must be true if the tidyverse is going to keep the row. Starting at the top of the data, R goes “alright, does the value in the STATE column match any of the values in the territories list?” If the condition is TRUE, R adds the row to the new data frame.

- -
Note: Since we're using the %!in% operator, any row that evaluates as TRUE will be kept because the value is NOT found in the territories list. If I wanted to keep only the territories, I would use the %in% operator and only the rows with STATE abbreviations found in the territories list would be kept.
- -

For example, if the STATE value in row 1 is CA, filter looks at it and goes “is CA NOT IN territories?” If that is TRUE, keep it because we want only the values that are NOT IN the territories list.

- -
- - - -
8    mutate(type = case_when(UNIT_TYPE == "International Historic Site" ~ "International Historic Site",
- -

mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

- -

The NPS data set has 23 different types of National Parks listed (you can view all of them by running levels(as.factor(nps$UNIT_TYPE))). I know that in later posts, I’m going to color code the land by type (blue for rivers, green for national parks, etc) so I wanted to reduce the number of colors I would have to use.

- -

mutate()’s first argument, type = creates a new column called type. R will populate the newly created column with whatever comes after the first (singular) equal = sign. For example, I can put type = NA and every row in the column will say NA.

- -

Here, I am using the case_when() function, which is also part of the tidyverse. The logic of case_when is fairly straight forward. The first value is the name of the column you want R to look in (here: UNIT_TYPE). Next, is a conditional. Here I am looking for an exact match (==) to the string (words) inside the first set of quotation marks (in line 8: "International Historic Site"). The last part of the argument is what I want R to put in the type column when it finds a row where the UNIT_TYPE is "International Historic Site".

- -

In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value")

- -

Lines 9-29 do the same thing for the other park types. You can reduce the parks however you want or use all 23 types. Just remember that the value before the tilde ~ has to match the values found in the data exactly. For example, in line 24 I change the NPS data’s National Trail Syste value to be National Trail. Whomever created the data set did not spell system correctly, so for R to match the value I also have to omit the last letter in system.

- -
- - - -
30    mutate(visited = case_when(PARKNAME == "Joshua Tree" ~ "visited",
- -

Lines 30-37 use the same mutate() and case_when logic as above. Instead of reducing the number of park types, I use it to mark the different parks I have visited.

- -

Line 30 creates the new column, visited and uses case_when to look for the names of the parks that I’ve been to. If I have visited them, it adds visited to the column of the same name.

- -

The last line, TRUE ~ "not_visited)), acts as an else statement. For any park not listed above, it will put not visited in the visited column I created.

- -

This feels like a very brute-force method of tracking which parks I’ve visited, but I haven’t spend much time trying to find another way.

- -
- - - -
38    shift_geometry(preserve_area = FALSE, 
-39                   position = "below") %>%
- -

In part I, when I made the base map, I moved Alaska and Hawaii so they were of similar size and closer to the continental USA. For the map to display the parks correctly, I have to shift them as well.

- -

I went over these two lines in part II, so I won’t go over them again here. If you want to read more about them, check out that post.

- -
- - - -
40  sf::st_transform('+proj=longlat +datum=WGS84')
- -

The last line uses the st_transform() function from the sf package to covert the data set from NAD83 to WGS84. Leaflet requires WGS84, so be sure to include this line at the end of your data manipulation.

- -

I covered the WGS84 ellipsoid in part I, if you want to read more about it.

- -
- - - -
43  st_write(nps, "~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
- -

Strictly speaking, this line isn’t necessary. You can do all your data processing in the same file where you make your map, but I prefer to separate the steps into different files.

- -

As a result, I save the shifted data to my hard drive so it’s easier to load later. I usually have this line commented out (by placing # at the start of the line) after I save it the first time. I don’t want it to save every time I run the rest of the code.

- -

5. add national parks to base map

-
- -
1
-2
-3
-4
-5
-6
-7
-8
-9
-10
-11
-12
-13
-14
-15
-16
-17
-18
-19
-20
-21
-22
-23
-24
-25
-26
-27
-28
-29
-30
-31
-32
-33
-34
-
## create usa Base Map using leaflet()
-    map <- leaflet() %>%
-    addPolygons(data = states,
-        smoothFactor = 0.2,
-        fillColor = "#808080",
-        fillOpacity = 0.5,
-        stroke = TRUE,
-        weight = 0.5,
-        opacity = 0.5,
-        color = "#808080",
-        highlight = highlightOptions(
-            weight = 0.5,
-            color = "#000000",
-            fillOpacity = 0.7,
-            bringToFront = FALSE),
-        group = "Base Map")  %>% 
-    addPolygons(data = nps,
-        smoothFactor = 0.2,                 
-        fillColor = "#354f52",
-        fillOpacity = 1,
-        stroke = TRUE,
-        weight = 1,     
-        opacity = 0.5,                       
-        color = "#354f52",             
-        highlight = highlightOptions(
-            weight = 3,
-            color = "#fff",
-            fillOpacity = 0.8,
-            bringToFront = TRUE),
-        group = "National Parks")  %>%
-    addLayersControl(
-        baseGroups = "Base Map",
-        overlayGroups = "National Parks",
-        options = layersControlOptions(collapsed = FALSE))
-
- - - -

Lines 2-16 are identical to those in part II where I created the base map. I am not going to cover these sections in detail, because I covered it previously.

- -
- - - -
17    addPolygons(data = nps,
- -

To add the National Park data to the base map, we call addPolygons() again. The arguments are the same as before - color, opacity, outline style - just with different values. By changing those values, we can differentiate the base map from the national park data.

- -

Since we’re mapping the National Parks and not the states, we have to tell R where the data is located using data = nps.

- -
- - - -
18   smoothFactor = 0.2,
- -

smoothFactor() determines how detailed the park boundaries should be. The lower the number, the more detailed the shape. The higher the number, the smoother the parks will render. I usually match this to whatever I set for the base map for consistency.

- -
- - - -
19   fillColor = "#354f52",
-20   fillOpacity = 1,
- -

Define the color and transparency of the National Parks. In a future post, I am going to change the color of each type of public land, but for now, I’ll make them all a nice sage green color #354f52. I also want to make the parks to be fully opaque.

- -
- - - -
21    stroke = TRUE,
-22    weight = 1,     
-23    opacity = 0.5,                       
-24    color = "#354f52",
- -

The next four lines (21-24) define what kind of outline the National Parks will have. I detail each of these arguments in part II of this series.

- -

Briefly, I want there to be an outline to each park (stroke = TRUE) that’s thicker weight = 1 than the outline used on the base map. I do not like the way it looks at full opacity, so I make it half-transparent (opacity = 0.5). Finally, I want the outline color = "#354f52 to be the same color as the fill. This will matter more when I change the fill color of the parks later on.

- -
- - - -
25  highlight = highlightOptions(
-26              weight = 3,
-27              color = "#fff",
-28              fillOpacity = 0.8,
- -

Lines 25-28 define the National Park’s behavior on mouseover. First we have to define and initialize the highlightOptions() function. The function take similar arguments as the addPolygons function - both of which I go over in detail in part II.

- -

I want to keep the mouseover behavior noticeable, but simple. To do so, I set the outline’s thickness to be weight = 3. This will give the shape a nice border that differentiates it from the rest of the map.

- -

color = "#fff sets the outline’s color on mouseover only. So, when inactive, the outline color will match the fill color, but on mouseover the outline color switches to white (#fff).

- -
- Note: When the hex code repeats (like, white - #ffffff, black - #000000, or grey #808080) you only need to include the first three characters.
- -
- - - -
29    bringToFront = TRUE),
- -

bringToFront can either be TRUE or FALSE. If TRUE, Leaflet will bring the park to the forefront on mouseover. This is useful later when we add in the state parks because national and state parks tend to be close together.

- -

When FALSE the shape will remain static.

- -
- - - -
30    group = "National Parks")  %>%
- -

Since Leaflet adds all new data to the top of the base map, I think it’s useful to group the layers together. In the next block of code, we add in some layer functionality. For now, though, I want to add the National Parks to their own group so I can hide the National Parks if I want.

- -
- - - -
31    addLayersControl(
-32        baseGroups = "Base Map",
-33        overlayGroups = "National Parks",
-34        options = layersControlOptions(collapsed = FALSE))
- -

addLayersControl defines how layers are displayed on the final map. The function takes three arguments.

- -

First, we have to tell Leaflet which layer should be used as the base map: baseGroups = "Base Map". The name in the quotations (here: "Base Map") has to match the name given to the layer you set in the addPolygons() call. In line 14, I put the 50 states into a group called "Base Map", but you can name it anything you like.

- -

There can be more than one base map, too. It’s not super helpful here since I shifted Alaska and Hawaii, but when using map tiles you can add multiple types of base maps that users can switch between.

- -

Next, we have to define the layers that are shown on top of the base group: overlayGroups = "National Parks". Just like the base map, this is defined in the corresponding addPolygons call. Here, I called the layer National Parks in line 30.

- -

Finally, on the map I don’t want the layers to be collapsed, so I set options = layersControlOptions(collapsed = FALSE). When TRUE the map will display an icon in the top right that, when clicked, will show the available layers.

- -
- - - Image: Layers collapsed - - -
Layers collapsed
-
- -

6. conclusion

-

Hey, look at that! You made a base map and you added some National Park data to it. You’re a certified cartographer now!

- -

In the next part IV post we’ll download and process the state park data before adding it to the map. Part V of this series we’ll add Shiny functionality and some additional markers.

- - -
Map showing US National Parks
-

</figure>

-:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/17/b7cba20a27e3021ce8f6ec95153175849f4d20efc196fd16f500aca543adc6 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/17/b7cba20a27e3021ce8f6ec95153175849f4d20efc196fd16f500aca543adc6 deleted file mode 100644 index ecc58f5..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/17/b7cba20a27e3021ce8f6ec95153175849f4d20efc196fd16f500aca543adc6 +++ /dev/null @@ -1,654 +0,0 @@ -I"Mô

Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

- - - -

project outline

-
- -

I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

- -

I. cartography in r part one

- - -

II. cartography in r part two

- - -

III. cartography in r part three

- - -

IV. cartography in r part four

- - -

V. cartography in r part five

- - -

VI. cartography in r part six

- - -

1. load libraries

-
- -
1
-2
-3
-4
-5
-
    ## load libraries
-    library("tidyverse")        # data manipulation & map creation
-    library("sf")               # loads shapefile
-    library("leaflet")          # creates the map
-    library("operator.tools")   # not-in function
-
- -

I am not going to explain in detail what each of these packages do because I already covered it in part one.

- -

2. load data

-
- -
1
-2
-3
-
    ## load data
-    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
-    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
-
- -

Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

- -

If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

- -

3. download state park data

-
- -

Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

- -

When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

- -

The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

- -
- - - Image: PAD-US download - - -
PAD-US download
-
- -

On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

- -
- - - Image: National data download - PAD-US - - -
National data download - PAD-US
-
- -

When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

- -

4. find the right layer

-
- -

Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

- -
1
-
    layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
-
- -

I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

- -

st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

- -
- - - Image: PAD-US Layer List - - -
PAD-US Layer List
-
- -

The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

- - -

I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

- -

Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

- -
- - - Image: PAD-US Viewer - - -
PAD-US Viewer
-
- -

In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

- -
- - - Image: Search for Valley of Fire State Park - - -
Search for Valley of Fire State Park
-
- -

The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

- -

I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

- -
- - - Image: Valley of Fire State Park Polygon - - -
Valley of Fire State Park Polygon
-
- -

The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

- -

For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

- -
- - - Image: Federal Fee Managers Layer - - -
Federal Fee Managers Layer
-
- -

5. find the right data

-
- -

The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

- -

Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

- -

For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

- -
- - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
Valley of Fire (left), Crissey Field (center), Salton Sea (right)
-
- -
- -Note: -Clicking on an image will open it in a new tab. -
- -

First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

- -
- - - Image: Valley of Fire State Park Table - - -
Valley of Fire State Park Table
-
- -

In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

- -
    -
  1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
  2. -
  3. Mang_Type is the name of the agency that manages the land.
  4. -
  5. Mang_Name is the type of agency that owns the land, here state.
  6. -
  7. Des_Tp is the designation type.
  8. -
  9. Unit_Nm is the land’s name.
  10. -
- -

When we look at Crissey Field, many of these fields have different values.

- -
- - - Image: Crissey Field State Recreation Site Table - - -
Crissey Field State Recreation Site Table
-
- -

The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

- -

Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

- -
- - - Image: Salton Sea - - -
Salton Sea
-
- -

Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

- -

On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

- -
- - - Image: Salton Sea - - -
Salton Sea
-
- -

I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

- -

In the viewer, I’ll look for the blueish violet color (boxed in red above).

- -
- - - Image: Salton Sea State Recreation Area Table - - -
Salton Sea State Recreation Area Table
-
- -

After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

- -

Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

- -

6. process state park data

-
- -
- Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
- -
1
-2
-3
-4
-5
-6
-7
-8
-9
-10
-11
-12
-13
-14
-15
-16
-17
-18
-19
-20
-21
-22
-23
-24
-25
-26
-27
-28
-29
-30
-31
-32
-33
-34
-35
-36
-37
-38
-39
-40
-41
-42
-43
-
    ## create territories list
-    territories <- c("AS", "GU", "MP", "PR", "VI")
-
-    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
-    filter(State_Nm %!in% territories & 
-           Own_Type == "STAT")  %>% 
-    filter(Des_Tp == "ACC" |
-           Des_Tp == "HCA" |
-           Des_Tp == "REC" |
-           Des_Tp == "SCA" |
-           Des_Tp == "SHCA" |
-           Des_Tp == "SP" |
-           Des_Tp == "SREC" |
-           Des_Tp == "SRMA" |
-           Des_Tp == "SW") %>% 
-    filter(d_Pub_Acce != "Closed" & 
-           d_Pub_Acce != "Unknown") %>% 
-    filter(Loc_Ds != "ACC" &
-           Loc_Ds != "Hunter Access",
-           Loc_Ds != "Public Boat Ramp")  %>% 
-    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
-    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
-                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
-                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
-                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
-                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
-                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
-                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
-                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
-                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
-    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
-                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
-                               Unit_Nm == "Salton Sea" ~ "visited",
-                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
-                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
-                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
-                               TRUE ~ "not visited")  %>% 
-    shift_geometry(preserve_area = FALSE, 
-                    position = "below") %>% 
-    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
-
-    ## save shifted park data
-    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
-
- -

These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

- - - -
2    territories <- c("AS", "GU", "MP", "PR", "VI")
- -

Line 2 creates the list of US territories that I filter out in line 5.

- -

If you want to read more about this line visit part III of the series.

- -
- - - -
4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
- -

Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

- -

st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

- -

The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

- -
- - - -
5    filter(State_Nm %!in% territories  &
-6           Own_Type == "STAT")
- -

R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

- -

The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

- -
- -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
- -

The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

- -
- - - -
7    filter(Des_Tp == "ACC" |
-8           Des_Tp == "HCA" |
-9           Des_Tp == "REC" |
-10          Des_Tp == "SCA" |
-11          Des_Tp == "SHCA" |
-12          Des_Tp == "SP" |
-13          Des_Tp == "SREC" |
-14          Des_Tp == "SRMA" |
-15          Des_Tp == "SW")  
- -

Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

- -

Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

- -
- -Note: -You can view the land designation types by referencing the PAD-US documentation. -
- -

I am going to restrict my data to include the following designations:

- - -

This will leave me with 50,102 rows.

- -
- -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
- -
- - - -
16    filter(d_Pub_Acce != "Closed" & 
-17           d_Pub_Acce != "Unknown")  %>% 
- -

Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

- -

The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

- -

This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

- -
- -

*lines 18-20

- -
18    filter(Loc_Ds != "ACC" &
-19           Loc_Ds != "Hunter Access",
-20           Loc_Ds != "Public Boat Ramp")
- -

Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

- -
- - - -
21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
- -

Now that I’ve pared down the data a little bit, I want discard any column don’t need.

- -

select() lets me choose the columns I want to keep by name, rather than by index number.

- -

I decided to keep:

- - -
- - - -
22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
-23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
-24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
-25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
-26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
-27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
-28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
-29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
-30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
- -

mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

- -

I wanted the state park designations to match closely with the types I used in the National Park data.

- -

I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

- -

In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

- -
- - - -
31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
-32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
-33                             Unit_Nm == "Salton Sea" ~ "visited",
-34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
-35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
-36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
-37                             TRUE ~ "not visited")  %>% 
- -

Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

- -

I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

- -
- Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
- -

In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

- -

The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

- -
- - - -
38   shift_geometry(preserve_area = FALSE, 
-39                  position = "below") %>% 
-40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
-41
-42   ## save shifted park data
-43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
- -

I’ve covered these lines extensively in part II and part III of this series.

- -

Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

- -

Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

- -

Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

- -

7. divide by state

-
- -
- - -
- - - Image: chaos ensues - - -
chaos ensues
-
- - -
- -

I tried to map the base map, National Parks, and the state parks. It did not go well. R froze, my computer screamed, and chaos ensued. As a result, I had to rethink my map. I decided to separate the state parks by state, save them, and in part VI of this never-ending series* I’ll create individual state maps. When you click on a state it’ll take you to a map that includes the state parks.

- -

Unfortunately, this also means I need to separate the National Parks by state so they also appear on the individual maps. The logic will be the same so I am not going to update part III to reflect that change. If you want to see that code it’s available on the project repo].

- -

I don’t want to manually separate and save each state, so I’m going to use a loop! I hate loops. The logic is simple enough “as long as condition X is true, do something.” So simple, yet esvery time I’ve tried to learn a programming language I have struggled with loops. That’s pretty sad considering it’s like day 2 of any programming class. Day 1 is learning how to write “Hello World!”**

- -
1
-2
-3
-4
-5
-
    split_states <- split(state_parks, f = state_parks$State_Nm) # split the data by state
-    all_names <- names(split_states)   
-
-    for(name in all_names){            
-        st_write(split_states[[name]], paste0("shapefiles/shifted/states/individual/", name, '.shp'))}
-
- -

Look ma, new code!

- - - -
    split_states <- split(state_parks, f = state_parks$State_Nm) # split the data by state
- -

8. conclusion

-
- -

* I have annoyed myself with how long this series is. Hopefully it is helpful. Drop me a line if it is.
-** print("Hello World!")

-:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/18/028e1e523f6238826ed65e72bbb3b7a0a1a519fd003d48233c066b06c1a44c b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/18/028e1e523f6238826ed65e72bbb3b7a0a1a519fd003d48233c066b06c1a44c deleted file mode 100644 index f1c2a3a..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/18/028e1e523f6238826ed65e72bbb3b7a0a1a519fd003d48233c066b06c1a44c +++ /dev/null @@ -1,687 +0,0 @@ -I"îţ

Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

- - - -

project outline

-
- -

I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

- -

I. cartography in r part one

- - -

II. cartography in r part two

- - -

III. cartography in r part three

- - -

IV. cartography in r part four

- - -

V. cartography in r part five

- - -

VI. cartography in r part six

- - -

1. load libraries

-
- -
1
-2
-3
-4
-5
-
    ## load libraries
-    library("tidyverse")        # data manipulation & map creation
-    library("sf")               # loads shapefile
-    library("leaflet")          # creates the map
-    library("operator.tools")   # not-in function
-
- -

I am not going to explain in detail what each of these packages do because I already covered it in part one.

- -

2. load data

-
- -
1
-2
-3
-
    ## load data
-    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
-    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
-
- -

Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

- -

If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

- -

3. download state park data

-
- -

Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

- -

When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

- -

The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

- -
- - - Image: PAD-US download - - -
PAD-US download
-
- -

On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

- -
- - - Image: National data download - PAD-US - - -
National data download - PAD-US
-
- -

When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

- -

4. find the right layer

-
- -

Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

- -
1
-
    layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
-
- -

I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

- -

st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

- -
- - - Image: PAD-US Layer List - - -
PAD-US Layer List
-
- -

The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

- - -

I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

- -

Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

- -
- - - Image: PAD-US Viewer - - -
PAD-US Viewer
-
- -

In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

- -
- - - Image: Search for Valley of Fire State Park - - -
Search for Valley of Fire State Park
-
- -

The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

- -

I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

- -
- - - Image: Valley of Fire State Park Polygon - - -
Valley of Fire State Park Polygon
-
- -

The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

- -

For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

- -
- - - Image: Federal Fee Managers Layer - - -
Federal Fee Managers Layer
-
- -

5. find the right data

-
- -

The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

- -

Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

- -

For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

- -
- - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
Valley of Fire (left), Crissey Field (center), Salton Sea (right)
-
- -
- -Note: -Clicking on an image will open it in a new tab. -
- -

First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

- -
- - - Image: Valley of Fire State Park Table - - -
Valley of Fire State Park Table
-
- -

In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

- -
    -
  1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
  2. -
  3. Mang_Type is the name of the agency that manages the land.
  4. -
  5. Mang_Name is the type of agency that owns the land, here state.
  6. -
  7. Des_Tp is the designation type.
  8. -
  9. Unit_Nm is the land’s name.
  10. -
- -

When we look at Crissey Field, many of these fields have different values.

- -
- - - Image: Crissey Field State Recreation Site Table - - -
Crissey Field State Recreation Site Table
-
- -

The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

- -

Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

- -
- - - Image: Salton Sea - - -
Salton Sea
-
- -

Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

- -

On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

- -
- - - Image: Salton Sea - - -
Salton Sea
-
- -

I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

- -

In the viewer, I’ll look for the blueish violet color (boxed in red above).

- -
- - - Image: Salton Sea State Recreation Area Table - - -
Salton Sea State Recreation Area Table
-
- -

After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

- -

Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

- -

6. process state park data

-
- -
- Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
- -
1
-2
-3
-4
-5
-6
-7
-8
-9
-10
-11
-12
-13
-14
-15
-16
-17
-18
-19
-20
-21
-22
-23
-24
-25
-26
-27
-28
-29
-30
-31
-32
-33
-34
-35
-36
-37
-38
-39
-40
-41
-42
-43
-
    ## create territories list
-    territories <- c("AS", "GU", "MP", "PR", "VI")
-
-    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
-    filter(State_Nm %!in% territories & 
-           Own_Type == "STAT")  %>% 
-    filter(Des_Tp == "ACC" |
-           Des_Tp == "HCA" |
-           Des_Tp == "REC" |
-           Des_Tp == "SCA" |
-           Des_Tp == "SHCA" |
-           Des_Tp == "SP" |
-           Des_Tp == "SREC" |
-           Des_Tp == "SRMA" |
-           Des_Tp == "SW") %>% 
-    filter(d_Pub_Acce != "Closed" & 
-           d_Pub_Acce != "Unknown") %>% 
-    filter(Loc_Ds != "ACC" &
-           Loc_Ds != "Hunter Access",
-           Loc_Ds != "Public Boat Ramp")  %>% 
-    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
-    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
-                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
-                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
-                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
-                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
-                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
-                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
-                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
-                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
-    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
-                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
-                               Unit_Nm == "Salton Sea" ~ "visited",
-                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
-                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
-                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
-                               TRUE ~ "not visited")  %>% 
-    shift_geometry(preserve_area = FALSE, 
-                    position = "below") %>% 
-    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
-
-    ## save shifted park data
-    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
-
- -

These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

- - - -
2    territories <- c("AS", "GU", "MP", "PR", "VI")
- -

Line 2 creates the list of US territories that I filter out in line 5.

- -

If you want to read more about this line visit part III of the series.

- -
- - - -
4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
- -

Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

- -

st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

- -

The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

- -
- - - -
5    filter(State_Nm %!in% territories  &
-6           Own_Type == "STAT")
- -

R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

- -

The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

- -
- -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
- -

The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

- -
- - - -
7    filter(Des_Tp == "ACC" |
-8           Des_Tp == "HCA" |
-9           Des_Tp == "REC" |
-10          Des_Tp == "SCA" |
-11          Des_Tp == "SHCA" |
-12          Des_Tp == "SP" |
-13          Des_Tp == "SREC" |
-14          Des_Tp == "SRMA" |
-15          Des_Tp == "SW")  
- -

Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

- -

Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

- -
- -Note: -You can view the land designation types by referencing the PAD-US documentation. -
- -

I am going to restrict my data to include the following designations:

- - -

This will leave me with 50,102 rows.

- -
- -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
- -
- - - -
16    filter(d_Pub_Acce != "Closed" & 
-17           d_Pub_Acce != "Unknown")  %>% 
- -

Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

- -

The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

- -

This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

- -
- -

*lines 18-20

- -
18    filter(Loc_Ds != "ACC" &
-19           Loc_Ds != "Hunter Access",
-20           Loc_Ds != "Public Boat Ramp")
- -

Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

- -
- - - -
21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
- -

Now that I’ve pared down the data a little bit, I want discard any column don’t need.

- -

select() lets me choose the columns I want to keep by name, rather than by index number.

- -

I decided to keep:

- - -
- - - -
22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
-23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
-24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
-25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
-26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
-27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
-28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
-29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
-30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
- -

mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

- -

I wanted the state park designations to match closely with the types I used in the National Park data.

- -

I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

- -

In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

- -
- - - -
31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
-32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
-33                             Unit_Nm == "Salton Sea" ~ "visited",
-34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
-35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
-36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
-37                             TRUE ~ "not visited")  %>% 
- -

Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

- -

I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

- -
- Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
- -

In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

- -

The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

- -
- - - -
38   shift_geometry(preserve_area = FALSE, 
-39                  position = "below") %>% 
-40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
-41
-42   ## save shifted park data
-43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
- -

I’ve covered these lines extensively in part II and part III of this series.

- -

Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

- -

Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

- -

Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

- -

7. divide by state

-
- -
- - -
- - - Image: chaos ensues - - -
chaos ensues
-
- - -
- -

I tried to map the base map, National Parks, and the state parks. It did not go well. R froze, my computer screamed, and chaos ensued. As a result, I had to rethink my map. I decided to separate the state parks by state, save them, and in part VI of this never-ending series* I’ll create individual state maps. When you click on a state it’ll take you to a map that includes the state parks.

- -

Unfortunately, this also means I need to separate the National Parks by state so they also appear on the individual maps. The logic will be the same so I am not going to update part III to reflect that change. If you want to see that code it’s available on the project repo].

- -

I don’t want to manually separate and save each state, so I’m going to use a loop! I hate loops. The logic is simple enough “as long as condition X is true, do something.” So simple, yet esvery time I’ve tried to learn a programming language I have struggled with loops. That’s pretty sad considering it’s like day 2 of any programming class. Day 1 is learning how to write “Hello World!”**

- -
1
-2
-3
-4
-5
-
    split_states <- split(state_parks, f = state_parks$State_Nm)
-    all_names <- names(split_states)   
-
-    for(name in all_names){            
-        st_write(split_states[[name]], paste0("shapefiles/shifted/states/individual/", name, '.shp'))}
-
- -

Look ma, new code!

- - - -
1    split_states <- split(state_parks, f = state_parks$State_Nm) 
- -

The split() is part of base R. It takes quite a few arguments, most of which are optional.

- -

The first argument is the vector (or data frame) that you want to split into different groups. I want to split the state_parks data into its corresponding states, so it is listed first.

- -

The second argument f = is how you want the data split. f in this instance stands for factor. If we run levels(as.factor(state_parks$State_Nm)) in the terminal, it will return a list of the 50 state abbreviations. That is what we’re telling R to do here.

- -

You can access an individual state using the $ operator. split_states$CA will return the state park data for California.

- -
- - - -
2    all_names <- names(split_states)
- -

names is also part of base R. It does what it sounds like - it gets the names of an object. Here, I want to get the names of each split data sets.

- -
- - - -
4    for(name in all_names){            
-5        st_write(split_states[[name]], paste0("shapefiles/shifted/states/individual/", name, '.shp'))}
- -

Here’s the actual for loop.

- -

The basic logic of a for loop is:
- for(x in y) {
- do something}

- -

8. conclusion

-
- -

* I have annoyed myself with how long this series is. Hopefully it is helpful. Drop me an email or a tweet if it is.
-** print("Hello World!")

-:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/18/77c2528811abb332b533fe512455fb18637d9f51bf105d96005a223fdae708 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/18/77c2528811abb332b533fe512455fb18637d9f51bf105d96005a223fdae708 deleted file mode 100644 index a891988..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/18/77c2528811abb332b533fe512455fb18637d9f51bf105d96005a223fdae708 +++ /dev/null @@ -1,243 +0,0 @@ -I"/+
-
-
-
- -
- - - -

Oldcart

-
- -
1
-2
-
    ## create territories list
-    territories <- c("AS", "GU", "MP", "PR", "VI")
-
- - - - - | - - - -read more -
- - - -

cartography in R part four

-
-

Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

- - - - - - | - - - -read more -
- - - -

cartography in R part three

-
-

This is part three of my cartography in R series. If you are just finding this, I suggest taking a look at part I and part II first.

- -

In this post, I will download and process the National Park data. Once that’s done, I’ll add it to the base map I created in part II.

- - - - - - | - - - -read more -
- - - -

Converting Gdb Files

-
-
    -
  1. download gdb data from the PAD-US Website: - https://www.sciencebase.gov/catalog/item/61794fc2d34ea58c3c6f9f69 - under Attached Files download “PADUS3_0Geodatabase.zip” - Unzip the folder
  2. -
- - - - - | - - - -read more -
- - - -

dvc (data version control)

-
-

Version control is helpful when you want to track your project’s changes. However, GitHub has one major (yet, understandable) shortcoming: file size. The free version of GitHub will warn you if your file is over 50MB and completely reject your push if the file is over 100MB. This is a huge problem when you’re working with shapefiles (.shp) which contain the geographic coordinates necessary for cartography. “Officially” there are three ways around GitHub’s file size limits, but I have a clear favorite.

- - - - - - | - - - -read more -
- - - -

github website

-
-

Building off my post about using Git & GitHub, this post is about using the GitHub website to initialize repos and get URLs from existing repos to clone them.

- - - - - - | - - - -read more -
- - - -

github desktop

-
-

This is part of my tutorial series on using Git and GitHub. In particular, this guide is about using GitHub’s desktop app for creating and managing repos. You can download the app here..

- - - - - | - - - -read more -
- - - -

git & github

-
-

Arguably one of the best things you can do before starting a PhD is invest time in learning how to properly use version control. With version control, you can track, save, and revert changes to any kind of project. There are several options available, but I’m partial to Git & GitHub. Even if you never touch a piece of code, version control is very helpful.

- - - - - - | - - - -read more -
- - - -

cartography in R part two

-
-

This is a continuation of my previous post where I walked through how to download and modify shape data. I also showed how to shift Alaska and Hawaii so they are closer to the continental usa.

- - - - - | - - - -read more -
- - - -

cartography in R part one

-
-

Twitter is a great resource for engaging with the academic community. For example, I saw this Tweet by PhD Genie asking users to name one positive skill learned during their PhD. I love this question for a number of reasons. First, it helps PhDs reframe their experience so it’s applicable outside of academia - which can help when applying to jobs. Second, it’s really cool to see what skills other people have learned during their program.

- - - - - | - - - -read more -
- - - -

hello

-
-

-This area will not be a real blog, in the sense that it will probably not have regular updates. -

- - - - - | - - - -read more -
- - - -

Citation Manager

-
-

GET A CITATION MANAGER.

- - - - - | - - - -read more -
- - - -
-
- -
-
-
- - -
-
- - -
-
- -
-
- -
-
- -
-:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/19/08d62ffa7d41ea9bd246a480604d8ec07cce5f9cf5ffdef1fdf41ed9d5ab86 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/19/08d62ffa7d41ea9bd246a480604d8ec07cce5f9cf5ffdef1fdf41ed9d5ab86 new file mode 100644 index 0000000..3260542 --- /dev/null +++ b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/19/08d62ffa7d41ea9bd246a480604d8ec07cce5f9cf5ffdef1fdf41ed9d5ab86 @@ -0,0 +1,29 @@ +I"w

welcome

+
+ +

Here you’ll find several non-partisan ballot guides. I try to include as much information as possible without directly recreating the official voter guide. The information is sourced from the ballot guide, calmatters, ballotpedia, LA Times, voter’s edge, and Mercury News. I have included links to the campaign website or wherever most of the information came from.

+ +

I have done my best to keep my views out of it.

+ +

I started this to help my two aunts because they would ask me to simplify their ballots for them. Democracy relies on an informed and participatory citizenry, but it’s not always easy. This is meant to alleviate some of the burden.

+ +

If you notice any errors, you feel like I’ve missed something, or you found this guide helpful feel free to send me an email [click the envelope at the bottom of the page]

+ +

helpful resources:

+
+ +
    +
  1. Register to vote (Deadline May 23): https://registertovote.ca.gov/
  2. +
  3. Check your registration status: https://voterstatus.sos.ca.gov/
  4. +
  5. Access the official voter guide: https://voterguide.sos.ca.gov/
  6. +
  7. Early voting & ballot drop off locations: https://caearlyvoting.sos.ca.gov/
  8. +
  9. Track your ballot: https://california.ballottrax.net/voter/
  10. +
  11. If you are in the Los Angeles, San Bernardino, Orange County area and need help getting to your polling place I will either find you resources or help you get there. I also offer to go with you to vote (and I will bring my two large German Shepherds) if you feel unsafe going to vote alone.
  12. +
+ +

ballot guides

+
+ +

2022 Primary California

+ +:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/19/75fea0c2525ada12b8783ac92c5114c49f061d3aad55439a27cdbcc64884cf b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/19/75fea0c2525ada12b8783ac92c5114c49f061d3aad55439a27cdbcc64884cf deleted file mode 100644 index dcbba3f..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/19/75fea0c2525ada12b8783ac92c5114c49f061d3aad55439a27cdbcc64884cf +++ /dev/null @@ -1,369 +0,0 @@ -I"ďh

Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

- - - -

project outline

-
- -

I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

- -

I. cartography in r part one

- - -

II. cartography in r part two

- - -

III. cartography in r part three

- - -

IV. cartography in r part four [this post]

- - -

V. cartography in r part five

- - -

1. load libraries

-
- -
1
-2
-3
-4
-5
-
    ## load libraries
-    library("tidyverse")        # data manipulation & map creation
-    library("sf")               # loads shapefile
-    library("leaflet")          # creates the map
-    library("operator.tools")   # not-in function
-
- -

I am not going to explain in detail what each of these packages do because I already covered it in part one.

- -

2. load data

-
- -
1
-2
-3
-
    ## load data
-    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
-    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
-
- -

Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

- -

If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

- -

3. download state park data

-
- -

Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

- -

When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

- -

The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

- -
- - - Image: PAD-US download - - -
PAD-US download
-
- -

On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

- -
- - - Image: National data download - PAD-US - - -
National data download - PAD-US
-
- -

When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

- -

4. find the right layer

-
- -

Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

- -
1
-
    layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
-
- -

I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

- -

st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

- -
- - - Image: PAD-US Layer List - - -
PAD-US Layer List
-
- -

The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

- - -

I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

- -

Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

- -
- - - Image: PAD-US Viewer - - -
PAD-US Viewer
-
- -

In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

- -
- - - Image: Search for Valley of Fire State Park - - -
Search for Valley of Fire State Park
-
- -

The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

- -

I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

- -
- - - Image: Valley of Fire State Park Polygon - - -
Valley of Fire State Park Polygon
-
- -

The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

- -

For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

- -
- - - Image: Federal Fee Managers Layer - - -
Federal Fee Managers Layer
-
- -

5. find the right data

-
- -

The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

- -

Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

- -

For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

- -
- - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
Valley of Fire (left), Crissey Field (center), Salton Sea (right)
-
- -
- -Note: -Clicking on an image will open it in a new tab. -
- -

First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

- -
- - - Image: Valley of Fire State Park Table - - -
Valley of Fire State Park Table
-
- -

In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

- -
    -
  1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
  2. -
  3. Mang_Type is the name of the agency that manages the land.
  4. -
  5. Mang_Name is the type of agency that owns the land, here state.
  6. -
  7. Des_Tp is the designation type.
  8. -
  9. Unit_Nm is the land’s name.
  10. -
- -

When we look at Crissey Field, many of these fields have different values.

- -
- - - Image: Crissey Field State Recreation Site Table - - -
Crissey Field State Recreation Site Table
-
- -

The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

- -

Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

- -
- - - Image: Salton Sea - - -
Salton Sea
-
- -

Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

- -

On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

- -
- - - Image: Salton Sea - - -
Salton Sea
-
- -

I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

- -

In the viewer, I’ll look for the blueish violet color (boxed in red above).

- -
- - - Image: Salton Sea State Recreation Area Table - - -
Salton Sea State Recreation Area Table
-
- -

After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

- -

Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

- -

6. pre-process data in R

-
- -
- Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
- -
1
-2
-3
-4
-5
-6
-
    ## create territories list
-    territories <- c("AS", "GU", "MP", "PR", "VI")
-
-    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
-        filter(State_Nm %!in% territories)  %>% 
-        filter(Own_Type == "State")
-
- -

These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

- - - -
    territories <- c("AS", "GU", "MP", "PR", "VI")
- -

Line 2 creates the list of US territories that I filter out in line 5.

- -

If you want to read more about this line visit part III of the series.

- -
- - - -
    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
- -

Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

- -

st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

- -

The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

- -
- - - -
    filter(State_Nm %!in% territories & Own_Type == "STAT")
- -

R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

- -

The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

- -
- -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
- -

The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

- -

Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

- -

I am going to restrict my data to include the following designations:

- -
- -Note: -You can view the land designation types by referencing the PAD-US documentation. -
- -
-:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/1a/0fe9b7c3cc8e7bd13deaab82d7f5eea49184be644f94f4c54e109db1d68d26 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/1a/0fe9b7c3cc8e7bd13deaab82d7f5eea49184be644f94f4c54e109db1d68d26 new file mode 100644 index 0000000..a0ab078 --- /dev/null +++ b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/1a/0fe9b7c3cc8e7bd13deaab82d7f5eea49184be644f94f4c54e109db1d68d26 @@ -0,0 +1,28 @@ +I"˝

welcome

+
+ +

Here you’ll find several non-partisan ballot guides. I try to include as much information as possible without directly recreating the official voter guide. The information is sourced from the ballot guide, calmatters, ballotpedia, LA Times, voter’s edge, and Mercury News. I have included links to the campaign website or wherever most of the information came from.

+ +

I have done my best to keep my views out of it.

+ +

I started this to help my two aunts because they would ask me to simplify their ballots for them. Democracy relies on an informed and participatory citizenry, but it’s not always easy. This is meant to alleviate some of the burden.

+ +

If you notice any errors, you feel like I’ve missed something, or you found this guide helpful feel free to send me an email [click the envelope at the bottom of the page]

+ +

helpful resources

+
+ +
    +
  1. Register to vote (Deadline May 23): https://registertovote.ca.gov/
  2. +
  3. Check your registration status: https://voterstatus.sos.ca.gov/
  4. +
  5. Access the official voter guide: https://voterguide.sos.ca.gov/
  6. +
  7. Early voting & ballot drop off locations: https://caearlyvoting.sos.ca.gov/
  8. +
  9. Track your ballot: https://california.ballottrax.net/voter/
  10. +
  11. If you are in the Los Angeles, San Bernardino, Orange County area and need help getting to your polling place I will either find you resources or help you get there. I also offer to go with you to vote (and I will bring my two large German Shepherds) if you feel unsafe going to vote alone.
  12. +
+ +

ballot guides

+
+ +

Click on a link to download the PDF.

+:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/1b/345711ba56a1cc2bb0a759c50a6ba2ba9291d2425fc99348b32889455a4850 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/1b/345711ba56a1cc2bb0a759c50a6ba2ba9291d2425fc99348b32889455a4850 deleted file mode 100644 index d65c0f0..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/1b/345711ba56a1cc2bb0a759c50a6ba2ba9291d2425fc99348b32889455a4850 +++ /dev/null @@ -1,686 +0,0 @@ -I"ľţ

Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

- - - -

project outline

-
- -

I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

- -

I. cartography in r part one

- - -

II. cartography in r part two

- - -

III. cartography in r part three

- - -

IV. cartography in r part four

- - -

V. cartography in r part five

- - -

VI. cartography in r part six

- - -

1. load libraries

-
- -
1
-2
-3
-4
-5
-
    ## load libraries
-    library("tidyverse")        # data manipulation & map creation
-    library("sf")               # loads shapefile
-    library("leaflet")          # creates the map
-    library("operator.tools")   # not-in function
-
- -

I am not going to explain in detail what each of these packages do because I already covered it in part one.

- -

2. load data

-
- -
1
-2
-3
-
    ## load data
-    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
-    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
-
- -

Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

- -

If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

- -

3. download state park data

-
- -

Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

- -

When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

- -

The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

- -
- - - Image: PAD-US download - - -
PAD-US download
-
- -

On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

- -
- - - Image: National data download - PAD-US - - -
National data download - PAD-US
-
- -

When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

- -

4. find the right layer

-
- -

Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

- -
1
-
    layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
-
- -

I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

- -

st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

- -
- - - Image: PAD-US Layer List - - -
PAD-US Layer List
-
- -

The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

- - -

I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

- -

Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

- -
- - - Image: PAD-US Viewer - - -
PAD-US Viewer
-
- -

In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

- -
- - - Image: Search for Valley of Fire State Park - - -
Search for Valley of Fire State Park
-
- -

The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

- -

I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

- -
- - - Image: Valley of Fire State Park Polygon - - -
Valley of Fire State Park Polygon
-
- -

The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

- -

For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

- -
- - - Image: Federal Fee Managers Layer - - -
Federal Fee Managers Layer
-
- -

5. find the right data

-
- -

The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

- -

Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

- -

For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

- -
- - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
Valley of Fire (left), Crissey Field (center), Salton Sea (right)
-
- -
- -Note: -Clicking on an image will open it in a new tab. -
- -

First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

- -
- - - Image: Valley of Fire State Park Table - - -
Valley of Fire State Park Table
-
- -

In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

- -
    -
  1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
  2. -
  3. Mang_Type is the name of the agency that manages the land.
  4. -
  5. Mang_Name is the type of agency that owns the land, here state.
  6. -
  7. Des_Tp is the designation type.
  8. -
  9. Unit_Nm is the land’s name.
  10. -
- -

When we look at Crissey Field, many of these fields have different values.

- -
- - - Image: Crissey Field State Recreation Site Table - - -
Crissey Field State Recreation Site Table
-
- -

The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

- -

Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

- -
- - - Image: Salton Sea - - -
Salton Sea
-
- -

Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

- -

On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

- -
- - - Image: Salton Sea - - -
Salton Sea
-
- -

I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

- -

In the viewer, I’ll look for the blueish violet color (boxed in red above).

- -
- - - Image: Salton Sea State Recreation Area Table - - -
Salton Sea State Recreation Area Table
-
- -

After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

- -

Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

- -

6. process state park data

-
- -
- Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
- -
1
-2
-3
-4
-5
-6
-7
-8
-9
-10
-11
-12
-13
-14
-15
-16
-17
-18
-19
-20
-21
-22
-23
-24
-25
-26
-27
-28
-29
-30
-31
-32
-33
-34
-35
-36
-37
-38
-39
-40
-41
-42
-43
-
    ## create territories list
-    territories <- c("AS", "GU", "MP", "PR", "VI")
-
-    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
-    filter(State_Nm %!in% territories & 
-           Own_Type == "STAT")  %>% 
-    filter(Des_Tp == "ACC" |
-           Des_Tp == "HCA" |
-           Des_Tp == "REC" |
-           Des_Tp == "SCA" |
-           Des_Tp == "SHCA" |
-           Des_Tp == "SP" |
-           Des_Tp == "SREC" |
-           Des_Tp == "SRMA" |
-           Des_Tp == "SW") %>% 
-    filter(d_Pub_Acce != "Closed" & 
-           d_Pub_Acce != "Unknown") %>% 
-    filter(Loc_Ds != "ACC" &
-           Loc_Ds != "Hunter Access",
-           Loc_Ds != "Public Boat Ramp")  %>% 
-    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
-    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
-                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
-                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
-                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
-                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
-                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
-                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
-                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
-                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
-    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
-                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
-                               Unit_Nm == "Salton Sea" ~ "visited",
-                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
-                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
-                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
-                               TRUE ~ "not visited")  %>% 
-    shift_geometry(preserve_area = FALSE, 
-                    position = "below") %>% 
-    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
-
-    ## save shifted park data
-    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
-
- -

These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

- - - -
2    territories <- c("AS", "GU", "MP", "PR", "VI")
- -

Line 2 creates the list of US territories that I filter out in line 5.

- -

If you want to read more about this line visit part III of the series.

- -
- - - -
4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
- -

Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

- -

st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

- -

The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

- -
- - - -
5    filter(State_Nm %!in% territories  &
-6           Own_Type == "STAT")
- -

R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

- -

The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

- -
- -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
- -

The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

- -
- - - -
7    filter(Des_Tp == "ACC" |
-8           Des_Tp == "HCA" |
-9           Des_Tp == "REC" |
-10          Des_Tp == "SCA" |
-11          Des_Tp == "SHCA" |
-12          Des_Tp == "SP" |
-13          Des_Tp == "SREC" |
-14          Des_Tp == "SRMA" |
-15          Des_Tp == "SW")  
- -

Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

- -

Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

- -
- -Note: -You can view the land designation types by referencing the PAD-US documentation. -
- -

I am going to restrict my data to include the following designations:

- - -

This will leave me with 50,102 rows.

- -
- -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
- -
- - - -
16    filter(d_Pub_Acce != "Closed" & 
-17           d_Pub_Acce != "Unknown")  %>% 
- -

Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

- -

The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

- -

This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

- -
- -

*lines 18-20

- -
18    filter(Loc_Ds != "ACC" &
-19           Loc_Ds != "Hunter Access",
-20           Loc_Ds != "Public Boat Ramp")
- -

Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

- -
- - - -
21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
- -

Now that I’ve pared down the data a little bit, I want discard any column don’t need.

- -

select() lets me choose the columns I want to keep by name, rather than by index number.

- -

I decided to keep:

- - -
- - - -
22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
-23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
-24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
-25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
-26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
-27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
-28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
-29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
-30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
- -

mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

- -

I wanted the state park designations to match closely with the types I used in the National Park data.

- -

I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

- -

In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

- -
- - - -
31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
-32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
-33                             Unit_Nm == "Salton Sea" ~ "visited",
-34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
-35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
-36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
-37                             TRUE ~ "not visited")  %>% 
- -

Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

- -

I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

- -
- Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
- -

In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

- -

The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

- -
- - - -
38   shift_geometry(preserve_area = FALSE, 
-39                  position = "below") %>% 
-40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
-41
-42   ## save shifted park data
-43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
- -

I’ve covered these lines extensively in part II and part III of this series.

- -

Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

- -

Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

- -

Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

- -

7. divide by state

-
- -
- - -
- - - Image: chaos ensues - - -
chaos ensues
-
- - -
- -

I tried to map the base map, National Parks, and the state parks. It did not go well. R froze, my computer screamed, and chaos ensued. As a result, I had to rethink my map. I decided to separate the state parks by state, save them, and in part VI of this never-ending series* I’ll create individual state maps. When you click on a state it’ll take you to a map that includes the state parks.

- -

Unfortunately, this also means I need to separate the National Parks by state so they also appear on the individual maps. The logic will be the same so I am not going to update part III to reflect that change. If you want to see that code it’s available on the project repo].

- -

I don’t want to manually separate and save each state, so I’m going to use a loop! I hate loops. The logic is simple enough “as long as condition X is true, do something.” So simple, yet esvery time I’ve tried to learn a programming language I have struggled with loops. That’s pretty sad considering it’s like day 2 of any programming class. Day 1 is learning how to write “Hello World!”**

- -
1
-2
-3
-4
-5
-
    split_states <- split(state_parks, f = state_parks$State_Nm)
-    all_names <- names(split_states)   
-
-    for(name in all_names){            
-        st_write(split_states[[name]], paste0("shapefiles/shifted/states/individual/", name, '.shp'))}
-
- -

Look ma, new code!

- - - -
1    split_states <- split(state_parks, f = state_parks$State_Nm) 
- -

The split() is part of base R. It takes quite a few arguments, most of which are optional.

- -

The first argument is the vector (or data frame) that you want to split into different groups. I want to split the state_parks data into its corresponding states, so it is listed first.

- -

The second argument f = is how you want the data split. f in this instance stands for factor. If we run levels(as.factor(state_parks$State_Nm)) in the terminal, it will return a list of the 50 state abbreviations. That is what we’re telling R to do here.

- -

You can access an individual state using the $ operator. split_states$CA will return the state park data for California.

- -
- - - -
2    all_names <- names(split_states)
- -

names is also part of base R. It does what it sounds like - it gets the names of an object. Here, I want to get the names of each split data sets.

- -
- - - -
4    for(name in all_names){            
-5        st_write(split_states[[name]], paste0("shapefiles/shifted/states/individual/", name, '.shp'))}
- -

Here’s the actual for loop.

- -

The for(x in y) { - do something}

- -

8. conclusion

-
- -

* I have annoyed myself with how long this series is. Hopefully it is helpful. Drop me an email or a tweet if it is.
-** print("Hello World!")

-:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/1b/7af65aa14a4b4c037b1b212abefa6089a04283d57a4f3ac88f19a3cc7c9779 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/1b/7af65aa14a4b4c037b1b212abefa6089a04283d57a4f3ac88f19a3cc7c9779 new file mode 100644 index 0000000..b2dece2 --- /dev/null +++ b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/1b/7af65aa14a4b4c037b1b212abefa6089a04283d57a4f3ac88f19a3cc7c9779 @@ -0,0 +1,30 @@ +I"ă
+
+
+
+ + +
+ Here you'll find several non-partisan ballot guides. I try to include as much information as possible without directly recreating the official voter guide. The information is sourced from the ballot guide, calmatters, ballotpedia, LA Times, voter’s edge, and Mercury News. I have included links to the campaign website or wherever most of the information came from. + + I have done my best to keep my views out of it. + + I started this to help my two aunts because they would ask me to simplify their ballots for them. Democracy relies on an informed and participatory citizenry, but it’s not always easy. This is meant to alleviate some of the burden. + + If you notice any errors, you feel like I’ve missed something, or you found this guide helpful feel free to send me an email [link is below] + + Helpful Resources: +
  1. Register to vote (Deadline May 23): https://registertovote.ca.gov/
  2. +
  3. Check your registration status: https://voterstatus.sos.ca.gov/
  4. +
  5. Access the official voter guide: https://voterguide.sos.ca.gov/
  6. +
  7. Early voting & ballot drop off locations: https://caearlyvoting.sos.ca.gov/
  8. +
  9. Track your ballot: https://california.ballottrax.net/voter/
  10. +
  11. If you are in the Los Angeles, San Bernardino, Orange County area and need help getting to your polling place I will either find you resources or help you get there. I also offer to go with you to vote (and I will bring my two large German Shepherds) if you feel unsafe going to vote alone.
+
+ +
+
+ +
+
+:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/1b/dd906283bfce8f38c585d3c382aeec50bc877500792c2a12acdb550a2d8280 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/1b/dd906283bfce8f38c585d3c382aeec50bc877500792c2a12acdb550a2d8280 deleted file mode 100644 index ef8cf44..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/1b/dd906283bfce8f38c585d3c382aeec50bc877500792c2a12acdb550a2d8280 +++ /dev/null @@ -1,708 +0,0 @@ -I" 

Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

- - - -

project outline

-
- -

I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

- -

I. cartography in r part one

- - -

II. cartography in r part two

- - -

III. cartography in r part three

- - -

IV. cartography in r part four

- - -

V. cartography in r part five

- - -

VI. cartography in r part six

- - -

1. load libraries

-
- -
1
-2
-3
-4
-5
-
    ## load libraries
-    library("tidyverse")        # data manipulation & map creation
-    library("sf")               # loads shapefile
-    library("leaflet")          # creates the map
-    library("operator.tools")   # not-in function
-
- -

I am not going to explain in detail what each of these packages do because I already covered it in part one.

- -

2. load data

-
- -
1
-2
-3
-
    ## load data
-    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
-    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
-
- -

Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

- -

If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

- -

3. download state park data

-
- -

Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

- -

When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

- -

The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

- -
- - - Image: PAD-US download - - -
PAD-US download
-
- -

On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

- -
- - - Image: National data download - PAD-US - - -
National data download - PAD-US
-
- -

When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

- -

4. find the right layer

-
- -

Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

- -
1
-
    layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
-
- -

I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

- -

st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

- -
- - - Image: PAD-US Layer List - - -
PAD-US Layer List
-
- -

The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

- - -

I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

- -

Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

- -
- - - Image: PAD-US Viewer - - -
PAD-US Viewer
-
- -

In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

- -
- - - Image: Search for Valley of Fire State Park - - -
Search for Valley of Fire State Park
-
- -

The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

- -

I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

- -
- - - Image: Valley of Fire State Park Polygon - - -
Valley of Fire State Park Polygon
-
- -

The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

- -

For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

- -
- - - Image: Federal Fee Managers Layer - - -
Federal Fee Managers Layer
-
- -

5. find the right data

-
- -

The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

- -

Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

- -

For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

- -
- - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
Valley of Fire (left), Crissey Field (center), Salton Sea (right)
-
- -
- -Note: -Clicking on an image will open it in a new tab. -
- -

First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

- -
- - - Image: Valley of Fire State Park Table - - -
Valley of Fire State Park Table
-
- -

In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

- -
    -
  1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
  2. -
  3. Mang_Type is the name of the agency that manages the land.
  4. -
  5. Mang_Name is the type of agency that owns the land, here state.
  6. -
  7. Des_Tp is the designation type.
  8. -
  9. Unit_Nm is the land’s name.
  10. -
- -

When we look at Crissey Field, many of these fields have different values.

- -
- - - Image: Crissey Field State Recreation Site Table - - -
Crissey Field State Recreation Site Table
-
- -

The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

- -

Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

- -
- - - Image: Salton Sea - - -
Salton Sea
-
- -

Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

- -

On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

- -
- - - Image: Salton Sea - - -
Salton Sea
-
- -

I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

- -

In the viewer, I’ll look for the blueish violet color (boxed in red above).

- -
- - - Image: Salton Sea State Recreation Area Table - - -
Salton Sea State Recreation Area Table
-
- -

After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

- -

Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

- -

6. process state park data

-
- -
- Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
- -
1
-2
-3
-4
-5
-6
-7
-8
-9
-10
-11
-12
-13
-14
-15
-16
-17
-18
-19
-20
-21
-22
-23
-24
-25
-26
-27
-28
-29
-30
-31
-32
-33
-34
-35
-36
-37
-38
-39
-40
-41
-42
-43
-
    ## create territories list
-    territories <- c("AS", "GU", "MP", "PR", "VI")
-
-    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
-    filter(State_Nm %!in% territories & 
-           Own_Type == "STAT")  %>% 
-    filter(Des_Tp == "ACC" |
-           Des_Tp == "HCA" |
-           Des_Tp == "REC" |
-           Des_Tp == "SCA" |
-           Des_Tp == "SHCA" |
-           Des_Tp == "SP" |
-           Des_Tp == "SREC" |
-           Des_Tp == "SRMA" |
-           Des_Tp == "SW") %>% 
-    filter(d_Pub_Acce != "Closed" & 
-           d_Pub_Acce != "Unknown") %>% 
-    filter(Loc_Ds != "ACC" &
-           Loc_Ds != "Hunter Access",
-           Loc_Ds != "Public Boat Ramp")  %>% 
-    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
-    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
-                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
-                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
-                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
-                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
-                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
-                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
-                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
-                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
-    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
-                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
-                               Unit_Nm == "Salton Sea" ~ "visited",
-                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
-                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
-                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
-                               TRUE ~ "not visited")  %>% 
-    shift_geometry(preserve_area = FALSE, 
-                    position = "below") %>% 
-    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
-
-    ## save shifted park data
-    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
-
- -

These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

- - - -
2    territories <- c("AS", "GU", "MP", "PR", "VI")
- -

Line 2 creates the list of US territories that I filter out in line 5.

- -

If you want to read more about this line visit part III of the series.

- -
- - - -
4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
- -

Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

- -

st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

- -

The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

- -
- - - -
5    filter(State_Nm %!in% territories  &
-6           Own_Type == "STAT")
- -

R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

- -

The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

- -
- -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
- -

The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

- -
- - - -
7    filter(Des_Tp == "ACC" |
-8           Des_Tp == "HCA" |
-9           Des_Tp == "REC" |
-10          Des_Tp == "SCA" |
-11          Des_Tp == "SHCA" |
-12          Des_Tp == "SP" |
-13          Des_Tp == "SREC" |
-14          Des_Tp == "SRMA" |
-15          Des_Tp == "SW")  
- -

Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

- -

Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

- -
- -Note: -You can view the land designation types by referencing the PAD-US documentation. -
- -

I am going to restrict my data to include the following designations:

- - -

This will leave me with 50,102 rows.

- -
- -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
- -
- - - -
16    filter(d_Pub_Acce != "Closed" & 
-17           d_Pub_Acce != "Unknown")  %>% 
- -

Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

- -

The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

- -

This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

- -
- -

*lines 18-20

- -
18    filter(Loc_Ds != "ACC" &
-19           Loc_Ds != "Hunter Access",
-20           Loc_Ds != "Public Boat Ramp")
- -

Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

- -
- - - -
21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
- -

Now that I’ve pared down the data a little bit, I want discard any column don’t need.

- -

select() lets me choose the columns I want to keep by name, rather than by index number.

- -

I decided to keep:

- - -
- - - -
22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
-23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
-24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
-25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
-26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
-27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
-28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
-29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
-30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
- -

mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

- -

I wanted the state park designations to match closely with the types I used in the National Park data.

- -

I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

- -

In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

- -
- - - -
31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
-32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
-33                             Unit_Nm == "Salton Sea" ~ "visited",
-34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
-35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
-36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
-37                             TRUE ~ "not visited")  %>% 
- -

Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

- -

I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

- -
- Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
- -

In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

- -

The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

- -
- - - -
38   shift_geometry(preserve_area = FALSE, 
-39                  position = "below") %>% 
-40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
-41
-42   ## save shifted park data
-43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
- -

I’ve covered these lines extensively in part II and part III of this series.

- -

Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

- -

Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

- -

Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

- -

7. divide by state

-
- -
- - -
- - - Image: chaos ensues - - -
chaos ensues
-
- - -
- -

I tried to map the base map, National Parks, and the state parks. It did not go well. R froze, my computer screamed, and chaos ensued. As a result, I had to rethink my map. I decided to separate the state parks by state, save them, and in part VI of this never-ending series* I’ll create individual state maps. When you click on a state it’ll take you to a map that includes the state parks.

- -

Unfortunately, this also means I need to separate the National Parks by state so they also appear on the individual maps. The logic will be the same so I am not going to update part III to reflect that change. If you want to see that code it’s available on the project repo].

- -

I don’t want to manually separate and save each state, so I’m going to use a loop! I hate loops. The logic is simple enough “as long as condition X is true, do something.” So simple, yet esvery time I’ve tried to learn a programming language I have struggled with loops. That’s pretty sad considering it’s like day 2 of any programming class. Day 1 is learning how to write “Hello World!”**

- -
1
-2
-3
-4
-5
-
    split_states <- split(state_parks, f = state_parks$State_Nm)
-    all_names <- names(split_states)   
-
-    for(name in all_names){            
-        st_write(split_states[[name]], paste0("shapefiles/shifted/states/individual/", name, '.shp'))}
-
- -

Look ma, new code!

- - - -
1    split_states <- split(state_parks, f = state_parks$State_Nm) 
- -

The split() is part of base R. It takes quite a few arguments, most of which are optional.

- -

The first argument is the vector (or data frame) that you want to split into different groups. I want to split the state_parks data into its corresponding states, so it is listed first.

- -

The second argument f = is how you want the data split. f in this instance stands for factor. If we run levels(as.factor(state_parks$State_Nm)) in the terminal, it will return a list of the 50 state abbreviations. That is what we’re telling R to do here.

- -

You can access an individual state using the $ operator. split_states$CA will return the state park data for California.

- -
- - - -
2    all_names <- names(split_states)
- -

names is also part of base R. It does what it sounds like - it gets the names of an object. Here, I want to get the names of each split data sets.

- -
- - - -
4    for(name in all_names){            
-5        st_write(split_states[[name]], paste0("shapefiles/shifted/states/individual/", name, '.shp'))}
- -

Here’s the actual for loop.

- -

The basic logic of a for loop is:
- for(x in y){
- do something}

- -

Inside the parenthesis is the condition that must evaluate to TRUE if the content in the curly braces is to run.

- -

In line 4, for(name in all_names){ says as long as there’s a name in the list of all names, do whatever is inside the curly braces. name can be whatever you want. It’s a placeholder value. I can have it say for(dogs in all_names){ it will still do the exact same thing. A lot of time you’ll see it as an i for item. I like to use more descriptive language because, again, for loops are my Achilles’ heel.

- -

The all_names part is where ever you want R to look for the data. It will change based on your data set and variable naming conventions.

- -

In line 5, I save the split data sets.

- -

st_write() is part of the sf package which allows us to create shapefiles. This can be any saving function (eg. write_csv() if you want to save CSVs). The function takes several arguments. In line 43 above I showed the basic structure: st_write(data, path/to/file.shp). This is good if you only have one file, but since I’m saving them in a loop I don’t want all of the files to have the same name. R will error out after the first and tell you the file already exists.

- -

The first part split_states[[name]] is still telling R what data to save, but using an index instead of a specific data frame name. To access an index you use data[[some-value]] where some-value is the index location. In my code, R will take the split_states data and go alright the first index location in [[name]] is 1 and return whatever value is stored in that index (here, AK). It will then do that for every index location as it loops through the split_states data.

- -

paste0() is also part of base R - it’s apparently faster than paste(). It concatenates (or links together) different pieces into one. I’m using it to create the filename. Within the paste0 call anything within quotation marks is static. So every file will be saved to "shapefiles/shifted/states/individual/" and every file will have the extension .shp. What will change with each loop is the name of the file. One by one, R will loop through and save each file using the name it pulled from all_names.

- -

st_write() automatically creates the other three files that each “shapefile” needs. When the loop is done, you should have a folder of 200 files (50 states * 4 files each). Which is why I strongly recommend using DVC if you’re doing any kind of version control.

- -

8. conclusion

-
- -

That’s all the processing done for the state files… for now. In part VI I’ll return to the states to create each state’s own map. Next up, in part V, I’m going back to my base map with the National Parks to add in some informational tool tips and interactivity.

- -




-
- -

* I have annoyed myself with how long this series is. Hopefully it is helpful. Drop me an email or a tweet if it is.
-** print("Hello World!")

-:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/1c/ebb05fe0b5ccedc7cd969006c17ad5c4e73c15915cb4b3882529f679e63771 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/1c/ebb05fe0b5ccedc7cd969006c17ad5c4e73c15915cb4b3882529f679e63771 deleted file mode 100644 index 8d7f66c..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/1c/ebb05fe0b5ccedc7cd969006c17ad5c4e73c15915cb4b3882529f679e63771 +++ /dev/null @@ -1,595 +0,0 @@ -I"®ĺ

This is part three of my cartography in R series. If you are just finding this, I suggest taking a look at part I and part II first.

- -

In this post, I will download and process the National Park data. Once that’s done, I’ll add it to the base map I created in part II.

- - - -
- Note: I know that my posts are verbose. I am sorry that I am not more concise but one of the biggest problems I have with code documentation is that it assumes you already know what you're doing. I'm writing these for absolute beginners that want to know what each part of the code does. If you want more concise explanations, I suggest looking at the annotated r files in the project's GitHub. -
- -

project outline

-
- -

I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

- -

I. cartography in r part one

- - -

II. cartography in r part two

- - -

III. cartography in r part three

- - -

IV. cartography in r part four

- - -

V. cartography in r part five

- - -

V. cartography in r part six

- - -

1. load libraries

-
- -
1
-2
-3
-4
-5
-
    ## load libraries
-    library("tidyverse")        # data manipulation & map creation
-    library("sf")               # loads shapefile
-    library("leaflet")          # creates the map
-    library("operator.tools")   # not-in function
-
- -

I am not going to explain in detail what each of these packages do because I already covered it in part one.

- -

2. load usa data

-
- -
1
-2
-
    ## load data
-    states <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
-
- -

Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp to reflect wherever you saved the shifted shapefile.

- -

If your data processing and base map creation are in the same file, you can skip this line, and when you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

- -

3. download national park data

-
- -

The National Park Service provides all the data we’ll need to make the map. The data is accessible on the ArcGIS’ Open Data website. Once you click on the link you’ll see a bunch of icons that lead to different data that’s available for download. Click on the one for boundaries.

- -
- - - Image: NPS GIS website - - -
NPS GIS website
-
- -

From here, you’ll be taken to a list of available National Park data. The second link should be nps boundary which contains the shape data for all the National Parks in the United States. The file contains all the data for the park outlines along with hiking trails, rest areas, and lots of other data.

- -
- - - Image: NPS download page - - -
NPS download page
-
- -

The nps boundary link will take you to a map showing the national parks. On the left, there will be a download link on the left.

- -
- - - Image: NPS map and download link - - -
NPS map and download link
-
- -

From here, you’ll have a few download options. The National Park Service provides the data in different formats including CSV and Shapefile. You’ll want to download the shapefile version.

- -
- - - Image: download nps shapefile - - -
download nps shapefile
-
- -

Be sure to save the file somewhere on your hard drive that is easy to find. When it finishes downloading, be sure to unzip the file. There will be four files inside the folder. All of them need to be kept in the same location. Even though we’ll only load the .shp file, R uses the three others to create the necessary shapes.

- -

4. process national park data

-
- -

The code below may look intimidating, but it’s fairly straight forward. I’ll go over each line below.

- -
- Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
- -
1
-2
-3
-4
-5
-6
-7
-8
-9
-10
-11
-12
-13
-14
-15
-16
-17
-18
-19
-20
-21
-22
-23
-24
-25
-26
-27
-28
-29
-30
-31
-32
-33
-34
-35
-36
-37
-38
-39
-40
-41
-42
-43
-
    ## create territories list
-    territories <- c("AS", "GU", "MP", "PR", "VI")
-
-    ## load and process nps data
-    nps <- read_sf("./shapefiles/original/nps/NPS_-_Land_Resources_Division_Boundary_and_Tract_Data_Service.shp")  %>% 
-    select(STATE, UNIT_TYPE, PARKNAME, Shape__Are, geometry) %>% 
-    filter(STATE %!in% territories) %>%  
-    mutate(type = case_when(UNIT_TYPE == "International Historic Site" ~ "International Historic Site", # there's 23 types of national park, I wanted to reduce this number.
-                UNIT_TYPE == "National Battlefield Site" ~ "National Military or Battlefield", # lines 56-77 reduce the number of park types
-                UNIT_TYPE == "National Military Park" ~ "National Military or Battlefield", 
-                UNIT_TYPE == "National Battlefield" ~ "National Military or Battlefield",
-                UNIT_TYPE == "National Historical Park" ~ "National Historical Park, Site, Monument, or Memorial",
-                UNIT_TYPE == "National Historic Site" ~ "National Historical Park, Site, Monument, or Memorial",
-                UNIT_TYPE == "National Historic Trail" ~ "National Historical Park, Site, Monument, or Memorial",
-                UNIT_TYPE == "National Memorial" ~ "National Historical Park, Site, Monument, or Memorial",
-                UNIT_TYPE == "National Monument" ~ "National Historical Park, Site, Monument, or Memorial",
-                UNIT_TYPE == "National Preserve" ~ "National Preserve, Reserve, or Recreation Area",
-                UNIT_TYPE == "National Reserve" ~ "National Preserve, Reserve, or Recreation Area",
-                UNIT_TYPE == "National Recreation Area" ~ "National Preserve, Reserve, or Recreation Area",
-                UNIT_TYPE == "National River" ~ "National River, Lakeshore, or Seashore",
-                UNIT_TYPE == "National Lakeshore" ~ "National River, Lakeshore, or Seashore",
-                UNIT_TYPE == "National Wild & Scenic River" ~ "National River, Lakeshore, or Seashore",
-                UNIT_TYPE == "National Seashore" ~ "National River, Lakeshore, or Seashore",
-                UNIT_TYPE == "National Trails Syste" ~ "National Trail",
-                UNIT_TYPE == "National Scenic Trail" ~ "National Trail",
-                UNIT_TYPE == "National Park" ~ "National Park or Parkway",
-                UNIT_TYPE == "Park" ~ "National Park or Parkway",
-                UNIT_TYPE == "Parkway" ~ "National Park or Parkway",
-                UNIT_TYPE == "Other Designation" ~ "Other National Land Area"))  %>%  
-    mutate(visited = case_when(PARKNAME == "Joshua Tree" ~ "visited", 
-                                PARKNAME == "Redwood" ~ "visited", 
-                                PARKNAME == "Santa Monica Mountains" ~ "visited", 
-                                PARKNAME == "Sequoia" ~ "visited", 
-                                PARKNAME == "Kings Canyon" ~ "visited",
-                                PARKNAME == "Lewis and Clark" ~ "visited",
-                                PARKNAME == "Mount Rainier" ~ "visited",
-                                TRUE ~ "not visited")) %>%  
-    shift_geometry(preserve_area = FALSE, 
-                    position = "below") %>% 
-    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
-
-    ## save the shifted national park data
-    st_write(nps, "~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
-
- - - -
2    territories <- c("AS", "GU", "MP", "PR", "VI")
- -

In part I of this series I talked about how R has an %in% function, but not a %!in% function. Here’s where the latter function shines.

- -

The United States is still an empire with its associated territories and islands. In this project I am interested in the 50 states - without these other areas. As a result, I need to filter them out. Using base R’s %in% function I would have to create a variable that contains the postal abbreviations for all 50 states. That is annoying. Instead, I want to use the shorter list that only includes the US’ associated islands and territories. To do so, however, I need to use the operator tools’ %!in% function.

- -

Line 2 creates the list of US territories that I filter out in line 7. The c() function in R means combine or concatenate. Inside the parenthesis are the five postal codes for the American Samoa, Guam, the Northern Mariana Islands, Puerto Rico, and the Virgin Islands.

- -
- - - -
5   nps <- read_sf("path/to/file/NPS_-_Land_Resources_Division_Boundary_and_Tract_Data_Service.shp")  %>% 
- -

nps <- read_sf("path/to/file.shp") loads the National Park data set to a variable called nps using the read_sf() function that is part of the sf package. You will need to change the file path so it reflects where you saved the data on your hard drive.

- -

The %>% operator is part of the tidyverse package. It tells R to go to the next line and process the next command. It has to go at the end of a line, rather than the beginning.

- -
- - - -
6    select(STATE, UNIT_TYPE, PARKNAME, Shape__Are, geometry) %>%
- -

select is part of the tidyverse package. With it, we can select columns by their name rather than their associated number. Large data sets take more computing power because the computer has to iterate over more rows. Unfortunately, rendering maps also takes a lot of computing power so I like to discard any unnecessary columns to reduce the amount of effort my computer has to exert.

- -

Deciding on which columns to keep will depend on the data you’re using and what you want to map (or analyze). I know for my project I want to include a few things:

- - -

There’s a couple ways to inspect the data to see what kind of information is available.

- -
    -
  1. You can view the entire data set by using view(nps) but as the number of data points increases, so does R’s struggle with opening it. I’ve found that VSCode doesn’t throw as big of a fit as R Studio when opening large data sets.
  2. -
  3. Another way is to just look at the column names using data.frame(colnames(nps)). This will return a list of the data set’s column names. This is my preferred method. I then go to the documentation to see what each column contains. This isn’t fool-proof because it really depends on if the data has good documentation.
  4. -
- -

The National Park data includes a lot of information about who created the data and maintains the property. I’m not interested in this, so in line 6 I select the following columns:

- - -
Note: The column name must be exact for select() to work. For example, in the NPS data, the park area is stored the Shape__Are column - two underscores and no a in area - and I have to use that exact format when selecting the column.
- -

The geometry column is specific to shapefiles and it includes the coordinates of the shape. It will be kept automatically - unless you use the st_drop_geometry() function. I like to specifically select so I remember it’s there.

- -
- - - -
7    filter(STATE %!in% territories) %>%  
- -

In line 7 I use the territories list I created in line 2 to filter out the United States’ associated areas. Since the nps data uses the two character state abbreviation, I have to use the two character abbreviation for the territories. Searching for “Guam,” for example, won’t work.

- -

filter() is part of the tidyverse and it uses conditional language. In the parentheses is a condition that must be true if the tidyverse is going to keep the row. Starting at the top of the data, R goes “alright, does the value in the STATE column match any of the values in the territories list?” If the condition is TRUE, R adds the row to the new data frame.

- -
Note: Since we're using the %!in% operator, any row that evaluates as TRUE will be kept because the value is NOT found in the territories list. If I wanted to keep only the territories, I would use the %in% operator and only the rows with STATE abbreviations found in the territories list would be kept.
- -

For example, if the STATE value in row 1 is CA, filter looks at it and goes “is CA NOT IN territories?” If that is TRUE, keep it because we want only the values that are NOT IN the territories list.

- -
- - - -
8    mutate(type = case_when(UNIT_TYPE == "International Historic Site" ~ "International Historic Site",
- -

mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

- -

The NPS data set has 23 different types of National Parks listed (you can view all of them by running levels(as.factor(nps$UNIT_TYPE))). I know that in later posts, I’m going to color code the land by type (blue for rivers, green for national parks, etc) so I wanted to reduce the number of colors I would have to use.

- -

mutate()’s first argument, type = creates a new column called type. R will populate the newly created column with whatever comes after the first (singular) equal = sign. For example, I can put type = NA and every row in the column will say NA.

- -

Here, I am using the case_when() function, which is also part of the tidyverse. The logic of case_when is fairly straight forward. The first value is the name of the column you want R to look in (here: UNIT_TYPE). Next, is a conditional. Here I am looking for an exact match (==) to the string (words) inside the first set of quotation marks (in line 8: "International Historic Site"). The last part of the argument is what I want R to put in the type column when it finds a row where the UNIT_TYPE is "International Historic Site".

- -

In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value")

- -

Lines 9-29 do the same thing for the other park types. You can reduce the parks however you want or use all 23 types. Just remember that the value before the tilde ~ has to match the values found in the data exactly. For example, in line 24 I change the NPS data’s National Trail Syste value to be National Trail. Whomever created the data set did not spell system correctly, so for R to match the value I also have to omit the last letter in system.

- -
- - - -
30    mutate(visited = case_when(PARKNAME == "Joshua Tree" ~ "visited",
- -

Lines 30-37 use the same mutate() and case_when logic as above. Instead of reducing the number of park types, I use it to mark the different parks I have visited.

- -

Line 30 creates the new column, visited and uses case_when to look for the names of the parks that I’ve been to. If I have visited them, it adds visited to the column of the same name.

- -

The last line, TRUE ~ "not_visited)), acts as an else statement. For any park not listed above, it will put not visited in the visited column I created.

- -

This feels like a very brute-force method of tracking which parks I’ve visited, but I haven’t spend much time trying to find another way.

- -
- - - -
38    shift_geometry(preserve_area = FALSE, 
-39                   position = "below") %>%
- -

In part I, when I made the base map, I moved Alaska and Hawaii so they were of similar size and closer to the continental USA. For the map to display the parks correctly, I have to shift them as well.

- -

I went over these two lines in part II, so I won’t go over them again here. If you want to read more about them, check out that post.

- -
- - - -
40  sf::st_transform('+proj=longlat +datum=WGS84')
- -

The last line uses the st_transform() function from the sf package to covert the data set from NAD83 to WGS84. Leaflet requires WGS84, so be sure to include this line at the end of your data manipulation.

- -

I covered the WGS84 ellipsoid in part I, if you want to read more about it.

- -
- - - -
43  st_write(nps, "~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
- -

Strictly speaking, this line isn’t necessary. You can do all your data processing in the same file where you make your map, but I prefer to separate the steps into different files.

- -

As a result, I save the shifted data to my hard drive so it’s easier to load later. I usually have this line commented out (by placing # at the start of the line) after I save it the first time. I don’t want it to save every time I run the rest of the code.

- -

5. add national parks to base map

-
- -
1
-2
-3
-4
-5
-6
-7
-8
-9
-10
-11
-12
-13
-14
-15
-16
-17
-18
-19
-20
-21
-22
-23
-24
-25
-26
-27
-28
-29
-30
-31
-32
-33
-34
-
## create usa Base Map using leaflet()
-    map <- leaflet() %>%
-    addPolygons(data = states,
-        smoothFactor = 0.2,
-        fillColor = "#808080",
-        fillOpacity = 0.5,
-        stroke = TRUE,
-        weight = 0.5,
-        opacity = 0.5,
-        color = "#808080",
-        highlight = highlightOptions(
-            weight = 0.5,
-            color = "#000000",
-            fillOpacity = 0.7,
-            bringToFront = FALSE),
-        group = "Base Map")  %>% 
-    addPolygons(data = nps,
-        smoothFactor = 0.2,                 
-        fillColor = "#354f52",
-        fillOpacity = 1,
-        stroke = TRUE,
-        weight = 1,     
-        opacity = 0.5,                       
-        color = "#354f52",             
-        highlight = highlightOptions(
-            weight = 3,
-            color = "#fff",
-            fillOpacity = 0.8,
-            bringToFront = TRUE),
-        group = "National Parks")  %>%
-    addLayersControl(
-        baseGroups = "Base Map",
-        overlayGroups = "National Parks",
-        options = layersControlOptions(collapsed = FALSE))
-
- - - -

Lines 2-16 are identical to those in part II where I created the base map. I am not going to cover these sections in detail, because I covered it previously.

- -
- - - -
17    addPolygons(data = nps,
- -

To add the National Park data to the base map, we call addPolygons() again. The arguments are the same as before - color, opacity, outline style - just with different values. By changing those values, we can differentiate the base map from the national park data.

- -

Since we’re mapping the National Parks and not the states, we have to tell R where the data is located using data = nps.

- -
- - - -
18   smoothFactor = 0.2,
- -

smoothFactor() determines how detailed the park boundaries should be. The lower the number, the more detailed the shape. The higher the number, the smoother the parks will render. I usually match this to whatever I set for the base map for consistency.

- -
- - - -
19   fillColor = "#354f52",
-20   fillOpacity = 1,
- -

Define the color and transparency of the National Parks. In a future post, I am going to change the color of each type of public land, but for now, I’ll make them all a nice sage green color #354f52. I also want to make the parks to be fully opaque.

- -
- - - -
21    stroke = TRUE,
-22    weight = 1,     
-23    opacity = 0.5,                       
-24    color = "#354f52",
- -

The next four lines (21-24) define what kind of outline the National Parks will have. I detail each of these arguments in part II of this series.

- -

Briefly, I want there to be an outline to each park (stroke = TRUE) that’s thicker weight = 1 than the outline used on the base map. I do not like the way it looks at full opacity, so I make it half-transparent (opacity = 0.5). Finally, I want the outline color = "#354f52 to be the same color as the fill. This will matter more when I change the fill color of the parks later on.

- -
- - - -
25  highlight = highlightOptions(
-26              weight = 3,
-27              color = "#fff",
-28              fillOpacity = 0.8,
- -

Lines 25-28 define the National Park’s behavior on mouseover. First we have to define and initialize the highlightOptions() function. The function take similar arguments as the addPolygons function - both of which I go over in detail in part II.

- -

I want to keep the mouseover behavior noticeable, but simple. To do so, I set the outline’s thickness to be weight = 3. This will give the shape a nice border that differentiates it from the rest of the map.

- -

color = "#fff sets the outline’s color on mouseover only. So, when inactive, the outline color will match the fill color, but on mouseover the outline color switches to white (#fff).

- -
- Note: When the hex code repeats (like, white - #ffffff, black - #000000, or grey #808080) you only need to include the first three characters.
- -
- - - -
29    bringToFront = TRUE),
- -

bringToFront can either be TRUE or FALSE. If TRUE, Leaflet will bring the park to the forefront on mouseover. This is useful later when we add in the state parks because national and state parks tend to be close together.

- -

When FALSE the shape will remain static.

- -
- - - -
30    group = "National Parks")  %>%
- -

Since Leaflet adds all new data to the top of the base map, I think it’s useful to group the layers together. In the next block of code, we add in some layer functionality. For now, though, I want to add the National Parks to their own group so I can hide the National Parks if I want.

- -
- - - -
31    addLayersControl(
-32        baseGroups = "Base Map",
-33        overlayGroups = "National Parks",
-34        options = layersControlOptions(collapsed = FALSE))
- -

addLayersControl defines how layers are displayed on the final map. The function takes three arguments.

- -

First, we have to tell Leaflet which layer should be used as the base map: baseGroups = "Base Map". The name in the quotations (here: "Base Map") has to match the name given to the layer you set in the addPolygons() call. In line 14, I put the 50 states into a group called "Base Map", but you can name it anything you like.

- -

There can be more than one base map, too. It’s not super helpful here since I shifted Alaska and Hawaii, but when using map tiles you can add multiple types of base maps that users can switch between.

- -

Next, we have to define the layers that are shown on top of the base group: overlayGroups = "National Parks". Just like the base map, this is defined in the corresponding addPolygons call. Here, I called the layer National Parks in line 30.

- -

Finally, on the map I don’t want the layers to be collapsed, so I set options = layersControlOptions(collapsed = FALSE). When TRUE the map will display an icon in the top right that, when clicked, will show the available layers.

- -
- - - Image: Layers collapsed - - -
Layers collapsed
-
- -

6. conclusion

-

Hey, look at that! You made a base map and you added some National Park data to it. You’re a certified cartographer now!

- -

In the next part IV post we’ll download and process the state park data before adding it to the map. Part V of this series we’ll add Shiny functionality and some additional markers.

- - -
Map showing US National Parks
-

</figure>

-:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/1d/f02237ce189d314dfa286e0f263a1e488e15a42f4f48134ec26601492652fe b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/1d/f02237ce189d314dfa286e0f263a1e488e15a42f4f48134ec26601492652fe new file mode 100644 index 0000000..dd9c293 --- /dev/null +++ b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/1d/f02237ce189d314dfa286e0f263a1e488e15a42f4f48134ec26601492652fe @@ -0,0 +1,30 @@ +I"Ů

welcome

+
+ +

Here you’ll find several non-partisan ballot guides. I try to include as much information as possible without directly recreating the official voter guide. The information is sourced from the ballot guide, calmatters, ballotpedia, LA Times, voter’s edge, and Mercury News. I have included links to the campaign website or wherever most of the information came from.

+ +

I have done my best to keep my views out of it.

+ +

I started this to help my two aunts because they would ask me to simplify their ballots for them. Democracy relies on an informed and participatory citizenry, but it’s not always easy. This is meant to alleviate some of the burden.

+ +

If you notice any errors, you feel like I’ve missed something, or you found this guide helpful feel free to send me an email [click the envelope at the bottom of the page]

+ +

helpful resources

+
+ +
    +
  1. Register to vote (Deadline May 23): https://registertovote.ca.gov/
  2. +
  3. Check your registration status: https://voterstatus.sos.ca.gov/
  4. +
  5. Access the official voter guide: https://voterguide.sos.ca.gov/
  6. +
  7. Early voting & ballot drop off locations: https://caearlyvoting.sos.ca.gov/
  8. +
  9. Track your ballot: https://california.ballottrax.net/voter/
  10. +
  11. If you are in the Los Angeles, San Bernardino, Orange County area and need help getting to your polling place I will either find you resources or help you get there. I also offer to go with you to vote (and I will bring my two large German Shepherds) if you feel unsafe going to vote alone.
  12. +
+ +

ballot guides

+
+ +

Clink on a link to download the PDF.

+ +

primary elections

+:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/20/02c813a7a0562d670ff6fd205e5ae9f57da744e69cc7978282b999513251b0 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/20/02c813a7a0562d670ff6fd205e5ae9f57da744e69cc7978282b999513251b0 new file mode 100644 index 0000000..b35dab2 --- /dev/null +++ b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/20/02c813a7a0562d670ff6fd205e5ae9f57da744e69cc7978282b999513251b0 @@ -0,0 +1,73 @@ +I" +
It made a difference to that one. + +
+ +

I have a simple motto in life: Do what you can, where you are, with what you have. As a result, I believe strongly in doing whatever is in my means to make the world a better place. Below are some groups that I have either founded or joined in order to help those around me.</p>

+ +

Current Groups

+
+ + + +

Previous Groups

+
+ + + +:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/21/2b0032250ee0c110ac1a4e54a659611bd8ce0b33c6f3300eeda916a2cc6f74 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/21/2b0032250ee0c110ac1a4e54a659611bd8ce0b33c6f3300eeda916a2cc6f74 new file mode 100644 index 0000000..835fa05 --- /dev/null +++ b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/21/2b0032250ee0c110ac1a4e54a659611bd8ce0b33c6f3300eeda916a2cc6f74 @@ -0,0 +1,30 @@ +I"Ŕ
+
+
+
+ + +
+ Here you'll find several non-partisan ballot guides. I try to include as much information as possible without directly recreating the official voter guide. The information is sourced from the ballot guide, calmatters, ballotpedia, LA Times, voter’s edge, and Mercury News. I have included links to the campaign website or wherever most of the information came from. + + I have done my best to keep my views out of it. + + I started this to help my two aunts because they would ask me to simplify their ballots for them. Democracy relies on an informed and participatory citizenry, but it’s not always easy. This is meant to alleviate some of the burden. + + If you notice any errors, you feel like I’ve missed something, or you found this guide helpful feel free to send me an email [link is below] + + Helpful Resources: +
  1. Register to vote (Deadline May 23): <a href="https://registertovote.ca.gov/??(https://registertovote.ca.gov/){:target="_blank" rel="noopener noreferrer"}
  2. +
  3. Check your registration status: <a href="https://voterstatus.sos.ca.gov/](https://voterstatus.sos.ca.gov/){:target="_blank" rel="noopener noreferrer"}
  4. +
  5. Access the official voter guide: <a href="https://voterguide.sos.ca.gov/](https://voterguide.sos.ca.gov/){:target="_blank" rel="noopener noreferrer"}
  6. +
  7. Early voting & ballot drop off locations: <a href="https://caearlyvoting.sos.ca.gov/](https://caearlyvoting.sos.ca.gov/){:target="_blank" rel="noopener noreferrer"}
  8. +
  9. Track your ballot: <a href="https://california.ballottrax.net/voter/](https://california.ballottrax.net/voter/){:target="_blank" rel="noopener noreferrer"}
  10. +
  11. If you are in the Los Angeles, San Bernardino, Orange County area and need help getting to your polling place I will either find you resources or help you get there. I also offer to go with you to vote (and I will bring my two large German Shepherds) if you feel unsafe going to vote alone.
+
+ +
+
+ +
+
+:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/4b/e9a9a6ca692d0dd9c67238e42b1c5b3c500bfb231a0e20e354360c932af009 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/22/56328d56265155b7cc82831a1663a93bf01c262c5f1cc90dc1a0c81266cbb8 similarity index 92% rename from .jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/4b/e9a9a6ca692d0dd9c67238e42b1c5b3c500bfb231a0e20e354360c932af009 rename to .jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/22/56328d56265155b7cc82831a1663a93bf01c262c5f1cc90dc1a0c81266cbb8 index 195896a..633d270 100644 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/4b/e9a9a6ca692d0dd9c67238e42b1c5b3c500bfb231a0e20e354360c932af009 +++ b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/22/56328d56265155b7cc82831a1663a93bf01c262c5f1cc90dc1a0c81266cbb8 @@ -1,4 +1,4 @@ -I"=1 +I"4
@@ -14,7 +14,7 @@
  • tutorial - 8 + 9
  • @@ -25,24 +25,28 @@
  • cartography - 4 + 5
  • nps - 4 + 5
  • r - 4 + 5
  • + + + +
  • version control @@ -114,6 +118,8 @@
    • +
      cartography in R part five
      +
      cartography in R part four
      cartography in R part three
      @@ -166,6 +172,8 @@
      • +
        cartography in R part five
        +
        cartography in R part four
        cartography in R part three
        @@ -207,6 +215,8 @@
        • +
          cartography in R part five
          +
          cartography in R part four
          cartography in R part three
          @@ -240,6 +250,8 @@
          • +
            cartography in R part five
            +
            cartography in R part four
            cartography in R part three
            @@ -267,6 +279,9 @@ + + +

            version control

            diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/25/d6ec4fbe0174aa0894ef7164d871adfe77a0bd18816c97728687cd605b0b76 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/25/d6ec4fbe0174aa0894ef7164d871adfe77a0bd18816c97728687cd605b0b76 deleted file mode 100644 index 71b5fe7..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/25/d6ec4fbe0174aa0894ef7164d871adfe77a0bd18816c97728687cd605b0b76 +++ /dev/null @@ -1,228 +0,0 @@ -I"¦>

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -

            The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management and the Bureau of Reclamation. Having visited the park, I can tell you there’s no fences blocking these areas off. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area. It will be a good test case to make sure I’m selecting the correct data.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/27/12aa8f79bd9e22edc713eea4ef92bd6a4a4a31cbbca0bd31ab37fb4c291d03 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/27/12aa8f79bd9e22edc713eea4ef92bd6a4a4a31cbbca0bd31ab37fb4c291d03 new file mode 100644 index 0000000..8c88e6e --- /dev/null +++ b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/27/12aa8f79bd9e22edc713eea4ef92bd6a4a4a31cbbca0bd31ab37fb4c291d03 @@ -0,0 +1,3 @@ +I"…

            Welcome to part five of my cartography in R series. In this post I’ll return to the maps created in part II and part III to include a Shiny information box and popups linking to posts about my adventures in the National Parks.

            + +:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/29/9ccf70e786b1023deae25b915c0bbde5d7f79a5973d22363bc86205b4ca3cb b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/29/9ccf70e786b1023deae25b915c0bbde5d7f79a5973d22363bc86205b4ca3cb deleted file mode 100644 index b77547b..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/29/9ccf70e786b1023deae25b915c0bbde5d7f79a5973d22363bc86205b4ca3cb +++ /dev/null @@ -1,434 +0,0 @@ -I"š

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") 
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")
            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/2b/7808a3160d4680bffbf39d28cbfb14419007e1a79454655e9c6125a06c5318 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/2b/7808a3160d4680bffbf39d28cbfb14419007e1a79454655e9c6125a06c5318 deleted file mode 100644 index 522d943..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/2b/7808a3160d4680bffbf39d28cbfb14419007e1a79454655e9c6125a06c5318 +++ /dev/null @@ -1,524 +0,0 @@ -I"Ó¸

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/2b/ccc85dee2a42e37ec6c1d3163037d7653766e17979194dc1841202c50b7182 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/2b/ccc85dee2a42e37ec6c1d3163037d7653766e17979194dc1841202c50b7182 new file mode 100644 index 0000000..0ea8350 --- /dev/null +++ b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/2b/ccc85dee2a42e37ec6c1d3163037d7653766e17979194dc1841202c50b7182 @@ -0,0 +1,31 @@ +I"ľ

            welcome

            +
            + +

            Here you’ll find several non-partisan ballot guides. I try to include as much information as possible without directly recreating the official voter guide. The information is sourced from the ballot guide, calmatters, ballotpedia, LA Times, voter’s edge, and Mercury News. I have included links to the campaign website or wherever most of the information came from.

            + +

            I have done my best to keep my views out of it.

            + +

            I started this to help my two aunts because they would ask me to simplify their ballots for them. Democracy relies on an informed and participatory citizenry, but it’s not always easy. This is meant to alleviate some of the burden.

            + +

            If you notice any errors, you feel like I’ve missed something, or you found this guide helpful feel free to send me an email [click the envelope at the bottom of the page]

            + +

            helpful resources:

            +
            + +
              +
            1. Register to vote (Deadline May 23): https://registertovote.ca.gov/
            2. +
            3. Check your registration status: https://voterstatus.sos.ca.gov/
            4. +
            5. Access the official voter guide: https://voterguide.sos.ca.gov/
            6. +
            7. Early voting & ballot drop off locations: https://caearlyvoting.sos.ca.gov/
            8. +
            9. Track your ballot: https://california.ballottrax.net/voter/
            10. +
            11. If you are in the Los Angeles, San Bernardino, Orange County area and need help getting to your polling place I will either find you resources or help you get there. I also offer to go with you to vote (and I will bring my two large German Shepherds) if you feel unsafe going to vote alone.
            12. +
            + +

            ballot guides

            +
            + +

            Clink on a link to download the PDF.

            +

            primary elections

            +

            2022 Primary California

            + +:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/2f/6fb60b3b2619c1e86123645096c892e3530120480586f700080da8b2dab8a6 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/2f/6fb60b3b2619c1e86123645096c892e3530120480586f700080da8b2dab8a6 deleted file mode 100644 index 8f7778b..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/2f/6fb60b3b2619c1e86123645096c892e3530120480586f700080da8b2dab8a6 +++ /dev/null @@ -1,604 +0,0 @@ -I"Bâ

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            V. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 31-37
            • -
            - -
            31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -33                             Unit_Nm == "Salton Sea" ~ "visited",
            -34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -37                             TRUE ~ "not visited")  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

            - -
            - -
              -
            • lines 38-43
            • -
            - -
            38   shift_geometry(preserve_area = FALSE, 
            -39                  position = "below") %>% 
            -40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -41
            -42   ## save shifted park data
            -43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            - -

            I’ve covered these lines extensively in part II and part III of this series.

            - -

            Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

            - -

            Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

            - -

            Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/30/51b85da51e12c435558dc283062090267ccd72d1c77cf35f981d565ca84322 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/30/51b85da51e12c435558dc283062090267ccd72d1c77cf35f981d565ca84322 new file mode 100644 index 0000000..d4e4d6b --- /dev/null +++ b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/30/51b85da51e12c435558dc283062090267ccd72d1c77cf35f981d565ca84322 @@ -0,0 +1,30 @@ +I"î
            +
            +
            +
            + + +
            + Here you'll find several non-partisan ballot guides. I try to include as much information as possible without directly recreating the official voter guide. The information is sourced from the ballot guide, calmatters, ballotpedia, LA Times, voter’s edge, and Mercury News. I have included links to the campaign website or wherever most of the information came from. + + I have done my best to keep my views out of it. + + I started this to help my two aunts because they would ask me to simplify their ballots for them. Democracy relies on an informed and participatory citizenry, but it’s not always easy. This is meant to alleviate some of the burden. + + If you notice any errors, you feel like I’ve missed something, or you found this guide helpful feel free to send me an email [link is below] + + Helpful Resources: +
            1. Register to vote (Deadline May 23): <a href="https://registertovote.ca.gov/>https://registertovote.ca.gov/</a>
            2. +
            3. Check your registration status: https://voterstatus.sos.ca.gov/
            4. +
            5. Access the official voter guide: https://voterguide.sos.ca.gov/
            6. +
            7. Early voting & ballot drop off locations: https://caearlyvoting.sos.ca.gov/
            8. +
            9. Track your ballot: https://california.ballottrax.net/voter/
            10. +
            11. If you are in the Los Angeles, San Bernardino, Orange County area and need help getting to your polling place I will either find you resources or help you get there. I also offer to go with you to vote (and I will bring my two large German Shepherds) if you feel unsafe going to vote alone.
            +
            + +
            +
            + +
            +
            +:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/32/3ad0e4b753f492c45ef38f092356a7df354769f922611a6c2d0b1304a95815 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/32/3ad0e4b753f492c45ef38f092356a7df354769f922611a6c2d0b1304a95815 deleted file mode 100644 index 89af035..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/32/3ad0e4b753f492c45ef38f092356a7df354769f922611a6c2d0b1304a95815 +++ /dev/null @@ -1,298 +0,0 @@ -I"g[

            This is a continuation of my previous post where I walked through how to download and modify shape data. I also showed how to shift Alaska and Hawaii so they are closer to the continental usa. -

            - -

            In this post, I’ll go over how to use Leaflet to map the shapefile we made in the previous post. If you’ve come here from part one of the series, you probably have the libraries and data loaded already. However, if you don’t, be sure to load the libraries and shapefiles before moving to number two.

            - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two [this post]

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -
                ## load libraries
            -    library("tidyverse")    # data manipulation & map creation
            -    library("sf")           # loads shapefile
            -    library("leaflet")      # creates the map
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -
                ## load data
            -    states <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp to reflect wherever you saved the shifted shapefile.

            - -

            If your data processing and base map creation are in the same file, you can skip this line, and when you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. create the base map

            -
            - -

            At its most basic, all Leaflet needs to create a map is a base map and data layers. The code below may look intimidating, but it’s mostly style options.

            - -

            This is the map we’re going to create. It’s a simple grey map and each state darkens in color as you hover over it. I’ll show the same map after each style option is added so you can see what effect it has.

            - -
            - - - Image: Final USA basemap - - -
            Final USA basemap
            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line. -
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -
                ## create usa base map using leaflet()
            -    map <- leaflet() %>%
            -    addPolygons(data = states,
            -        smoothFactor = 0.2,
            -        fillColor = "#808080",
            -        fillOpacity = 0.5,
            -        stroke = TRUE,
            -        weight = 0.5,
            -        opacity = 0.5,
            -        color = "#808080",
            -        highlight = highlightOptions(
            -            weight = 0.5,
            -            color = "#000000",
            -            fillOpacity = 0.7,
            -            bringToFront = FALSE),
            -        group = "Base Map")
            -
            - -
              -
            • line 2:
            • -
            - -
            2    map <- leaflet() %>% 
            - -

            leaflet() initializes the map widget. I save it to a variable called map (map <-) so I can run other code in the file without recreating the map each time. When you want to see the map, you can type map (or whatever you want to name your map) in the terminal and hit enter. R will display the map in the viewer.

            - -
            - -
              -
            • line 3:
            • -
            - -
            3    addPolygons(data = states,
            - -

            addPolygons() adds a layer to the map widget. Leaflet has different layer options, including addTiles and addMarkers which do different things. You can read about them on the leaflet website. Since we’re using a previously created shapefile, we’ll add the shapefile to the map using addPolygons().

            - -

            The first argument you need to specify after calling addPolygons is data = [data-source]. [data-source] is whatever variable your data is stored in. For me, it’s called states. This is either the processed data from part I of this series or the saved shapefile loaded above under the section called load data.

            - -

            When you run only the first two lines, Leaflet will use its default styling. The base color will be a light blue and the outlines of the states will be dark blue and fairly thick.

            - -
            - - - Image: USA map with default leaflet styling - - -
            USA map with default leaflet styling
            -
            - -

            You can leave the base map like this if you want, but all additional data will be added as a layer on top</i>* of this map which can become distracting very quickly. I prefer to make my base maps as basic and unobtrusive as possible so the data I add on top of the base map is more prominent.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    smoothFactor = 0.2,
            - -

            smoothFactor controls how much the polygon shape should be smoothed at each zoom level. The lower the number the more accurate your shapes will be. A larger number, on the other hand, will lead to better performance, but can distort the shapes of known areas.

            - -

            I keep the smoothFactor low because I want the United States to appear as a coherent land mass. The image below shows three different maps, each with a different smoothFactor to illustrate what this argument does. On the left, the map’s smoothFactor=0.2, the center map’s smoothFactor=10, and the right’s smoothFactor=100.

            - -
            - - - Image: SmoothFactor = 0.2 (left), 10 (center), 100 (right) - - -
            SmoothFactor = 0.2 (left), 10 (center), 100 (right)
            -
            - -

            As you can see, the higher the smoothFactor the less coherent the United States becomes.

            - -
            - Note: Leaflet - all map programs, really - don't know that continents, countries, states, or any other land mass is a coherent set. It reads the shape data and spits out a map without regard to where things "should" be. It doesn't know California goes on the left or that Alaska is part of the United States. It's important to keep this in mind because each state's shape is rendered individually when using addPolygons(). -
            - -
            - -
              -
            • lines 5-6:
            • -
            - -
            5    fillColor = "#808080",
            -6    fillOpacity = 0.5,
            - -

            fillColor refers to what color is on the inside of the polygons. Since I want a minimal base map, I usually set this value to be some shade of grey. If you want a different color, you only need to replace #808080 with the corresponding hex code for the color you want. Here is a useful hex color picker. If you have a hex value and you want the same color in a different shade, this is a useful site.

            - -

            fillOpacity determines how transparent the color inside the shape should be. I set mine to be 0.5 because I like the way it looks. The number can be between 0 and 1 with 1 being fully opaque and 0 being fully transparent.

            - -
            - -
              -
            • line 7-10:
            • -
            - -
            7    stroke = TRUE,
            -8    weight = 0.5,
            -9    opacity = 0.5,
            -10   color = "#808080",
            - -

            The next four lines define the appearance of the shapes’ outline.

            - -

            The stroke property can be set to either TRUE or FALSE. When true, Leaflet adds an outline around each polygon. When false, the polygons have no outline. In the image below, the map on the left has the default outlines and on the right stroke = FALSE.

            - -
            - - - Image: Default outlines on the left, no outline on the right - - -
            Default outlines on the left, no outline on the right
            -
            - -

            weight = 0.5 sets the thickness of the outlines to be 0.5 pixels. This can be any value you want with higher numbers corresponding to thicker lines. Lower numbers correspond to thinner lines.

            - -

            The opacity property operates in the same way as fill opacity above, but on the outlines. The number can be between 0 and 1. Lower numbers correspond to the lines being more transparent and 1 means fully opaque.

            - -

            color = "#808080" sets the color of the outline. I typically set it to be the same color as the fill color.

            - -

            If you want a static base map then lines 2-10 are all you need, as shown in the image below. I like to add some functionality to my base map so that the individual states become darker when they’re hovered over.

            - -
            - - - Image: static base map - - -
            static base map
            -
            - -

            Lines 11-15 define the map’s behavior when the mouse hovers over the shape. Most of the options are the same as the ones used on the base polygon shapes, so I won’t go into them with much detail.

            - -
            - -
              -
            • line 11:
            • -
            - -
            11    highlight = highlightOptions()
            - -

            highlight = highlightOptions() contains the mouseover specifications. The word before the equal sign has to be either highlight or highlightOptions. I am not sure why you have to declare highlight twice, but you do.

            - -

            highlightOptions() is the actual function call.

            - -
            - -
              -
            • lines 12-14
            • -
            - -
            12    weight = 0.5,
            -13    color = "#000000",
            -14    fillOpacity = 0.7,
            - -

            weight, color, and fillOpacity all operate in the same way as before, but whatever values you specify here will only show up when the mouse hovers over.

            - -
            - -
              -
            • line 15
            • -
            - -
            15    bringToFront = FALSE),
            - -

            bringToFront takes one of two values: TRUE or FALSE. It only really matters when you have multiple layers (like we will in later parts of this series). When bringToFront = TRUE hovering over the state will bring it to the front. When bringToFront = FALSE it will stay in the back.

            - -

            Since the base map has only one layer, this property doesn’t affect anything.

            - -
            - -
              -
            • line 16
            • -
            - -
            16    group = "Base Map")
            - -

            group = "Base Map") lets you group multiple layers together. This argument will come in handy as we add more information to the map. The base map is the default layer and is always visible - though, when you use map tiles you can define multiple base layers. All other layers will be on top of the base layer. When using different groups, you can define functionality that allows users to turn off certain layers.

            - -

            4. Conclusion

            -

            You’ve created your first base map! It’s a boring flat, grey map, but it’s the base we’ll use when adding in the national and state park data. In part III of this series we’ll process and add in the National Parks.

            - -
            -
            United States base map made with Leaflet
            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/32/64cce75ff13780ece87dede51e072bf71b04cc300fb1718d8271c2df5ac124 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/32/64cce75ff13780ece87dede51e072bf71b04cc300fb1718d8271c2df5ac124 deleted file mode 100644 index 7c5a78c..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/32/64cce75ff13780ece87dede51e072bf71b04cc300fb1718d8271c2df5ac124 +++ /dev/null @@ -1,621 +0,0 @@ -I"Łç

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            VI. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 31-37
            • -
            - -
            31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -33                             Unit_Nm == "Salton Sea" ~ "visited",
            -34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -37                             TRUE ~ "not visited")  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

            - -
            - -
              -
            • lines 38-43
            • -
            - -
            38   shift_geometry(preserve_area = FALSE, 
            -39                  position = "below") %>% 
            -40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -41
            -42   ## save shifted park data
            -43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            - -

            I’ve covered these lines extensively in part II and part III of this series.

            - -

            Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

            - -

            Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

            - -

            Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

            - -

            7. divide by state

            -
            - -
            -
            -
            "chaos ensued"
            - -
            - -

            I tried to map the base map, National Parks, and the state parks. It did not go well. R froze, my computer screamed, and chaos ensued. As a result, I had to rethink my map. I decided to separate the state parks by state, save them, and in part VI of this never-ending series* I’ll create individual state maps. When you click on a state it’ll take you to a map that includes the state parks.

            - -

            Unfortunately, this also means I need to separate the National Parks by state so they also appear on the individual maps. The logic will be the same so I am not going to update part III to reflect that change. If you want to see that code it’s available on the project repo].

            - -
              -
            • I have annoyed myself with how long this series is. Hopefully it is helpful. Drop me a line if it is.
            • -
            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/32/910e628507cb6b66acad4e530ebffa6de79fdec6a4867ec5ae0b00f6c951b3 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/32/910e628507cb6b66acad4e530ebffa6de79fdec6a4867ec5ae0b00f6c951b3 new file mode 100644 index 0000000..3b9ef2d --- /dev/null +++ b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/32/910e628507cb6b66acad4e530ebffa6de79fdec6a4867ec5ae0b00f6c951b3 @@ -0,0 +1,30 @@ +I"’

            welcome

            +
            + +

            Here you’ll find several non-partisan ballot guides. I try to include as much information as possible without directly recreating the official voter guide. The information is sourced from the ballot guide, calmatters, ballotpedia, LA Times, voter’s edge, and Mercury News. I have included links to the campaign website or wherever most of the information came from.

            + +

            I have done my best to keep my views out of it.

            + +

            I started this to help my two aunts because they would ask me to simplify their ballots for them. Democracy relies on an informed and participatory citizenry, but it’s not always easy. This is meant to alleviate some of the burden.

            + +

            If you notice any errors, you feel like I’ve missed something, or you found this guide helpful feel free to send me an email [click the envelope at the bottom of the page]

            + +

            helpful resources:

            +
            + +
              +
            1. Register to vote (Deadline May 23): https://registertovote.ca.gov/
            2. +
            3. Check your registration status: https://voterstatus.sos.ca.gov/
            4. +
            5. Access the official voter guide: https://voterguide.sos.ca.gov/
            6. +
            7. Early voting & ballot drop off locations: https://caearlyvoting.sos.ca.gov/
            8. +
            9. Track your ballot: https://california.ballottrax.net/voter/
            10. +
            11. If you are in the Los Angeles, San Bernardino, Orange County area and need help getting to your polling place I will either find you resources or help you get there. I also offer to go with you to vote (and I will bring my two large German Shepherds) if you feel unsafe going to vote alone.
            12. +
            + +

            ballot guides

            +
            + +

            primary elections

            +

            2022 Primary California

            + +:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/32/9bce6621afdbed25d36ad4845ffc839cda4de83f92c32a0150454aa56037e2 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/32/9bce6621afdbed25d36ad4845ffc839cda4de83f92c32a0150454aa56037e2 new file mode 100644 index 0000000..69432e6 --- /dev/null +++ b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/32/9bce6621afdbed25d36ad4845ffc839cda4de83f92c32a0150454aa56037e2 @@ -0,0 +1,55 @@ +I"a + +
            It made a difference to that one. + +
            + +

            I have a simple motto in life: Do what you can, where you are, with what you have. As a result, I believe strongly in doing whatever is in my means to make the world a better place. Below are some groups that I have either founded or joined in order to help those around me.</p>

            + +

            Current Groups

            +
            + +
              +
            • Political Science Methodology Group | Co-organizer with Melina Much +University of California, Irvine
            • +
            • Political Science Womxn’s Caucus | Student leader +University of California, Irvine
            • +
            • Political Science Workshop Coordinator +University of California, Irvine
            • +
            • Legal Politics Writing Workshop
              +University of California, Irvine
            • +
            • Center for Democracy: Writing Workshop | Member +University of California, Irvine
            • +
            • +

              UCI Humanities: Writing Workshop | Member +University of California, Irvine

              + +
              <h1>Previous Groups</h1>
              +<hr class = "h-line">
              +<ul>
              +  <li><i>Friends of the San Dimas Dog Park</i> | Ambassador <br/>
              +  San Dimas, California </li><br/>
              +  <li><i>Prisoner Education Project</i> | Volunteer <br/>
              +  Pomona, California</li><br/>
              +  <li><i>Tails of the City</i> | Volunteer Photographer <br/>
              +  Los Angeles, California</li><br/>
              +  <li><i>Philosophy Club</i> | President, Graphic Designer, and Banquet Chair <br/>
              +  California State Polytechnic University, Pomona</li><br/>
              +  <li><i><a href = "https://www.voteamerica.com/">Long Distance Voter</a></i> | Intern <br/>
              +  Social Media Content Creator</li><br/>
              +  <li><i><a href = "https://www.freepress.net/">Free Press</a></i> | Intern <br/>
              +  Social Media Content Creator</li>
              +</ul>   <!-- </div>
              +
              +
              +
              +
            • +
            +

            </div> + –>

            +:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/33/75b1c677d7b1e5cedf1c6642297bc124aadd75275e2902761f1726023d2982 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/33/75b1c677d7b1e5cedf1c6642297bc124aadd75275e2902761f1726023d2982 deleted file mode 100644 index 1427d40..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/33/75b1c677d7b1e5cedf1c6642297bc124aadd75275e2902761f1726023d2982 +++ /dev/null @@ -1,516 +0,0 @@ -I"˝µ

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/33/e97cb72bdc5c496b8ca1f9f0e618dcd4e447bb6f14af97dc9db8ccd2a616b2 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/33/e97cb72bdc5c496b8ca1f9f0e618dcd4e447bb6f14af97dc9db8ccd2a616b2 deleted file mode 100644 index 4d6ec77..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/33/e97cb72bdc5c496b8ca1f9f0e618dcd4e447bb6f14af97dc9db8ccd2a616b2 +++ /dev/null @@ -1,666 +0,0 @@ -I"fř

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            VI. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 31-37
            • -
            - -
            31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -33                             Unit_Nm == "Salton Sea" ~ "visited",
            -34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -37                             TRUE ~ "not visited")  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

            - -
            - -
              -
            • lines 38-43
            • -
            - -
            38   shift_geometry(preserve_area = FALSE, 
            -39                  position = "below") %>% 
            -40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -41
            -42   ## save shifted park data
            -43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            - -

            I’ve covered these lines extensively in part II and part III of this series.

            - -

            Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

            - -

            Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

            - -

            Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

            - -

            7. divide by state

            -
            - -
            - - -
            - - - Image: chaos ensues - - -
            chaos ensues
            -
            - - -
            - -

            I tried to map the base map, National Parks, and the state parks. It did not go well. R froze, my computer screamed, and chaos ensued. As a result, I had to rethink my map. I decided to separate the state parks by state, save them, and in part VI of this never-ending series* I’ll create individual state maps. When you click on a state it’ll take you to a map that includes the state parks.

            - -

            Unfortunately, this also means I need to separate the National Parks by state so they also appear on the individual maps. The logic will be the same so I am not going to update part III to reflect that change. If you want to see that code it’s available on the project repo].

            - -

            I don’t want to manually separate and save each state, so I’m going to use a loop! I hate loops. The logic is simple enough “as long as condition X is true, do something.” So simple, yet esvery time I’ve tried to learn a programming language I have struggled with loops. That’s pretty sad considering it’s like day 2 of any programming class. Day 1 is learning how to write “Hello World!”**

            - -
            1
            -2
            -3
            -4
            -5
            -
                split_states <- split(state_parks, f = state_parks$State_Nm)
            -    all_names <- names(split_states)   
            -
            -    for(name in all_names){            
            -        st_write(split_states[[name]], paste0("shapefiles/shifted/states/individual/", name, '.shp'))}
            -
            - -

            Look ma, new code!

            - -
              -
            • line 1
            • -
            - -
            1    split_states <- split(state_parks, f = state_parks$State_Nm) 
            - -

            The split() is part of base R. It takes quite a few arguments, most of which are optional.

            - -

            The first argument is the vector (or data frame) that you want to split into different groups. I want to split the state_parks data into its corresponding states, so it is listed first.

            - -

            The second argument f = is how you want the data split. f in this instance stands for factor. If we run levels(as.factor(state_parks$State_Nm)) in the terminal, it will return a list of the 50 state abbreviations. That is what we're telling R to do here.

            - -
              -
            • line 1
            • -
            - -
            2    all_names <- names(split_states)
            - -

            8. conclusion

            -
            - -

            * I have annoyed myself with how long this series is. Hopefully it is helpful. Drop me a line if it is.
            -** print("Hello World!")

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/3b/6b58befe98cf96ee3db121106977ae65901fe2bfb6518182162b9ae87b94c7 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/3b/6b58befe98cf96ee3db121106977ae65901fe2bfb6518182162b9ae87b94c7 deleted file mode 100644 index c6dae36..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/3b/6b58befe98cf96ee3db121106977ae65901fe2bfb6518182162b9ae87b94c7 +++ /dev/null @@ -1,575 +0,0 @@ -I"Ř

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 31-37
            • -
            - -
            31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -33                             Unit_Nm == "Salton Sea" ~ "visited",
            -34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -37                             TRUE ~ "not visited")  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            The logic is the same as the National Park data. mutate() created a new column

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/3b/909e0bca8843bf850f24f84fd8988cf11d9223b622badf168e677edd95af99 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/3b/909e0bca8843bf850f24f84fd8988cf11d9223b622badf168e677edd95af99 deleted file mode 100644 index a626540..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/3b/909e0bca8843bf850f24f84fd8988cf11d9223b622badf168e677edd95af99 +++ /dev/null @@ -1,596 +0,0 @@ -I"á

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 31-37
            • -
            - -
            31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -33                             Unit_Nm == "Salton Sea" ~ "visited",
            -34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -37                             TRUE ~ "not visited")  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

            - -
            - -
              -
            • lines 38-43
            • -
            - -
            38   shift_geometry(preserve_area = FALSE, 
            -39                  position = "below") %>% 
            -40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -41
            -42   ## save shifted park data
            -43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            - -

            I’ve covered these lines extensively in part II and part III of this series.

            - -

            Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

            - -

            Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

            - -

            Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/3b/a55e730b952119a99d46d4d668459bf214f89488ddac9c36f6fdd6a4c41c49 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/3b/a55e730b952119a99d46d4d668459bf214f89488ddac9c36f6fdd6a4c41c49 deleted file mode 100644 index 254da15..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/3b/a55e730b952119a99d46d4d668459bf214f89488ddac9c36f6fdd6a4c41c49 +++ /dev/null @@ -1,305 +0,0 @@ -I"–R

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -
              -
            1. pre-process data in R
            2. -
            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -
                state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories)  %>% 
            -    filter(Own_Type == "State")
            -
            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/3d/55c222fe065bd92a8571df7ffd2bda186b08b9f0a2ca136ae50058dafdcb77 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/3d/55c222fe065bd92a8571df7ffd2bda186b08b9f0a2ca136ae50058dafdcb77 deleted file mode 100644 index 5c7a9a5..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/3d/55c222fe065bd92a8571df7ffd2bda186b08b9f0a2ca136ae50058dafdcb77 +++ /dev/null @@ -1,603 +0,0 @@ -I"â

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            V. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 31-37
            • -
            - -
            31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -33                             Unit_Nm == "Salton Sea" ~ "visited",
            -34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -37                             TRUE ~ "not visited")  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

            - -
            - -
              -
            • lines 38-43
            • -
            - -
            38   shift_geometry(preserve_area = FALSE, 
            -39                  position = "below") %>% 
            -40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -41
            -42   ## save shifted park data
            -43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            - -

            I’ve covered these lines extensively in part II and part III of this series.

            - -

            Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

            - -

            Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

            - -

            Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/3e/ff79ab52499b220f030df1c703a647d87af25118796b29228edaf1246dc988 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/3e/ff79ab52499b220f030df1c703a647d87af25118796b29228edaf1246dc988 new file mode 100644 index 0000000..839f993 --- /dev/null +++ b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/3e/ff79ab52499b220f030df1c703a647d87af25118796b29228edaf1246dc988 @@ -0,0 +1,34 @@ +I"ă

            welcome

            +
            + +

            Here you’ll find several non-partisan ballot guides. I try to include as much information as possible without directly recreating the official voter guide. The information is sourced from the ballot guide, calmatters, ballotpedia, LA Times, voter’s edge, and Mercury News. I have included links to the campaign website or wherever most of the information came from.

            + +

            I have done my best to keep my views out of it.

            + +

            I started this to help my two aunts because they would ask me to simplify their ballots for them. Democracy relies on an informed and participatory citizenry, but it’s not always easy. This is meant to alleviate some of the burden.

            + +

            If you notice any errors, you feel like I’ve missed something, or you found this guide helpful feel free to send me an email [click the envelope at the bottom of the page]

            + +

            helpful resources

            +
            + +
              +
            1. Register to vote (Deadline May 23): https://registertovote.ca.gov/
            2. +
            3. Check your registration status: https://voterstatus.sos.ca.gov/
            4. +
            5. Access the official voter guide: https://voterguide.sos.ca.gov/
            6. +
            7. Early voting & ballot drop off locations: https://caearlyvoting.sos.ca.gov/
            8. +
            9. Track your ballot: https://california.ballottrax.net/voter/
            10. +
            11. If you are in the Los Angeles, San Bernardino, Orange County area and need help getting to your polling place I will either find you resources or help you get there. I also offer to go with you to vote (and I will bring my two large German Shepherds) if you feel unsafe going to vote alone.
            12. +
            + +

            ballot guides

            +
            + +

            Clink on a link to download the PDF.

            +

            primary elections

            + +

            2022 Primary California

            + +

            general elections

            +

            +:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/46/a109f5babc77e5a2df6342eed8238a111403991ebd80bd4ad37b17dbc1a55e b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/46/a109f5babc77e5a2df6342eed8238a111403991ebd80bd4ad37b17dbc1a55e deleted file mode 100644 index 48d0834..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/46/a109f5babc77e5a2df6342eed8238a111403991ebd80bd4ad37b17dbc1a55e +++ /dev/null @@ -1,468 +0,0 @@ -I"ŕ•

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column I know I won’t use in my map.

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/49/dbea98cb249de42b1c8ae86a404143ff0e816cf4c142a746a5675e5e493221 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/49/dbea98cb249de42b1c8ae86a404143ff0e816cf4c142a746a5675e5e493221 new file mode 100644 index 0000000..25b8a85 --- /dev/null +++ b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/49/dbea98cb249de42b1c8ae86a404143ff0e816cf4c142a746a5675e5e493221 @@ -0,0 +1,55 @@ +I" + +
            It made a difference to that one. + +
            + +

            I have a simple motto in life: Do what you can, where you are, with what you have. As a result, I believe strongly in doing whatever is in my means to make the world a better place. Below are some groups that I have either founded or joined in order to help those around me.</p>

            + +

            Current Groups

            +
            + +
              +
            • Political Science Methodology Group | Co-organizer with Melina Much
              +University of California, Irvine
            • +
            • Political Science Womxn’s Caucus | Student leader
              +University of California, Irvine
            • +
            • Political Science Workshop Coordinator
              +University of California, Irvine
            • +
            • Legal Politics Writing Workshop
              +University of California, Irvine
            • +
            • Center for Democracy: Writing Workshop | Member
              +University of California, Irvine
            • +
            • +

              UCI Humanities: Writing Workshop | Member
              +University of California, Irvine

              + +
              <h1>Previous Groups</h1>
              +<hr class = "h-line">
              +<ul>
              +  <li><i>Friends of the San Dimas Dog Park</i> | Ambassador <br/>
              +  San Dimas, California </li><br/>
              +  <li><i>Prisoner Education Project</i> | Volunteer <br/>
              +  Pomona, California</li><br/>
              +  <li><i>Tails of the City</i> | Volunteer Photographer <br/>
              +  Los Angeles, California</li><br/>
              +  <li><i>Philosophy Club</i> | President, Graphic Designer, and Banquet Chair <br/>
              +  California State Polytechnic University, Pomona</li><br/>
              +  <li><i><a href = "https://www.voteamerica.com/">Long Distance Voter</a></i> | Intern <br/>
              +  Social Media Content Creator</li><br/>
              +  <li><i><a href = "https://www.freepress.net/">Free Press</a></i> | Intern <br/>
              +  Social Media Content Creator</li>
              +</ul>   <!-- </div>
              +
              +
              +
              +
            • +
            +

            </div> + –>

            +:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/4a/98107153859527c24a981ee82410cc3d75b010352704f29da87c5c82d76850 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/4a/98107153859527c24a981ee82410cc3d75b010352704f29da87c5c82d76850 deleted file mode 100644 index 85c4818..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/4a/98107153859527c24a981ee82410cc3d75b010352704f29da87c5c82d76850 +++ /dev/null @@ -1,298 +0,0 @@ -I"g[

            This is a continuation of my previous post where I walked through how to download and modify shape data. I also showed how to shift Alaska and Hawaii so they are closer to the continental usa. -

            - -

            In this post, I’ll go over how to use Leaflet to map the shapefile we made in the previous post. If you’ve come here from part one of the series, you probably have the libraries and data loaded already. However, if you don’t, be sure to load the libraries and shapefiles before moving to number two.

            - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -
                ## load libraries
            -    library("tidyverse")    # data manipulation & map creation
            -    library("sf")           # loads shapefile
            -    library("leaflet")      # creates the map
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -
                ## load data
            -    states <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp to reflect wherever you saved the shifted shapefile.

            - -

            If your data processing and base map creation are in the same file, you can skip this line, and when you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. create the base map

            -
            - -

            At its most basic, all Leaflet needs to create a map is a base map and data layers. The code below may look intimidating, but it’s mostly style options.

            - -

            This is the map we’re going to create. It’s a simple grey map and each state darkens in color as you hover over it. I’ll show the same map after each style option is added so you can see what effect it has.

            - -
            - - - Image: Final USA basemap - - -
            Final USA basemap
            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line. -
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -
                ## create usa base map using leaflet()
            -    map <- leaflet() %>%
            -    addPolygons(data = states,
            -        smoothFactor = 0.2,
            -        fillColor = "#808080",
            -        fillOpacity = 0.5,
            -        stroke = TRUE,
            -        weight = 0.5,
            -        opacity = 0.5,
            -        color = "#808080",
            -        highlight = highlightOptions(
            -            weight = 0.5,
            -            color = "#000000",
            -            fillOpacity = 0.7,
            -            bringToFront = FALSE),
            -        group = "Base Map")
            -
            - -
              -
            • line 2:
            • -
            - -
            2    map <- leaflet() %>% 
            - -

            leaflet() initializes the map widget. I save it to a variable called map (map <-) so I can run other code in the file without recreating the map each time. When you want to see the map, you can type map (or whatever you want to name your map) in the terminal and hit enter. R will display the map in the viewer.

            - -
            - -
              -
            • line 3:
            • -
            - -
            3    addPolygons(data = states,
            - -

            addPolygons() adds a layer to the map widget. Leaflet has different layer options, including addTiles and addMarkers which do different things. You can read about them on the leaflet website. Since we’re using a previously created shapefile, we’ll add the shapefile to the map using addPolygons().

            - -

            The first argument you need to specify after calling addPolygons is data = [data-source]. [data-source] is whatever variable your data is stored in. For me, it’s called states. This is either the processed data from part I of this series or the saved shapefile loaded above under the section called load data.

            - -

            When you run only the first two lines, Leaflet will use its default styling. The base color will be a light blue and the outlines of the states will be dark blue and fairly thick.

            - -
            - - - Image: USA map with default leaflet styling - - -
            USA map with default leaflet styling
            -
            - -

            You can leave the base map like this if you want, but all additional data will be added as a layer on top</i>* of this map which can become distracting very quickly. I prefer to make my base maps as basic and unobtrusive as possible so the data I add on top of the base map is more prominent.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    smoothFactor = 0.2,
            - -

            smoothFactor controls how much the polygon shape should be smoothed at each zoom level. The lower the number the more accurate your shapes will be. A larger number, on the other hand, will lead to better performance, but can distort the shapes of known areas.

            - -

            I keep the smoothFactor low because I want the United States to appear as a coherent land mass. The image below shows three different maps, each with a different smoothFactor to illustrate what this argument does. On the left, the map’s smoothFactor=0.2, the center map’s smoothFactor=10, and the right’s smoothFactor=100.

            - -
            - - - Image: SmoothFactor = 0.2 (left), 10 (center), 100 (right) - - -
            SmoothFactor = 0.2 (left), 10 (center), 100 (right)
            -
            - -

            As you can see, the higher the smoothFactor the less coherent the United States becomes.

            - -
            - Note: Leaflet - all map programs, really - don't know that continents, countries, states, or any other land mass is a coherent set. It reads the shape data and spits out a map without regard to where things "should" be. It doesn't know California goes on the left or that Alaska is part of the United States. It's important to keep this in mind because each state's shape is rendered individually when using addPolygons(). -
            - -
            - -
              -
            • lines 5-6:
            • -
            - -
            5    fillColor = "#808080",
            -6    fillOpacity = 0.5,
            - -

            fillColor refers to what color is on the inside of the polygons. Since I want a minimal base map, I usually set this value to be some shade of grey. If you want a different color, you only need to replace #808080 with the corresponding hex code for the color you want. Here is a useful hex color picker. If you have a hex value and you want the same color in a different shade, this is a useful site.

            - -

            fillOpacity determines how transparent the color inside the shape should be. I set mine to be 0.5 because I like the way it looks. The number can be between 0 and 1 with 1 being fully opaque and 0 being fully transparent.

            - -
            - -
              -
            • line 7-10:
            • -
            - -
            7    stroke = TRUE,
            -8    weight = 0.5,
            -9    opacity = 0.5,
            -10   color = "#808080",
            - -

            The next four lines define the appearance of the shapes’ outline.

            - -

            The stroke property can be set to either TRUE or FALSE. When true, Leaflet adds an outline around each polygon. When false, the polygons have no outline. In the image below, the map on the left has the default outlines and on the right stroke = FALSE.

            - -
            - - - Image: Default outlines on the left, no outline on the right - - -
            Default outlines on the left, no outline on the right
            -
            - -

            weight = 0.5 sets the thickness of the outlines to be 0.5 pixels. This can be any value you want with higher numbers corresponding to thicker lines. Lower numbers correspond to thinner lines.

            - -

            The opacity property operates in the same way as fill opacity above, but on the outlines. The number can be between 0 and 1. Lower numbers correspond to the lines being more transparent and 1 means fully opaque.

            - -

            color = "#808080" sets the color of the outline. I typically set it to be the same color as the fill color.

            - -

            If you want a static base map then lines 2-10 are all you need, as shown in the image below. I like to add some functionality to my base map so that the individual states become darker when they’re hovered over.

            - -
            - - - Image: static base map - - -
            static base map
            -
            - -

            Lines 11-15 define the map’s behavior when the mouse hovers over the shape. Most of the options are the same as the ones used on the base polygon shapes, so I won’t go into them with much detail.

            - -
            - -
              -
            • line 11:
            • -
            - -
            11    highlight = highlightOptions()
            - -

            highlight = highlightOptions() contains the mouseover specifications. The word before the equal sign has to be either highlight or highlightOptions. I am not sure why you have to declare highlight twice, but you do.

            - -

            highlightOptions() is the actual function call.

            - -
            - -
              -
            • lines 12-14
            • -
            - -
            12    weight = 0.5,
            -13    color = "#000000",
            -14    fillOpacity = 0.7,
            - -

            weight, color, and fillOpacity all operate in the same way as before, but whatever values you specify here will only show up when the mouse hovers over.

            - -
            - -
              -
            • line 15
            • -
            - -
            15    bringToFront = FALSE),
            - -

            bringToFront takes one of two values: TRUE or FALSE. It only really matters when you have multiple layers (like we will in later parts of this series). When bringToFront = TRUE hovering over the state will bring it to the front. When bringToFront = FALSE it will stay in the back.

            - -

            Since the base map has only one layer, this property doesn’t affect anything.

            - -
            - -
              -
            • line 16
            • -
            - -
            16    group = "Base Map")
            - -

            group = "Base Map") lets you group multiple layers together. This argument will come in handy as we add more information to the map. The base map is the default layer and is always visible - though, when you use map tiles you can define multiple base layers. All other layers will be on top of the base layer. When using different groups, you can define functionality that allows users to turn off certain layers.

            - -

            4. Conclusion

            -

            You’ve created your first base map! It’s a boring flat, grey map, but it’s the base we’ll use when adding in the national and state park data. In part III of this series we’ll process and add in the National Parks.

            - -
            -
            United States base map made with Leaflet
            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/4d/c72004c5f4ff0cdce4229f29166e31c1cd058c705c1b7ac83f349606ae82c3 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/4d/c72004c5f4ff0cdce4229f29166e31c1cd058c705c1b7ac83f349606ae82c3 new file mode 100644 index 0000000..19d5421 --- /dev/null +++ b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/4d/c72004c5f4ff0cdce4229f29166e31c1cd058c705c1b7ac83f349606ae82c3 @@ -0,0 +1,32 @@ +I"ŕ

            welcome

            +
            + +

            Here you’ll find several non-partisan ballot guides. I try to include as much information as possible without directly recreating the official voter guide. The information is sourced from the ballot guide, calmatters, ballotpedia, LA Times, voter’s edge, and Mercury News. I have included links to the campaign website or wherever most of the information came from.

            + +

            I have done my best to keep my views out of it.

            + +

            I started this to help my two aunts because they would ask me to simplify their ballots for them. Democracy relies on an informed and participatory citizenry, but it’s not always easy. This is meant to alleviate some of the burden.

            + +

            If you notice any errors, you feel like I’ve missed something, or you found this guide helpful feel free to send me an email [click the envelope at the bottom of the page]

            + +

            helpful resources:

            +
            + +
              +
            1. Register to vote (Deadline May 23): https://registertovote.ca.gov/
            2. +
            3. Check your registration status: https://voterstatus.sos.ca.gov/
            4. +
            5. Access the official voter guide: https://voterguide.sos.ca.gov/
            6. +
            7. Early voting & ballot drop off locations: https://caearlyvoting.sos.ca.gov/
            8. +
            9. Track your ballot: https://california.ballottrax.net/voter/
            10. +
            11. If you are in the Los Angeles, San Bernardino, Orange County area and need help getting to your polling place I will either find you resources or help you get there. I also offer to go with you to vote (and I will bring my two large German Shepherds) if you feel unsafe going to vote alone.
            12. +
            + +

            ballot guides

            +
            + +

            primary elections

            +

            2022 Primary California

            + +

            2022 Primary California

            + +:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/4d/ea770e75ecc269bbaa2967e441fc46103324394309778ba29975c7e20de808 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/4d/ea770e75ecc269bbaa2967e441fc46103324394309778ba29975c7e20de808 deleted file mode 100644 index ae1a946..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/4d/ea770e75ecc269bbaa2967e441fc46103324394309778ba29975c7e20de808 +++ /dev/null @@ -1,327 +0,0 @@ -I"NY

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. pre-process data in R

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -        filter(State_Nm %!in% territories)  %>% 
            -        filter(Own_Type == "State")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 1 -Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()
            • -
            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • line 2 -Line two
            • -
            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/4e/d27e7a8d330db2b889e65c1955b38a32ddfbdb79745daebf47229acef5ac86 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/4e/d27e7a8d330db2b889e65c1955b38a32ddfbdb79745daebf47229acef5ac86 deleted file mode 100644 index d15579f..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/4e/d27e7a8d330db2b889e65c1955b38a32ddfbdb79745daebf47229acef5ac86 +++ /dev/null @@ -1,482 +0,0 @@ -I"6

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/4f/5dd49747d21c2ccacf96984044c38a6113d7287f53c017f7d4b2d5b47cefd6 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/4f/5dd49747d21c2ccacf96984044c38a6113d7287f53c017f7d4b2d5b47cefd6 deleted file mode 100644 index a626540..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/4f/5dd49747d21c2ccacf96984044c38a6113d7287f53c017f7d4b2d5b47cefd6 +++ /dev/null @@ -1,596 +0,0 @@ -I"á

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 31-37
            • -
            - -
            31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -33                             Unit_Nm == "Salton Sea" ~ "visited",
            -34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -37                             TRUE ~ "not visited")  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

            - -
            - -
              -
            • lines 38-43
            • -
            - -
            38   shift_geometry(preserve_area = FALSE, 
            -39                  position = "below") %>% 
            -40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -41
            -42   ## save shifted park data
            -43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            - -

            I’ve covered these lines extensively in part II and part III of this series.

            - -

            Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

            - -

            Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

            - -

            Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/51/95af1b582ceb645b012f7d11f5dcee0377afcbaa367e883df0bfb4045c1278 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/51/95af1b582ceb645b012f7d11f5dcee0377afcbaa367e883df0bfb4045c1278 deleted file mode 100644 index d4f23d9..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/51/95af1b582ceb645b012f7d11f5dcee0377afcbaa367e883df0bfb4045c1278 +++ /dev/null @@ -1,470 +0,0 @@ -I"-š

            Twitter is a great resource for engaging with the academic community. For example, I saw this Tweet by PhD Genie asking users to name one positive skill learned during their PhD. I love this question for a number of reasons. First, it helps PhDs reframe their experience so it’s applicable outside of academia - which can help when applying to jobs. Second, it’s really cool to see what skills other people have learned during their program.
            -

            - -
            - - - Image: PhD Genie Tweet - - -
            PhD Genie Tweet
            -
            - -

            I responded to the tweet because during my PhD I learned how to create maps in R. I started by recreating a map from the University of North Carolina’s Hussman School of Journalism’s News Deserts project (below). Now, I am working on a personal project mapping the U.S. National and State parks.

            - -
            - - - Image: Map of US newspapers by county - - -
            Map of US newspapers by county
            -
            - -

            There was quite a bit of interest in how to do this, so in this series of posts I will document my process from start to finish.

            - -

            a few notes

            -
            - -

            First, I’m not an expert. I wanted to make a map, so I learned how. There may be easier ways and, if I learn how to do them, I’ll write another post.

            - -

            Second, before starting, I strongly suggest setting up a Github and DVC. I wrote about how to use GitHub, the Github Website, and Github Desktop. You can use any of these methods to manage your repositories. I use all three based purely on whatever mood I’m in.

            - -

            If you do use Git or GitHub, then DVC (data version control) is mandatory. GitHub will warn you that your file is too large if it’s over 50MB and reject your pushes if the files are over 100MB. The total repository size can’t exceed 2GB if you’re using the free version (which I am). DVC is useful because cartography files are large. They contain a lot of coordinates which increases with each location you try to map. DVC will store your data outside of GitHub but allows you to track changes with your data. It’s super useful.

            - -

            Third, there are several ways to make a map. R is capable of making interactive maps and static maps. Static maps are less computationally expensive and better for publication. Interactive maps are prettier and better for displaying on the web.

            - -

            I make interactive maps with Leaflet and Shiny because they offer a lot of functionality. The most common way is to use map tiles. Map tiles use data from sources like Open Street Map and Maps to create map squares (tiles) with custom data on top. A list of available map tiles is available on the Open Street Maps website.

            - -
            - - - Image: Earthquakes near Australia, using map tiles - - -
            Earthquakes near Australia, using map tiles
            -
            - -

            When I make static maps (like the US map pictured above), I use ggplot

            - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. install packages

            -
            - -

            You only need to install the packages once. You can do so by running each line in the terminal. When you rerun the code later, you can skip right to loading the packages using library("package-name")

            - -
            Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -
                ## you only need to install the packages once
            -
            -    install.packages("leaflet")        # interactive maps
            -    install.packages("shiny")          # added map functionality
            -    install.packages("tidyverse")      # data manipulation
            -    install.packages("tigris")         # cartographic boundaries
            -    install.packages("operator.tools") # for the not-in function
            -    install.packages("sf")             # read and write shapefiles
            -
            - -
            package descriptions
            - -
            3    install.packages("leaflet")
            - -
              -
            • Leaflet is an open source mapping library used to create interactive maps. All leaflet maps have the same basic components: -
                -
              • A map widget (created by calling leaflet())
              • -
              • Layers (or features) added using addTiles(), addPolygons(), or addMarkers(). -

                - Leaflet was originally written in Javascript by “Volodymyr Agafonkin, a Ukrainian citizen living in Kyiv.” With the Russian invasion of Ukraine, Volodymyr, his family, and the people he knows are being forced to flee. If you can, make a donation to Come Back Alive or to one of the charities listed at Stand With Ukraine. If you think the invasion is justified, don’t forget to carry some seeds in your pocket
              • -
              -
            • -
            - -
            4    install.packages("shiny")
            - -
              -
            • Shiny is an R package that provides additional functionality. I am primarily using it to create an informative tool box so the map is not overwhelmed by using too many popups.
            • -
            - -
            5    install.packages("tidyverse")
            - -
              -
            • -

              The Tidyverse is a collection of packages used for data manipulation and analysis. Its syntax is more intuitive than base R. Furthermore, you can chain (aka pipe) commands together.

              - -

              For cartography, you don’t need the whole Tidyverse. We’ll mainly use dplyr and ggplot. You can install these packages individually instead of installing the whole tidyverse. Though, when we get to the national park database, we’ll also need purr and tidyr.

              -
            • -
            - -
            6    install.packages("tigris")
            - -
              -
            • Tigris is used to load Census data and shapefiles. You can download the same data from the Census website, but it’s nice that you can do it in R.
            • -
            - -
            7    install.packages("operator.tools")
            - -
              -
            • -

              operator.tools is not required, but it’s recommended.

              - -

              For some unknown reason, base R has a %in% function but not a not-in function. Unfortunately, the United States is still an empire with it’s associated areas, islands, and pseudo-states. I only want to include the 50 states, so I needed a way to easily filter out the non-states. Operator tool’s %!!in% function is perfect for that.

              -
            • -
            - -
            8    install.packages("sf")
            - -
              -
            • sf is used to read and write the shapefiles necessary for cartography.
            • -
            - -

            2. load the packages

            -
            - -

            To start, create and save a new file called usa.r. In it, we’re going to download and modify the United States shape data that we’ll use to create the base map in part two of this series.

            - -

            At the beginning of each file, you have to load the necessary packages. In this file, the only packages we need to load are tidyverse, sf, and tigris. I also load leaflet to make sure the map renders correctly.

            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")
            -    library("sf")
            -    library("tigris")
            -    library("leaflet")
            -
            - -

            3. download usa shapefile

            -
            - -

            There’s two ways to download the USA shape data. First, we can use the R package, tigris. Second, we can download it from the Census website.

            - -
            Note: I show two ways to download the shape data but you only have to choose one method.
            - -

            I prefer using tigris but I’ve been having some problems with it. Sometimes it ignores the Great Lakes and merges Michigan and Wisconsin into a Frankenstate (boxed in red below).

            - -
            - - - Image: Weird Frankenstate that merges Michigan and Wisconsin. - - -
            Weird Frankenstate that merges Michigan and Wisconsin.
            -
            - -
            method 1: using tigris()
            - -

            tigris() downloads the TIGER/Shapefile data directly from the Census and includes a treasure trove of data. Some of the data includes land area, water area, state names, and geometry.

            - -

            Tigris can also download boundaries for counties, divisions, regions, tracts, blocks, congressional and school districts, and a whole host of other groupings. A complete list of available data can be found on the packages’ GitHub.

            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -
                ## download state data using tigris()
            -    us_states <- tigris::states(cb = FALSE, year = 2020)  %>% 
            -        filter(STATEFP < 57)  %>% 
            -        shift_geometry(preserve_area = FALSE,
            -                       position = "below")  %>% 
            -        sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -    
            -    ## save the shifted shapefile
            -    st_write(us_states, "path/to/file/usa.shp")
            -
            - -

            Here we create the us_states variable, save the geographic data to it, move Alaska and Hawaii so they’re beneath the continental US, and save the shifted shapefile.

            - -
            - -
              -
            • line 2:
            • -
            - -
            2   us_states <- tigris::states(cb = FALSE, year = 2020)  %>%
            - -

            R uses the <- operator to define new variables. Here, we’re naming our new variable us_states.

            - -

            In our us_states variable we’re going to store data on the 50 states downloaded using tigris. Within (::) tigris, we’re going to use the states() function.

            - -

            The states() function allows you to pull state-level data from the Census. This function takes several arguments

            - -

            The cb argument can either be TRUE or FALSE. If cb = FALSE tells Tigris() to download the most detailed shapefile. If cb = TRUE it will download a generalized (1:5000k) file. After a lot of trial and error, I found that using cb = TRUE prevents the Frankenstate from happening.

            - -

            If the year argument is omitted it will download the shapefile for the default year (currently 2020). I set out of habit from when I work with county boundaries. When I work with county boundaries I have to set the year because their boundaries change more than states.

            - -

            Finally, the %>% operator is part of the Tidyverse. It basically tells R “Hey! I’m not done, keep going to the next line!”

            - -
            - -
              -
            • line 3:
            • -
            - -
            3    filter(STATEFP < 57)  %>% 
            - -

            tigris::states() downloads data for the 50 states and the United States’ minor outlying islands, Puerto Rico, and its associated territories. Each state and territory is assigned a unique two-digit Federal Information Processing Standard [FIPS] code.

            - -

            They’re mostly consecutive (Alaska is 01) but when they were conceived of in the 1970s a couple were reserved for the US territories (American Samoa was 03), but in the updated version the “reserved codes” were left out and the territories were assigned to new numbers (American Samoa is now 60). The important bit about this is that the last official state (Wyoming) has a FIPS of 56.

            - -

            This line of code uses the filter() function on the STATEFP variable downloaded using Tigris(). All it says is keep any row that has a FIPS of less than 57. This will keep only the 50 states and exclude the United States’ empire associated territories.

            - -
            - -
              -
            • lines 4-5:
            • -
            - -
            4    shift_geometry(preserve_area = FALSE,
            -5                   position = "below")  %>%
            - -

            The shift_geometry() is from the Tigris package. It takes two arguments preserve_area and position.

            - -
            Note: Strictly speaking, these two lines are not required. Delete them if you want Alaska and Hawaii to remain in their original locations.
            - -

            When preserve_area = FALSE tigris will shrink Alaska’s size and increase Hawaii’s so that they are comparable to the size of the other states.

            - -

            The position argument can either be "below" or "outside". When it’s below, both Alaska and Hawaii are moved to be below California. When it’s outside then Alaska is moved to be near Washington and Hawaii is moved to be near California.

            - -
            - -

            Since I’m a born-theorist, I should warn you that messing with maps has inherent normative implications. The most common projection is Mercator which stretches the continents near the poles and squishes the ones near the equator.

            - -
            - - - Image: Mercator vs. Gall-Peters Projection - - -
            Mercator vs. Gall-Peters Projection
            -
            - -

            One of the competing projections is Gall-Peters which claims to be more accurate because it was - at the time it was created in the 1980s - the only “area-correct map.” Though it has now been criticized for skewing the polar continents and the equatorial ones. The above photo shows you just how different the projects are from one another.

            - -

            The problem arises because we’re trying to project a 3D object into 2D space. It’s a classic case of even though we can, maybe we shouldn’t. Computers can do these computations and change the projections to anything we want fairly easily. However, humans think and exist in metaphors. We assume bigger = better and up = good. When we project maps that puts the Northern Hemisphere as both upwards and larger than other parts of the world we are imbuing that projection with metaphorical meaning.

            - -

            I caution you to be careful when creating maps. Think through the implications of something as simple as making Alaska more visually appealing by distorting it to be of similar size as the other states.

            - -

            If you want to read more about map projections this is a good post. If you want to read more about metaphors, I suggest Metaphors We Live By by George Lakoff and Mark Johnson.

            - -
            - -
              -
            • line 6:
            • -
            - -
            6    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            - -

            The sf package includes a function called st_transform() which will reproject the data for us. There are a lot of projects. You can read them at the proj website.

            - -

            Leaflet requires all boundaries use the World Geodetic Service 1984 (WGS84) coordinate system. While making maps I’ve come across two main coordinate systems: WGS84 and North American Datum (1983). WGS84 uses the WGS84 ellipsoid and NAD83 uses the Geodetic Reference System (GRS80). From what I’ve gathered, the differences are slight, but leaflet requires WGS and the Census uses NAD83. As a result, we have to reproject the the data in order to make our map.

            - -

            The st_transform function takes four arguments, each preceded by a +. All four arguments are required to transform the data from NAD83 to WGS84.

            - -

            Briefly, +proj=longlat tells R to use project the code into longitude and latitude [rather than, for example, transverse mercator (tmerc)].

            - -

            +ellps=WGS84 sets the ellipsoid to the WGS84 standard.

            - -

            +datum=WGS84 is a holdover from previous proj releases. It tells R to use the WGS84 data.

            - -

            +no_defs is also a holdover.

            - -

            Essentially, you need to include line 6 before you create the map, but after you do any data manipulation. It might throw some warnings which you can just ignore.

            - -
            - -
              -
            • line 9
            • -
            - -
            9    st_write(us_states, "path/to/file/usa.shp")
            - -

            In the last line, we save the data we manipulated in lines 2-6. Strictly speaking you don’t have to save the shapefile. You can manipulate the data and then skip right to mapping the data. I caution against it because the files can get unreadable once you start using multiple data sets. I usually comment out line 9 after I save the file. That way I’m not saving and re-saving it whenever I need to run the code above it.

            - -

            The st_write() function is part of the sf package and it takes two arguments. The first is the data set you want to save. Since I used us_states to save the data, it will be the first argument in the st_write() function call.

            - -

            The second argument is the path to where you want the file saved and what name you want to give it. I named mine usa. It is mandatory that you add .shp to the end of the filepath so that R knows to save it as a shapefile.

            - -

            Although it’s called a shapefile, it’s actually four files. I usually create a separate folder for each set of shapefiles and store that in one master folder called shapefiles. An example of my folder structure is below. I keep all of this in my GitHub repo and track changes using DVC.

            - -
            - - - Image: Example folder structure - - -
            Example folder structure
            -
            - -

            On my C:// drive is My Documents. In that folder I keep a GitHub folder that holds all my repos, including my nps one. Inside the nps folder I separate my shapefiles into their own folder. For this tutorial I am using original and shifted shapefiles, so I’ve also separated them into two separate folders to keep things neat. I also know I’m going to have multiple shapefiles (one for the USA, one for the National Parks, and a final one for the State Parks) so I created a folder for each set. In the usa folder I saved the shifted states shapefile.

            - -

            Altogether, my line 9 would read:

            - -
            9    st_write(us_states, "~/Documents/GitHub/nps/shapefiles/shifted/usa/usa.shp")
            - -

            Running that line will save the four necessary files that R needs to load the geographic data.

            - -

            That’s it for method 1 using tigris. The next section, method 2, shows how to load and transform a previously downloaded shapefile. If you used method 1, feel free to leave this post and go directly to mapping the shapefile in part II of this series.

            - -
            - -
            method 2: using downloaded shapefiles
            - -

            In this section, I’ll go through the process of downloading the shapefiles from the Census website. If you tried method 1 and tigris caused the weird Frankenstate, you can try using the data downloaded from the Census website. I don’t know why it works, since tigris uses the same data, but it does.

            - -

            Generally, though, finding and using shapefiles created by others is a great way to create cool maps. There are thousands of shapefiles available, many from ArcGis’ Open Data Website.

            - -
              -
            • step 1:
              -On the Census website, select the Data & Maps dropdown. In the dropdown, on the right-hand side you’ll select Mapping Files.
            • -
            - -
            - - - Image: Census website dropdown. - - -
            Census website dropdown.
            -
            - -
              -
            • step 2:
              -From here, about halfway down on the page, there’s a link called TIGER Data Products Guide which will take you to a complete list of the shapefiles available.
            • -
            - -
            - - - Image: TIGER Data Products Download Link - - -
            TIGER Data Products Download Link
            -
            - -
              -
            • step 3:
              -There’s a lot of downloads available on this page, but for now just click on “Cartographic Boundaries Shapefiles.” Make sure you select the shapefiles one and not the Geodatabases or Geopackages link.
            • -
            - -
            - - - Image: Cartographic Boundaries Shapefiles - - -
            Cartographic Boundaries Shapefiles
            -
            - -

            Save the file wherever you want, but I prefer to keep it within the “original” shapefiles folder in a sub-folder called “zips.” Once it downloads, unzip it - again, anywhere is fine. It will download all 30 Census shapefiles. We’re only going to use the one called “cb_2021_us_state_500k.zip”. The rest you can delete, if you want.

            - -
            - - - Image: cb_2021_us_state_500k folder - - -
            cb_2021_us_state_500k folder
            -
            - -

            When you unzip the cb_2021_us_state_500k.zip, it will contain four files. You’ll only ever work with the .shp file, but the other three are used in the background to display the data.

            - -
              -
            • step 4:
            • -
            - -

            Once all the files are unzipped, we can load the .shp file into R.

            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -
                ## load a previously downloaded shapefile
            -    usa <- read_sf("shapefiles/original/usa/states/cb_2021_us_state_500k.shp") %>%
            -        filter(STATEFP < 57) %>%
            -        shift_geometry(preserve_area = FALSE,
            -            position = "below") %>%
            -        sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save the shifted shapefile
            -    st_write(usa, "path/to/file/usa.shp")
            -
            - -

            Everything except line 2 is the same as in method 1. I won’t go over lines 3-9 here, because all the information is above.

            - -
              -
            • line 2:
            • -
            - -

            This line is very similar to the one above. I changed the name of the variable to usa so I could keep both methods in the same R file (each R variable needs to be unique or it will be overwritten).

            - -

            read_sf is part of the sf() package. It’s used to load shapefiles into R. The path to the file is enclosed in quotation marks and parentheses. Simply navigate to wherever you unzipped the cb_2021_us_state_500k file and choose the file with the .shp extension.

            - -

            4. Conclusion

            -
            - -

            Once the shapefiles are downloaded - either using tigris() or by loading the shapefiles from the Census website - you can create the base map. I’ll tackle making the base map in part II of this series.*

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/53/a7d803818d88739e14dbd3513573f7d4d900156de2cbec5a5e032fe1030e2d b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/53/a7d803818d88739e14dbd3513573f7d4d900156de2cbec5a5e032fe1030e2d deleted file mode 100644 index 8261b95..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/53/a7d803818d88739e14dbd3513573f7d4d900156de2cbec5a5e032fe1030e2d +++ /dev/null @@ -1,237 +0,0 @@ -I"ZA

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -

            The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management and the Bureau of Reclamation. Having visited the park, I can tell you there’s no fences blocking these areas off. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area. It will be a good test case to make sure I’m selecting the correct data.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/56/d4488baf2b21b4fc190c36b65a5cb5d5ea9ffa0b8622570ec2f3b23012ca0a b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/56/d4488baf2b21b4fc190c36b65a5cb5d5ea9ffa0b8622570ec2f3b23012ca0a new file mode 100644 index 0000000..2f381cf --- /dev/null +++ b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/56/d4488baf2b21b4fc190c36b65a5cb5d5ea9ffa0b8622570ec2f3b23012ca0a @@ -0,0 +1,30 @@ +I"ď
            +
            +
            +
            + + +
            + Here you'll find several non-partisan ballot guides. I try to include as much information as possible without directly recreating the official voter guide. The information is sourced from the ballot guide, calmatters, ballotpedia, LA Times, voter’s edge, and Mercury News. I have included links to the campaign website or wherever most of the information came from. + + I have done my best to keep my views out of it. + + I started this to help my two aunts because they would ask me to simplify their ballots for them. Democracy relies on an informed and participatory citizenry, but it’s not always easy. This is meant to alleviate some of the burden. + + If you notice any errors, you feel like I’ve missed something, or you found this guide helpful feel free to send me an email [link is below] +

            + Helpful Resources: +
            1. Register to vote (Deadline May 23): https://registertovote.ca.gov/
            2. +
            3. Check your registration status: https://voterstatus.sos.ca.gov/
            4. +
            5. Access the official voter guide: https://voterguide.sos.ca.gov/
            6. +
            7. Early voting & ballot drop off locations: https://caearlyvoting.sos.ca.gov/
            8. +
            9. Track your ballot: https://california.ballottrax.net/voter/
            10. +
            11. If you are in the Los Angeles, San Bernardino, Orange County area and need help getting to your polling place I will either find you resources or help you get there. I also offer to go with you to vote (and I will bring my two large German Shepherds) if you feel unsafe going to vote alone.
            +
            + +
            +
            + +
            +
            +:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/58/79b50358f1fa2f50093fd2b10f75482a8d34f9b61bf31a3c5e35ed5848d6fe b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/58/79b50358f1fa2f50093fd2b10f75482a8d34f9b61bf31a3c5e35ed5848d6fe deleted file mode 100644 index bbf1c79..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/58/79b50358f1fa2f50093fd2b10f75482a8d34f9b61bf31a3c5e35ed5848d6fe +++ /dev/null @@ -1,622 +0,0 @@ -I"_é

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            VI. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 31-37
            • -
            - -
            31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -33                             Unit_Nm == "Salton Sea" ~ "visited",
            -34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -37                             TRUE ~ "not visited")  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

            - -
            - -
              -
            • lines 38-43
            • -
            - -
            38   shift_geometry(preserve_area = FALSE, 
            -39                  position = "below") %>% 
            -40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -41
            -42   ## save shifted park data
            -43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            - -

            I’ve covered these lines extensively in part II and part III of this series.

            - -

            Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

            - -

            Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

            - -

            Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

            - -

            7. divide by state

            -
            - -
            -
            -
            "chaos ensued"
            - -
            - -

            I tried to map the base map, National Parks, and the state parks. It did not go well. R froze, my computer screamed, and chaos ensued. As a result, I had to rethink my map. I decided to separate the state parks by state, save them, and in part VI of this never-ending series* I’ll create individual state maps. When you click on a state it’ll take you to a map that includes the state parks.

            - -

            Unfortunately, this also means I need to separate the National Parks by state so they also appear on the individual maps. The logic will be the same so I am not going to update part III to reflect that change. If you want to see that code it’s available on the project repo].

            - -

            I don’t want to manually separate and save each state, so I’m going to use a loop! I hate loops. The logic is simple enough “as long as condition X is true, do something.” So simple, yet esvery time I’ve tried to learn a programming language I have struggled with loops. That’s pretty sad considering it’s like day 2 of any programming class. Day 1 is learning how to write “hello world!”**

            - -

            * I have annoyed myself with how long this series is. Hopefully it is helpful. Drop me a line if it is.\n -** print("Hello World!")

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/58/e9b32001260736370f69ed3cea8db0084cf835f2efa7f947c895ecd97cb5f2 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/58/e9b32001260736370f69ed3cea8db0084cf835f2efa7f947c895ecd97cb5f2 deleted file mode 100644 index 844b763..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/58/e9b32001260736370f69ed3cea8db0084cf835f2efa7f947c895ecd97cb5f2 +++ /dev/null @@ -1,624 +0,0 @@ -I"jé

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            VI. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 31-37
            • -
            - -
            31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -33                             Unit_Nm == "Salton Sea" ~ "visited",
            -34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -37                             TRUE ~ "not visited")  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

            - -
            - -
              -
            • lines 38-43
            • -
            - -
            38   shift_geometry(preserve_area = FALSE, 
            -39                  position = "below") %>% 
            -40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -41
            -42   ## save shifted park data
            -43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            - -

            I’ve covered these lines extensively in part II and part III of this series.

            - -

            Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

            - -

            Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

            - -

            Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

            - -

            7. divide by state

            -
            - -
            -
            -
            "chaos ensued"
            - -
            - -

            I tried to map the base map, National Parks, and the state parks. It did not go well. R froze, my computer screamed, and chaos ensued. As a result, I had to rethink my map. I decided to separate the state parks by state, save them, and in part VI of this never-ending series* I’ll create individual state maps. When you click on a state it’ll take you to a map that includes the state parks.

            - -

            Unfortunately, this also means I need to separate the National Parks by state so they also appear on the individual maps. The logic will be the same so I am not going to update part III to reflect that change. If you want to see that code it’s available on the project repo].

            - -

            I don’t want to manually separate and save each state, so I’m going to use a loop! I hate loops. The logic is simple enough “as long as condition X is true, do something.” So simple, yet esvery time I’ve tried to learn a programming language I have struggled with loops. That’s pretty sad considering it’s like day 2 of any programming class. Day 1 is learning how to write “hello world!”**

            - -
              -
            • I have annoyed myself with how long this series is. Hopefully it is helpful. Drop me a line if it is. -** print("Hello World!")
            • -
            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/5c/68f51d064ec9c1ae3ef35d2b944ade24346c92a6b5bac9fa717590dc5a313b b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/5c/68f51d064ec9c1ae3ef35d2b944ade24346c92a6b5bac9fa717590dc5a313b deleted file mode 100644 index e1c48c2..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/5c/68f51d064ec9c1ae3ef35d2b944ade24346c92a6b5bac9fa717590dc5a313b +++ /dev/null @@ -1,470 +0,0 @@ -I"-š

            Twitter is a great resource for engaging with the academic community. For example, I saw this Tweet by PhD Genie asking users to name one positive skill learned during their PhD. I love this question for a number of reasons. First, it helps PhDs reframe their experience so it’s applicable outside of academia - which can help when applying to jobs. Second, it’s really cool to see what skills other people have learned during their program.
            -

            - -
            - - - Image: PhD Genie Tweet - - -
            PhD Genie Tweet
            -
            - -

            I responded to the tweet because during my PhD I learned how to create maps in R. I started by recreating a map from the University of North Carolina’s Hussman School of Journalism’s News Deserts project (below). Now, I am working on a personal project mapping the U.S. National and State parks.

            - -
            - - - Image: Map of US newspapers by county - - -
            Map of US newspapers by county
            -
            - -

            There was quite a bit of interest in how to do this, so in this series of posts I will document my process from start to finish.

            - -

            a few notes

            -
            - -

            First, I’m not an expert. I wanted to make a map, so I learned how. There may be easier ways and, if I learn how to do them, I’ll write another post.

            - -

            Second, before starting, I strongly suggest setting up a Github and DVC. I wrote about how to use GitHub, the Github Website, and Github Desktop. You can use any of these methods to manage your repositories. I use all three based purely on whatever mood I’m in.

            - -

            If you do use Git or GitHub, then DVC (data version control) is mandatory. GitHub will warn you that your file is too large if it’s over 50MB and reject your pushes if the files are over 100MB. The total repository size can’t exceed 2GB if you’re using the free version (which I am). DVC is useful because cartography files are large. They contain a lot of coordinates which increases with each location you try to map. DVC will store your data outside of GitHub but allows you to track changes with your data. It’s super useful.

            - -

            Third, there are several ways to make a map. R is capable of making interactive maps and static maps. Static maps are less computationally expensive and better for publication. Interactive maps are prettier and better for displaying on the web.

            - -

            I make interactive maps with Leaflet and Shiny because they offer a lot of functionality. The most common way is to use map tiles. Map tiles use data from sources like Open Street Map and Maps to create map squares (tiles) with custom data on top. A list of available map tiles is available on the Open Street Maps website.

            - -
            - - - Image: Earthquakes near Australia, using map tiles - - -
            Earthquakes near Australia, using map tiles
            -
            - -

            When I make static maps (like the US map pictured above), I use ggplot

            - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one [this post]

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. install packages

            -
            - -

            You only need to install the packages once. You can do so by running each line in the terminal. When you rerun the code later, you can skip right to loading the packages using library("package-name")

            - -
            Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -
                ## you only need to install the packages once
            -
            -    install.packages("leaflet")        # interactive maps
            -    install.packages("shiny")          # added map functionality
            -    install.packages("tidyverse")      # data manipulation
            -    install.packages("tigris")         # cartographic boundaries
            -    install.packages("operator.tools") # for the not-in function
            -    install.packages("sf")             # read and write shapefiles
            -
            - -
            package descriptions
            - -
            3    install.packages("leaflet")
            - -
              -
            • Leaflet is an open source mapping library used to create interactive maps. All leaflet maps have the same basic components: -
                -
              • A map widget (created by calling leaflet())
              • -
              • Layers (or features) added using addTiles(), addPolygons(), or addMarkers(). -

                - Leaflet was originally written in Javascript by “Volodymyr Agafonkin, a Ukrainian citizen living in Kyiv.” With the Russian invasion of Ukraine, Volodymyr, his family, and the people he knows are being forced to flee. If you can, make a donation to Come Back Alive or to one of the charities listed at Stand With Ukraine. If you think the invasion is justified, don’t forget to carry some seeds in your pocket
              • -
              -
            • -
            - -
            4    install.packages("shiny")
            - -
              -
            • Shiny is an R package that provides additional functionality. I am primarily using it to create an informative tool box so the map is not overwhelmed by using too many popups.
            • -
            - -
            5    install.packages("tidyverse")
            - -
              -
            • -

              The Tidyverse is a collection of packages used for data manipulation and analysis. Its syntax is more intuitive than base R. Furthermore, you can chain (aka pipe) commands together.

              - -

              For cartography, you don’t need the whole Tidyverse. We’ll mainly use dplyr and ggplot. You can install these packages individually instead of installing the whole tidyverse. Though, when we get to the national park database, we’ll also need purr and tidyr.

              -
            • -
            - -
            6    install.packages("tigris")
            - -
              -
            • Tigris is used to load Census data and shapefiles. You can download the same data from the Census website, but it’s nice that you can do it in R.
            • -
            - -
            7    install.packages("operator.tools")
            - -
              -
            • -

              operator.tools is not required, but it’s recommended.

              - -

              For some unknown reason, base R has a %in% function but not a not-in function. Unfortunately, the United States is still an empire with it’s associated areas, islands, and pseudo-states. I only want to include the 50 states, so I needed a way to easily filter out the non-states. Operator tool’s %!!in% function is perfect for that.

              -
            • -
            - -
            8    install.packages("sf")
            - -
              -
            • sf is used to read and write the shapefiles necessary for cartography.
            • -
            - -

            2. load the packages

            -
            - -

            To start, create and save a new file called usa.r. In it, we’re going to download and modify the United States shape data that we’ll use to create the base map in part two of this series.

            - -

            At the beginning of each file, you have to load the necessary packages. In this file, the only packages we need to load are tidyverse, sf, and tigris. I also load leaflet to make sure the map renders correctly.

            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")
            -    library("sf")
            -    library("tigris")
            -    library("leaflet")
            -
            - -

            3. download usa shapefile

            -
            - -

            There’s two ways to download the USA shape data. First, we can use the R package, tigris. Second, we can download it from the Census website.

            - -
            Note: I show two ways to download the shape data but you only have to choose one method.
            - -

            I prefer using tigris but I’ve been having some problems with it. Sometimes it ignores the Great Lakes and merges Michigan and Wisconsin into a Frankenstate (boxed in red below).

            - -
            - - - Image: Weird Frankenstate that merges Michigan and Wisconsin. - - -
            Weird Frankenstate that merges Michigan and Wisconsin.
            -
            - -
            method 1: using tigris()
            - -

            tigris() downloads the TIGER/Shapefile data directly from the Census and includes a treasure trove of data. Some of the data includes land area, water area, state names, and geometry.

            - -

            Tigris can also download boundaries for counties, divisions, regions, tracts, blocks, congressional and school districts, and a whole host of other groupings. A complete list of available data can be found on the packages’ GitHub.

            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -
                ## download state data using tigris()
            -    us_states <- tigris::states(cb = FALSE, year = 2020)  %>% 
            -        filter(STATEFP < 57)  %>% 
            -        shift_geometry(preserve_area = FALSE,
            -                       position = "below")  %>% 
            -        sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -    
            -    ## save the shifted shapefile
            -    st_write(us_states, "path/to/file/usa.shp")
            -
            - -

            Here we create the us_states variable, save the geographic data to it, move Alaska and Hawaii so they’re beneath the continental US, and save the shifted shapefile.

            - -
            - -
              -
            • line 2:
            • -
            - -
            2   us_states <- tigris::states(cb = FALSE, year = 2020)  %>%
            - -

            R uses the <- operator to define new variables. Here, we’re naming our new variable us_states.

            - -

            In our us_states variable we’re going to store data on the 50 states downloaded using tigris. Within (::) tigris, we’re going to use the states() function.

            - -

            The states() function allows you to pull state-level data from the Census. This function takes several arguments

            - -

            The cb argument can either be TRUE or FALSE. If cb = FALSE tells Tigris() to download the most detailed shapefile. If cb = TRUE it will download a generalized (1:5000k) file. After a lot of trial and error, I found that using cb = TRUE prevents the Frankenstate from happening.

            - -

            If the year argument is omitted it will download the shapefile for the default year (currently 2020). I set out of habit from when I work with county boundaries. When I work with county boundaries I have to set the year because their boundaries change more than states.

            - -

            Finally, the %>% operator is part of the Tidyverse. It basically tells R “Hey! I’m not done, keep going to the next line!”

            - -
            - -
              -
            • line 3:
            • -
            - -
            3    filter(STATEFP < 57)  %>% 
            - -

            tigris::states() downloads data for the 50 states and the United States’ minor outlying islands, Puerto Rico, and its associated territories. Each state and territory is assigned a unique two-digit Federal Information Processing Standard [FIPS] code.

            - -

            They’re mostly consecutive (Alaska is 01) but when they were conceived of in the 1970s a couple were reserved for the US territories (American Samoa was 03), but in the updated version the “reserved codes” were left out and the territories were assigned to new numbers (American Samoa is now 60). The important bit about this is that the last official state (Wyoming) has a FIPS of 56.

            - -

            This line of code uses the filter() function on the STATEFP variable downloaded using Tigris(). All it says is keep any row that has a FIPS of less than 57. This will keep only the 50 states and exclude the United States’ empire associated territories.

            - -
            - -
              -
            • lines 4-5:
            • -
            - -
            4    shift_geometry(preserve_area = FALSE,
            -5                   position = "below")  %>%
            - -

            The shift_geometry() is from the Tigris package. It takes two arguments preserve_area and position.

            - -
            Note: Strictly speaking, these two lines are not required. Delete them if you want Alaska and Hawaii to remain in their original locations.
            - -

            When preserve_area = FALSE tigris will shrink Alaska’s size and increase Hawaii’s so that they are comparable to the size of the other states.

            - -

            The position argument can either be "below" or "outside". When it’s below, both Alaska and Hawaii are moved to be below California. When it’s outside then Alaska is moved to be near Washington and Hawaii is moved to be near California.

            - -
            - -

            Since I’m a born-theorist, I should warn you that messing with maps has inherent normative implications. The most common projection is Mercator which stretches the continents near the poles and squishes the ones near the equator.

            - -
            - - - Image: Mercator vs. Gall-Peters Projection - - -
            Mercator vs. Gall-Peters Projection
            -
            - -

            One of the competing projections is Gall-Peters which claims to be more accurate because it was - at the time it was created in the 1980s - the only “area-correct map.” Though it has now been criticized for skewing the polar continents and the equatorial ones. The above photo shows you just how different the projects are from one another.

            - -

            The problem arises because we’re trying to project a 3D object into 2D space. It’s a classic case of even though we can, maybe we shouldn’t. Computers can do these computations and change the projections to anything we want fairly easily. However, humans think and exist in metaphors. We assume bigger = better and up = good. When we project maps that puts the Northern Hemisphere as both upwards and larger than other parts of the world we are imbuing that projection with metaphorical meaning.

            - -

            I caution you to be careful when creating maps. Think through the implications of something as simple as making Alaska more visually appealing by distorting it to be of similar size as the other states.

            - -

            If you want to read more about map projections this is a good post. If you want to read more about metaphors, I suggest Metaphors We Live By by George Lakoff and Mark Johnson.

            - -
            - -
              -
            • line 6:
            • -
            - -
            6    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            - -

            The sf package includes a function called st_transform() which will reproject the data for us. There are a lot of projects. You can read them at the proj website.

            - -

            Leaflet requires all boundaries use the World Geodetic Service 1984 (WGS84) coordinate system. While making maps I’ve come across two main coordinate systems: WGS84 and North American Datum (1983). WGS84 uses the WGS84 ellipsoid and NAD83 uses the Geodetic Reference System (GRS80). From what I’ve gathered, the differences are slight, but leaflet requires WGS and the Census uses NAD83. As a result, we have to reproject the the data in order to make our map.

            - -

            The st_transform function takes four arguments, each preceded by a +. All four arguments are required to transform the data from NAD83 to WGS84.

            - -

            Briefly, +proj=longlat tells R to use project the code into longitude and latitude [rather than, for example, transverse mercator (tmerc)].

            - -

            +ellps=WGS84 sets the ellipsoid to the WGS84 standard.

            - -

            +datum=WGS84 is a holdover from previous proj releases. It tells R to use the WGS84 data.

            - -

            +no_defs is also a holdover.

            - -

            Essentially, you need to include line 6 before you create the map, but after you do any data manipulation. It might throw some warnings which you can just ignore.

            - -
            - -
              -
            • line 9
            • -
            - -
            9    st_write(us_states, "path/to/file/usa.shp")
            - -

            In the last line, we save the data we manipulated in lines 2-6. Strictly speaking you don’t have to save the shapefile. You can manipulate the data and then skip right to mapping the data. I caution against it because the files can get unreadable once you start using multiple data sets. I usually comment out line 9 after I save the file. That way I’m not saving and re-saving it whenever I need to run the code above it.

            - -

            The st_write() function is part of the sf package and it takes two arguments. The first is the data set you want to save. Since I used us_states to save the data, it will be the first argument in the st_write() function call.

            - -

            The second argument is the path to where you want the file saved and what name you want to give it. I named mine usa. It is mandatory that you add .shp to the end of the filepath so that R knows to save it as a shapefile.

            - -

            Although it’s called a shapefile, it’s actually four files. I usually create a separate folder for each set of shapefiles and store that in one master folder called shapefiles. An example of my folder structure is below. I keep all of this in my GitHub repo and track changes using DVC.

            - -
            - - - Image: Example folder structure - - -
            Example folder structure
            -
            - -

            On my C:// drive is My Documents. In that folder I keep a GitHub folder that holds all my repos, including my nps one. Inside the nps folder I separate my shapefiles into their own folder. For this tutorial I am using original and shifted shapefiles, so I’ve also separated them into two separate folders to keep things neat. I also know I’m going to have multiple shapefiles (one for the USA, one for the National Parks, and a final one for the State Parks) so I created a folder for each set. In the usa folder I saved the shifted states shapefile.

            - -

            Altogether, my line 9 would read:

            - -
            9    st_write(us_states, "~/Documents/GitHub/nps/shapefiles/shifted/usa/usa.shp")
            - -

            Running that line will save the four necessary files that R needs to load the geographic data.

            - -

            That’s it for method 1 using tigris. The next section, method 2, shows how to load and transform a previously downloaded shapefile. If you used method 1, feel free to leave this post and go directly to mapping the shapefile in part II of this series.

            - -
            - -
            method 2: using downloaded shapefiles
            - -

            In this section, I’ll go through the process of downloading the shapefiles from the Census website. If you tried method 1 and tigris caused the weird Frankenstate, you can try using the data downloaded from the Census website. I don’t know why it works, since tigris uses the same data, but it does.

            - -

            Generally, though, finding and using shapefiles created by others is a great way to create cool maps. There are thousands of shapefiles available, many from ArcGis’ Open Data Website.

            - -
              -
            • step 1:
              -On the Census website, select the Data & Maps dropdown. In the dropdown, on the right-hand side you’ll select Mapping Files.
            • -
            - -
            - - - Image: Census website dropdown. - - -
            Census website dropdown.
            -
            - -
              -
            • step 2:
              -From here, about halfway down on the page, there’s a link called TIGER Data Products Guide which will take you to a complete list of the shapefiles available.
            • -
            - -
            - - - Image: TIGER Data Products Download Link - - -
            TIGER Data Products Download Link
            -
            - -
              -
            • step 3:
              -There’s a lot of downloads available on this page, but for now just click on “Cartographic Boundaries Shapefiles.” Make sure you select the shapefiles one and not the Geodatabases or Geopackages link.
            • -
            - -
            - - - Image: Cartographic Boundaries Shapefiles - - -
            Cartographic Boundaries Shapefiles
            -
            - -

            Save the file wherever you want, but I prefer to keep it within the “original” shapefiles folder in a sub-folder called “zips.” Once it downloads, unzip it - again, anywhere is fine. It will download all 30 Census shapefiles. We’re only going to use the one called “cb_2021_us_state_500k.zip”. The rest you can delete, if you want.

            - -
            - - - Image: cb_2021_us_state_500k folder - - -
            cb_2021_us_state_500k folder
            -
            - -

            When you unzip the cb_2021_us_state_500k.zip, it will contain four files. You’ll only ever work with the .shp file, but the other three are used in the background to display the data.

            - -
              -
            • step 4:
            • -
            - -

            Once all the files are unzipped, we can load the .shp file into R.

            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -
                ## load a previously downloaded shapefile
            -    usa <- read_sf("shapefiles/original/usa/states/cb_2021_us_state_500k.shp") %>%
            -        filter(STATEFP < 57) %>%
            -        shift_geometry(preserve_area = FALSE,
            -            position = "below") %>%
            -        sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save the shifted shapefile
            -    st_write(usa, "path/to/file/usa.shp")
            -
            - -

            Everything except line 2 is the same as in method 1. I won’t go over lines 3-9 here, because all the information is above.

            - -
              -
            • line 2:
            • -
            - -

            This line is very similar to the one above. I changed the name of the variable to usa so I could keep both methods in the same R file (each R variable needs to be unique or it will be overwritten).

            - -

            read_sf is part of the sf() package. It’s used to load shapefiles into R. The path to the file is enclosed in quotation marks and parentheses. Simply navigate to wherever you unzipped the cb_2021_us_state_500k file and choose the file with the .shp extension.

            - -

            4. Conclusion

            -
            - -

            Once the shapefiles are downloaded - either using tigris() or by loading the shapefiles from the Census website - you can create the base map. I’ll tackle making the base map in part II of this series.*

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/5c/bf7f3d5d72a108dc02f59e832fb832167b3d45db5bf3e123a6da07c1c13f23 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/5c/bf7f3d5d72a108dc02f59e832fb832167b3d45db5bf3e123a6da07c1c13f23 deleted file mode 100644 index 1ed99fd..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/5c/bf7f3d5d72a108dc02f59e832fb832167b3d45db5bf3e123a6da07c1c13f23 +++ /dev/null @@ -1,646 +0,0 @@ -I"Śń

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            VI. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 31-37
            • -
            - -
            31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -33                             Unit_Nm == "Salton Sea" ~ "visited",
            -34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -37                             TRUE ~ "not visited")  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

            - -
            - -
              -
            • lines 38-43
            • -
            - -
            38   shift_geometry(preserve_area = FALSE, 
            -39                  position = "below") %>% 
            -40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -41
            -42   ## save shifted park data
            -43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            - -

            I’ve covered these lines extensively in part II and part III of this series.

            - -

            Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

            - -

            Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

            - -

            Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

            - -

            7. divide by state

            -
            - -
            - - -
            - - - Image: chaos ensues - - -
            chaos ensues
            -
            - - -
            - -

            I tried to map the base map, National Parks, and the state parks. It did not go well. R froze, my computer screamed, and chaos ensued. As a result, I had to rethink my map. I decided to separate the state parks by state, save them, and in part VI of this never-ending series* I’ll create individual state maps. When you click on a state it’ll take you to a map that includes the state parks.

            - -

            Unfortunately, this also means I need to separate the National Parks by state so they also appear on the individual maps. The logic will be the same so I am not going to update part III to reflect that change. If you want to see that code it’s available on the project repo].

            - -

            I don’t want to manually separate and save each state, so I’m going to use a loop! I hate loops. The logic is simple enough “as long as condition X is true, do something.” So simple, yet esvery time I’ve tried to learn a programming language I have struggled with loops. That’s pretty sad considering it’s like day 2 of any programming class. Day 1 is learning how to write “Hello World!”**

            - -
            1
            -2
            -3
            -4
            -5
            -
                split_states <- split(state_parks, f = state_parks$State_Nm) # split the data by state
            -    all_names <- names(split_states)   
            -
            -    for(name in all_names){            
            -        st_write(split_states[[name]], paste0("shapefiles/shifted/states/individual/", name, '.shp'))}
            -
            - -

            8. conclusion

            -
            - -

            * I have annoyed myself with how long this series is. Hopefully it is helpful. Drop me a line if it is.
            -** print("Hello World!")

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/5c/f9cf352151b13eea41a52eac4d3c0142f63d777192c6179a7504839a5726fa b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/5c/f9cf352151b13eea41a52eac4d3c0142f63d777192c6179a7504839a5726fa deleted file mode 100644 index af63c46..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/5c/f9cf352151b13eea41a52eac4d3c0142f63d777192c6179a7504839a5726fa +++ /dev/null @@ -1,336 +0,0 @@ -I"_

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. pre-process data in R

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -        filter(State_Nm %!in% territories)  %>% 
            -        filter(Own_Type == "State")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
                territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 7.

            - -
            - -
              -
            • line 4
            • -
            - -
                state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/5e/041f00171d9b650372f11c54895cb2a30f559157d96cc39f2363830beb7aec b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/5e/041f00171d9b650372f11c54895cb2a30f559157d96cc39f2363830beb7aec new file mode 100644 index 0000000..c62487a --- /dev/null +++ b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/5e/041f00171d9b650372f11c54895cb2a30f559157d96cc39f2363830beb7aec @@ -0,0 +1,36 @@ +I"Đ

            welcome

            +
            + +

            Here you’ll find several non-partisan ballot guides. I try to include as much information as possible without directly recreating the official voter guide. The information is sourced from the ballot guide, calmatters, ballotpedia, LA Times, voter’s edge, and Mercury News. I have included links to the campaign website or wherever most of the information came from.

            + +

            I have done my best to keep my views out of it.

            + +

            I started this to help my two aunts because they would ask me to simplify their ballots for them. Democracy relies on an informed and participatory citizenry, but it’s not always easy. This is meant to alleviate some of the burden.

            + +

            If you notice any errors, you feel like I’ve missed something, or you found this guide helpful feel free to send me an email [click the envelope at the bottom of the page]

            + +

            helpful resources

            +
            + +
              +
            1. Register to vote (Deadline May 23): https://registertovote.ca.gov/
            2. +
            3. Check your registration status: https://voterstatus.sos.ca.gov/
            4. +
            5. Access the official voter guide: https://voterguide.sos.ca.gov/
            6. +
            7. Early voting & ballot drop off locations: https://caearlyvoting.sos.ca.gov/
            8. +
            9. Track your ballot: https://california.ballottrax.net/voter/
            10. +
            11. If you are in the Los Angeles, San Bernardino, Orange County area and need help getting to your polling place I will either find you resources or help you get there. I also offer to go with you to vote (and I will bring my two large German Shepherds) if you feel unsafe going to vote alone.
            12. +
            + +

            ballot guides

            +
            + +

            Clink on a link to download the PDF.

            +

            primary elections

            + +

            2022 Primary California

            + +

            2022 Primary Los Angeles

            + +

            general elections

            +

            +:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/5e/962320bc357a5f6b0bd2a5d79dcf5c1c561daf64b97e552ef82bd61fbe325d b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/5e/962320bc357a5f6b0bd2a5d79dcf5c1c561daf64b97e552ef82bd61fbe325d deleted file mode 100644 index 5e75a7e..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/5e/962320bc357a5f6b0bd2a5d79dcf5c1c561daf64b97e552ef82bd61fbe325d +++ /dev/null @@ -1,617 +0,0 @@ -I"ţć

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            VI. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 31-37
            • -
            - -
            31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -33                             Unit_Nm == "Salton Sea" ~ "visited",
            -34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -37                             TRUE ~ "not visited")  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

            - -
            - -
              -
            • lines 38-43
            • -
            - -
            38   shift_geometry(preserve_area = FALSE, 
            -39                  position = "below") %>% 
            -40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -41
            -42   ## save shifted park data
            -43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            - -

            I’ve covered these lines extensively in part II and part III of this series.

            - -

            Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

            - -

            Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

            - -

            Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

            - -

            7. divide by state

            -
            - -
            -
            -
            "chaos ensued"
            - -
            - -

            I tried to map the base map, National Parks, and the state parks. It did not go well. R froze, my computer screamed, and chaos ensued. As a result, I had to rethink my map. I decided to separate the state parks by state, save them, and in part VI of this never-ending series* I’ll create individual state maps and link them to the map of National Parks.

            - -

            Unfortunately, this also means I need to separate the National Parks by state so they also appear on the individual maps. The logic will be the same so I am not going to update part III to reflect that change. If you want to see that code it’s available on the project repo].

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/5f/571a2bc7ca1fae55f3f2e509b9628eef379f451ad9895961cce87bd375dbd7 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/5f/571a2bc7ca1fae55f3f2e509b9628eef379f451ad9895961cce87bd375dbd7 deleted file mode 100644 index d1382d2..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/5f/571a2bc7ca1fae55f3f2e509b9628eef379f451ad9895961cce87bd375dbd7 +++ /dev/null @@ -1,617 +0,0 @@ -I"ůć

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            VI. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 31-37
            • -
            - -
            31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -33                             Unit_Nm == "Salton Sea" ~ "visited",
            -34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -37                             TRUE ~ "not visited")  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

            - -
            - -
              -
            • lines 38-43
            • -
            - -
            38   shift_geometry(preserve_area = FALSE, 
            -39                  position = "below") %>% 
            -40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -41
            -42   ## save shifted park data
            -43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            - -

            I’ve covered these lines extensively in part II and part III of this series.

            - -

            Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

            - -

            Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

            - -

            Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

            - -

            7. divide by state

            -
            - -
            -
            -
            "chaos ensued"
            - -
            - -

            I tried to map the US Map, National Parks, and the state parks. It did not go well. R froze, my computer screamed, and chaos ensued. As a result, I had to rethink my map. I decided to separate the state parks by state, save it, and in part VI of this never-ending series* I’ll create individual state maps and link them to the map of National Parks.

            - -

            Unfortunately, this also means I need to separate the National Parks by state so they also appear on the individual maps. The logic will be the same so I am not going to update part III to reflect that change. If you want to see that code it’s available on the project repo]

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/60/8fe78e1377ab217a91c1fd12543cdd0874db21b17dad0110e22a31681f4808 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/60/8fe78e1377ab217a91c1fd12543cdd0874db21b17dad0110e22a31681f4808 deleted file mode 100644 index 037d08f..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/60/8fe78e1377ab217a91c1fd12543cdd0874db21b17dad0110e22a31681f4808 +++ /dev/null @@ -1,622 +0,0 @@ -I"cé

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            VI. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 31-37
            • -
            - -
            31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -33                             Unit_Nm == "Salton Sea" ~ "visited",
            -34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -37                             TRUE ~ "not visited")  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

            - -
            - -
              -
            • lines 38-43
            • -
            - -
            38   shift_geometry(preserve_area = FALSE, 
            -39                  position = "below") %>% 
            -40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -41
            -42   ## save shifted park data
            -43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            - -

            I’ve covered these lines extensively in part II and part III of this series.

            - -

            Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

            - -

            Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

            - -

            Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

            - -

            7. divide by state

            -
            - -
            -
            -
            "chaos ensued"
            - -
            - -

            I tried to map the base map, National Parks, and the state parks. It did not go well. R froze, my computer screamed, and chaos ensued. As a result, I had to rethink my map. I decided to separate the state parks by state, save them, and in part VI of this never-ending series* I’ll create individual state maps. When you click on a state it’ll take you to a map that includes the state parks.

            - -

            Unfortunately, this also means I need to separate the National Parks by state so they also appear on the individual maps. The logic will be the same so I am not going to update part III to reflect that change. If you want to see that code it’s available on the project repo].

            - -

            I don’t want to manually separate and save each state, so I’m going to use a loop! I hate loops. The logic is simple enough “as long as condition X is true, do something.” So simple, yet esvery time I’ve tried to learn a programming language I have struggled with loops. That’s pretty sad considering it’s like day 2 of any programming class. Day 1 is learning how to write “hello world!”**

            - -

            * I have annoyed myself with how long this series is. Hopefully it is helpful. Drop me a line if it is.
            -** print("Hello World!")

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/61/7c88714d599efb28a847bbe552dfc50a1b86a86e6f46fb4606c2efd61d7e31 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/61/7c88714d599efb28a847bbe552dfc50a1b86a86e6f46fb4606c2efd61d7e31 deleted file mode 100644 index 9778d3f..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/61/7c88714d599efb28a847bbe552dfc50a1b86a86e6f46fb4606c2efd61d7e31 +++ /dev/null @@ -1,229 +0,0 @@ -I"Ĺ>

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -

            The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management and the Bureau of Reclamation. Having visited the park, I can tell you there’s no fences blocking these areas off. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area. It will be a good test case to make sure I’m selecting the correct data.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            The table below shows a

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/61/e69b138269c656b580f2f597c5983db8321ec193bc95175b6f675ddb065435 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/61/e69b138269c656b580f2f597c5983db8321ec193bc95175b6f675ddb065435 deleted file mode 100644 index 715b456..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/61/e69b138269c656b580f2f597c5983db8321ec193bc95175b6f675ddb065435 +++ /dev/null @@ -1,622 +0,0 @@ -I"cé

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            VI. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 31-37
            • -
            - -
            31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -33                             Unit_Nm == "Salton Sea" ~ "visited",
            -34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -37                             TRUE ~ "not visited")  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

            - -
            - -
              -
            • lines 38-43
            • -
            - -
            38   shift_geometry(preserve_area = FALSE, 
            -39                  position = "below") %>% 
            -40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -41
            -42   ## save shifted park data
            -43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            - -

            I’ve covered these lines extensively in part II and part III of this series.

            - -

            Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

            - -

            Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

            - -

            Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

            - -

            7. divide by state

            -
            - -
            -
            -
            "chaos ensued"
            - -
            - -

            I tried to map the base map, National Parks, and the state parks. It did not go well. R froze, my computer screamed, and chaos ensued. As a result, I had to rethink my map. I decided to separate the state parks by state, save them, and in part VI of this never-ending series* I’ll create individual state maps. When you click on a state it’ll take you to a map that includes the state parks.

            - -

            Unfortunately, this also means I need to separate the National Parks by state so they also appear on the individual maps. The logic will be the same so I am not going to update part III to reflect that change. If you want to see that code it’s available on the project repo].

            - -

            I don’t want to manually separate and save each state, so I’m going to use a loop! I hate loops. The logic is simple enough “as long as condition X is true, do something.” So simple, yet esvery time I’ve tried to learn a programming language I have struggled with loops. That’s pretty sad considering it’s like day 2 of any programming class. Day 1 is learning how to write “Hello World!”**

            - -

            * I have annoyed myself with how long this series is. Hopefully it is helpful. Drop me a line if it is.
            -** print("Hello World!")

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/64/b9d02d24f0a49527d3dedea9b879b205856483e2386865019f9bd3555765f1 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/64/b9d02d24f0a49527d3dedea9b879b205856483e2386865019f9bd3555765f1 deleted file mode 100644 index f5628b6..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/64/b9d02d24f0a49527d3dedea9b879b205856483e2386865019f9bd3555765f1 +++ /dev/null @@ -1,427 +0,0 @@ -I"ú€

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") 
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/65/5ce5f0fa1d0a9b3e9bb42ab2dc0d78d2df54bc5109760750a7cf2c96514197 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/65/5ce5f0fa1d0a9b3e9bb42ab2dc0d78d2df54bc5109760750a7cf2c96514197 deleted file mode 100644 index b9bf092..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/65/5ce5f0fa1d0a9b3e9bb42ab2dc0d78d2df54bc5109760750a7cf2c96514197 +++ /dev/null @@ -1,647 +0,0 @@ -I"ň

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            VI. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 31-37
            • -
            - -
            31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -33                             Unit_Nm == "Salton Sea" ~ "visited",
            -34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -37                             TRUE ~ "not visited")  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

            - -
            - -
              -
            • lines 38-43
            • -
            - -
            38   shift_geometry(preserve_area = FALSE, 
            -39                  position = "below") %>% 
            -40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -41
            -42   ## save shifted park data
            -43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            - -

            I’ve covered these lines extensively in part II and part III of this series.

            - -

            Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

            - -

            Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

            - -

            Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

            - -

            7. divide by state

            -
            - -
            - - -
            - - - Image: - - -
            -
            - -
            -
            "chaos ensued"
            - -
            - -

            I tried to map the base map, National Parks, and the state parks. It did not go well. R froze, my computer screamed, and chaos ensued. As a result, I had to rethink my map. I decided to separate the state parks by state, save them, and in part VI of this never-ending series* I’ll create individual state maps. When you click on a state it’ll take you to a map that includes the state parks.

            - -

            Unfortunately, this also means I need to separate the National Parks by state so they also appear on the individual maps. The logic will be the same so I am not going to update part III to reflect that change. If you want to see that code it’s available on the project repo].

            - -

            I don’t want to manually separate and save each state, so I’m going to use a loop! I hate loops. The logic is simple enough “as long as condition X is true, do something.” So simple, yet esvery time I’ve tried to learn a programming language I have struggled with loops. That’s pretty sad considering it’s like day 2 of any programming class. Day 1 is learning how to write “Hello World!”**

            - -
            1
            -2
            -3
            -4
            -5
            -6
            -
                split_states <- split(state_parks, f = state_parks$State_Nm) # split the data by state
            -    all_names <- names(split_states)   
            -
            -    for(name in all_names){            
            -        st_write(split_states[[name]], paste0("shapefiles/shifted/states/individual/", name, '.shp')) 
            -    }
            -
            - -

            * I have annoyed myself with how long this series is. Hopefully it is helpful. Drop me a line if it is.
            -** print("Hello World!")

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/65/749e73d8393e9b118694899e4f390531ab64cf60f00e05beb2704e10b098aa b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/65/749e73d8393e9b118694899e4f390531ab64cf60f00e05beb2704e10b098aa deleted file mode 100644 index e046ead..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/65/749e73d8393e9b118694899e4f390531ab64cf60f00e05beb2704e10b098aa +++ /dev/null @@ -1,615 +0,0 @@ -I"ąă

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            VI. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 31-37
            • -
            - -
            31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -33                             Unit_Nm == "Salton Sea" ~ "visited",
            -34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -37                             TRUE ~ "not visited")  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

            - -
            - -
              -
            • lines 38-43
            • -
            - -
            38   shift_geometry(preserve_area = FALSE, 
            -39                  position = "below") %>% 
            -40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -41
            -42   ## save shifted park data
            -43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            - -

            I’ve covered these lines extensively in part II and part III of this series.

            - -

            Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

            - -

            Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

            - -

            Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

            - -

            7. divide by state

            -
            - -
            -
            -
            "chaos ensued"
            - -
            - -

            I tried to map the US Map, National Parks, and the state parks. It did not go well. R froze, my computer screamed, and

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/66/1ccf2036c7646d2cdc40ba21b1428777bcfb919680fa851f1456eccd32b8e6 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/66/1ccf2036c7646d2cdc40ba21b1428777bcfb919680fa851f1456eccd32b8e6 deleted file mode 100644 index ccc29fc..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/66/1ccf2036c7646d2cdc40ba21b1428777bcfb919680fa851f1456eccd32b8e6 +++ /dev/null @@ -1,403 +0,0 @@ -I"ys

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW")  
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
                territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines
            • -
            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -
            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/66/d47cf6389a6257922af93d2e1243767b134027a69e2edf2ac319655919ec1d b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/66/d47cf6389a6257922af93d2e1243767b134027a69e2edf2ac319655919ec1d deleted file mode 100644 index 7de7c78..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/66/d47cf6389a6257922af93d2e1243767b134027a69e2edf2ac319655919ec1d +++ /dev/null @@ -1,345 +0,0 @@ -I"±b

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. pre-process data in R

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -        filter(State_Nm %!in% territories)  %>% 
            -        filter(Own_Type == "State")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
                territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
                state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • line 5
            • -
            - -
                filter(State_Nm %!in% territories & Own_Type == "State")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/68/83e9087f123bee5672810de95530237cab33b2c70a01c5603f818bb2d26953 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/68/83e9087f123bee5672810de95530237cab33b2c70a01c5603f818bb2d26953 deleted file mode 100644 index 1f90454..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/68/83e9087f123bee5672810de95530237cab33b2c70a01c5603f818bb2d26953 +++ /dev/null @@ -1,306 +0,0 @@ -I"“\

            This is a continuation of my previous post where I walked through how to download and modify shape data. I also showed how to shift Alaska and Hawaii so they are closer to the continental usa. -

            - -

            In this post, I’ll go over how to use Leaflet to map the shapefile we made in the previous post. If you’ve come here from part one of the series, you probably have the libraries and data loaded already. However, if you don’t, be sure to load the libraries and shapefiles before moving to number two.

            - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            VI. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -
                ## load libraries
            -    library("tidyverse")    # data manipulation & map creation
            -    library("sf")           # loads shapefile
            -    library("leaflet")      # creates the map
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -
                ## load data
            -    states <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp to reflect wherever you saved the shifted shapefile.

            - -

            If your data processing and base map creation are in the same file, you can skip this line, and when you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. create the base map

            -
            - -

            At its most basic, all Leaflet needs to create a map is a base map and data layers. The code below may look intimidating, but it’s mostly style options.

            - -

            This is the map we’re going to create. It’s a simple grey map and each state darkens in color as you hover over it. I’ll show the same map after each style option is added so you can see what effect it has.

            - -
            - - - Image: Final USA basemap - - -
            Final USA basemap
            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line. -
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -
                ## create usa base map using leaflet()
            -    map <- leaflet() %>%
            -    addPolygons(data = states,
            -        smoothFactor = 0.2,
            -        fillColor = "#808080",
            -        fillOpacity = 0.5,
            -        stroke = TRUE,
            -        weight = 0.5,
            -        opacity = 0.5,
            -        color = "#808080",
            -        highlight = highlightOptions(
            -            weight = 0.5,
            -            color = "#000000",
            -            fillOpacity = 0.7,
            -            bringToFront = FALSE),
            -        group = "Base Map")
            -
            - -
              -
            • line 2:
            • -
            - -
            2    map <- leaflet() %>% 
            - -

            leaflet() initializes the map widget. I save it to a variable called map (map <-) so I can run other code in the file without recreating the map each time. When you want to see the map, you can type map (or whatever you want to name your map) in the terminal and hit enter. R will display the map in the viewer.

            - -
            - -
              -
            • line 3:
            • -
            - -
            3    addPolygons(data = states,
            - -

            addPolygons() adds a layer to the map widget. Leaflet has different layer options, including addTiles and addMarkers which do different things. You can read about them on the leaflet website. Since we’re using a previously created shapefile, we’ll add the shapefile to the map using addPolygons().

            - -

            The first argument you need to specify after calling addPolygons is data = [data-source]. [data-source] is whatever variable your data is stored in. For me, it’s called states. This is either the processed data from part I of this series or the saved shapefile loaded above under the section called load data.

            - -

            When you run only the first two lines, Leaflet will use its default styling. The base color will be a light blue and the outlines of the states will be dark blue and fairly thick.

            - -
            - - - Image: USA map with default leaflet styling - - -
            USA map with default leaflet styling
            -
            - -

            You can leave the base map like this if you want, but all additional data will be added as a layer on top</i>* of this map which can become distracting very quickly. I prefer to make my base maps as basic and unobtrusive as possible so the data I add on top of the base map is more prominent.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    smoothFactor = 0.2,
            - -

            smoothFactor controls how much the polygon shape should be smoothed at each zoom level. The lower the number the more accurate your shapes will be. A larger number, on the other hand, will lead to better performance, but can distort the shapes of known areas.

            - -

            I keep the smoothFactor low because I want the United States to appear as a coherent land mass. The image below shows three different maps, each with a different smoothFactor to illustrate what this argument does. On the left, the map’s smoothFactor=0.2, the center map’s smoothFactor=10, and the right’s smoothFactor=100.

            - -
            - - - Image: SmoothFactor = 0.2 (left), 10 (center), 100 (right) - - -
            SmoothFactor = 0.2 (left), 10 (center), 100 (right)
            -
            - -

            As you can see, the higher the smoothFactor the less coherent the United States becomes.

            - -
            - Note: Leaflet - all map programs, really - don't know that continents, countries, states, or any other land mass is a coherent set. It reads the shape data and spits out a map without regard to where things "should" be. It doesn't know California goes on the left or that Alaska is part of the United States. It's important to keep this in mind because each state's shape is rendered individually when using addPolygons(). -
            - -
            - -
              -
            • lines 5-6:
            • -
            - -
            5    fillColor = "#808080",
            -6    fillOpacity = 0.5,
            - -

            fillColor refers to what color is on the inside of the polygons. Since I want a minimal base map, I usually set this value to be some shade of grey. If you want a different color, you only need to replace #808080 with the corresponding hex code for the color you want. Here is a useful hex color picker. If you have a hex value and you want the same color in a different shade, this is a useful site.

            - -

            fillOpacity determines how transparent the color inside the shape should be. I set mine to be 0.5 because I like the way it looks. The number can be between 0 and 1 with 1 being fully opaque and 0 being fully transparent.

            - -
            - -
              -
            • line 7-10:
            • -
            - -
            7    stroke = TRUE,
            -8    weight = 0.5,
            -9    opacity = 0.5,
            -10   color = "#808080",
            - -

            The next four lines define the appearance of the shapes’ outline.

            - -

            The stroke property can be set to either TRUE or FALSE. When true, Leaflet adds an outline around each polygon. When false, the polygons have no outline. In the image below, the map on the left has the default outlines and on the right stroke = FALSE.

            - -
            - - - Image: Default outlines on the left, no outline on the right - - -
            Default outlines on the left, no outline on the right
            -
            - -

            weight = 0.5 sets the thickness of the outlines to be 0.5 pixels. This can be any value you want with higher numbers corresponding to thicker lines. Lower numbers correspond to thinner lines.

            - -

            The opacity property operates in the same way as fill opacity above, but on the outlines. The number can be between 0 and 1. Lower numbers correspond to the lines being more transparent and 1 means fully opaque.

            - -

            color = "#808080" sets the color of the outline. I typically set it to be the same color as the fill color.

            - -

            If you want a static base map then lines 2-10 are all you need, as shown in the image below. I like to add some functionality to my base map so that the individual states become darker when they’re hovered over.

            - -
            - - - Image: static base map - - -
            static base map
            -
            - -

            Lines 11-15 define the map’s behavior when the mouse hovers over the shape. Most of the options are the same as the ones used on the base polygon shapes, so I won’t go into them with much detail.

            - -
            - -
              -
            • line 11:
            • -
            - -
            11    highlight = highlightOptions()
            - -

            highlight = highlightOptions() contains the mouseover specifications. The word before the equal sign has to be either highlight or highlightOptions. I am not sure why you have to declare highlight twice, but you do.

            - -

            highlightOptions() is the actual function call.

            - -
            - -
              -
            • lines 12-14
            • -
            - -
            12    weight = 0.5,
            -13    color = "#000000",
            -14    fillOpacity = 0.7,
            - -

            weight, color, and fillOpacity all operate in the same way as before, but whatever values you specify here will only show up when the mouse hovers over.

            - -
            - -
              -
            • line 15
            • -
            - -
            15    bringToFront = FALSE),
            - -

            bringToFront takes one of two values: TRUE or FALSE. It only really matters when you have multiple layers (like we will in later parts of this series). When bringToFront = TRUE hovering over the state will bring it to the front. When bringToFront = FALSE it will stay in the back.

            - -

            Since the base map has only one layer, this property doesn’t affect anything.

            - -
            - -
              -
            • line 16
            • -
            - -
            16    group = "Base Map")
            - -

            group = "Base Map") lets you group multiple layers together. This argument will come in handy as we add more information to the map. The base map is the default layer and is always visible - though, when you use map tiles you can define multiple base layers. All other layers will be on top of the base layer. When using different groups, you can define functionality that allows users to turn off certain layers.

            - -

            4. Conclusion

            -

            You’ve created your first base map! It’s a boring flat, grey map, but it’s the base we’ll use when adding in the national and state park data. In part III of this series we’ll process and add in the National Parks.

            - -
            -
            United States base map made with Leaflet
            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/6b/0340d6d0e09e12280d7bacd55cb2b37abfdd0ee344e1490065cc821e544984 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/6b/0340d6d0e09e12280d7bacd55cb2b37abfdd0ee344e1490065cc821e544984 deleted file mode 100644 index 74b8b2b..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/6b/0340d6d0e09e12280d7bacd55cb2b37abfdd0ee344e1490065cc821e544984 +++ /dev/null @@ -1,423 +0,0 @@ -I"é~

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW")  
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/6b/9ca936cfaf7173e287e67d928e1a2e97b6634fa380535be1cd8fb24c227f46 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/6b/9ca936cfaf7173e287e67d928e1a2e97b6634fa380535be1cd8fb24c227f46 deleted file mode 100644 index 35b4cc4..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/6b/9ca936cfaf7173e287e67d928e1a2e97b6634fa380535be1cd8fb24c227f46 +++ /dev/null @@ -1,686 +0,0 @@ -I"Ŕţ

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            VI. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 31-37
            • -
            - -
            31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -33                             Unit_Nm == "Salton Sea" ~ "visited",
            -34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -37                             TRUE ~ "not visited")  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

            - -
            - -
              -
            • lines 38-43
            • -
            - -
            38   shift_geometry(preserve_area = FALSE, 
            -39                  position = "below") %>% 
            -40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -41
            -42   ## save shifted park data
            -43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            - -

            I’ve covered these lines extensively in part II and part III of this series.

            - -

            Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

            - -

            Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

            - -

            Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

            - -

            7. divide by state

            -
            - -
            - - -
            - - - Image: chaos ensues - - -
            chaos ensues
            -
            - - -
            - -

            I tried to map the base map, National Parks, and the state parks. It did not go well. R froze, my computer screamed, and chaos ensued. As a result, I had to rethink my map. I decided to separate the state parks by state, save them, and in part VI of this never-ending series* I’ll create individual state maps. When you click on a state it’ll take you to a map that includes the state parks.

            - -

            Unfortunately, this also means I need to separate the National Parks by state so they also appear on the individual maps. The logic will be the same so I am not going to update part III to reflect that change. If you want to see that code it’s available on the project repo].

            - -

            I don’t want to manually separate and save each state, so I’m going to use a loop! I hate loops. The logic is simple enough “as long as condition X is true, do something.” So simple, yet esvery time I’ve tried to learn a programming language I have struggled with loops. That’s pretty sad considering it’s like day 2 of any programming class. Day 1 is learning how to write “Hello World!”**

            - -
            1
            -2
            -3
            -4
            -5
            -
                split_states <- split(state_parks, f = state_parks$State_Nm)
            -    all_names <- names(split_states)   
            -
            -    for(name in all_names){            
            -        st_write(split_states[[name]], paste0("shapefiles/shifted/states/individual/", name, '.shp'))}
            -
            - -

            Look ma, new code!

            - -
              -
            • line 1
            • -
            - -
            1    split_states <- split(state_parks, f = state_parks$State_Nm) 
            - -

            The split() is part of base R. It takes quite a few arguments, most of which are optional.

            - -

            The first argument is the vector (or data frame) that you want to split into different groups. I want to split the state_parks data into its corresponding states, so it is listed first.

            - -

            The second argument f = is how you want the data split. f in this instance stands for factor. If we run levels(as.factor(state_parks$State_Nm)) in the terminal, it will return a list of the 50 state abbreviations. That is what we’re telling R to do here.

            - -

            You can access an individual state using the $ operator. split_states$CA will return the state park data for California.

            - -
            - -
              -
            • line 2
            • -
            - -
            2    all_names <- names(split_states)
            - -

            names is also part of base R. It does what it sounds like - it gets the names of an object. Here, I want to get the names of each split data sets.

            - -
            - -
              -
            • lines 4-5
            • -
            - -
            4    for(name in all_names){            
            -5        st_write(split_states[[name]], paste0("shapefiles/shifted/states/individual/", name, '.shp'))}
            - -

            Here’s the actual for loop.

            - -

            The for(x in y) { - do something}

            - -

            8. conclusion

            -
            - -

            * I have annoyed myself with how long this series is. Hopefully it is helpful. Drop me an email or a tweet if it is.
            -** print("Hello World!")

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/6c/350b247e3e5df804ed28fc0dcfdeaeb98661a0e8f92fbe34cf7d6be42518c3 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/6c/350b247e3e5df804ed28fc0dcfdeaeb98661a0e8f92fbe34cf7d6be42518c3 deleted file mode 100644 index 1aad4de..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/6c/350b247e3e5df804ed28fc0dcfdeaeb98661a0e8f92fbe34cf7d6be42518c3 +++ /dev/null @@ -1,273 +0,0 @@ -I"ŘF

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers.

            - -
            - - - Image: PAD-US Viewer Key - - -
            PAD-US Viewer Key
            -
            - -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/6e/e2e6438058e870a3597eef7f3a107ae94f22eae3aebce97a0d1d3cb71df603 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/6e/e2e6438058e870a3597eef7f3a107ae94f22eae3aebce97a0d1d3cb71df603 deleted file mode 100644 index b75df04..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/6e/e2e6438058e870a3597eef7f3a107ae94f22eae3aebce97a0d1d3cb71df603 +++ /dev/null @@ -1,615 +0,0 @@ -I"×ă

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            VI. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 31-37
            • -
            - -
            31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -33                             Unit_Nm == "Salton Sea" ~ "visited",
            -34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -37                             TRUE ~ "not visited")  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

            - -
            - -
              -
            • lines 38-43
            • -
            - -
            38   shift_geometry(preserve_area = FALSE, 
            -39                  position = "below") %>% 
            -40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -41
            -42   ## save shifted park data
            -43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            - -

            I’ve covered these lines extensively in part II and part III of this series.

            - -

            Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

            - -

            Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

            - -

            Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

            - -

            7. divide by state

            -
            - -
            - -
            "chaos ensued"
            - -
            - -

            I tried to map the US Map, National Parks, and the state parks. It did not go well. R froze, my computer screamed, and

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/6f/a20d79a1cc8eadc33e524d2b9a64394f609eed0b9e399e496049f9f2119dbb b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/6f/a20d79a1cc8eadc33e524d2b9a64394f609eed0b9e399e496049f9f2119dbb deleted file mode 100644 index dad2a22..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/6f/a20d79a1cc8eadc33e524d2b9a64394f609eed0b9e399e496049f9f2119dbb +++ /dev/null @@ -1,624 +0,0 @@ -I"ié

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            VI. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 31-37
            • -
            - -
            31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -33                             Unit_Nm == "Salton Sea" ~ "visited",
            -34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -37                             TRUE ~ "not visited")  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

            - -
            - -
              -
            • lines 38-43
            • -
            - -
            38   shift_geometry(preserve_area = FALSE, 
            -39                  position = "below") %>% 
            -40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -41
            -42   ## save shifted park data
            -43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            - -

            I’ve covered these lines extensively in part II and part III of this series.

            - -

            Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

            - -

            Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

            - -

            Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

            - -

            7. divide by state

            -
            - -
            -
            -
            "chaos ensued"
            - -
            - -

            I tried to map the base map, National Parks, and the state parks. It did not go well. R froze, my computer screamed, and chaos ensued. As a result, I had to rethink my map. I decided to separate the state parks by state, save them, and in part VI of this never-ending series* I’ll create individual state maps. When you click on a state it’ll take you to a map that includes the state parks.

            - -

            Unfortunately, this also means I need to separate the National Parks by state so they also appear on the individual maps. The logic will be the same so I am not going to update part III to reflect that change. If you want to see that code it’s available on the project repo].

            - -

            I don’t want to manually separate and save each state, so I’m going to use a loop! I hate loops. The logic is simple enough “as long as condition X is true, do something.” So simple, yet every time I’ve tried to learn a programming language I have struggled with loops. That’s pretty sad considering it’s like day 2 of any programming class. Day 1 is learning how to write “hello world!”**

            - -
              -
            • I have annoyed myself with how long this series is. Hopefully it is helpful. Drop me a line if it is. -** print("Hello World!")
            • -
            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/71/5e898baf629a8d77b9abc982301cd8f279666a8f7b845e1fc47b1461099de7 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/71/5e898baf629a8d77b9abc982301cd8f279666a8f7b845e1fc47b1461099de7 new file mode 100644 index 0000000..5752feb --- /dev/null +++ b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/71/5e898baf629a8d77b9abc982301cd8f279666a8f7b845e1fc47b1461099de7 @@ -0,0 +1,33 @@ +I"í
            +
            +
            +
            + + +
            + Here you'll find several non-partisan ballot guides. I try to include as much information as possible without directly recreating the official voter guide. The information is sourced from the ballot guide, calmatters, ballotpedia, LA Times, voter’s edge, and Mercury News. I have included links to the campaign website or wherever most of the information came from. + + I have done my best to keep my views out of it. + + I started this to help my two aunts because they would ask me to simplify their ballots for them. Democracy relies on an informed and participatory citizenry, but it’s not always easy. This is meant to alleviate some of the burden. + + If you notice any errors, you feel like I’ve missed something, or you found this guide helpful feel free to send me an email [click the envelope at the bottom of the page] +

            + Helpful Resources: +
            1. Register to vote (Deadline May 23): https://registertovote.ca.gov/
            2. +
            3. Check your registration status: https://voterstatus.sos.ca.gov/
            4. +
            5. Access the official voter guide: https://voterguide.sos.ca.gov/
            6. +
            7. Early voting & ballot drop off locations: https://caearlyvoting.sos.ca.gov/
            8. +
            9. Track your ballot: https://california.ballottrax.net/voter/
            10. +
            11. If you are in the Los Angeles, San Bernardino, Orange County area and need help getting to your polling place I will either find you resources or help you get there. I also offer to go with you to vote (and I will bring my two large German Shepherds) if you feel unsafe going to vote alone.
            + +2022 Primary California + +
            + +
            +
            + +
            +
            +:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/71/c1602d7b6500c15a663e26d121c27e9c83c6c345fd7f52006db04dc37b3966 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/71/c1602d7b6500c15a663e26d121c27e9c83c6c345fd7f52006db04dc37b3966 deleted file mode 100644 index 12bec76..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/71/c1602d7b6500c15a663e26d121c27e9c83c6c345fd7f52006db04dc37b3966 +++ /dev/null @@ -1,355 +0,0 @@ -I"­e

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. pre-process data in R

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -        filter(State_Nm %!in% territories)  %>% 
            -        filter(Own_Type == "State")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
                territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
                state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • line 5
            • -
            - -
                filter(State_Nm %!in% territories & Own_Type == "Stat")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is Stat. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as Stat. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)). -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/72/6df7e46db37ef1176ec5f53dc80e1b83dfe9e99a195ba7216dea4fdfe7d454 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/72/6df7e46db37ef1176ec5f53dc80e1b83dfe9e99a195ba7216dea4fdfe7d454 deleted file mode 100644 index 708bd50..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/72/6df7e46db37ef1176ec5f53dc80e1b83dfe9e99a195ba7216dea4fdfe7d454 +++ /dev/null @@ -1,686 +0,0 @@ -I"Ľţ

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            VI. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 31-37
            • -
            - -
            31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -33                             Unit_Nm == "Salton Sea" ~ "visited",
            -34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -37                             TRUE ~ "not visited")  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

            - -
            - -
              -
            • lines 38-43
            • -
            - -
            38   shift_geometry(preserve_area = FALSE, 
            -39                  position = "below") %>% 
            -40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -41
            -42   ## save shifted park data
            -43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            - -

            I’ve covered these lines extensively in part II and part III of this series.

            - -

            Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

            - -

            Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

            - -

            Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

            - -

            7. divide by state

            -
            - -
            - - -
            - - - Image: chaos ensues - - -
            chaos ensues
            -
            - - -
            - -

            I tried to map the base map, National Parks, and the state parks. It did not go well. R froze, my computer screamed, and chaos ensued. As a result, I had to rethink my map. I decided to separate the state parks by state, save them, and in part VI of this never-ending series* I’ll create individual state maps. When you click on a state it’ll take you to a map that includes the state parks.

            - -

            Unfortunately, this also means I need to separate the National Parks by state so they also appear on the individual maps. The logic will be the same so I am not going to update part III to reflect that change. If you want to see that code it’s available on the project repo].

            - -

            I don’t want to manually separate and save each state, so I’m going to use a loop! I hate loops. The logic is simple enough “as long as condition X is true, do something.” So simple, yet esvery time I’ve tried to learn a programming language I have struggled with loops. That’s pretty sad considering it’s like day 2 of any programming class. Day 1 is learning how to write “Hello World!”**

            - -
            1
            -2
            -3
            -4
            -5
            -
                split_states <- split(state_parks, f = state_parks$State_Nm)
            -    all_names <- names(split_states)   
            -
            -    for(name in all_names){            
            -        st_write(split_states[[name]], paste0("shapefiles/shifted/states/individual/", name, '.shp'))}
            -
            - -

            Look ma, new code!

            - -
              -
            • line 1
            • -
            - -
            1    split_states <- split(state_parks, f = state_parks$State_Nm) 
            - -

            The split() is part of base R. It takes quite a few arguments, most of which are optional.

            - -

            The first argument is the vector (or data frame) that you want to split into different groups. I want to split the state_parks data into its corresponding states, so it is listed first.

            - -

            The second argument f = is how you want the data split. f in this instance stands for factor. If we run levels(as.factor(state_parks$State_Nm)) in the terminal, it will return a list of the 50 state abbreviations. That is what we're telling R to do here.

            - -

            You can access an individual state using the $ operator. split_states$CA will return the state park data for California.

            - -
            - -
              -
            • line 2
            • -
            - -
            2    all_names <- names(split_states)
            - -

            names is also part of base R. It does what it sounds like - it gets the names of an object. Here, I want to get the names of each split data sets.

            - -
            - -
              -
            • lines 4-5
            • -
            - -
            4    for(name in all_names){            
            -5        st_write(split_states[[name]], paste0("shapefiles/shifted/states/individual/", name, '.shp'))}
            - -

            Here’s the actual for loop.

            - -

            The for(x in y) { - do something}

            - -

            8. conclusion

            -
            - -

            * I have annoyed myself with how long this series is. Hopefully it is helpful. Drop me an email or a tweet if it is.
            -** print("Hello World!")

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/73/6183de5aca99fce5d49c962e6a8babf0ad670b2c93f8f7ae81af5cfdbe3baf b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/73/6183de5aca99fce5d49c962e6a8babf0ad670b2c93f8f7ae81af5cfdbe3baf deleted file mode 100644 index 32cdbb3..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/73/6183de5aca99fce5d49c962e6a8babf0ad670b2c93f8f7ae81af5cfdbe3baf +++ /dev/null @@ -1,687 +0,0 @@ -I"éţ

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            VI. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 31-37
            • -
            - -
            31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -33                             Unit_Nm == "Salton Sea" ~ "visited",
            -34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -37                             TRUE ~ "not visited")  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

            - -
            - -
              -
            • lines 38-43
            • -
            - -
            38   shift_geometry(preserve_area = FALSE, 
            -39                  position = "below") %>% 
            -40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -41
            -42   ## save shifted park data
            -43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            - -

            I’ve covered these lines extensively in part II and part III of this series.

            - -

            Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

            - -

            Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

            - -

            Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

            - -

            7. divide by state

            -
            - -
            - - -
            - - - Image: chaos ensues - - -
            chaos ensues
            -
            - - -
            - -

            I tried to map the base map, National Parks, and the state parks. It did not go well. R froze, my computer screamed, and chaos ensued. As a result, I had to rethink my map. I decided to separate the state parks by state, save them, and in part VI of this never-ending series* I’ll create individual state maps. When you click on a state it’ll take you to a map that includes the state parks.

            - -

            Unfortunately, this also means I need to separate the National Parks by state so they also appear on the individual maps. The logic will be the same so I am not going to update part III to reflect that change. If you want to see that code it’s available on the project repo].

            - -

            I don’t want to manually separate and save each state, so I’m going to use a loop! I hate loops. The logic is simple enough “as long as condition X is true, do something.” So simple, yet esvery time I’ve tried to learn a programming language I have struggled with loops. That’s pretty sad considering it’s like day 2 of any programming class. Day 1 is learning how to write “Hello World!”**

            - -
            1
            -2
            -3
            -4
            -5
            -
                split_states <- split(state_parks, f = state_parks$State_Nm)
            -    all_names <- names(split_states)   
            -
            -    for(name in all_names){            
            -        st_write(split_states[[name]], paste0("shapefiles/shifted/states/individual/", name, '.shp'))}
            -
            - -

            Look ma, new code!

            - -
              -
            • line 1
            • -
            - -
            1    split_states <- split(state_parks, f = state_parks$State_Nm) 
            - -

            The split() is part of base R. It takes quite a few arguments, most of which are optional.

            - -

            The first argument is the vector (or data frame) that you want to split into different groups. I want to split the state_parks data into its corresponding states, so it is listed first.

            - -

            The second argument f = is how you want the data split. f in this instance stands for factor. If we run levels(as.factor(state_parks$State_Nm)) in the terminal, it will return a list of the 50 state abbreviations. That is what we’re telling R to do here.

            - -

            You can access an individual state using the $ operator. split_states$CA will return the state park data for California.

            - -
            - -
              -
            • line 2
            • -
            - -
            2    all_names <- names(split_states)
            - -

            names is also part of base R. It does what it sounds like - it gets the names of an object. Here, I want to get the names of each split data sets.

            - -
            - -
              -
            • lines 4-5
            • -
            - -
            4    for(name in all_names){            
            -5        st_write(split_states[[name]], paste0("shapefiles/shifted/states/individual/", name, '.shp'))}
            - -

            Here’s the actual for loop.

            - -

            The basic logic of a for loop is: - for(x in y) {
            - do something}

            - -

            8. conclusion

            -
            - -

            * I have annoyed myself with how long this series is. Hopefully it is helpful. Drop me an email or a tweet if it is.
            -** print("Hello World!")

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/74/1bb9441c4a45416889b63630f9b4d1e7de8960bc5c9648b6150bbffef4d3af b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/74/1bb9441c4a45416889b63630f9b4d1e7de8960bc5c9648b6150bbffef4d3af new file mode 100644 index 0000000..f14acb3 --- /dev/null +++ b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/74/1bb9441c4a45416889b63630f9b4d1e7de8960bc5c9648b6150bbffef4d3af @@ -0,0 +1,24 @@ +I"˝

            welcome

            +

            Here you’ll find several non-partisan ballot guides. I try to include as much information as possible without directly recreating the official voter guide. The information is sourced from the ballot guide, calmatters, ballotpedia, LA Times, voter’s edge, and Mercury News. I have included links to the campaign website or wherever most of the information came from.

            + +

            I have done my best to keep my views out of it.

            + +

            I started this to help my two aunts because they would ask me to simplify their ballots for them. Democracy relies on an informed and participatory citizenry, but it’s not always easy. This is meant to alleviate some of the burden.

            + +

            If you notice any errors, you feel like I’ve missed something, or you found this guide helpful feel free to send me an email [click the envelope at the bottom of the page]

            + +

            helpful resources:

            +
              +
            1. Register to vote (Deadline May 23): https://registertovote.ca.gov/
            2. +
            3. Check your registration status: https://voterstatus.sos.ca.gov/
            4. +
            5. Access the official voter guide: https://voterguide.sos.ca.gov/
            6. +
            7. Early voting & ballot drop off locations: https://caearlyvoting.sos.ca.gov/
            8. +
            9. Track your ballot: https://california.ballottrax.net/voter/
            10. +
            11. If you are in the Los Angeles, San Bernardino, Orange County area and need help getting to your polling place I will either find you resources or help you get there. I also offer to go with you to vote (and I will bring my two large German Shepherds) if you feel unsafe going to vote alone.
            12. +
            + +

            ballot guides

            + +
                <a href="https://github.com/liz-muehlmann/Election_Guides/raw/main/California/Primary%20Elections/National%20and%20State/2022%20-%20Primary%20-%20California.pdf" download="2022_Primary_CA.pdf">2022 Primary California</a> s
            +
            +:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/74/c766532bbc8a69acd484635a8d80dc5d759d1b04d4ba61f5e4b37cd81d7525 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/74/c766532bbc8a69acd484635a8d80dc5d759d1b04d4ba61f5e4b37cd81d7525 deleted file mode 100644 index 1240461..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/74/c766532bbc8a69acd484635a8d80dc5d759d1b04d4ba61f5e4b37cd81d7525 +++ /dev/null @@ -1,347 +0,0 @@ -I"%c

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. pre-process data in R

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -        filter(State_Nm %!in% territories)  %>% 
            -        filter(Own_Type == "State")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
                territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
                state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • line 5
            • -
            - -
                filter(State_Nm %!in% territories & Own_Type == "State")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call.

            - -

            Common filter operators include & (and), | (or), < (less than), or </code>></code> (greater than).

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/76/eac0f3fdeb074c8d7a6378b05c43e8b0a52a29eb9306ad679262be44dba6d6 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/76/eac0f3fdeb074c8d7a6378b05c43e8b0a52a29eb9306ad679262be44dba6d6 deleted file mode 100644 index 469e6a4..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/76/eac0f3fdeb074c8d7a6378b05c43e8b0a52a29eb9306ad679262be44dba6d6 +++ /dev/null @@ -1,261 +0,0 @@ -I"E

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences blocking these areas off. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/78/256997bdedae112ce920a2c66baf2f0d450fbc3dd2c7e20070149f8ac457ef b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/78/256997bdedae112ce920a2c66baf2f0d450fbc3dd2c7e20070149f8ac457ef deleted file mode 100644 index 94b7c29..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/78/256997bdedae112ce920a2c66baf2f0d450fbc3dd2c7e20070149f8ac457ef +++ /dev/null @@ -1,503 +0,0 @@ -I"2¦

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/7b/87e20d6416c7b12e250013029905f5d945f540f95a248d62dd5ab4d3f0e0e8 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/7b/87e20d6416c7b12e250013029905f5d945f540f95a248d62dd5ab4d3f0e0e8 deleted file mode 100644 index a572d4f..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/7b/87e20d6416c7b12e250013029905f5d945f540f95a248d62dd5ab4d3f0e0e8 +++ /dev/null @@ -1,413 +0,0 @@ -I"B}

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW")  
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            15    filter(Des_Tp == "ACC" |
            -16           Des_Tp == "HCA" |
            -17           Des_Tp == "REC" |
            -18           Des_Tp == "SCA" |
            -19           Des_Tp == "SHCA" |
            -20           Des_Tp == "SP" |
            -21           Des_Tp == "SREC" |
            -22           Des_Tp == "SRMA" |
            -23           Des_Tp == "SW")  
            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -
            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/7c/00be513afa746fc3c79a9019e35b3a9f29492303bf58063514a5c8eb9088cb b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/7c/00be513afa746fc3c79a9019e35b3a9f29492303bf58063514a5c8eb9088cb new file mode 100644 index 0000000..c8c58fe --- /dev/null +++ b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/7c/00be513afa746fc3c79a9019e35b3a9f29492303bf58063514a5c8eb9088cb @@ -0,0 +1,33 @@ +I"
            +
            +
            +
            + + +
            + Here you'll find several non-partisan ballot guides. I try to include as much information as possible without directly recreating the official voter guide. The information is sourced from the ballot guide, calmatters, ballotpedia, LA Times, voter’s edge, and Mercury News. I have included links to the campaign website or wherever most of the information came from. + + I have done my best to keep my views out of it. + + I started this to help my two aunts because they would ask me to simplify their ballots for them. Democracy relies on an informed and participatory citizenry, but it’s not always easy. This is meant to alleviate some of the burden. + + If you notice any errors, you feel like I’ve missed something, or you found this guide helpful feel free to send me an email [click the envelope at the bottom of the page] +

            + Helpful Resources: +
            1. Register to vote (Deadline May 23): https://registertovote.ca.gov/
            2. +
            3. Check your registration status: https://voterstatus.sos.ca.gov/
            4. +
            5. Access the official voter guide: https://voterguide.sos.ca.gov/
            6. +
            7. Early voting & ballot drop off locations: https://caearlyvoting.sos.ca.gov/
            8. +
            9. Track your ballot: https://california.ballottrax.net/voter/
            10. +
            11. If you are in the Los Angeles, San Bernardino, Orange County area and need help getting to your polling place I will either find you resources or help you get there. I also offer to go with you to vote (and I will bring my two large German Shepherds) if you feel unsafe going to vote alone.
            + +2022 Primary California + +
            + +
            +
            + +
            +
            +:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/7d/64d0a1a77390c0b8a1f87c940840e8ae02e78011fadfb54aec8690be275769 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/7d/64d0a1a77390c0b8a1f87c940840e8ae02e78011fadfb54aec8690be275769 deleted file mode 100644 index 794c1e6..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/7d/64d0a1a77390c0b8a1f87c940840e8ae02e78011fadfb54aec8690be275769 +++ /dev/null @@ -1,301 +0,0 @@ -I"˙P

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -
              -
            1. pre-process data in R
            2. -
            - -
            1
            -2
            -3
            -
                state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories)  %>% 
            -    filter(Own_Type == "State")
            -
            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/7e/30f30756ffe2aac8d9b485a1ca34212849816505628f57068da583a45bcfda b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/7e/30f30756ffe2aac8d9b485a1ca34212849816505628f57068da583a45bcfda deleted file mode 100644 index e0b08c3..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/7e/30f30756ffe2aac8d9b485a1ca34212849816505628f57068da583a45bcfda +++ /dev/null @@ -1,600 +0,0 @@ -I"iá

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 31-37
            • -
            - -
            31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -33                             Unit_Nm == "Salton Sea" ~ "visited",
            -34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -37                             TRUE ~ "not visited")  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

            - -
            - -
              -
            • lines 38-43
            • -
            - -
            38   shift_geometry(preserve_area = FALSE, 
            -39                  position = "below") %>% 
            -40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -41
            -42   ## save shifted park data
            -43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            - -

            I’ve covered these lines extensively in part II and part III of this series.

            - -

            Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

            - -

            Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

            - -

            Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

            - -

            7. add state parks to the map

            -
            - -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/7e/43e3b85eed951c7308f256ae90deb98f3e4c3d83d2bea7c3223c6d00448730 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/7e/43e3b85eed951c7308f256ae90deb98f3e4c3d83d2bea7c3223c6d00448730 deleted file mode 100644 index 0a5384d..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/7e/43e3b85eed951c7308f256ae90deb98f3e4c3d83d2bea7c3223c6d00448730 +++ /dev/null @@ -1,701 +0,0 @@ -I"i

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            VI. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 31-37
            • -
            - -
            31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -33                             Unit_Nm == "Salton Sea" ~ "visited",
            -34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -37                             TRUE ~ "not visited")  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

            - -
            - -
              -
            • lines 38-43
            • -
            - -
            38   shift_geometry(preserve_area = FALSE, 
            -39                  position = "below") %>% 
            -40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -41
            -42   ## save shifted park data
            -43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            - -

            I’ve covered these lines extensively in part II and part III of this series.

            - -

            Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

            - -

            Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

            - -

            Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

            - -

            7. divide by state

            -
            - -
            - - -
            - - - Image: chaos ensues - - -
            chaos ensues
            -
            - - -
            - -

            I tried to map the base map, National Parks, and the state parks. It did not go well. R froze, my computer screamed, and chaos ensued. As a result, I had to rethink my map. I decided to separate the state parks by state, save them, and in part VI of this never-ending series* I’ll create individual state maps. When you click on a state it’ll take you to a map that includes the state parks.

            - -

            Unfortunately, this also means I need to separate the National Parks by state so they also appear on the individual maps. The logic will be the same so I am not going to update part III to reflect that change. If you want to see that code it’s available on the project repo].

            - -

            I don’t want to manually separate and save each state, so I’m going to use a loop! I hate loops. The logic is simple enough “as long as condition X is true, do something.” So simple, yet esvery time I’ve tried to learn a programming language I have struggled with loops. That’s pretty sad considering it’s like day 2 of any programming class. Day 1 is learning how to write “Hello World!”**

            - -
            1
            -2
            -3
            -4
            -5
            -
                split_states <- split(state_parks, f = state_parks$State_Nm)
            -    all_names <- names(split_states)   
            -
            -    for(name in all_names){            
            -        st_write(split_states[[name]], paste0("shapefiles/shifted/states/individual/", name, '.shp'))}
            -
            - -

            Look ma, new code!

            - -
              -
            • line 1
            • -
            - -
            1    split_states <- split(state_parks, f = state_parks$State_Nm) 
            - -

            The split() is part of base R. It takes quite a few arguments, most of which are optional.

            - -

            The first argument is the vector (or data frame) that you want to split into different groups. I want to split the state_parks data into its corresponding states, so it is listed first.

            - -

            The second argument f = is how you want the data split. f in this instance stands for factor. If we run levels(as.factor(state_parks$State_Nm)) in the terminal, it will return a list of the 50 state abbreviations. That is what we’re telling R to do here.

            - -

            You can access an individual state using the $ operator. split_states$CA will return the state park data for California.

            - -
            - -
              -
            • line 2
            • -
            - -
            2    all_names <- names(split_states)
            - -

            names is also part of base R. It does what it sounds like - it gets the names of an object. Here, I want to get the names of each split data sets.

            - -
            - -
              -
            • lines 4-5
            • -
            - -
            4    for(name in all_names){            
            -5        st_write(split_states[[name]], paste0("shapefiles/shifted/states/individual/", name, '.shp'))}
            - -

            Here’s the actual for loop.

            - -

            The basic logic of a for loop is:
            - for(x in y){
            - do something}

            - -

            Inside the parenthesis is the condition that must evaluate to TRUE if the content in the curly braces is to run.

            - -

            In line 4, for(name in all_names){ says as long as there’s a name in the list of all names, do whatever is inside the curly braces. name can be whatever you want. It’s a placeholder value. I can have it say for(dogs in all_names){ it will still do the exact same thing. A lot of time you’ll see it as an i for item. I like to use more descriptive language because, again, for loops are my Achilles’ heel.

            - -

            The all_names part is where ever you want R to look for the data. It will change based on your data set and variable naming conventions.

            - -

            In line 5, I save the split data sets.

            - -

            st_write() is part of the sf package which allows us to create shapefiles. This can be any saving function (eg. write_csv() if you want to save CSVs). The function takes several arguments. In line 43 above I showed the basic structure: st_write(data, path/to/file.shp). This is good if you only have one file, but since I’m saving them in a loop I don’t want all of the files to have the same name. R will error out after the first and tell you the file already exists.

            - -

            The first part split_states[[name]] is still telling R what data to save, but using an index instead of a specific data frame name. To access an index you use data[[some-value]] where some-value is the index location. In my code, R will take the split_states data and go alright the first index location in [[name]] is 1 and return whatever value is stored in that index (here, AK). It will then do that for every index location as it loops through the split_states data.

            - -

            paste0() is also part of base R.

            - -

            8. conclusion

            -
            - -

            * I have annoyed myself with how long this series is. Hopefully it is helpful. Drop me an email or a tweet if it is.
            -** print("Hello World!")

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/7f/efba38bb79efc816743af381249476c31fed29185e0f496af40d00f8367f55 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/7f/efba38bb79efc816743af381249476c31fed29185e0f496af40d00f8367f55 deleted file mode 100644 index 4cec042..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/7f/efba38bb79efc816743af381249476c31fed29185e0f496af40d00f8367f55 +++ /dev/null @@ -1,705 +0,0 @@ -I" - -

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            VI. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 31-37
            • -
            - -
            31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -33                             Unit_Nm == "Salton Sea" ~ "visited",
            -34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -37                             TRUE ~ "not visited")  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

            - -
            - -
              -
            • lines 38-43
            • -
            - -
            38   shift_geometry(preserve_area = FALSE, 
            -39                  position = "below") %>% 
            -40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -41
            -42   ## save shifted park data
            -43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            - -

            I’ve covered these lines extensively in part II and part III of this series.

            - -

            Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

            - -

            Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

            - -

            Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

            - -

            7. divide by state

            -
            - -
            - - -
            - - - Image: chaos ensues - - -
            chaos ensues
            -
            - - -
            - -

            I tried to map the base map, National Parks, and the state parks. It did not go well. R froze, my computer screamed, and chaos ensued. As a result, I had to rethink my map. I decided to separate the state parks by state, save them, and in part VI of this never-ending series* I’ll create individual state maps. When you click on a state it’ll take you to a map that includes the state parks.

            - -

            Unfortunately, this also means I need to separate the National Parks by state so they also appear on the individual maps. The logic will be the same so I am not going to update part III to reflect that change. If you want to see that code it’s available on the project repo].

            - -

            I don’t want to manually separate and save each state, so I’m going to use a loop! I hate loops. The logic is simple enough “as long as condition X is true, do something.” So simple, yet esvery time I’ve tried to learn a programming language I have struggled with loops. That’s pretty sad considering it’s like day 2 of any programming class. Day 1 is learning how to write “Hello World!”**

            - -
            1
            -2
            -3
            -4
            -5
            -
                split_states <- split(state_parks, f = state_parks$State_Nm)
            -    all_names <- names(split_states)   
            -
            -    for(name in all_names){            
            -        st_write(split_states[[name]], paste0("shapefiles/shifted/states/individual/", name, '.shp'))}
            -
            - -

            Look ma, new code!

            - -
              -
            • line 1
            • -
            - -
            1    split_states <- split(state_parks, f = state_parks$State_Nm) 
            - -

            The split() is part of base R. It takes quite a few arguments, most of which are optional.

            - -

            The first argument is the vector (or data frame) that you want to split into different groups. I want to split the state_parks data into its corresponding states, so it is listed first.

            - -

            The second argument f = is how you want the data split. f in this instance stands for factor. If we run levels(as.factor(state_parks$State_Nm)) in the terminal, it will return a list of the 50 state abbreviations. That is what we’re telling R to do here.

            - -

            You can access an individual state using the $ operator. split_states$CA will return the state park data for California.

            - -
            - -
              -
            • line 2
            • -
            - -
            2    all_names <- names(split_states)
            - -

            names is also part of base R. It does what it sounds like - it gets the names of an object. Here, I want to get the names of each split data sets.

            - -
            - -
              -
            • lines 4-5
            • -
            - -
            4    for(name in all_names){            
            -5        st_write(split_states[[name]], paste0("shapefiles/shifted/states/individual/", name, '.shp'))}
            - -

            Here’s the actual for loop.

            - -

            The basic logic of a for loop is:
            - for(x in y){
            - do something}

            - -

            Inside the parenthesis is the condition that must evaluate to TRUE if the content in the curly braces is to run.

            - -

            In line 4, for(name in all_names){ says as long as there’s a name in the list of all names, do whatever is inside the curly braces. name can be whatever you want. It’s a placeholder value. I can have it say for(dogs in all_names){ it will still do the exact same thing. A lot of time you’ll see it as an i for item. I like to use more descriptive language because, again, for loops are my Achilles’ heel.

            - -

            The all_names part is where ever you want R to look for the data. It will change based on your data set and variable naming conventions.

            - -

            In line 5, I save the split data sets.

            - -

            st_write() is part of the sf package which allows us to create shapefiles. This can be any saving function (eg. write_csv() if you want to save CSVs). The function takes several arguments. In line 43 above I showed the basic structure: st_write(data, path/to/file.shp). This is good if you only have one file, but since I’m saving them in a loop I don’t want all of the files to have the same name. R will error out after the first and tell you the file already exists.

            - -

            The first part split_states[[name]] is still telling R what data to save, but using an index instead of a specific data frame name. To access an index you use data[[some-value]] where some-value is the index location. In my code, R will take the split_states data and go alright the first index location in [[name]] is 1 and return whatever value is stored in that index (here, AK). It will then do that for every index location as it loops through the split_states data.

            - -

            paste0() is also part of base R - it’s apparently faster than paste(). It concatenates (or links together) different pieces into one. I’m using it to create the filename. Within the paste0 call anything within quotation marks is static. So every file will be saved to "shapefiles/shifted/states/individual/" and every file will have the extension .shp. What will change with each loop is the name of the file. One by one, R will loop through and save each file using the name it pulled from all_names.

            - -

            st_write() automatically creates the other three files that each “shapefile” needs. When the loop is done, you should have a folder of 200 files (50 states * 4 files each). Which is why I strongly recommend using DVC if you’re doing any kind of version control.

            - -

            8. conclusion

            -
            - -

            * I have annoyed myself with how long this series is. Hopefully it is helpful. Drop me an email or a tweet if it is.
            -** print("Hello World!")

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/82/25b629fb01a6c27e819f696097862d2eb3d160d8436096a96f2843a846924a b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/82/25b629fb01a6c27e819f696097862d2eb3d160d8436096a96f2843a846924a deleted file mode 100644 index a7980d9..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/82/25b629fb01a6c27e819f696097862d2eb3d160d8436096a96f2843a846924a +++ /dev/null @@ -1,238 +0,0 @@ -I"[A

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -

            The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management and the Bureau of Reclamation. Having visited the park, I can tell you there’s no fences blocking these areas off. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area. It will be a good test case to make sure I’m selecting the correct data.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/83/2e45bda82b83128d8bbca6447247838157f869e47fc3d61a8b4ee0d2aa6530 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/83/2e45bda82b83128d8bbca6447247838157f869e47fc3d61a8b4ee0d2aa6530 deleted file mode 100644 index 3dd8759..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/83/2e45bda82b83128d8bbca6447247838157f869e47fc3d61a8b4ee0d2aa6530 +++ /dev/null @@ -1,701 +0,0 @@ -I"Q

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            VI. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 31-37
            • -
            - -
            31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -33                             Unit_Nm == "Salton Sea" ~ "visited",
            -34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -37                             TRUE ~ "not visited")  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

            - -
            - -
              -
            • lines 38-43
            • -
            - -
            38   shift_geometry(preserve_area = FALSE, 
            -39                  position = "below") %>% 
            -40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -41
            -42   ## save shifted park data
            -43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            - -

            I’ve covered these lines extensively in part II and part III of this series.

            - -

            Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

            - -

            Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

            - -

            Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

            - -

            7. divide by state

            -
            - -
            - - -
            - - - Image: chaos ensues - - -
            chaos ensues
            -
            - - -
            - -

            I tried to map the base map, National Parks, and the state parks. It did not go well. R froze, my computer screamed, and chaos ensued. As a result, I had to rethink my map. I decided to separate the state parks by state, save them, and in part VI of this never-ending series* I’ll create individual state maps. When you click on a state it’ll take you to a map that includes the state parks.

            - -

            Unfortunately, this also means I need to separate the National Parks by state so they also appear on the individual maps. The logic will be the same so I am not going to update part III to reflect that change. If you want to see that code it’s available on the project repo].

            - -

            I don’t want to manually separate and save each state, so I’m going to use a loop! I hate loops. The logic is simple enough “as long as condition X is true, do something.” So simple, yet esvery time I’ve tried to learn a programming language I have struggled with loops. That’s pretty sad considering it’s like day 2 of any programming class. Day 1 is learning how to write “Hello World!”**

            - -
            1
            -2
            -3
            -4
            -5
            -
                split_states <- split(state_parks, f = state_parks$State_Nm)
            -    all_names <- names(split_states)   
            -
            -    for(name in all_names){            
            -        st_write(split_states[[name]], paste0("shapefiles/shifted/states/individual/", name, '.shp'))}
            -
            - -

            Look ma, new code!

            - -
              -
            • line 1
            • -
            - -
            1    split_states <- split(state_parks, f = state_parks$State_Nm) 
            - -

            The split() is part of base R. It takes quite a few arguments, most of which are optional.

            - -

            The first argument is the vector (or data frame) that you want to split into different groups. I want to split the state_parks data into its corresponding states, so it is listed first.

            - -

            The second argument f = is how you want the data split. f in this instance stands for factor. If we run levels(as.factor(state_parks$State_Nm)) in the terminal, it will return a list of the 50 state abbreviations. That is what we’re telling R to do here.

            - -

            You can access an individual state using the $ operator. split_states$CA will return the state park data for California.

            - -
            - -
              -
            • line 2
            • -
            - -
            2    all_names <- names(split_states)
            - -

            names is also part of base R. It does what it sounds like - it gets the names of an object. Here, I want to get the names of each split data sets.

            - -
            - -
              -
            • lines 4-5
            • -
            - -
            4    for(name in all_names){            
            -5        st_write(split_states[[name]], paste0("shapefiles/shifted/states/individual/", name, '.shp'))}
            - -

            Here’s the actual for loop.

            - -

            The basic logic of a for loop is:
            - for(x in y){
            - do something}

            - -

            Inside the parenthesis is the condition that must evaluate to TRUE if the content in the curly braces is to run.

            - -

            In line 4, for(name in all_names){ says as long as there’s a name in the list of all names, do whatever is inside the curly braces. name can be whatever you want. It’s a placeholder value. I can have it say for(dogs in all_names){ it will still do the exact same thing. A lot of time you’ll see it as an i for item. I like to use more descriptive language because, again, for loops are my Achilles’ heel.

            - -

            The all_names part is where ever you want R to look for the data. It will change based on your data set and variable naming conventions.

            - -

            In line 5, I save the split data sets.

            - -

            st_write() is part of the sf package which allows us to create shapefiles. This can be any saving function (eg. write_csv() if you want to save CSVs). The function takes several arguments. In line 43 above I showed the basic structure: st_write(data, path/to/file.shp). This is good if you only have one file, but since I’m saving them in a loop I don’t want all of the files to have the same name. R will error out after the first and tell you the file already exists.

            - -

            The first part split_states[[name]] is still telling R what data to save, but using an index instead of a specific data frame name. To access an index you use data[[some-value]] where some-value is the index location. In my code, R will take the split_states data and go alright the first index location in [[name]] is 1 and return whatever value is stored in that index (here, AK). It will then do that for every index location as it loops through the split_states data.

            - -

            paste0()

            - -

            8. conclusion

            -
            - -

            * I have annoyed myself with how long this series is. Hopefully it is helpful. Drop me an email or a tweet if it is.
            -** print("Hello World!")

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/84/16f54dc8f1119e102c811f3b6ddcf3977aae456ee1c7539666388b88bbde00 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/84/16f54dc8f1119e102c811f3b6ddcf3977aae456ee1c7539666388b88bbde00 new file mode 100644 index 0000000..f58aa06 --- /dev/null +++ b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/84/16f54dc8f1119e102c811f3b6ddcf3977aae456ee1c7539666388b88bbde00 @@ -0,0 +1,28 @@ +I"˝

            welcome

            +
            + +

            Here you’ll find several non-partisan ballot guides. I try to include as much information as possible without directly recreating the official voter guide. The information is sourced from the ballot guide, calmatters, ballotpedia, LA Times, voter’s edge, and Mercury News. I have included links to the campaign website or wherever most of the information came from.

            + +

            I have done my best to keep my views out of it.

            + +

            I started this to help my two aunts because they would ask me to simplify their ballots for them. Democracy relies on an informed and participatory citizenry, but it’s not always easy. This is meant to alleviate some of the burden.

            + +

            If you notice any errors, you feel like I’ve missed something, or you found this guide helpful feel free to send me an email [click the envelope at the bottom of the page]

            + +

            helpful resources

            +
            + +
              +
            1. Register to vote (Deadline May 23): https://registertovote.ca.gov/
            2. +
            3. Check your registration status: https://voterstatus.sos.ca.gov/
            4. +
            5. Access the official voter guide: https://voterguide.sos.ca.gov/
            6. +
            7. Early voting & ballot drop off locations: https://caearlyvoting.sos.ca.gov/
            8. +
            9. Track your ballot: https://california.ballottrax.net/voter/
            10. +
            11. If you are in the Los Angeles, San Bernardino, Orange County area and need help getting to your polling place I will either find you resources or help you get there. I also offer to go with you to vote (and I will bring my two large German Shepherds) if you feel unsafe going to vote alone.
            12. +
            + +

            ballot guides

            +
            + +

            Clink on a link to download the PDF.

            +:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/8b/3729b9dce6676a5ec02ede434da75a73b848be8c9aaeb49a63b163ca1eba47 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/8b/3729b9dce6676a5ec02ede434da75a73b848be8c9aaeb49a63b163ca1eba47 deleted file mode 100644 index 6ee89f5..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/8b/3729b9dce6676a5ec02ede434da75a73b848be8c9aaeb49a63b163ca1eba47 +++ /dev/null @@ -1,595 +0,0 @@ -I"¦ŕ

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 31-37
            • -
            - -
            31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -33                             Unit_Nm == "Salton Sea" ~ "visited",
            -34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -37                             TRUE ~ "not visited")  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

            - -
              -
            • lines 38-43
            • -
            - -
            38   shift_geometry(preserve_area = FALSE, 
            -39                  position = "below") %>% 
            -40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -41
            -42   ## save shifted park data
            -43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            - -

            I’ve covered these lines extensively in part II and part III of this series.

            - -

            Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

            - -

            Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

            - -

            Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

            - -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/8c/47bbc7fff350ec7a9c0030a6c61ffbc57f6adf24b8c92ec49bd6727cf32edb b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/8c/47bbc7fff350ec7a9c0030a6c61ffbc57f6adf24b8c92ec49bd6727cf32edb deleted file mode 100644 index 04dc3af..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/8c/47bbc7fff350ec7a9c0030a6c61ffbc57f6adf24b8c92ec49bd6727cf32edb +++ /dev/null @@ -1,281 +0,0 @@ -I"tJ

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/8d/92e7954d80f33b4f97a12f3270341f738f014ee3cc4d1f9fd44eba9ebb2024 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/8d/92e7954d80f33b4f97a12f3270341f738f014ee3cc4d1f9fd44eba9ebb2024 deleted file mode 100644 index 0856258..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/8d/92e7954d80f33b4f97a12f3270341f738f014ee3cc4d1f9fd44eba9ebb2024 +++ /dev/null @@ -1,413 +0,0 @@ -I"9}

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW")  
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -
            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/92/24401fdcaa8d6dfa73d18246c74e314c8d64880a99436c323c7d49fb700454 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/92/24401fdcaa8d6dfa73d18246c74e314c8d64880a99436c323c7d49fb700454 deleted file mode 100644 index 2f8afa6..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/92/24401fdcaa8d6dfa73d18246c74e314c8d64880a99436c323c7d49fb700454 +++ /dev/null @@ -1,417 +0,0 @@ -I"î}

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW")  
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/92/680fec0ac6d780bd58cc43f6ffbd33e670c8406a5f5ff8047a6084165f1e0e b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/92/680fec0ac6d780bd58cc43f6ffbd33e670c8406a5f5ff8047a6084165f1e0e deleted file mode 100644 index 7c73f6e..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/92/680fec0ac6d780bd58cc43f6ffbd33e670c8406a5f5ff8047a6084165f1e0e +++ /dev/null @@ -1,177 +0,0 @@ -I"+[ -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    ## load and process state park data
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>% 
            -    filter(State_Nm %!in% territories)  %>% 
            -    filter(d_Own_Type == "State" & d_Des_Tp == "Recreation Management Area" | 
            -           d_Des_Tp == "State Historic or Cultural Area"  | 
            -           d_Des_Tp == "State Park" | 
            -           d_Des_Tp == "State Wilderness")  %>% 
            -    select(d_Des_Tp, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres) %>% 
            -    mutate(type = case_when(d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area", 
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Park" ~ "State Park",
            -                            d_Des_Tp == "State Wilderness" ~ "State Wilderness"))   %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               TRUE ~ "not visited"))  %>% 
            -      shift_geometry(preserve_area = FALSE, # resizes alaska to fit with the size of the other states
            -                    position = "below") %>% # moves alaska so it's near hawaii
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs") # changes the geographic data from NAD83 to WGS84 
            -
            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 here is the same as it was when I processed the National Park data in part III. It creates a list of the postal codes associated with the United States’ associated territories and lands. In line 6 we use it to filter these areas out of the data set because I am only interested in the 50 states.

            - -
            - -
              -
            • line 5
            • -
            - -
            5   state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>% 
            - -

            Line 5 is where we actually load the data into R. Before, when I used st_layers() I was only getting the layer information from the geopackage. Here, I use the st_read() function which takes two arguments.

            - -

            The first argument is the path to the geopackage. Geopackage files contain a lot of information, so make sure you choose the file with the .gpkg extension.

            - -

            The second argument layer tells R which layer from the geopackage to open. Here, I load the Fee layer because it has all the data I’m interested in.

            - -

            The %>% is part of the tidyverse and it tells R to continue processing the data on the next line.

            - -
            - -
              -
            • line 6
            • -
            - -
            6   filter(State_Nm %!in% territories)  %>% 
            - -

            I went over the logic of this line when I did the National Park data. You can read about it more in detail in Part III. All this line does is filter out the US’ associated islands and territories.

            - -
            - -
              -
            • line 7-10
            • -
            - -
            7    filter(d_Own_Type == "State" & d_Des_Tp == "Recreation Management Area" |
            -8           d_Des_Tp == "State Historic or Cultural Area"  | 
            -9           d_Des_Tp == "State Park" | 
            -10          d_Des_Tp == "State Wilderness")  %>% 
            - -

            The filter function in the tidyverse is very powerful. It can do complex operations by using certain operators. Here, I’m using it to filter for ownership type. The PAD-US data includes national parks, Native American land, and military installations. I didn’t want any of these areas on my map so I filtered for "State" ownership.

            - -

            However, I didn’t want just state ownership because that also includes a lot of land areas that aren’t necessarily state parks - like watersheds or mineral areas. I use the & operator to tell R I want land owned by the state and of a certain designation type.

            - -

            The d_Des_Type column contains the land’s designation code. I am selecting for state ownership and a designation type of “Recreation Management Area” or | “State Historic or Cultural Area” or “State Park” or “State Wilderness” in lines 7-10.

            - -

            This line will give me state owned land that is probably a state park.

            - -
            - -
              -
            • line 11
            • -
            - -
            11   select(d_Des_Tp, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres) %>% 
            - -

            select() lets me choose the columns I want to keep by name, rather than by index number. I want to reduce PAD-US data size and get rid of any information I don’t need for my map. You’ll need to inspect the data and look at the documentation if you want to use different columns.

            - -

            I wanted the land designation type (state park, wilderness, etc) in the d_Des_Type column. The Park Name (Unit_Name), state abbreviation (State_Nm), long form of the state name (d_State_Nm), and size of the park (GIS).

            - -
            - Note: R will automatically keep the geometry column. In the PAD-US data it's the SHAPE column. To drop the geometry, use st_drop_geometry()
            - -
            - -
              -
            • line 12-15
            • -
            - -
            12     mutate(type = case_when(d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area", 
            -13                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historic or Cultural Area", 
            -14                            d_Des_Tp == "State Park" ~ "State Park",
            -15                            d_Des_Tp == "State Wilderness" ~ "State Wilderness"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 16-17
            • -
            - -
            16  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -17                             TRUE ~ "not visited"))  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks.

            - -

            For the state parks, though, it is unwieldy. Even in its reduced size the PAD-US data has over 5,000 parks. It would be cumbersome to sort through all 5,000 parks and individually code “visited” or “not visited.”

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was a lot.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            Instead, I am going to focus on the parks I know I’ve visited and have photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            In line 16, I create the visited column using the tidyverse's mutate() function. I populate the column using case_when() which looks in the Unit_Nm column for the park named Valley of Fire State Park. Once found, it puts *visited* in the visited column.

            - -

            Line 17 operates in the same way as an else statement. It enters not visited into the visited column for every park not specifically listed using case_when().

            - -
            - -
              -
            • lines 18-21
            • -
            - -
            18      shift_geometry(preserve_area = FALSE, # resizes alaska to fit with the size of the other states
            -19                     position = "below") %>% # moves alaska so it's near hawaii
            -20      sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs") # changes the geographic data from NAD83 to WGS84 
            -21      st_write(state_parks, "./Documents/GitHub/nps/shapefiles/shifted/states/state_parks.shp")
            - -

            I’ve covered these lines extensively in part II and part III of this series.

            - -

            Lines 18-19 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

            - -

            Line 20 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

            - -

            Line 21 saves the shifted shapefile to the hard drive.

            - -

            6. add state parks to the map

            -
            - -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/94/b1e8d79659cf402e2d8be09a8d5f855cd3b09d26d1a465465e4d4ccb906b15 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/94/b1e8d79659cf402e2d8be09a8d5f855cd3b09d26d1a465465e4d4ccb906b15 deleted file mode 100644 index df84e80..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/94/b1e8d79659cf402e2d8be09a8d5f855cd3b09d26d1a465465e4d4ccb906b15 +++ /dev/null @@ -1,681 +0,0 @@ -I"]ţ

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            VI. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 31-37
            • -
            - -
            31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -33                             Unit_Nm == "Salton Sea" ~ "visited",
            -34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -37                             TRUE ~ "not visited")  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

            - -
            - -
              -
            • lines 38-43
            • -
            - -
            38   shift_geometry(preserve_area = FALSE, 
            -39                  position = "below") %>% 
            -40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -41
            -42   ## save shifted park data
            -43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            - -

            I’ve covered these lines extensively in part II and part III of this series.

            - -

            Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

            - -

            Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

            - -

            Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

            - -

            7. divide by state

            -
            - -
            - - -
            - - - Image: chaos ensues - - -
            chaos ensues
            -
            - - -
            - -

            I tried to map the base map, National Parks, and the state parks. It did not go well. R froze, my computer screamed, and chaos ensued. As a result, I had to rethink my map. I decided to separate the state parks by state, save them, and in part VI of this never-ending series* I’ll create individual state maps. When you click on a state it’ll take you to a map that includes the state parks.

            - -

            Unfortunately, this also means I need to separate the National Parks by state so they also appear on the individual maps. The logic will be the same so I am not going to update part III to reflect that change. If you want to see that code it’s available on the project repo].

            - -

            I don’t want to manually separate and save each state, so I’m going to use a loop! I hate loops. The logic is simple enough “as long as condition X is true, do something.” So simple, yet esvery time I’ve tried to learn a programming language I have struggled with loops. That’s pretty sad considering it’s like day 2 of any programming class. Day 1 is learning how to write “Hello World!”**

            - -
            1
            -2
            -3
            -4
            -5
            -
                split_states <- split(state_parks, f = state_parks$State_Nm)
            -    all_names <- names(split_states)   
            -
            -    for(name in all_names){            
            -        st_write(split_states[[name]], paste0("shapefiles/shifted/states/individual/", name, '.shp'))}
            -
            - -

            Look ma, new code!

            - -
              -
            • line 1
            • -
            - -
            1    split_states <- split(state_parks, f = state_parks$State_Nm) 
            - -

            The split() is part of base R. It takes quite a few arguments, most of which are optional.

            - -

            The first argument is the vector (or data frame) that you want to split into different groups. I want to split the state_parks data into its corresponding states, so it is listed first.

            - -

            The second argument f = is how you want the data split. f in this instance stands for factor. If we run levels(as.factor(state_parks$State_Nm)) in the terminal, it will return a list of the 50 state abbreviations. That is what we're telling R to do here.

            - -

            You can access an individual state using the $ operator. split_states$CA will return the state park data for California.

            - -
            - -
              -
            • line 2
            • -
            - -
            2    all_names <- names(split_states)
            - -

            names is also part of base R. It does what it sounds like - it gets the names of an object. Here, I want to get the names of each split data sets.

            - -
            - -
              -
            • lines 4-5
            • -
            - -
            4    for(name in all_names){            
            -5        st_write(split_states[[name]], paste0("shapefiles/shifted/states/individual/", name, '.shp'))}
            - -

            8. conclusion

            -
            - -

            * I have annoyed myself with how long this series is. Hopefully it is helpful. Drop me an email or a tweet if it is.
            -** print("Hello World!")

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/96/c00cb466ef89a19b59e7796c763fd7b8f9957d37ec7b672032898c3219a4f7 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/96/c00cb466ef89a19b59e7796c763fd7b8f9957d37ec7b672032898c3219a4f7 deleted file mode 100644 index 9b5550c..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/96/c00cb466ef89a19b59e7796c763fd7b8f9957d37ec7b672032898c3219a4f7 +++ /dev/null @@ -1,402 +0,0 @@ -I"s

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. pre-process data in R

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW")  
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
                territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
                state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • line 5
            • -
            - -
                filter(State_Nm %!in% territories & Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines
            • -
            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -
            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/97/63333d2b147d1c76b17395c7e304f897053193a681a660fe8c7c95bcff87af b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/97/63333d2b147d1c76b17395c7e304f897053193a681a660fe8c7c95bcff87af deleted file mode 100644 index a941dc5..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/97/63333d2b147d1c76b17395c7e304f897053193a681a660fe8c7c95bcff87af +++ /dev/null @@ -1,608 +0,0 @@ -I"â

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            VI. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 31-37
            • -
            - -
            31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -33                             Unit_Nm == "Salton Sea" ~ "visited",
            -34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -37                             TRUE ~ "not visited")  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

            - -
            - -
              -
            • lines 38-43
            • -
            - -
            38   shift_geometry(preserve_area = FALSE, 
            -39                  position = "below") %>% 
            -40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -41
            -42   ## save shifted park data
            -43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            - -

            I’ve covered these lines extensively in part II and part III of this series.

            - -

            Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

            - -

            Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

            - -

            Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

            - -

            7. divide by state

            -
            - -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/97/b0ff21c5a67d9b4254f7e4d45434273abc597c7c705e196fa6a0d0efb2ac63 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/97/b0ff21c5a67d9b4254f7e4d45434273abc597c7c705e196fa6a0d0efb2ac63 deleted file mode 100644 index 7fd4cb1..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/97/b0ff21c5a67d9b4254f7e4d45434273abc597c7c705e196fa6a0d0efb2ac63 +++ /dev/null @@ -1,597 +0,0 @@ -I"á

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 31-37
            • -
            - -
            31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -33                             Unit_Nm == "Salton Sea" ~ "visited",
            -34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -37                             TRUE ~ "not visited")  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

            - -
            - -
              -
            • lines 38-43
            • -
            - -
            38   shift_geometry(preserve_area = FALSE, 
            -39                  position = "below") %>% 
            -40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -41
            -42   ## save shifted park data
            -43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            - -

            I’ve covered these lines extensively in part II and part III of this series.

            - -

            Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

            - -

            Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

            - -

            Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

            - -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/97/fc0c6ce173ae4f31dd35976b331bc83dbac3b11b20169caf95114d3d716130 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/97/fc0c6ce173ae4f31dd35976b331bc83dbac3b11b20169caf95114d3d716130 deleted file mode 100644 index ab6ec1b..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/97/fc0c6ce173ae4f31dd35976b331bc83dbac3b11b20169caf95114d3d716130 +++ /dev/null @@ -1,305 +0,0 @@ -I"®S

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. pre-process data in R

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -
                state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories)  %>% 
            -    filter(Own_Type == "State")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part I of this series.

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/99/8b4af99d68d1a7a3e2762c1652d219fe8062ad67d3f13ade057a9aa63e7b01 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/99/8b4af99d68d1a7a3e2762c1652d219fe8062ad67d3f13ade057a9aa63e7b01 deleted file mode 100644 index 9a2778c..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/99/8b4af99d68d1a7a3e2762c1652d219fe8062ad67d3f13ade057a9aa63e7b01 +++ /dev/null @@ -1,595 +0,0 @@ -I"Żĺ

            This is part three of my cartography in R series. If you are just finding this, I suggest taking a look at part I and part II first.

            - -

            In this post, I will download and process the National Park data. Once that’s done, I’ll add it to the base map I created in part II.

            - - - -
            - Note: I know that my posts are verbose. I am sorry that I am not more concise but one of the biggest problems I have with code documentation is that it assumes you already know what you're doing. I'm writing these for absolute beginners that want to know what each part of the code does. If you want more concise explanations, I suggest looking at the annotated r files in the project's GitHub. -
            - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            VI. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load usa data

            -
            - -
            1
            -2
            -
                ## load data
            -    states <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp to reflect wherever you saved the shifted shapefile.

            - -

            If your data processing and base map creation are in the same file, you can skip this line, and when you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download national park data

            -
            - -

            The National Park Service provides all the data we’ll need to make the map. The data is accessible on the ArcGIS’ Open Data website. Once you click on the link you’ll see a bunch of icons that lead to different data that’s available for download. Click on the one for boundaries.

            - -
            - - - Image: NPS GIS website - - -
            NPS GIS website
            -
            - -

            From here, you’ll be taken to a list of available National Park data. The second link should be nps boundary which contains the shape data for all the National Parks in the United States. The file contains all the data for the park outlines along with hiking trails, rest areas, and lots of other data.

            - -
            - - - Image: NPS download page - - -
            NPS download page
            -
            - -

            The nps boundary link will take you to a map showing the national parks. On the left, there will be a download link on the left.

            - -
            - - - Image: NPS map and download link - - -
            NPS map and download link
            -
            - -

            From here, you’ll have a few download options. The National Park Service provides the data in different formats including CSV and Shapefile. You’ll want to download the shapefile version.

            - -
            - - - Image: download nps shapefile - - -
            download nps shapefile
            -
            - -

            Be sure to save the file somewhere on your hard drive that is easy to find. When it finishes downloading, be sure to unzip the file. There will be four files inside the folder. All of them need to be kept in the same location. Even though we’ll only load the .shp file, R uses the three others to create the necessary shapes.

            - -

            4. process national park data

            -
            - -

            The code below may look intimidating, but it’s fairly straight forward. I’ll go over each line below.

            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    ## load and process nps data
            -    nps <- read_sf("./shapefiles/original/nps/NPS_-_Land_Resources_Division_Boundary_and_Tract_Data_Service.shp")  %>% 
            -    select(STATE, UNIT_TYPE, PARKNAME, Shape__Are, geometry) %>% 
            -    filter(STATE %!in% territories) %>%  
            -    mutate(type = case_when(UNIT_TYPE == "International Historic Site" ~ "International Historic Site", # there's 23 types of national park, I wanted to reduce this number.
            -                UNIT_TYPE == "National Battlefield Site" ~ "National Military or Battlefield", # lines 56-77 reduce the number of park types
            -                UNIT_TYPE == "National Military Park" ~ "National Military or Battlefield", 
            -                UNIT_TYPE == "National Battlefield" ~ "National Military or Battlefield",
            -                UNIT_TYPE == "National Historical Park" ~ "National Historical Park, Site, Monument, or Memorial",
            -                UNIT_TYPE == "National Historic Site" ~ "National Historical Park, Site, Monument, or Memorial",
            -                UNIT_TYPE == "National Historic Trail" ~ "National Historical Park, Site, Monument, or Memorial",
            -                UNIT_TYPE == "National Memorial" ~ "National Historical Park, Site, Monument, or Memorial",
            -                UNIT_TYPE == "National Monument" ~ "National Historical Park, Site, Monument, or Memorial",
            -                UNIT_TYPE == "National Preserve" ~ "National Preserve, Reserve, or Recreation Area",
            -                UNIT_TYPE == "National Reserve" ~ "National Preserve, Reserve, or Recreation Area",
            -                UNIT_TYPE == "National Recreation Area" ~ "National Preserve, Reserve, or Recreation Area",
            -                UNIT_TYPE == "National River" ~ "National River, Lakeshore, or Seashore",
            -                UNIT_TYPE == "National Lakeshore" ~ "National River, Lakeshore, or Seashore",
            -                UNIT_TYPE == "National Wild & Scenic River" ~ "National River, Lakeshore, or Seashore",
            -                UNIT_TYPE == "National Seashore" ~ "National River, Lakeshore, or Seashore",
            -                UNIT_TYPE == "National Trails Syste" ~ "National Trail",
            -                UNIT_TYPE == "National Scenic Trail" ~ "National Trail",
            -                UNIT_TYPE == "National Park" ~ "National Park or Parkway",
            -                UNIT_TYPE == "Park" ~ "National Park or Parkway",
            -                UNIT_TYPE == "Parkway" ~ "National Park or Parkway",
            -                UNIT_TYPE == "Other Designation" ~ "Other National Land Area"))  %>%  
            -    mutate(visited = case_when(PARKNAME == "Joshua Tree" ~ "visited", 
            -                                PARKNAME == "Redwood" ~ "visited", 
            -                                PARKNAME == "Santa Monica Mountains" ~ "visited", 
            -                                PARKNAME == "Sequoia" ~ "visited", 
            -                                PARKNAME == "Kings Canyon" ~ "visited",
            -                                PARKNAME == "Lewis and Clark" ~ "visited",
            -                                PARKNAME == "Mount Rainier" ~ "visited",
            -                                TRUE ~ "not visited")) %>%  
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save the shifted national park data
            -    st_write(nps, "~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            In part I of this series I talked about how R has an %in% function, but not a %!in% function. Here’s where the latter function shines.

            - -

            The United States is still an empire with its associated territories and islands. In this project I am interested in the 50 states - without these other areas. As a result, I need to filter them out. Using base R’s %in% function I would have to create a variable that contains the postal abbreviations for all 50 states. That is annoying. Instead, I want to use the shorter list that only includes the US’ associated islands and territories. To do so, however, I need to use the operator tools’ %!in% function.

            - -

            Line 2 creates the list of US territories that I filter out in line 7. The c() function in R means combine or concatenate. Inside the parenthesis are the five postal codes for the American Samoa, Guam, the Northern Mariana Islands, Puerto Rico, and the Virgin Islands.

            - -
            - -
              -
            • line 5
            • -
            - -
            5   nps <- read_sf("path/to/file/NPS_-_Land_Resources_Division_Boundary_and_Tract_Data_Service.shp")  %>% 
            - -

            nps <- read_sf("path/to/file.shp") loads the National Park data set to a variable called nps using the read_sf() function that is part of the sf package. You will need to change the file path so it reflects where you saved the data on your hard drive.

            - -

            The %>% operator is part of the tidyverse package. It tells R to go to the next line and process the next command. It has to go at the end of a line, rather than the beginning.

            - -
            - -
              -
            • line 6
            • -
            - -
            6    select(STATE, UNIT_TYPE, PARKNAME, Shape__Are, geometry) %>%
            - -

            select is part of the tidyverse package. With it, we can select columns by their name rather than their associated number. Large data sets take more computing power because the computer has to iterate over more rows. Unfortunately, rendering maps also takes a lot of computing power so I like to discard any unnecessary columns to reduce the amount of effort my computer has to exert.

            - -

            Deciding on which columns to keep will depend on the data you’re using and what you want to map (or analyze). I know for my project I want to include a few things:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type (park, battlefield, memorial, etc)
            • -
            • and the park’s size.
            • -
            - -

            There’s a couple ways to inspect the data to see what kind of information is available.

            - -
              -
            1. You can view the entire data set by using view(nps) but as the number of data points increases, so does R’s struggle with opening it. I’ve found that VSCode doesn’t throw as big of a fit as R Studio when opening large data sets.
            2. -
            3. Another way is to just look at the column names using data.frame(colnames(nps)). This will return a list of the data set’s column names. This is my preferred method. I then go to the documentation to see what each column contains. This isn’t fool-proof because it really depends on if the data has good documentation.
            4. -
            - -

            The National Park data includes a lot of information about who created the data and maintains the property. I’m not interested in this, so in line 6 I select the following columns:

            -
              -
            • STATE [the two character state abbreviation]
            • -
            • UNIT_TYPE [if it’s a national park, battlefield, etc]
            • -
            • PARKNAME [the name of the park]
            • -
            • Shape__Are [the size of the park]
            • -
            • geometry
            • -
            - -
            Note: The column name must be exact for select() to work. For example, in the NPS data, the park area is stored the Shape__Are column - two underscores and no a in area - and I have to use that exact format when selecting the column.
            - -

            The geometry column is specific to shapefiles and it includes the coordinates of the shape. It will be kept automatically - unless you use the st_drop_geometry() function. I like to specifically select so I remember it’s there.

            - -
            - -
              -
            • line 7
            • -
            - -
            7    filter(STATE %!in% territories) %>%  
            - -

            In line 7 I use the territories list I created in line 2 to filter out the United States’ associated areas. Since the nps data uses the two character state abbreviation, I have to use the two character abbreviation for the territories. Searching for “Guam,” for example, won’t work.

            - -

            filter() is part of the tidyverse and it uses conditional language. In the parentheses is a condition that must be true if the tidyverse is going to keep the row. Starting at the top of the data, R goes “alright, does the value in the STATE column match any of the values in the territories list?” If the condition is TRUE, R adds the row to the new data frame.

            - -
            Note: Since we're using the %!in% operator, any row that evaluates as TRUE will be kept because the value is NOT found in the territories list. If I wanted to keep only the territories, I would use the %in% operator and only the rows with STATE abbreviations found in the territories list would be kept.
            - -

            For example, if the STATE value in row 1 is CA, filter looks at it and goes “is CA NOT IN territories?” If that is TRUE, keep it because we want only the values that are NOT IN the territories list.

            - -
            - -
              -
            • lines 8-29
            • -
            - -
            8    mutate(type = case_when(UNIT_TYPE == "International Historic Site" ~ "International Historic Site",
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            The NPS data set has 23 different types of National Parks listed (you can view all of them by running levels(as.factor(nps$UNIT_TYPE))). I know that in later posts, I’m going to color code the land by type (blue for rivers, green for national parks, etc) so I wanted to reduce the number of colors I would have to use.

            - -

            mutate()’s first argument, type = creates a new column called type. R will populate the newly created column with whatever comes after the first (singular) equal = sign. For example, I can put type = NA and every row in the column will say NA.

            - -

            Here, I am using the case_when() function, which is also part of the tidyverse. The logic of case_when is fairly straight forward. The first value is the name of the column you want R to look in (here: UNIT_TYPE). Next, is a conditional. Here I am looking for an exact match (==) to the string (words) inside the first set of quotation marks (in line 8: "International Historic Site"). The last part of the argument is what I want R to put in the type column when it finds a row where the UNIT_TYPE is "International Historic Site".

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value")

            - -

            Lines 9-29 do the same thing for the other park types. You can reduce the parks however you want or use all 23 types. Just remember that the value before the tilde ~ has to match the values found in the data exactly. For example, in line 24 I change the NPS data’s National Trail Syste value to be National Trail. Whomever created the data set did not spell system correctly, so for R to match the value I also have to omit the last letter in system.

            - -
            - -
              -
            • lines 30-37
            • -
            - -
            30    mutate(visited = case_when(PARKNAME == "Joshua Tree" ~ "visited",
            - -

            Lines 30-37 use the same mutate() and case_when logic as above. Instead of reducing the number of park types, I use it to mark the different parks I have visited.

            - -

            Line 30 creates the new column, visited and uses case_when to look for the names of the parks that I’ve been to. If I have visited them, it adds visited to the column of the same name.

            - -

            The last line, TRUE ~ "not_visited)), acts as an else statement. For any park not listed above, it will put not visited in the visited column I created.

            - -

            This feels like a very brute-force method of tracking which parks I’ve visited, but I haven’t spend much time trying to find another way.

            - -
            - -
              -
            • line 38-39
            • -
            - -
            38    shift_geometry(preserve_area = FALSE, 
            -39                   position = "below") %>%
            - -

            In part I, when I made the base map, I moved Alaska and Hawaii so they were of similar size and closer to the continental USA. For the map to display the parks correctly, I have to shift them as well.

            - -

            I went over these two lines in part II, so I won’t go over them again here. If you want to read more about them, check out that post.

            - -
            - -
              -
            • line 40
            • -
            - -
            40  sf::st_transform('+proj=longlat +datum=WGS84')
            - -

            The last line uses the st_transform() function from the sf package to covert the data set from NAD83 to WGS84. Leaflet requires WGS84, so be sure to include this line at the end of your data manipulation.

            - -

            I covered the WGS84 ellipsoid in part I, if you want to read more about it.

            - -
            - -
              -
            • line 43
            • -
            - -
            43  st_write(nps, "~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            - -

            Strictly speaking, this line isn’t necessary. You can do all your data processing in the same file where you make your map, but I prefer to separate the steps into different files.

            - -

            As a result, I save the shifted data to my hard drive so it’s easier to load later. I usually have this line commented out (by placing # at the start of the line) after I save it the first time. I don’t want it to save every time I run the rest of the code.

            - -

            5. add national parks to base map

            -
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -
            ## create usa Base Map using leaflet()
            -    map <- leaflet() %>%
            -    addPolygons(data = states,
            -        smoothFactor = 0.2,
            -        fillColor = "#808080",
            -        fillOpacity = 0.5,
            -        stroke = TRUE,
            -        weight = 0.5,
            -        opacity = 0.5,
            -        color = "#808080",
            -        highlight = highlightOptions(
            -            weight = 0.5,
            -            color = "#000000",
            -            fillOpacity = 0.7,
            -            bringToFront = FALSE),
            -        group = "Base Map")  %>% 
            -    addPolygons(data = nps,
            -        smoothFactor = 0.2,                 
            -        fillColor = "#354f52",
            -        fillOpacity = 1,
            -        stroke = TRUE,
            -        weight = 1,     
            -        opacity = 0.5,                       
            -        color = "#354f52",             
            -        highlight = highlightOptions(
            -            weight = 3,
            -            color = "#fff",
            -            fillOpacity = 0.8,
            -            bringToFront = TRUE),
            -        group = "National Parks")  %>%
            -    addLayersControl(
            -        baseGroups = "Base Map",
            -        overlayGroups = "National Parks",
            -        options = layersControlOptions(collapsed = FALSE))
            -
            - -
              -
            • lines 2-16
            • -
            - -

            Lines 2-16 are identical to those in part II where I created the base map. I am not going to cover these sections in detail, because I covered it previously.

            - -
            - -
              -
            • line 17
            • -
            - -
            17    addPolygons(data = nps,
            - -

            To add the National Park data to the base map, we call addPolygons() again. The arguments are the same as before - color, opacity, outline style - just with different values. By changing those values, we can differentiate the base map from the national park data.

            - -

            Since we’re mapping the National Parks and not the states, we have to tell R where the data is located using data = nps.

            - -
            - -
              -
            • line 18
            • -
            - -
            18   smoothFactor = 0.2,
            - -

            smoothFactor() determines how detailed the park boundaries should be. The lower the number, the more detailed the shape. The higher the number, the smoother the parks will render. I usually match this to whatever I set for the base map for consistency.

            - -
            - -
              -
            • lines 19-20
            • -
            - -
            19   fillColor = "#354f52",
            -20   fillOpacity = 1,
            - -

            Define the color and transparency of the National Parks. In a future post, I am going to change the color of each type of public land, but for now, I’ll make them all a nice sage green color #354f52. I also want to make the parks to be fully opaque.

            - -
            - -
              -
            • lines 21-24
            • -
            - -
            21    stroke = TRUE,
            -22    weight = 1,     
            -23    opacity = 0.5,                       
            -24    color = "#354f52",
            - -

            The next four lines (21-24) define what kind of outline the National Parks will have. I detail each of these arguments in part II of this series.

            - -

            Briefly, I want there to be an outline to each park (stroke = TRUE) that’s thicker weight = 1 than the outline used on the base map. I do not like the way it looks at full opacity, so I make it half-transparent (opacity = 0.5). Finally, I want the outline color = "#354f52 to be the same color as the fill. This will matter more when I change the fill color of the parks later on.

            - -
            - -
              -
            • lines 25-28
            • -
            - -
            25  highlight = highlightOptions(
            -26              weight = 3,
            -27              color = "#fff",
            -28              fillOpacity = 0.8,
            - -

            Lines 25-28 define the National Park’s behavior on mouseover. First we have to define and initialize the highlightOptions() function. The function take similar arguments as the addPolygons function - both of which I go over in detail in part II.

            - -

            I want to keep the mouseover behavior noticeable, but simple. To do so, I set the outline’s thickness to be weight = 3. This will give the shape a nice border that differentiates it from the rest of the map.

            - -

            color = "#fff sets the outline’s color on mouseover only. So, when inactive, the outline color will match the fill color, but on mouseover the outline color switches to white (#fff).

            - -
            - Note: When the hex code repeats (like, white - #ffffff, black - #000000, or grey #808080) you only need to include the first three characters.
            - -
            - -
              -
            • line 29
            • -
            - -
            29    bringToFront = TRUE),
            - -

            bringToFront can either be TRUE or FALSE. If TRUE, Leaflet will bring the park to the forefront on mouseover. This is useful later when we add in the state parks because national and state parks tend to be close together.

            - -

            When FALSE the shape will remain static.

            - -
            - -
              -
            • line 30
            • -
            - -
            30    group = "National Parks")  %>%
            - -

            Since Leaflet adds all new data to the top of the base map, I think it’s useful to group the layers together. In the next block of code, we add in some layer functionality. For now, though, I want to add the National Parks to their own group so I can hide the National Parks if I want.

            - -
            - -
              -
            • line 31-34
            • -
            - -
            31    addLayersControl(
            -32        baseGroups = "Base Map",
            -33        overlayGroups = "National Parks",
            -34        options = layersControlOptions(collapsed = FALSE))
            - -

            addLayersControl defines how layers are displayed on the final map. The function takes three arguments.

            - -

            First, we have to tell Leaflet which layer should be used as the base map: baseGroups = "Base Map". The name in the quotations (here: "Base Map") has to match the name given to the layer you set in the addPolygons() call. In line 14, I put the 50 states into a group called "Base Map", but you can name it anything you like.

            - -

            There can be more than one base map, too. It’s not super helpful here since I shifted Alaska and Hawaii, but when using map tiles you can add multiple types of base maps that users can switch between.

            - -

            Next, we have to define the layers that are shown on top of the base group: overlayGroups = "National Parks". Just like the base map, this is defined in the corresponding addPolygons call. Here, I called the layer National Parks in line 30.

            - -

            Finally, on the map I don’t want the layers to be collapsed, so I set options = layersControlOptions(collapsed = FALSE). When TRUE the map will display an icon in the top right that, when clicked, will show the available layers.

            - -
            - - - Image: Layers collapsed - - -
            Layers collapsed
            -
            - -

            6. conclusion

            -

            Hey, look at that! You made a base map and you added some National Park data to it. You’re a certified cartographer now!

            - -

            In the next part IV post we’ll download and process the state park data before adding it to the map. Part V of this series we’ll add Shiny functionality and some additional markers.

            - - -
            Map showing US National Parks
            -

            </figure>

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/9a/80d109580bc466f81cddb2c7337a7dd7020f19aeee595dd2d803ee8d6d7e7c b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/9a/80d109580bc466f81cddb2c7337a7dd7020f19aeee595dd2d803ee8d6d7e7c deleted file mode 100644 index e9a6d48..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/9a/80d109580bc466f81cddb2c7337a7dd7020f19aeee595dd2d803ee8d6d7e7c +++ /dev/null @@ -1,306 +0,0 @@ -I"’\

            This is a continuation of my previous post where I walked through how to download and modify shape data. I also showed how to shift Alaska and Hawaii so they are closer to the continental usa. -

            - -

            In this post, I’ll go over how to use Leaflet to map the shapefile we made in the previous post. If you’ve come here from part one of the series, you probably have the libraries and data loaded already. However, if you don’t, be sure to load the libraries and shapefiles before moving to number two.

            - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            V. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -
                ## load libraries
            -    library("tidyverse")    # data manipulation & map creation
            -    library("sf")           # loads shapefile
            -    library("leaflet")      # creates the map
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -
                ## load data
            -    states <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp to reflect wherever you saved the shifted shapefile.

            - -

            If your data processing and base map creation are in the same file, you can skip this line, and when you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. create the base map

            -
            - -

            At its most basic, all Leaflet needs to create a map is a base map and data layers. The code below may look intimidating, but it’s mostly style options.

            - -

            This is the map we’re going to create. It’s a simple grey map and each state darkens in color as you hover over it. I’ll show the same map after each style option is added so you can see what effect it has.

            - -
            - - - Image: Final USA basemap - - -
            Final USA basemap
            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line. -
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -
                ## create usa base map using leaflet()
            -    map <- leaflet() %>%
            -    addPolygons(data = states,
            -        smoothFactor = 0.2,
            -        fillColor = "#808080",
            -        fillOpacity = 0.5,
            -        stroke = TRUE,
            -        weight = 0.5,
            -        opacity = 0.5,
            -        color = "#808080",
            -        highlight = highlightOptions(
            -            weight = 0.5,
            -            color = "#000000",
            -            fillOpacity = 0.7,
            -            bringToFront = FALSE),
            -        group = "Base Map")
            -
            - -
              -
            • line 2:
            • -
            - -
            2    map <- leaflet() %>% 
            - -

            leaflet() initializes the map widget. I save it to a variable called map (map <-) so I can run other code in the file without recreating the map each time. When you want to see the map, you can type map (or whatever you want to name your map) in the terminal and hit enter. R will display the map in the viewer.

            - -
            - -
              -
            • line 3:
            • -
            - -
            3    addPolygons(data = states,
            - -

            addPolygons() adds a layer to the map widget. Leaflet has different layer options, including addTiles and addMarkers which do different things. You can read about them on the leaflet website. Since we’re using a previously created shapefile, we’ll add the shapefile to the map using addPolygons().

            - -

            The first argument you need to specify after calling addPolygons is data = [data-source]. [data-source] is whatever variable your data is stored in. For me, it’s called states. This is either the processed data from part I of this series or the saved shapefile loaded above under the section called load data.

            - -

            When you run only the first two lines, Leaflet will use its default styling. The base color will be a light blue and the outlines of the states will be dark blue and fairly thick.

            - -
            - - - Image: USA map with default leaflet styling - - -
            USA map with default leaflet styling
            -
            - -

            You can leave the base map like this if you want, but all additional data will be added as a layer on top</i>* of this map which can become distracting very quickly. I prefer to make my base maps as basic and unobtrusive as possible so the data I add on top of the base map is more prominent.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    smoothFactor = 0.2,
            - -

            smoothFactor controls how much the polygon shape should be smoothed at each zoom level. The lower the number the more accurate your shapes will be. A larger number, on the other hand, will lead to better performance, but can distort the shapes of known areas.

            - -

            I keep the smoothFactor low because I want the United States to appear as a coherent land mass. The image below shows three different maps, each with a different smoothFactor to illustrate what this argument does. On the left, the map’s smoothFactor=0.2, the center map’s smoothFactor=10, and the right’s smoothFactor=100.

            - -
            - - - Image: SmoothFactor = 0.2 (left), 10 (center), 100 (right) - - -
            SmoothFactor = 0.2 (left), 10 (center), 100 (right)
            -
            - -

            As you can see, the higher the smoothFactor the less coherent the United States becomes.

            - -
            - Note: Leaflet - all map programs, really - don't know that continents, countries, states, or any other land mass is a coherent set. It reads the shape data and spits out a map without regard to where things "should" be. It doesn't know California goes on the left or that Alaska is part of the United States. It's important to keep this in mind because each state's shape is rendered individually when using addPolygons(). -
            - -
            - -
              -
            • lines 5-6:
            • -
            - -
            5    fillColor = "#808080",
            -6    fillOpacity = 0.5,
            - -

            fillColor refers to what color is on the inside of the polygons. Since I want a minimal base map, I usually set this value to be some shade of grey. If you want a different color, you only need to replace #808080 with the corresponding hex code for the color you want. Here is a useful hex color picker. If you have a hex value and you want the same color in a different shade, this is a useful site.

            - -

            fillOpacity determines how transparent the color inside the shape should be. I set mine to be 0.5 because I like the way it looks. The number can be between 0 and 1 with 1 being fully opaque and 0 being fully transparent.

            - -
            - -
              -
            • line 7-10:
            • -
            - -
            7    stroke = TRUE,
            -8    weight = 0.5,
            -9    opacity = 0.5,
            -10   color = "#808080",
            - -

            The next four lines define the appearance of the shapes’ outline.

            - -

            The stroke property can be set to either TRUE or FALSE. When true, Leaflet adds an outline around each polygon. When false, the polygons have no outline. In the image below, the map on the left has the default outlines and on the right stroke = FALSE.

            - -
            - - - Image: Default outlines on the left, no outline on the right - - -
            Default outlines on the left, no outline on the right
            -
            - -

            weight = 0.5 sets the thickness of the outlines to be 0.5 pixels. This can be any value you want with higher numbers corresponding to thicker lines. Lower numbers correspond to thinner lines.

            - -

            The opacity property operates in the same way as fill opacity above, but on the outlines. The number can be between 0 and 1. Lower numbers correspond to the lines being more transparent and 1 means fully opaque.

            - -

            color = "#808080" sets the color of the outline. I typically set it to be the same color as the fill color.

            - -

            If you want a static base map then lines 2-10 are all you need, as shown in the image below. I like to add some functionality to my base map so that the individual states become darker when they’re hovered over.

            - -
            - - - Image: static base map - - -
            static base map
            -
            - -

            Lines 11-15 define the map’s behavior when the mouse hovers over the shape. Most of the options are the same as the ones used on the base polygon shapes, so I won’t go into them with much detail.

            - -
            - -
              -
            • line 11:
            • -
            - -
            11    highlight = highlightOptions()
            - -

            highlight = highlightOptions() contains the mouseover specifications. The word before the equal sign has to be either highlight or highlightOptions. I am not sure why you have to declare highlight twice, but you do.

            - -

            highlightOptions() is the actual function call.

            - -
            - -
              -
            • lines 12-14
            • -
            - -
            12    weight = 0.5,
            -13    color = "#000000",
            -14    fillOpacity = 0.7,
            - -

            weight, color, and fillOpacity all operate in the same way as before, but whatever values you specify here will only show up when the mouse hovers over.

            - -
            - -
              -
            • line 15
            • -
            - -
            15    bringToFront = FALSE),
            - -

            bringToFront takes one of two values: TRUE or FALSE. It only really matters when you have multiple layers (like we will in later parts of this series). When bringToFront = TRUE hovering over the state will bring it to the front. When bringToFront = FALSE it will stay in the back.

            - -

            Since the base map has only one layer, this property doesn’t affect anything.

            - -
            - -
              -
            • line 16
            • -
            - -
            16    group = "Base Map")
            - -

            group = "Base Map") lets you group multiple layers together. This argument will come in handy as we add more information to the map. The base map is the default layer and is always visible - though, when you use map tiles you can define multiple base layers. All other layers will be on top of the base layer. When using different groups, you can define functionality that allows users to turn off certain layers.

            - -

            4. Conclusion

            -

            You’ve created your first base map! It’s a boring flat, grey map, but it’s the base we’ll use when adding in the national and state park data. In part III of this series we’ll process and add in the National Parks.

            - -
            -
            United States base map made with Leaflet
            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/9c/e82b25b3111f2efecad4d980ce1ff3bcbe0eeec53504f56bf7e8e6dbf822e1 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/9c/e82b25b3111f2efecad4d980ce1ff3bcbe0eeec53504f56bf7e8e6dbf822e1 deleted file mode 100644 index 9267bbb..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/9c/e82b25b3111f2efecad4d980ce1ff3bcbe0eeec53504f56bf7e8e6dbf822e1 +++ /dev/null @@ -1,562 +0,0 @@ -I"Ě

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "	Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/9c/fe46a5db861bc21c0996d7f247b81e8b6f6ef95543271a9092265410edcbf6 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/9c/fe46a5db861bc21c0996d7f247b81e8b6f6ef95543271a9092265410edcbf6 deleted file mode 100644 index 2a60048..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/9c/fe46a5db861bc21c0996d7f247b81e8b6f6ef95543271a9092265410edcbf6 +++ /dev/null @@ -1,638 +0,0 @@ -I" ń

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            VI. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 31-37
            • -
            - -
            31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -33                             Unit_Nm == "Salton Sea" ~ "visited",
            -34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -37                             TRUE ~ "not visited")  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

            - -
            - -
              -
            • lines 38-43
            • -
            - -
            38   shift_geometry(preserve_area = FALSE, 
            -39                  position = "below") %>% 
            -40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -41
            -42   ## save shifted park data
            -43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            - -

            I’ve covered these lines extensively in part II and part III of this series.

            - -

            Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

            - -

            Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

            - -

            Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

            - -

            7. divide by state

            -
            - -
            -
            -
            "chaos ensued"
            - -
            - -

            I tried to map the base map, National Parks, and the state parks. It did not go well. R froze, my computer screamed, and chaos ensued. As a result, I had to rethink my map. I decided to separate the state parks by state, save them, and in part VI of this never-ending series* I’ll create individual state maps. When you click on a state it’ll take you to a map that includes the state parks.

            - -

            Unfortunately, this also means I need to separate the National Parks by state so they also appear on the individual maps. The logic will be the same so I am not going to update part III to reflect that change. If you want to see that code it’s available on the project repo].

            - -

            I don’t want to manually separate and save each state, so I’m going to use a loop! I hate loops. The logic is simple enough “as long as condition X is true, do something.” So simple, yet esvery time I’ve tried to learn a programming language I have struggled with loops. That’s pretty sad considering it’s like day 2 of any programming class. Day 1 is learning how to write “Hello World!”**

            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -
                ## split the data by state
            -    split_states <- split(state_parks, f = state_parks$State_Nm) # split the data by state
            -    all_names <- names(split_states)   
            -    
            -    for(name in all_names){            
            -        st_write(split_states[[name]], paste0("shapefiles/shifted/states/individual/", name, '.shp')) 
            -    }
            -
            - -

            * I have annoyed myself with how long this series is. Hopefully it is helpful. Drop me a line if it is.
            -** print("Hello World!")

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/9d/a34dcab5e0577ccbaea6f9f9193e97033e31b893809a3b8065ebfca1bbd855 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/9d/a34dcab5e0577ccbaea6f9f9193e97033e31b893809a3b8065ebfca1bbd855 deleted file mode 100644 index bc563bd..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/9d/a34dcab5e0577ccbaea6f9f9193e97033e31b893809a3b8065ebfca1bbd855 +++ /dev/null @@ -1,705 +0,0 @@ -I"ţ 

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            VI. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 31-37
            • -
            - -
            31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -33                             Unit_Nm == "Salton Sea" ~ "visited",
            -34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -37                             TRUE ~ "not visited")  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

            - -
            - -
              -
            • lines 38-43
            • -
            - -
            38   shift_geometry(preserve_area = FALSE, 
            -39                  position = "below") %>% 
            -40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -41
            -42   ## save shifted park data
            -43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            - -

            I’ve covered these lines extensively in part II and part III of this series.

            - -

            Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

            - -

            Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

            - -

            Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

            - -

            7. divide by state

            -
            - -
            - - -
            - - - Image: chaos ensues - - -
            chaos ensues
            -
            - - -
            - -

            I tried to map the base map, National Parks, and the state parks. It did not go well. R froze, my computer screamed, and chaos ensued. As a result, I had to rethink my map. I decided to separate the state parks by state, save them, and in part VI of this never-ending series* I’ll create individual state maps. When you click on a state it’ll take you to a map that includes the state parks.

            - -

            Unfortunately, this also means I need to separate the National Parks by state so they also appear on the individual maps. The logic will be the same so I am not going to update part III to reflect that change. If you want to see that code it’s available on the project repo].

            - -

            I don’t want to manually separate and save each state, so I’m going to use a loop! I hate loops. The logic is simple enough “as long as condition X is true, do something.” So simple, yet esvery time I’ve tried to learn a programming language I have struggled with loops. That’s pretty sad considering it’s like day 2 of any programming class. Day 1 is learning how to write “Hello World!”**

            - -
            1
            -2
            -3
            -4
            -5
            -
                split_states <- split(state_parks, f = state_parks$State_Nm)
            -    all_names <- names(split_states)   
            -
            -    for(name in all_names){            
            -        st_write(split_states[[name]], paste0("shapefiles/shifted/states/individual/", name, '.shp'))}
            -
            - -

            Look ma, new code!

            - -
              -
            • line 1
            • -
            - -
            1    split_states <- split(state_parks, f = state_parks$State_Nm) 
            - -

            The split() is part of base R. It takes quite a few arguments, most of which are optional.

            - -

            The first argument is the vector (or data frame) that you want to split into different groups. I want to split the state_parks data into its corresponding states, so it is listed first.

            - -

            The second argument f = is how you want the data split. f in this instance stands for factor. If we run levels(as.factor(state_parks$State_Nm)) in the terminal, it will return a list of the 50 state abbreviations. That is what we’re telling R to do here.

            - -

            You can access an individual state using the $ operator. split_states$CA will return the state park data for California.

            - -
            - -
              -
            • line 2
            • -
            - -
            2    all_names <- names(split_states)
            - -

            names is also part of base R. It does what it sounds like - it gets the names of an object. Here, I want to get the names of each split data sets.

            - -
            - -
              -
            • lines 4-5
            • -
            - -
            4    for(name in all_names){            
            -5        st_write(split_states[[name]], paste0("shapefiles/shifted/states/individual/", name, '.shp'))}
            - -

            Here’s the actual for loop.

            - -

            The basic logic of a for loop is:
            - for(x in y){
            - do something}

            - -

            Inside the parenthesis is the condition that must evaluate to TRUE if the content in the curly braces is to run.

            - -

            In line 4, for(name in all_names){ says as long as there’s a name in the list of all names, do whatever is inside the curly braces. name can be whatever you want. It’s a placeholder value. I can have it say for(dogs in all_names){ it will still do the exact same thing. A lot of time you’ll see it as an i for item. I like to use more descriptive language because, again, for loops are my Achilles’ heel.

            - -

            The all_names part is where ever you want R to look for the data. It will change based on your data set and variable naming conventions.

            - -

            In line 5, I save the split data sets.

            - -

            st_write() is part of the sf package which allows us to create shapefiles. This can be any saving function (eg. write_csv() if you want to save CSVs). The function takes several arguments. In line 43 above I showed the basic structure: st_write(data, path/to/file.shp). This is good if you only have one file, but since I’m saving them in a loop I don’t want all of the files to have the same name. R will error out after the first and tell you the file already exists.

            - -

            The first part split_states[[name]] is still telling R what data to save, but using an index instead of a specific data frame name. To access an index you use data[[some-value]] where some-value is the index location. In my code, R will take the split_states data and go alright the first index location in [[name]] is 1 and return whatever value is stored in that index (here, AK). It will then do that for every index location as it loops through the split_states data.

            - -

            paste0() is also part of base R - it’s apparently faster than paste(). It concatenates (or links together) different pieces into one. I’m using it to create the filename. Within the paste0 call anything within quotation marks is static. So every file will be saved to "shapefiles/shifted/states/individual/" and every file will have the extension .shp. What will change with each loop is the name of the file. One by one, R will loop through and save each file using the name it pulled from all_names.

            - -

            st_write() automatically creates the other three files that each “shapefile” needs. When the loop is done, you should have a folder of 200 files (50 states * 4 files each). Which is why I strongly recommend using DVC if you’re doing any kind of version control.

            - -

            8. conclusion

            -
            - -

            That’s all the processing done for the state files… for now. In part VI I’ll return to the states to create each state’s own map. Next up, in part V, I’m going back to my base map with the National Parks to add in some informational tool tips and interactivity.

            - -

            * I have annoyed myself with how long this series is. Hopefully it is helpful. Drop me an email or a tweet if it is.
            -** print("Hello World!")

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/a1/0d7d7ef6a7e6f03e4cbfcf2066b9dd10ff9b56e955396c5b247ccfec238878 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/a1/0d7d7ef6a7e6f03e4cbfcf2066b9dd10ff9b56e955396c5b247ccfec238878 deleted file mode 100644 index f52a45e..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/a1/0d7d7ef6a7e6f03e4cbfcf2066b9dd10ff9b56e955396c5b247ccfec238878 +++ /dev/null @@ -1,594 +0,0 @@ -I"ĺ

            This is part three of my cartography in R series. If you are just finding this, I suggest taking a look at part I and part II first.

            - -

            In this post, I will download and process the National Park data. Once that’s done, I’ll add it to the base map I created in part II.

            - - - -
            - Note: I know that my posts are verbose. I am sorry that I am not more concise but one of the biggest problems I have with code documentation is that it assumes you already know what you're doing. I'm writing these for absolute beginners that want to know what each part of the code does. If you want more concise explanations, I suggest looking at the annotated r files in the project's GitHub. -
            - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            V. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load usa data

            -
            - -
            1
            -2
            -
                ## load data
            -    states <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp to reflect wherever you saved the shifted shapefile.

            - -

            If your data processing and base map creation are in the same file, you can skip this line, and when you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download national park data

            -
            - -

            The National Park Service provides all the data we’ll need to make the map. The data is accessible on the ArcGIS’ Open Data website. Once you click on the link you’ll see a bunch of icons that lead to different data that’s available for download. Click on the one for boundaries.

            - -
            - - - Image: NPS GIS website - - -
            NPS GIS website
            -
            - -

            From here, you’ll be taken to a list of available National Park data. The second link should be nps boundary which contains the shape data for all the National Parks in the United States. The file contains all the data for the park outlines along with hiking trails, rest areas, and lots of other data.

            - -
            - - - Image: NPS download page - - -
            NPS download page
            -
            - -

            The nps boundary link will take you to a map showing the national parks. On the left, there will be a download link on the left.

            - -
            - - - Image: NPS map and download link - - -
            NPS map and download link
            -
            - -

            From here, you’ll have a few download options. The National Park Service provides the data in different formats including CSV and Shapefile. You’ll want to download the shapefile version.

            - -
            - - - Image: download nps shapefile - - -
            download nps shapefile
            -
            - -

            Be sure to save the file somewhere on your hard drive that is easy to find. When it finishes downloading, be sure to unzip the file. There will be four files inside the folder. All of them need to be kept in the same location. Even though we’ll only load the .shp file, R uses the three others to create the necessary shapes.

            - -

            4. process national park data

            -
            - -

            The code below may look intimidating, but it’s fairly straight forward. I’ll go over each line below.

            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    ## load and process nps data
            -    nps <- read_sf("./shapefiles/original/nps/NPS_-_Land_Resources_Division_Boundary_and_Tract_Data_Service.shp")  %>% 
            -    select(STATE, UNIT_TYPE, PARKNAME, Shape__Are, geometry) %>% 
            -    filter(STATE %!in% territories) %>%  
            -    mutate(type = case_when(UNIT_TYPE == "International Historic Site" ~ "International Historic Site", # there's 23 types of national park, I wanted to reduce this number.
            -                UNIT_TYPE == "National Battlefield Site" ~ "National Military or Battlefield", # lines 56-77 reduce the number of park types
            -                UNIT_TYPE == "National Military Park" ~ "National Military or Battlefield", 
            -                UNIT_TYPE == "National Battlefield" ~ "National Military or Battlefield",
            -                UNIT_TYPE == "National Historical Park" ~ "National Historical Park, Site, Monument, or Memorial",
            -                UNIT_TYPE == "National Historic Site" ~ "National Historical Park, Site, Monument, or Memorial",
            -                UNIT_TYPE == "National Historic Trail" ~ "National Historical Park, Site, Monument, or Memorial",
            -                UNIT_TYPE == "National Memorial" ~ "National Historical Park, Site, Monument, or Memorial",
            -                UNIT_TYPE == "National Monument" ~ "National Historical Park, Site, Monument, or Memorial",
            -                UNIT_TYPE == "National Preserve" ~ "National Preserve, Reserve, or Recreation Area",
            -                UNIT_TYPE == "National Reserve" ~ "National Preserve, Reserve, or Recreation Area",
            -                UNIT_TYPE == "National Recreation Area" ~ "National Preserve, Reserve, or Recreation Area",
            -                UNIT_TYPE == "National River" ~ "National River, Lakeshore, or Seashore",
            -                UNIT_TYPE == "National Lakeshore" ~ "National River, Lakeshore, or Seashore",
            -                UNIT_TYPE == "National Wild & Scenic River" ~ "National River, Lakeshore, or Seashore",
            -                UNIT_TYPE == "National Seashore" ~ "National River, Lakeshore, or Seashore",
            -                UNIT_TYPE == "National Trails Syste" ~ "National Trail",
            -                UNIT_TYPE == "National Scenic Trail" ~ "National Trail",
            -                UNIT_TYPE == "National Park" ~ "National Park or Parkway",
            -                UNIT_TYPE == "Park" ~ "National Park or Parkway",
            -                UNIT_TYPE == "Parkway" ~ "National Park or Parkway",
            -                UNIT_TYPE == "Other Designation" ~ "Other National Land Area"))  %>%  
            -    mutate(visited = case_when(PARKNAME == "Joshua Tree" ~ "visited", 
            -                                PARKNAME == "Redwood" ~ "visited", 
            -                                PARKNAME == "Santa Monica Mountains" ~ "visited", 
            -                                PARKNAME == "Sequoia" ~ "visited", 
            -                                PARKNAME == "Kings Canyon" ~ "visited",
            -                                PARKNAME == "Lewis and Clark" ~ "visited",
            -                                PARKNAME == "Mount Rainier" ~ "visited",
            -                                TRUE ~ "not visited")) %>%  
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save the shifted national park data
            -    st_write(nps, "~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            In part I of this series I talked about how R has an %in% function, but not a %!in% function. Here’s where the latter function shines.

            - -

            The United States is still an empire with its associated territories and islands. In this project I am interested in the 50 states - without these other areas. As a result, I need to filter them out. Using base R’s %in% function I would have to create a variable that contains the postal abbreviations for all 50 states. That is annoying. Instead, I want to use the shorter list that only includes the US’ associated islands and territories. To do so, however, I need to use the operator tools’ %!in% function.

            - -

            Line 2 creates the list of US territories that I filter out in line 7. The c() function in R means combine or concatenate. Inside the parenthesis are the five postal codes for the American Samoa, Guam, the Northern Mariana Islands, Puerto Rico, and the Virgin Islands.

            - -
            - -
              -
            • line 5
            • -
            - -
            5   nps <- read_sf("path/to/file/NPS_-_Land_Resources_Division_Boundary_and_Tract_Data_Service.shp")  %>% 
            - -

            nps <- read_sf("path/to/file.shp") loads the National Park data set to a variable called nps using the read_sf() function that is part of the sf package. You will need to change the file path so it reflects where you saved the data on your hard drive.

            - -

            The %>% operator is part of the tidyverse package. It tells R to go to the next line and process the next command. It has to go at the end of a line, rather than the beginning.

            - -
            - -
              -
            • line 6
            • -
            - -
            6    select(STATE, UNIT_TYPE, PARKNAME, Shape__Are, geometry) %>%
            - -

            select is part of the tidyverse package. With it, we can select columns by their name rather than their associated number. Large data sets take more computing power because the computer has to iterate over more rows. Unfortunately, rendering maps also takes a lot of computing power so I like to discard any unnecessary columns to reduce the amount of effort my computer has to exert.

            - -

            Deciding on which columns to keep will depend on the data you’re using and what you want to map (or analyze). I know for my project I want to include a few things:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type (park, battlefield, memorial, etc)
            • -
            • and the park’s size.
            • -
            - -

            There’s a couple ways to inspect the data to see what kind of information is available.

            - -
              -
            1. You can view the entire data set by using view(nps) but as the number of data points increases, so does R’s struggle with opening it. I’ve found that VSCode doesn’t throw as big of a fit as R Studio when opening large data sets.
            2. -
            3. Another way is to just look at the column names using data.frame(colnames(nps)). This will return a list of the data set’s column names. This is my preferred method. I then go to the documentation to see what each column contains. This isn’t fool-proof because it really depends on if the data has good documentation.
            4. -
            - -

            The National Park data includes a lot of information about who created the data and maintains the property. I’m not interested in this, so in line 6 I select the following columns:

            -
              -
            • STATE [the two character state abbreviation]
            • -
            • UNIT_TYPE [if it’s a national park, battlefield, etc]
            • -
            • PARKNAME [the name of the park]
            • -
            • Shape__Are [the size of the park]
            • -
            • geometry
            • -
            - -
            Note: The column name must be exact for select() to work. For example, in the NPS data, the park area is stored the Shape__Are column - two underscores and no a in area - and I have to use that exact format when selecting the column.
            - -

            The geometry column is specific to shapefiles and it includes the coordinates of the shape. It will be kept automatically - unless you use the st_drop_geometry() function. I like to specifically select so I remember it’s there.

            - -
            - -
              -
            • line 7
            • -
            - -
            7    filter(STATE %!in% territories) %>%  
            - -

            In line 7 I use the territories list I created in line 2 to filter out the United States’ associated areas. Since the nps data uses the two character state abbreviation, I have to use the two character abbreviation for the territories. Searching for “Guam,” for example, won’t work.

            - -

            filter() is part of the tidyverse and it uses conditional language. In the parentheses is a condition that must be true if the tidyverse is going to keep the row. Starting at the top of the data, R goes “alright, does the value in the STATE column match any of the values in the territories list?” If the condition is TRUE, R adds the row to the new data frame.

            - -
            Note: Since we're using the %!in% operator, any row that evaluates as TRUE will be kept because the value is NOT found in the territories list. If I wanted to keep only the territories, I would use the %in% operator and only the rows with STATE abbreviations found in the territories list would be kept.
            - -

            For example, if the STATE value in row 1 is CA, filter looks at it and goes “is CA NOT IN territories?” If that is TRUE, keep it because we want only the values that are NOT IN the territories list.

            - -
            - -
              -
            • lines 8-29
            • -
            - -
            8    mutate(type = case_when(UNIT_TYPE == "International Historic Site" ~ "International Historic Site",
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            The NPS data set has 23 different types of National Parks listed (you can view all of them by running levels(as.factor(nps$UNIT_TYPE))). I know that in later posts, I’m going to color code the land by type (blue for rivers, green for national parks, etc) so I wanted to reduce the number of colors I would have to use.

            - -

            mutate()’s first argument, type = creates a new column called type. R will populate the newly created column with whatever comes after the first (singular) equal = sign. For example, I can put type = NA and every row in the column will say NA.

            - -

            Here, I am using the case_when() function, which is also part of the tidyverse. The logic of case_when is fairly straight forward. The first value is the name of the column you want R to look in (here: UNIT_TYPE). Next, is a conditional. Here I am looking for an exact match (==) to the string (words) inside the first set of quotation marks (in line 8: "International Historic Site"). The last part of the argument is what I want R to put in the type column when it finds a row where the UNIT_TYPE is "International Historic Site".

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value")

            - -

            Lines 9-29 do the same thing for the other park types. You can reduce the parks however you want or use all 23 types. Just remember that the value before the tilde ~ has to match the values found in the data exactly. For example, in line 24 I change the NPS data’s National Trail Syste value to be National Trail. Whomever created the data set did not spell system correctly, so for R to match the value I also have to omit the last letter in system.

            - -
            - -
              -
            • lines 30-37
            • -
            - -
            30    mutate(visited = case_when(PARKNAME == "Joshua Tree" ~ "visited",
            - -

            Lines 30-37 use the same mutate() and case_when logic as above. Instead of reducing the number of park types, I use it to mark the different parks I have visited.

            - -

            Line 30 creates the new column, visited and uses case_when to look for the names of the parks that I’ve been to. If I have visited them, it adds visited to the column of the same name.

            - -

            The last line, TRUE ~ "not_visited)), acts as an else statement. For any park not listed above, it will put not visited in the visited column I created.

            - -

            This feels like a very brute-force method of tracking which parks I’ve visited, but I haven’t spend much time trying to find another way.

            - -
            - -
              -
            • line 38-39
            • -
            - -
            38    shift_geometry(preserve_area = FALSE, 
            -39                   position = "below") %>%
            - -

            In part I, when I made the base map, I moved Alaska and Hawaii so they were of similar size and closer to the continental USA. For the map to display the parks correctly, I have to shift them as well.

            - -

            I went over these two lines in part II, so I won’t go over them again here. If you want to read more about them, check out that post.

            - -
            - -
              -
            • line 40
            • -
            - -
            40  sf::st_transform('+proj=longlat +datum=WGS84')
            - -

            The last line uses the st_transform() function from the sf package to covert the data set from NAD83 to WGS84. Leaflet requires WGS84, so be sure to include this line at the end of your data manipulation.

            - -

            I covered the WGS84 ellipsoid in part I, if you want to read more about it.

            - -
            - -
              -
            • line 43
            • -
            - -
            43  st_write(nps, "~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            - -

            Strictly speaking, this line isn’t necessary. You can do all your data processing in the same file where you make your map, but I prefer to separate the steps into different files.

            - -

            As a result, I save the shifted data to my hard drive so it’s easier to load later. I usually have this line commented out (by placing # at the start of the line) after I save it the first time. I don’t want it to save every time I run the rest of the code.

            - -

            5. add national parks to base map

            -
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -
            ## create usa Base Map using leaflet()
            -    map <- leaflet() %>%
            -    addPolygons(data = states,
            -        smoothFactor = 0.2,
            -        fillColor = "#808080",
            -        fillOpacity = 0.5,
            -        stroke = TRUE,
            -        weight = 0.5,
            -        opacity = 0.5,
            -        color = "#808080",
            -        highlight = highlightOptions(
            -            weight = 0.5,
            -            color = "#000000",
            -            fillOpacity = 0.7,
            -            bringToFront = FALSE),
            -        group = "Base Map")  %>% 
            -    addPolygons(data = nps,
            -        smoothFactor = 0.2,                 
            -        fillColor = "#354f52",
            -        fillOpacity = 1,
            -        stroke = TRUE,
            -        weight = 1,     
            -        opacity = 0.5,                       
            -        color = "#354f52",             
            -        highlight = highlightOptions(
            -            weight = 3,
            -            color = "#fff",
            -            fillOpacity = 0.8,
            -            bringToFront = TRUE),
            -        group = "National Parks")  %>%
            -    addLayersControl(
            -        baseGroups = "Base Map",
            -        overlayGroups = "National Parks",
            -        options = layersControlOptions(collapsed = FALSE))
            -
            - -
              -
            • lines 2-16
            • -
            - -

            Lines 2-16 are identical to those in part II where I created the base map. I am not going to cover these sections in detail, because I covered it previously.

            - -
            - -
              -
            • line 17
            • -
            - -
            17    addPolygons(data = nps,
            - -

            To add the National Park data to the base map, we call addPolygons() again. The arguments are the same as before - color, opacity, outline style - just with different values. By changing those values, we can differentiate the base map from the national park data.

            - -

            Since we’re mapping the National Parks and not the states, we have to tell R where the data is located using data = nps.

            - -
            - -
              -
            • line 18
            • -
            - -
            18   smoothFactor = 0.2,
            - -

            smoothFactor() determines how detailed the park boundaries should be. The lower the number, the more detailed the shape. The higher the number, the smoother the parks will render. I usually match this to whatever I set for the base map for consistency.

            - -
            - -
              -
            • lines 19-20
            • -
            - -
            19   fillColor = "#354f52",
            -20   fillOpacity = 1,
            - -

            Define the color and transparency of the National Parks. In a future post, I am going to change the color of each type of public land, but for now, I’ll make them all a nice sage green color #354f52. I also want to make the parks to be fully opaque.

            - -
            - -
              -
            • lines 21-24
            • -
            - -
            21    stroke = TRUE,
            -22    weight = 1,     
            -23    opacity = 0.5,                       
            -24    color = "#354f52",
            - -

            The next four lines (21-24) define what kind of outline the National Parks will have. I detail each of these arguments in part II of this series.

            - -

            Briefly, I want there to be an outline to each park (stroke = TRUE) that’s thicker weight = 1 than the outline used on the base map. I do not like the way it looks at full opacity, so I make it half-transparent (opacity = 0.5). Finally, I want the outline color = "#354f52 to be the same color as the fill. This will matter more when I change the fill color of the parks later on.

            - -
            - -
              -
            • lines 25-28
            • -
            - -
            25  highlight = highlightOptions(
            -26              weight = 3,
            -27              color = "#fff",
            -28              fillOpacity = 0.8,
            - -

            Lines 25-28 define the National Park’s behavior on mouseover. First we have to define and initialize the highlightOptions() function. The function take similar arguments as the addPolygons function - both of which I go over in detail in part II.

            - -

            I want to keep the mouseover behavior noticeable, but simple. To do so, I set the outline’s thickness to be weight = 3. This will give the shape a nice border that differentiates it from the rest of the map.

            - -

            color = "#fff sets the outline’s color on mouseover only. So, when inactive, the outline color will match the fill color, but on mouseover the outline color switches to white (#fff).

            - -
            - Note: When the hex code repeats (like, white - #ffffff, black - #000000, or grey #808080) you only need to include the first three characters.
            - -
            - -
              -
            • line 29
            • -
            - -
            29    bringToFront = TRUE),
            - -

            bringToFront can either be TRUE or FALSE. If TRUE, Leaflet will bring the park to the forefront on mouseover. This is useful later when we add in the state parks because national and state parks tend to be close together.

            - -

            When FALSE the shape will remain static.

            - -
            - -
              -
            • line 30
            • -
            - -
            30    group = "National Parks")  %>%
            - -

            Since Leaflet adds all new data to the top of the base map, I think it’s useful to group the layers together. In the next block of code, we add in some layer functionality. For now, though, I want to add the National Parks to their own group so I can hide the National Parks if I want.

            - -
            - -
              -
            • line 31-34
            • -
            - -
            31    addLayersControl(
            -32        baseGroups = "Base Map",
            -33        overlayGroups = "National Parks",
            -34        options = layersControlOptions(collapsed = FALSE))
            - -

            addLayersControl defines how layers are displayed on the final map. The function takes three arguments.

            - -

            First, we have to tell Leaflet which layer should be used as the base map: baseGroups = "Base Map". The name in the quotations (here: "Base Map") has to match the name given to the layer you set in the addPolygons() call. In line 14, I put the 50 states into a group called "Base Map", but you can name it anything you like.

            - -

            There can be more than one base map, too. It’s not super helpful here since I shifted Alaska and Hawaii, but when using map tiles you can add multiple types of base maps that users can switch between.

            - -

            Next, we have to define the layers that are shown on top of the base group: overlayGroups = "National Parks". Just like the base map, this is defined in the corresponding addPolygons call. Here, I called the layer National Parks in line 30.

            - -

            Finally, on the map I don’t want the layers to be collapsed, so I set options = layersControlOptions(collapsed = FALSE). When TRUE the map will display an icon in the top right that, when clicked, will show the available layers.

            - -
            - - - Image: Layers collapsed - - -
            Layers collapsed
            -
            - -

            6. conclusion

            -

            Hey, look at that! You made a base map and you added some National Park data to it. You’re a certified cartographer now!

            - -

            In the next part IV post we’ll download and process the state park data before adding it to the map. Part V of this series we’ll add Shiny functionality and some additional markers.

            - - -
            Map showing US National Parks
            -

            </figure>

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/a3/241ac7d82a37476b5b317d45db535d57d52603968fc785ff6de752973ed5fe b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/a3/241ac7d82a37476b5b317d45db535d57d52603968fc785ff6de752973ed5fe deleted file mode 100644 index 2c8ec14..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/a3/241ac7d82a37476b5b317d45db535d57d52603968fc785ff6de752973ed5fe +++ /dev/null @@ -1,251 +0,0 @@ -I"ůC

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management and the Bureau of Reclamation. Having visited the park, I can tell you there’s no fences blocking these areas off. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area. It will be a good test case to make sure I’m selecting the correct data.

            - -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/a4/fbfa5eaec02d5b5448b303c888653bd58dad21516f549d9773ff86ab66a570 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/a4/fbfa5eaec02d5b5448b303c888653bd58dad21516f549d9773ff86ab66a570 deleted file mode 100644 index c006fe3..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/a4/fbfa5eaec02d5b5448b303c888653bd58dad21516f549d9773ff86ab66a570 +++ /dev/null @@ -1,646 +0,0 @@ -I"‹ń

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            VI. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 31-37
            • -
            - -
            31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -33                             Unit_Nm == "Salton Sea" ~ "visited",
            -34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -37                             TRUE ~ "not visited")  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

            - -
            - -
              -
            • lines 38-43
            • -
            - -
            38   shift_geometry(preserve_area = FALSE, 
            -39                  position = "below") %>% 
            -40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -41
            -42   ## save shifted park data
            -43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            - -

            I’ve covered these lines extensively in part II and part III of this series.

            - -

            Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

            - -

            Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

            - -

            Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

            - -

            7. divide by state

            -
            - -
            - - -
            - - - Image: chaos ensues - - -
            chaos ensues
            -
            - - -
            - -

            I tried to map the base map, National Parks, and the state parks. It did not go well. R froze, my computer screamed, and chaos ensued. As a result, I had to rethink my map. I decided to separate the state parks by state, save them, and in part VI of this never-ending series* I’ll create individual state maps. When you click on a state it’ll take you to a map that includes the state parks.

            - -

            Unfortunately, this also means I need to separate the National Parks by state so they also appear on the individual maps. The logic will be the same so I am not going to update part III to reflect that change. If you want to see that code it’s available on the project repo].

            - -

            I don’t want to manually separate and save each state, so I’m going to use a loop! I hate loops. The logic is simple enough “as long as condition X is true, do something.” So simple, yet esvery time I’ve tried to learn a programming language I have struggled with loops. That’s pretty sad considering it’s like day 2 of any programming class. Day 1 is learning how to write “Hello World!”**

            - -
            1
            -2
            -3
            -4
            -5
            -
                split_states <- split(state_parks, f = state_parks$State_Nm) # split the data by state
            -    all_names <- names(split_states)   
            -
            -    for(name in all_names){            
            -        st_write(split_states[[name]], paste0("shapefiles/shifted/states/individual/", name, '.shp'))}
            -
            - -

            8 conclusion

            -
            - -

            * I have annoyed myself with how long this series is. Hopefully it is helpful. Drop me a line if it is.
            -** print("Hello World!")

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/a5/49660ea41ea0fcdd50feff4fe64449520443e926e8f446f76aea51637f6b18 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/a5/49660ea41ea0fcdd50feff4fe64449520443e926e8f446f76aea51637f6b18 deleted file mode 100644 index a75dc7d..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/a5/49660ea41ea0fcdd50feff4fe64449520443e926e8f446f76aea51637f6b18 +++ /dev/null @@ -1,477 +0,0 @@ -I"2›

            Twitter is a great resource for engaging with the academic community. For example, I saw this Tweet by PhD Genie asking users to name one positive skill learned during their PhD. I love this question for a number of reasons. First, it helps PhDs reframe their experience so it’s applicable outside of academia - which can help when applying to jobs. Second, it’s really cool to see what skills other people have learned during their program.
            -

            - -
            - - - Image: PhD Genie Tweet - - -
            PhD Genie Tweet
            -
            - -

            I responded to the tweet because during my PhD I learned how to create maps in R. I started by recreating a map from the University of North Carolina’s Hussman School of Journalism’s News Deserts project (below). Now, I am working on a personal project mapping the U.S. National and State parks.

            - -
            - - - Image: Map of US newspapers by county - - -
            Map of US newspapers by county
            -
            - -

            There was quite a bit of interest in how to do this, so in this series of posts I will document my process from start to finish.

            - -

            a few notes

            -
            - -

            First, I’m not an expert. I wanted to make a map, so I learned how. There may be easier ways and, if I learn how to do them, I’ll write another post.

            - -

            Second, before starting, I strongly suggest setting up a Github and DVC. I wrote about how to use GitHub, the Github Website, and Github Desktop. You can use any of these methods to manage your repositories. I use all three based purely on whatever mood I’m in.

            - -

            If you do use Git or GitHub, then DVC (data version control) is mandatory. GitHub will warn you that your file is too large if it’s over 50MB and reject your pushes if the files are over 100MB. The total repository size can’t exceed 2GB if you’re using the free version (which I am). DVC is useful because cartography files are large. They contain a lot of coordinates which increases with each location you try to map. DVC will store your data outside of GitHub but allows you to track changes with your data. It’s super useful.

            - -

            Third, there are several ways to make a map. R is capable of making interactive maps and static maps. Static maps are less computationally expensive and better for publication. Interactive maps are prettier and better for displaying on the web.

            - -

            I make interactive maps with Leaflet and Shiny because they offer a lot of functionality. The most common way is to use map tiles. Map tiles use data from sources like Open Street Map and Maps to create map squares (tiles) with custom data on top. A list of available map tiles is available on the Open Street Maps website.

            - -
            - - - Image: Earthquakes near Australia, using map tiles - - -
            Earthquakes near Australia, using map tiles
            -
            - -

            When I make static maps (like the US map pictured above), I use ggplot

            - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            V. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. install packages

            -
            - -

            You only need to install the packages once. You can do so by running each line in the terminal. When you rerun the code later, you can skip right to loading the packages using library("package-name")

            - -
            Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -
                ## you only need to install the packages once
            -
            -    install.packages("leaflet")        # interactive maps
            -    install.packages("shiny")          # added map functionality
            -    install.packages("tidyverse")      # data manipulation
            -    install.packages("tigris")         # cartographic boundaries
            -    install.packages("operator.tools") # for the not-in function
            -    install.packages("sf")             # read and write shapefiles
            -
            - -
            package descriptions
            - -
            3    install.packages("leaflet")
            - -
              -
            • Leaflet is an open source mapping library used to create interactive maps. All leaflet maps have the same basic components: -
                -
              • A map widget (created by calling leaflet())
              • -
              • Layers (or features) added using addTiles(), addPolygons(), or addMarkers(). -

                - Leaflet was originally written in Javascript by “Volodymyr Agafonkin, a Ukrainian citizen living in Kyiv.” With the Russian invasion of Ukraine, Volodymyr, his family, and the people he knows are being forced to flee. If you can, make a donation to Come Back Alive or to one of the charities listed at Stand With Ukraine. If you think the invasion is justified, don’t forget to carry some seeds in your pocket
              • -
              -
            • -
            - -
            4    install.packages("shiny")
            - -
              -
            • Shiny is an R package that provides additional functionality. I am primarily using it to create an informative tool box so the map is not overwhelmed by using too many popups.
            • -
            - -
            5    install.packages("tidyverse")
            - -
              -
            • -

              The Tidyverse is a collection of packages used for data manipulation and analysis. Its syntax is more intuitive than base R. Furthermore, you can chain (aka pipe) commands together.

              - -

              For cartography, you don’t need the whole Tidyverse. We’ll mainly use dplyr and ggplot. You can install these packages individually instead of installing the whole tidyverse. Though, when we get to the national park database, we’ll also need purr and tidyr.

              -
            • -
            - -
            6    install.packages("tigris")
            - -
              -
            • Tigris is used to load Census data and shapefiles. You can download the same data from the Census website, but it’s nice that you can do it in R.
            • -
            - -
            7    install.packages("operator.tools")
            - -
              -
            • -

              operator.tools is not required, but it’s recommended.

              - -

              For some unknown reason, base R has a %in% function but not a not-in function. Unfortunately, the United States is still an empire with it’s associated areas, islands, and pseudo-states. I only want to include the 50 states, so I needed a way to easily filter out the non-states. Operator tool’s %!!in% function is perfect for that.

              -
            • -
            - -
            8    install.packages("sf")
            - -
              -
            • sf is used to read and write the shapefiles necessary for cartography.
            • -
            - -

            2. load the packages

            -
            - -

            To start, create and save a new file called usa.r. In it, we’re going to download and modify the United States shape data that we’ll use to create the base map in part two of this series.

            - -

            At the beginning of each file, you have to load the necessary packages. In this file, the only packages we need to load are tidyverse, sf, and tigris. I also load leaflet to make sure the map renders correctly.

            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")
            -    library("sf")
            -    library("tigris")
            -    library("leaflet")
            -
            - -

            3. download usa shapefile

            -
            - -

            There’s two ways to download the USA shape data. First, we can use the R package, tigris. Second, we can download it from the Census website.

            - -
            Note: I show two ways to download the shape data but you only have to choose one method.
            - -

            I prefer using tigris but I’ve been having some problems with it. Sometimes it ignores the Great Lakes and merges Michigan and Wisconsin into a Frankenstate (boxed in red below).

            - -
            - - - Image: Weird Frankenstate that merges Michigan and Wisconsin. - - -
            Weird Frankenstate that merges Michigan and Wisconsin.
            -
            - -
            method 1: using tigris()
            - -

            tigris() downloads the TIGER/Shapefile data directly from the Census and includes a treasure trove of data. Some of the data includes land area, water area, state names, and geometry.

            - -

            Tigris can also download boundaries for counties, divisions, regions, tracts, blocks, congressional and school districts, and a whole host of other groupings. A complete list of available data can be found on the packages’ GitHub.

            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -
                ## download state data using tigris()
            -    us_states <- tigris::states(cb = FALSE, year = 2020)  %>% 
            -        filter(STATEFP < 57)  %>% 
            -        shift_geometry(preserve_area = FALSE,
            -                       position = "below")  %>% 
            -        sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -    
            -    ## save the shifted shapefile
            -    st_write(us_states, "path/to/file/usa.shp")
            -
            - -

            Here we create the us_states variable, save the geographic data to it, move Alaska and Hawaii so they’re beneath the continental US, and save the shifted shapefile.

            - -
            - -
              -
            • line 2:
            • -
            - -
            2   us_states <- tigris::states(cb = FALSE, year = 2020)  %>%
            - -

            R uses the <- operator to define new variables. Here, we’re naming our new variable us_states.

            - -

            In our us_states variable we’re going to store data on the 50 states downloaded using tigris. Within (::) tigris, we’re going to use the states() function.

            - -

            The states() function allows you to pull state-level data from the Census. This function takes several arguments

            - -

            The cb argument can either be TRUE or FALSE. If cb = FALSE tells Tigris() to download the most detailed shapefile. If cb = TRUE it will download a generalized (1:5000k) file. After a lot of trial and error, I found that using cb = TRUE prevents the Frankenstate from happening.

            - -

            If the year argument is omitted it will download the shapefile for the default year (currently 2020). I set out of habit from when I work with county boundaries. When I work with county boundaries I have to set the year because their boundaries change more than states.

            - -

            Finally, the %>% operator is part of the Tidyverse. It basically tells R “Hey! I’m not done, keep going to the next line!”

            - -
            - -
              -
            • line 3:
            • -
            - -
            3    filter(STATEFP < 57)  %>% 
            - -

            tigris::states() downloads data for the 50 states and the United States’ minor outlying islands, Puerto Rico, and its associated territories. Each state and territory is assigned a unique two-digit Federal Information Processing Standard [FIPS] code.

            - -

            They’re mostly consecutive (Alaska is 01) but when they were conceived of in the 1970s a couple were reserved for the US territories (American Samoa was 03), but in the updated version the “reserved codes” were left out and the territories were assigned to new numbers (American Samoa is now 60). The important bit about this is that the last official state (Wyoming) has a FIPS of 56.

            - -

            This line of code uses the filter() function on the STATEFP variable downloaded using Tigris(). All it says is keep any row that has a FIPS of less than 57. This will keep only the 50 states and exclude the United States’ empire associated territories.

            - -
            - -
              -
            • lines 4-5:
            • -
            - -
            4    shift_geometry(preserve_area = FALSE,
            -5                   position = "below")  %>%
            - -

            The shift_geometry() is from the Tigris package. It takes two arguments preserve_area and position.

            - -
            Note: Strictly speaking, these two lines are not required. Delete them if you want Alaska and Hawaii to remain in their original locations.
            - -

            When preserve_area = FALSE tigris will shrink Alaska’s size and increase Hawaii’s so that they are comparable to the size of the other states.

            - -

            The position argument can either be "below" or "outside". When it’s below, both Alaska and Hawaii are moved to be below California. When it’s outside then Alaska is moved to be near Washington and Hawaii is moved to be near California.

            - -
            - -

            Since I’m a born-theorist, I should warn you that messing with maps has inherent normative implications. The most common projection is Mercator which stretches the continents near the poles and squishes the ones near the equator.

            - -
            - - - Image: Mercator vs. Gall-Peters Projection - - -
            Mercator vs. Gall-Peters Projection
            -
            - -

            One of the competing projections is Gall-Peters which claims to be more accurate because it was - at the time it was created in the 1980s - the only “area-correct map.” Though it has now been criticized for skewing the polar continents and the equatorial ones. The above photo shows you just how different the projects are from one another.

            - -

            The problem arises because we’re trying to project a 3D object into 2D space. It’s a classic case of even though we can, maybe we shouldn’t. Computers can do these computations and change the projections to anything we want fairly easily. However, humans think and exist in metaphors. We assume bigger = better and up = good. When we project maps that puts the Northern Hemisphere as both upwards and larger than other parts of the world we are imbuing that projection with metaphorical meaning.

            - -

            I caution you to be careful when creating maps. Think through the implications of something as simple as making Alaska more visually appealing by distorting it to be of similar size as the other states.

            - -

            If you want to read more about map projections this is a good post. If you want to read more about metaphors, I suggest Metaphors We Live By by George Lakoff and Mark Johnson.

            - -
            - -
              -
            • line 6:
            • -
            - -
            6    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            - -

            The sf package includes a function called st_transform() which will reproject the data for us. There are a lot of projects. You can read them at the proj website.

            - -

            Leaflet requires all boundaries use the World Geodetic Service 1984 (WGS84) coordinate system. While making maps I’ve come across two main coordinate systems: WGS84 and North American Datum (1983). WGS84 uses the WGS84 ellipsoid and NAD83 uses the Geodetic Reference System (GRS80). From what I’ve gathered, the differences are slight, but leaflet requires WGS and the Census uses NAD83. As a result, we have to reproject the the data in order to make our map.

            - -

            The st_transform function takes four arguments, each preceded by a +. All four arguments are required to transform the data from NAD83 to WGS84.

            - -

            Briefly, +proj=longlat tells R to use project the code into longitude and latitude [rather than, for example, transverse mercator (tmerc)].

            - -

            +ellps=WGS84 sets the ellipsoid to the WGS84 standard.

            - -

            +datum=WGS84 is a holdover from previous proj releases. It tells R to use the WGS84 data.

            - -

            +no_defs is also a holdover.

            - -

            Essentially, you need to include line 6 before you create the map, but after you do any data manipulation. It might throw some warnings which you can just ignore.

            - -
            - -
              -
            • line 9
            • -
            - -
            9    st_write(us_states, "path/to/file/usa.shp")
            - -

            In the last line, we save the data we manipulated in lines 2-6. Strictly speaking you don’t have to save the shapefile. You can manipulate the data and then skip right to mapping the data. I caution against it because the files can get unreadable once you start using multiple data sets. I usually comment out line 9 after I save the file. That way I’m not saving and re-saving it whenever I need to run the code above it.

            - -

            The st_write() function is part of the sf package and it takes two arguments. The first is the data set you want to save. Since I used us_states to save the data, it will be the first argument in the st_write() function call.

            - -

            The second argument is the path to where you want the file saved and what name you want to give it. I named mine usa. It is mandatory that you add .shp to the end of the filepath so that R knows to save it as a shapefile.

            - -

            Although it’s called a shapefile, it’s actually four files. I usually create a separate folder for each set of shapefiles and store that in one master folder called shapefiles. An example of my folder structure is below. I keep all of this in my GitHub repo and track changes using DVC.

            - -
            - - - Image: Example folder structure - - -
            Example folder structure
            -
            - -

            On my C:// drive is My Documents. In that folder I keep a GitHub folder that holds all my repos, including my nps one. Inside the nps folder I separate my shapefiles into their own folder. For this tutorial I am using original and shifted shapefiles, so I’ve also separated them into two separate folders to keep things neat. I also know I’m going to have multiple shapefiles (one for the USA, one for the National Parks, and a final one for the State Parks) so I created a folder for each set. In the usa folder I saved the shifted states shapefile.

            - -

            Altogether, my line 9 would read:

            - -
            9    st_write(us_states, "~/Documents/GitHub/nps/shapefiles/shifted/usa/usa.shp")
            - -

            Running that line will save the four necessary files that R needs to load the geographic data.

            - -

            That’s it for method 1 using tigris. The next section, method 2, shows how to load and transform a previously downloaded shapefile. If you used method 1, feel free to leave this post and go directly to mapping the shapefile in part II of this series.

            - -
            - -
            method 2: using downloaded shapefiles
            - -

            In this section, I’ll go through the process of downloading the shapefiles from the Census website. If you tried method 1 and tigris caused the weird Frankenstate, you can try using the data downloaded from the Census website. I don’t know why it works, since tigris uses the same data, but it does.

            - -

            Generally, though, finding and using shapefiles created by others is a great way to create cool maps. There are thousands of shapefiles available, many from ArcGis’ Open Data Website.

            - -
              -
            • step 1:
              -On the Census website, select the Data & Maps dropdown. In the dropdown, on the right-hand side you’ll select Mapping Files.
            • -
            - -
            - - - Image: Census website dropdown. - - -
            Census website dropdown.
            -
            - -
              -
            • step 2:
              -From here, about halfway down on the page, there’s a link called TIGER Data Products Guide which will take you to a complete list of the shapefiles available.
            • -
            - -
            - - - Image: TIGER Data Products Download Link - - -
            TIGER Data Products Download Link
            -
            - -
              -
            • step 3:
              -There’s a lot of downloads available on this page, but for now just click on “Cartographic Boundaries Shapefiles.” Make sure you select the shapefiles one and not the Geodatabases or Geopackages link.
            • -
            - -
            - - - Image: Cartographic Boundaries Shapefiles - - -
            Cartographic Boundaries Shapefiles
            -
            - -

            Save the file wherever you want, but I prefer to keep it within the “original” shapefiles folder in a sub-folder called “zips.” Once it downloads, unzip it - again, anywhere is fine. It will download all 30 Census shapefiles. We’re only going to use the one called “cb_2021_us_state_500k.zip”. The rest you can delete, if you want.

            - -
            - - - Image: cb_2021_us_state_500k folder - - -
            cb_2021_us_state_500k folder
            -
            - -

            When you unzip the cb_2021_us_state_500k.zip, it will contain four files. You’ll only ever work with the .shp file, but the other three are used in the background to display the data.

            - -
              -
            • step 4:
            • -
            - -

            Once all the files are unzipped, we can load the .shp file into R.

            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -
                ## load a previously downloaded shapefile
            -    usa <- read_sf("shapefiles/original/usa/states/cb_2021_us_state_500k.shp") %>%
            -        filter(STATEFP < 57) %>%
            -        shift_geometry(preserve_area = FALSE,
            -            position = "below") %>%
            -        sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save the shifted shapefile
            -    st_write(usa, "path/to/file/usa.shp")
            -
            - -

            Everything except line 2 is the same as in method 1. I won’t go over lines 3-9 here, because all the information is above.

            - -
              -
            • line 2:
            • -
            - -

            This line is very similar to the one above. I changed the name of the variable to usa so I could keep both methods in the same R file (each R variable needs to be unique or it will be overwritten).

            - -

            read_sf is part of the sf() package. It’s used to load shapefiles into R. The path to the file is enclosed in quotation marks and parentheses. Simply navigate to wherever you unzipped the cb_2021_us_state_500k file and choose the file with the .shp extension.

            - -

            4. Conclusion

            -
            - -

            Once the shapefiles are downloaded - either using tigris() or by loading the shapefiles from the Census website - you can create the base map. I’ll tackle making the base map in part II of this series.*

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/a8/0c4a320e4314e67d878a4cbf17de723fa15b8025c7213e4a7c6d4b44a8a37c b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/a8/0c4a320e4314e67d878a4cbf17de723fa15b8025c7213e4a7c6d4b44a8a37c deleted file mode 100644 index 82cce46..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/a8/0c4a320e4314e67d878a4cbf17de723fa15b8025c7213e4a7c6d4b44a8a37c +++ /dev/null @@ -1,686 +0,0 @@ -I"Ćţ

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            VI. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 31-37
            • -
            - -
            31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -33                             Unit_Nm == "Salton Sea" ~ "visited",
            -34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -37                             TRUE ~ "not visited")  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

            - -
            - -
              -
            • lines 38-43
            • -
            - -
            38   shift_geometry(preserve_area = FALSE, 
            -39                  position = "below") %>% 
            -40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -41
            -42   ## save shifted park data
            -43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            - -

            I’ve covered these lines extensively in part II and part III of this series.

            - -

            Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

            - -

            Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

            - -

            Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

            - -

            7. divide by state

            -
            - -
            - - -
            - - - Image: chaos ensues - - -
            chaos ensues
            -
            - - -
            - -

            I tried to map the base map, National Parks, and the state parks. It did not go well. R froze, my computer screamed, and chaos ensued. As a result, I had to rethink my map. I decided to separate the state parks by state, save them, and in part VI of this never-ending series* I’ll create individual state maps. When you click on a state it’ll take you to a map that includes the state parks.

            - -

            Unfortunately, this also means I need to separate the National Parks by state so they also appear on the individual maps. The logic will be the same so I am not going to update part III to reflect that change. If you want to see that code it’s available on the project repo].

            - -

            I don’t want to manually separate and save each state, so I’m going to use a loop! I hate loops. The logic is simple enough “as long as condition X is true, do something.” So simple, yet esvery time I’ve tried to learn a programming language I have struggled with loops. That’s pretty sad considering it’s like day 2 of any programming class. Day 1 is learning how to write “Hello World!”**

            - -
            1
            -2
            -3
            -4
            -5
            -
                split_states <- split(state_parks, f = state_parks$State_Nm)
            -    all_names <- names(split_states)   
            -
            -    for(name in all_names){            
            -        st_write(split_states[[name]], paste0("shapefiles/shifted/states/individual/", name, '.shp'))}
            -
            - -

            Look ma, new code!

            - -
              -
            • line 1
            • -
            - -
            1    split_states <- split(state_parks, f = state_parks$State_Nm) 
            - -

            The split() is part of base R. It takes quite a few arguments, most of which are optional.

            - -

            The first argument is the vector (or data frame) that you want to split into different groups. I want to split the state_parks data into its corresponding states, so it is listed first.

            - -

            The second argument f = is how you want the data split. f in this instance stands for factor. If we run levels(as.factor(state_parks$State_Nm)) in the terminal, it will return a list of the 50 state abbreviations. That is what we’re telling R to do here.

            - -

            You can access an individual state using the $ operator. split_states$CA will return the state park data for California.

            - -
            - -
              -
            • line 2
            • -
            - -
            2    all_names <- names(split_states)
            - -

            names is also part of base R. It does what it sounds like - it gets the names of an object. Here, I want to get the names of each split data sets.

            - -
            - -
              -
            • lines 4-5
            • -
            - -
            4    for(name in all_names){            
            -5        st_write(split_states[[name]], paste0("shapefiles/shifted/states/individual/", name, '.shp'))}
            - -

            Here’s the actual for loop.

            - -

            The for(x in y) {
            - do something}

            - -

            8. conclusion

            -
            - -

            * I have annoyed myself with how long this series is. Hopefully it is helpful. Drop me an email or a tweet if it is.
            -** print("Hello World!")

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/a9/d7572a1f6e02cd3f633742f8eca583be5e14f03d28245aba7004060a49db1b b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/a9/d7572a1f6e02cd3f633742f8eca583be5e14f03d28245aba7004060a49db1b deleted file mode 100644 index 1dc0b00..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/a9/d7572a1f6e02cd3f633742f8eca583be5e14f03d28245aba7004060a49db1b +++ /dev/null @@ -1,478 +0,0 @@ -I"X›

            Twitter is a great resource for engaging with the academic community. For example, I saw this Tweet by PhD Genie asking users to name one positive skill learned during their PhD. I love this question for a number of reasons. First, it helps PhDs reframe their experience so it’s applicable outside of academia - which can help when applying to jobs. Second, it’s really cool to see what skills other people have learned during their program.
            -

            - -
            - - - Image: PhD Genie Tweet - - -
            PhD Genie Tweet
            -
            - -

            I responded to the tweet because during my PhD I learned how to create maps in R. I started by recreating a map from the University of North Carolina’s Hussman School of Journalism’s News Deserts project (below). Now, I am working on a personal project mapping the U.S. National and State parks.

            - -
            - - - Image: Map of US newspapers by county - - -
            Map of US newspapers by county
            -
            - -

            There was quite a bit of interest in how to do this, so in this series of posts I will document my process from start to finish.

            - -

            a few notes

            -
            - -

            First, I’m not an expert. I wanted to make a map, so I learned how. There may be easier ways and, if I learn how to do them, I’ll write another post.

            - -

            Second, before starting, I strongly suggest setting up a Github and DVC. I wrote about how to use GitHub, the Github Website, and Github Desktop. You can use any of these methods to manage your repositories. I use all three based purely on whatever mood I’m in.

            - -

            If you do use Git or GitHub, then DVC (data version control) is mandatory. GitHub will warn you that your file is too large if it’s over 50MB and reject your pushes if the files are over 100MB. The total repository size can’t exceed 2GB if you’re using the free version (which I am). DVC is useful because cartography files are large. They contain a lot of coordinates which increases with each location you try to map. DVC will store your data outside of GitHub but allows you to track changes with your data. It’s super useful.

            - -

            Third, there are several ways to make a map. R is capable of making interactive maps and static maps. Static maps are less computationally expensive and better for publication. Interactive maps are prettier and better for displaying on the web.

            - -

            I make interactive maps with Leaflet and Shiny because they offer a lot of functionality. The most common way is to use map tiles. Map tiles use data from sources like Open Street Map and Maps to create map squares (tiles) with custom data on top. A list of available map tiles is available on the Open Street Maps website.

            - -
            - - - Image: Earthquakes near Australia, using map tiles - - -
            Earthquakes near Australia, using map tiles
            -
            - -

            When I make static maps (like the US map pictured above), I use ggplot

            - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            V. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. install packages

            -
            - -

            You only need to install the packages once. You can do so by running each line in the terminal. When you rerun the code later, you can skip right to loading the packages using library("package-name")

            - -
            Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -
                ## you only need to install the packages once
            -
            -    install.packages("leaflet")        # interactive maps
            -    install.packages("shiny")          # added map functionality
            -    install.packages("tidyverse")      # data manipulation
            -    install.packages("tigris")         # cartographic boundaries
            -    install.packages("operator.tools") # for the not-in function
            -    install.packages("sf")             # read and write shapefiles
            -
            - -
            package descriptions
            - -
            3    install.packages("leaflet")
            - -
              -
            • Leaflet is an open source mapping library used to create interactive maps. All leaflet maps have the same basic components: -
                -
              • A map widget (created by calling leaflet())
              • -
              • Layers (or features) added using addTiles(), addPolygons(), or addMarkers(). -

                - Leaflet was originally written in Javascript by “Volodymyr Agafonkin, a Ukrainian citizen living in Kyiv.” With the Russian invasion of Ukraine, Volodymyr, his family, and the people he knows are being forced to flee. If you can, make a donation to Come Back Alive or to one of the charities listed at Stand With Ukraine. If you think the invasion is justified, don’t forget to carry some seeds in your pocket
              • -
              -
            • -
            - -
            4    install.packages("shiny")
            - -
              -
            • Shiny is an R package that provides additional functionality. I am primarily using it to create an informative tool box so the map is not overwhelmed by using too many popups.
            • -
            - -
            5    install.packages("tidyverse")
            - -
              -
            • -

              The Tidyverse is a collection of packages used for data manipulation and analysis. Its syntax is more intuitive than base R. Furthermore, you can chain (aka pipe) commands together.

              - -

              For cartography, you don’t need the whole Tidyverse. We’ll mainly use dplyr and ggplot. You can install these packages individually instead of installing the whole tidyverse. Though, when we get to the national park database, we’ll also need purr and tidyr.

              -
            • -
            - -
            6    install.packages("tigris")
            - -
              -
            • Tigris is used to load Census data and shapefiles. You can download the same data from the Census website, but it’s nice that you can do it in R.
            • -
            - -
            7    install.packages("operator.tools")
            - -
              -
            • -

              operator.tools is not required, but it’s recommended.

              - -

              For some unknown reason, base R has a %in% function but not a not-in function. Unfortunately, the United States is still an empire with it’s associated areas, islands, and pseudo-states. I only want to include the 50 states, so I needed a way to easily filter out the non-states. Operator tool’s %!!in% function is perfect for that.

              -
            • -
            - -
            8    install.packages("sf")
            - -
              -
            • sf is used to read and write the shapefiles necessary for cartography.
            • -
            - -

            2. load the packages

            -
            - -

            To start, create and save a new file called usa.r. In it, we’re going to download and modify the United States shape data that we’ll use to create the base map in part two of this series.

            - -

            At the beginning of each file, you have to load the necessary packages. In this file, the only packages we need to load are tidyverse, sf, and tigris. I also load leaflet to make sure the map renders correctly.

            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")
            -    library("sf")
            -    library("tigris")
            -    library("leaflet")
            -
            - -

            3. download usa shapefile

            -
            - -

            There’s two ways to download the USA shape data. First, we can use the R package, tigris. Second, we can download it from the Census website.

            - -
            Note: I show two ways to download the shape data but you only have to choose one method.
            - -

            I prefer using tigris but I’ve been having some problems with it. Sometimes it ignores the Great Lakes and merges Michigan and Wisconsin into a Frankenstate (boxed in red below).

            - -
            - - - Image: Weird Frankenstate that merges Michigan and Wisconsin. - - -
            Weird Frankenstate that merges Michigan and Wisconsin.
            -
            - -
            method 1: using tigris()
            - -

            tigris() downloads the TIGER/Shapefile data directly from the Census and includes a treasure trove of data. Some of the data includes land area, water area, state names, and geometry.

            - -

            Tigris can also download boundaries for counties, divisions, regions, tracts, blocks, congressional and school districts, and a whole host of other groupings. A complete list of available data can be found on the packages’ GitHub.

            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -
                ## download state data using tigris()
            -    us_states <- tigris::states(cb = FALSE, year = 2020)  %>% 
            -        filter(STATEFP < 57)  %>% 
            -        shift_geometry(preserve_area = FALSE,
            -                       position = "below")  %>% 
            -        sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -    
            -    ## save the shifted shapefile
            -    st_write(us_states, "path/to/file/usa.shp")
            -
            - -

            Here we create the us_states variable, save the geographic data to it, move Alaska and Hawaii so they’re beneath the continental US, and save the shifted shapefile.

            - -
            - -
              -
            • line 2:
            • -
            - -
            2   us_states <- tigris::states(cb = FALSE, year = 2020)  %>%
            - -

            R uses the <- operator to define new variables. Here, we’re naming our new variable us_states.

            - -

            In our us_states variable we’re going to store data on the 50 states downloaded using tigris. Within (::) tigris, we’re going to use the states() function.

            - -

            The states() function allows you to pull state-level data from the Census. This function takes several arguments

            - -

            The cb argument can either be TRUE or FALSE. If cb = FALSE tells Tigris() to download the most detailed shapefile. If cb = TRUE it will download a generalized (1:5000k) file. After a lot of trial and error, I found that using cb = TRUE prevents the Frankenstate from happening.

            - -

            If the year argument is omitted it will download the shapefile for the default year (currently 2020). I set out of habit from when I work with county boundaries. When I work with county boundaries I have to set the year because their boundaries change more than states.

            - -

            Finally, the %>% operator is part of the Tidyverse. It basically tells R “Hey! I’m not done, keep going to the next line!”

            - -
            - -
              -
            • line 3:
            • -
            - -
            3    filter(STATEFP < 57)  %>% 
            - -

            tigris::states() downloads data for the 50 states and the United States’ minor outlying islands, Puerto Rico, and its associated territories. Each state and territory is assigned a unique two-digit Federal Information Processing Standard [FIPS] code.

            - -

            They’re mostly consecutive (Alaska is 01) but when they were conceived of in the 1970s a couple were reserved for the US territories (American Samoa was 03), but in the updated version the “reserved codes” were left out and the territories were assigned to new numbers (American Samoa is now 60). The important bit about this is that the last official state (Wyoming) has a FIPS of 56.

            - -

            This line of code uses the filter() function on the STATEFP variable downloaded using Tigris(). All it says is keep any row that has a FIPS of less than 57. This will keep only the 50 states and exclude the United States’ empire associated territories.

            - -
            - -
              -
            • lines 4-5:
            • -
            - -
            4    shift_geometry(preserve_area = FALSE,
            -5                   position = "below")  %>%
            - -

            The shift_geometry() is from the Tigris package. It takes two arguments preserve_area and position.

            - -
            Note: Strictly speaking, these two lines are not required. Delete them if you want Alaska and Hawaii to remain in their original locations.
            - -

            When preserve_area = FALSE tigris will shrink Alaska’s size and increase Hawaii’s so that they are comparable to the size of the other states.

            - -

            The position argument can either be "below" or "outside". When it’s below, both Alaska and Hawaii are moved to be below California. When it’s outside then Alaska is moved to be near Washington and Hawaii is moved to be near California.

            - -
            - -

            Since I’m a born-theorist, I should warn you that messing with maps has inherent normative implications. The most common projection is Mercator which stretches the continents near the poles and squishes the ones near the equator.

            - -
            - - - Image: Mercator vs. Gall-Peters Projection - - -
            Mercator vs. Gall-Peters Projection
            -
            - -

            One of the competing projections is Gall-Peters which claims to be more accurate because it was - at the time it was created in the 1980s - the only “area-correct map.” Though it has now been criticized for skewing the polar continents and the equatorial ones. The above photo shows you just how different the projects are from one another.

            - -

            The problem arises because we’re trying to project a 3D object into 2D space. It’s a classic case of even though we can, maybe we shouldn’t. Computers can do these computations and change the projections to anything we want fairly easily. However, humans think and exist in metaphors. We assume bigger = better and up = good. When we project maps that puts the Northern Hemisphere as both upwards and larger than other parts of the world we are imbuing that projection with metaphorical meaning.

            - -

            I caution you to be careful when creating maps. Think through the implications of something as simple as making Alaska more visually appealing by distorting it to be of similar size as the other states.

            - -

            If you want to read more about map projections this is a good post. If you want to read more about metaphors, I suggest Metaphors We Live By by George Lakoff and Mark Johnson.

            - -
            - -
              -
            • line 6:
            • -
            - -
            6    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            - -

            The sf package includes a function called st_transform() which will reproject the data for us. There are a lot of projects. You can read them at the proj website.

            - -

            Leaflet requires all boundaries use the World Geodetic Service 1984 (WGS84) coordinate system. While making maps I’ve come across two main coordinate systems: WGS84 and North American Datum (1983). WGS84 uses the WGS84 ellipsoid and NAD83 uses the Geodetic Reference System (GRS80). From what I’ve gathered, the differences are slight, but leaflet requires WGS and the Census uses NAD83. As a result, we have to reproject the the data in order to make our map.

            - -

            The st_transform function takes four arguments, each preceded by a +. All four arguments are required to transform the data from NAD83 to WGS84.

            - -

            Briefly, +proj=longlat tells R to use project the code into longitude and latitude [rather than, for example, transverse mercator (tmerc)].

            - -

            +ellps=WGS84 sets the ellipsoid to the WGS84 standard.

            - -

            +datum=WGS84 is a holdover from previous proj releases. It tells R to use the WGS84 data.

            - -

            +no_defs is also a holdover.

            - -

            Essentially, you need to include line 6 before you create the map, but after you do any data manipulation. It might throw some warnings which you can just ignore.

            - -
            - -
              -
            • line 9
            • -
            - -
            9    st_write(us_states, "path/to/file/usa.shp")
            - -

            In the last line, we save the data we manipulated in lines 2-6. Strictly speaking you don’t have to save the shapefile. You can manipulate the data and then skip right to mapping the data. I caution against it because the files can get unreadable once you start using multiple data sets. I usually comment out line 9 after I save the file. That way I’m not saving and re-saving it whenever I need to run the code above it.

            - -

            The st_write() function is part of the sf package and it takes two arguments. The first is the data set you want to save. Since I used us_states to save the data, it will be the first argument in the st_write() function call.

            - -

            The second argument is the path to where you want the file saved and what name you want to give it. I named mine usa. It is mandatory that you add .shp to the end of the filepath so that R knows to save it as a shapefile.

            - -

            Although it’s called a shapefile, it’s actually four files. I usually create a separate folder for each set of shapefiles and store that in one master folder called shapefiles. An example of my folder structure is below. I keep all of this in my GitHub repo and track changes using DVC.

            - -
            - - - Image: Example folder structure - - -
            Example folder structure
            -
            - -

            On my C:// drive is My Documents. In that folder I keep a GitHub folder that holds all my repos, including my nps one. Inside the nps folder I separate my shapefiles into their own folder. For this tutorial I am using original and shifted shapefiles, so I’ve also separated them into two separate folders to keep things neat. I also know I’m going to have multiple shapefiles (one for the USA, one for the National Parks, and a final one for the State Parks) so I created a folder for each set. In the usa folder I saved the shifted states shapefile.

            - -

            Altogether, my line 9 would read:

            - -
            9    st_write(us_states, "~/Documents/GitHub/nps/shapefiles/shifted/usa/usa.shp")
            - -

            Running that line will save the four necessary files that R needs to load the geographic data.

            - -

            That’s it for method 1 using tigris. The next section, method 2, shows how to load and transform a previously downloaded shapefile. If you used method 1, feel free to leave this post and go directly to mapping the shapefile in part II of this series.

            - -
            - -
            method 2: using downloaded shapefiles
            - -

            In this section, I’ll go through the process of downloading the shapefiles from the Census website. If you tried method 1 and tigris caused the weird Frankenstate, you can try using the data downloaded from the Census website. I don’t know why it works, since tigris uses the same data, but it does.

            - -

            Generally, though, finding and using shapefiles created by others is a great way to create cool maps. There are thousands of shapefiles available, many from ArcGis’ Open Data Website.

            - -
              -
            • step 1:
              -On the Census website, select the Data & Maps dropdown. In the dropdown, on the right-hand side you’ll select Mapping Files.
            • -
            - -
            - - - Image: Census website dropdown. - - -
            Census website dropdown.
            -
            - -
              -
            • step 2:
              -From here, about halfway down on the page, there’s a link called TIGER Data Products Guide which will take you to a complete list of the shapefiles available.
            • -
            - -
            - - - Image: TIGER Data Products Download Link - - -
            TIGER Data Products Download Link
            -
            - -
              -
            • step 3:
              -There’s a lot of downloads available on this page, but for now just click on “Cartographic Boundaries Shapefiles.” Make sure you select the shapefiles one and not the Geodatabases or Geopackages link.
            • -
            - -
            - - - Image: Cartographic Boundaries Shapefiles - - -
            Cartographic Boundaries Shapefiles
            -
            - -

            Save the file wherever you want, but I prefer to keep it within the “original” shapefiles folder in a sub-folder called “zips.” Once it downloads, unzip it - again, anywhere is fine. It will download all 30 Census shapefiles. We’re only going to use the one called “cb_2021_us_state_500k.zip”. The rest you can delete, if you want.

            - -
            - - - Image: cb_2021_us_state_500k folder - - -
            cb_2021_us_state_500k folder
            -
            - -

            When you unzip the cb_2021_us_state_500k.zip, it will contain four files. You’ll only ever work with the .shp file, but the other three are used in the background to display the data.

            - -
              -
            • step 4:
            • -
            - -

            Once all the files are unzipped, we can load the .shp file into R.

            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -
                ## load a previously downloaded shapefile
            -    usa <- read_sf("shapefiles/original/usa/states/cb_2021_us_state_500k.shp") %>%
            -        filter(STATEFP < 57) %>%
            -        shift_geometry(preserve_area = FALSE,
            -            position = "below") %>%
            -        sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save the shifted shapefile
            -    st_write(usa, "path/to/file/usa.shp")
            -
            - -

            Everything except line 2 is the same as in method 1. I won’t go over lines 3-9 here, because all the information is above.

            - -
              -
            • line 2:
            • -
            - -

            This line is very similar to the one above. I changed the name of the variable to usa so I could keep both methods in the same R file (each R variable needs to be unique or it will be overwritten).

            - -

            read_sf is part of the sf() package. It’s used to load shapefiles into R. The path to the file is enclosed in quotation marks and parentheses. Simply navigate to wherever you unzipped the cb_2021_us_state_500k file and choose the file with the .shp extension.

            - -

            4. Conclusion

            -
            - -

            Once the shapefiles are downloaded - either using tigris() or by loading the shapefiles from the Census website - you can create the base map. I’ll tackle making the base map in part II of this series.*

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/aa/e02f2fed1aed3fdc5c5f1df529d07469c5652c3f47cdd1dc8d4cca4a44a8d6 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/aa/e02f2fed1aed3fdc5c5f1df529d07469c5652c3f47cdd1dc8d4cca4a44a8d6 new file mode 100644 index 0000000..9c80bbe --- /dev/null +++ b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/aa/e02f2fed1aed3fdc5c5f1df529d07469c5652c3f47cdd1dc8d4cca4a44a8d6 @@ -0,0 +1,30 @@ +I"[
            +
            +
            +
            + + +
            + Here you'll find several non-partisan ballot guides. I try to include as much information as possible without directly recreating the official voter guide. The information is sourced from the ballot guide, calmatters, ballotpedia, LA Times, voter’s edge, and Mercury News. I have included links to the campaign website or wherever most of the information came from. + + I have done my best to keep my views out of it. + + I started this to help my two aunts because they would ask me to simplify their ballots for them. Democracy relies on an informed and participatory citizenry, but it’s not always easy. This is meant to alleviate some of the burden. + + If you notice any errors, you feel like I’ve missed something, or you found this guide helpful feel free to send me an email [link is below] + + Helpful Resources: + 1. Register to vote (Deadline May 23): [https://registertovote.ca.gov/](https://registertovote.ca.gov/){:target="_blank" rel="noopener noreferrer"} + 2. Check your registration status: [https://voterstatus.sos.ca.gov/](https://voterstatus.sos.ca.gov/){:target="_blank" rel="noopener noreferrer"} + 3. Access the official voter guide: [https://voterguide.sos.ca.gov/](https://voterguide.sos.ca.gov/){:target="_blank" rel="noopener noreferrer"} + 4. Early voting & ballot drop off locations: [https://caearlyvoting.sos.ca.gov/](https://caearlyvoting.sos.ca.gov/){:target="_blank" rel="noopener noreferrer"} + 5. Track your ballot: [https://california.ballottrax.net/voter/](https://california.ballottrax.net/voter/){:target="_blank" rel="noopener noreferrer"} + 6. If you are in the Los Angeles, San Bernardino, Orange County area and need help getting to your polling place I will either find you resources or help you get there. I also offer to go with you to vote (and I will bring my two large German Shepherds) if you feel unsafe going to vote alone. +
            + +
            +
            + +
            +
            +:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/ab/6798ad48b99d5f0975b90614d1cdad713415ae2cecd73df1b3e15ec052553e b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/ab/6798ad48b99d5f0975b90614d1cdad713415ae2cecd73df1b3e15ec052553e deleted file mode 100644 index d37a14d..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/ab/6798ad48b99d5f0975b90614d1cdad713415ae2cecd73df1b3e15ec052553e +++ /dev/null @@ -1,478 +0,0 @@ -I"Y›

            Twitter is a great resource for engaging with the academic community. For example, I saw this Tweet by PhD Genie asking users to name one positive skill learned during their PhD. I love this question for a number of reasons. First, it helps PhDs reframe their experience so it’s applicable outside of academia - which can help when applying to jobs. Second, it’s really cool to see what skills other people have learned during their program.
            -

            - -
            - - - Image: PhD Genie Tweet - - -
            PhD Genie Tweet
            -
            - -

            I responded to the tweet because during my PhD I learned how to create maps in R. I started by recreating a map from the University of North Carolina’s Hussman School of Journalism’s News Deserts project (below). Now, I am working on a personal project mapping the U.S. National and State parks.

            - -
            - - - Image: Map of US newspapers by county - - -
            Map of US newspapers by county
            -
            - -

            There was quite a bit of interest in how to do this, so in this series of posts I will document my process from start to finish.

            - -

            a few notes

            -
            - -

            First, I’m not an expert. I wanted to make a map, so I learned how. There may be easier ways and, if I learn how to do them, I’ll write another post.

            - -

            Second, before starting, I strongly suggest setting up a Github and DVC. I wrote about how to use GitHub, the Github Website, and Github Desktop. You can use any of these methods to manage your repositories. I use all three based purely on whatever mood I’m in.

            - -

            If you do use Git or GitHub, then DVC (data version control) is mandatory. GitHub will warn you that your file is too large if it’s over 50MB and reject your pushes if the files are over 100MB. The total repository size can’t exceed 2GB if you’re using the free version (which I am). DVC is useful because cartography files are large. They contain a lot of coordinates which increases with each location you try to map. DVC will store your data outside of GitHub but allows you to track changes with your data. It’s super useful.

            - -

            Third, there are several ways to make a map. R is capable of making interactive maps and static maps. Static maps are less computationally expensive and better for publication. Interactive maps are prettier and better for displaying on the web.

            - -

            I make interactive maps with Leaflet and Shiny because they offer a lot of functionality. The most common way is to use map tiles. Map tiles use data from sources like Open Street Map and Maps to create map squares (tiles) with custom data on top. A list of available map tiles is available on the Open Street Maps website.

            - -
            - - - Image: Earthquakes near Australia, using map tiles - - -
            Earthquakes near Australia, using map tiles
            -
            - -

            When I make static maps (like the US map pictured above), I use ggplot

            - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            VI. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. install packages

            -
            - -

            You only need to install the packages once. You can do so by running each line in the terminal. When you rerun the code later, you can skip right to loading the packages using library("package-name")

            - -
            Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -
                ## you only need to install the packages once
            -
            -    install.packages("leaflet")        # interactive maps
            -    install.packages("shiny")          # added map functionality
            -    install.packages("tidyverse")      # data manipulation
            -    install.packages("tigris")         # cartographic boundaries
            -    install.packages("operator.tools") # for the not-in function
            -    install.packages("sf")             # read and write shapefiles
            -
            - -
            package descriptions
            - -
            3    install.packages("leaflet")
            - -
              -
            • Leaflet is an open source mapping library used to create interactive maps. All leaflet maps have the same basic components: -
                -
              • A map widget (created by calling leaflet())
              • -
              • Layers (or features) added using addTiles(), addPolygons(), or addMarkers(). -

                - Leaflet was originally written in Javascript by “Volodymyr Agafonkin, a Ukrainian citizen living in Kyiv.” With the Russian invasion of Ukraine, Volodymyr, his family, and the people he knows are being forced to flee. If you can, make a donation to Come Back Alive or to one of the charities listed at Stand With Ukraine. If you think the invasion is justified, don’t forget to carry some seeds in your pocket
              • -
              -
            • -
            - -
            4    install.packages("shiny")
            - -
              -
            • Shiny is an R package that provides additional functionality. I am primarily using it to create an informative tool box so the map is not overwhelmed by using too many popups.
            • -
            - -
            5    install.packages("tidyverse")
            - -
              -
            • -

              The Tidyverse is a collection of packages used for data manipulation and analysis. Its syntax is more intuitive than base R. Furthermore, you can chain (aka pipe) commands together.

              - -

              For cartography, you don’t need the whole Tidyverse. We’ll mainly use dplyr and ggplot. You can install these packages individually instead of installing the whole tidyverse. Though, when we get to the national park database, we’ll also need purr and tidyr.

              -
            • -
            - -
            6    install.packages("tigris")
            - -
              -
            • Tigris is used to load Census data and shapefiles. You can download the same data from the Census website, but it’s nice that you can do it in R.
            • -
            - -
            7    install.packages("operator.tools")
            - -
              -
            • -

              operator.tools is not required, but it’s recommended.

              - -

              For some unknown reason, base R has a %in% function but not a not-in function. Unfortunately, the United States is still an empire with it’s associated areas, islands, and pseudo-states. I only want to include the 50 states, so I needed a way to easily filter out the non-states. Operator tool’s %!!in% function is perfect for that.

              -
            • -
            - -
            8    install.packages("sf")
            - -
              -
            • sf is used to read and write the shapefiles necessary for cartography.
            • -
            - -

            2. load the packages

            -
            - -

            To start, create and save a new file called usa.r. In it, we’re going to download and modify the United States shape data that we’ll use to create the base map in part two of this series.

            - -

            At the beginning of each file, you have to load the necessary packages. In this file, the only packages we need to load are tidyverse, sf, and tigris. I also load leaflet to make sure the map renders correctly.

            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")
            -    library("sf")
            -    library("tigris")
            -    library("leaflet")
            -
            - -

            3. download usa shapefile

            -
            - -

            There’s two ways to download the USA shape data. First, we can use the R package, tigris. Second, we can download it from the Census website.

            - -
            Note: I show two ways to download the shape data but you only have to choose one method.
            - -

            I prefer using tigris but I’ve been having some problems with it. Sometimes it ignores the Great Lakes and merges Michigan and Wisconsin into a Frankenstate (boxed in red below).

            - -
            - - - Image: Weird Frankenstate that merges Michigan and Wisconsin. - - -
            Weird Frankenstate that merges Michigan and Wisconsin.
            -
            - -
            method 1: using tigris()
            - -

            tigris() downloads the TIGER/Shapefile data directly from the Census and includes a treasure trove of data. Some of the data includes land area, water area, state names, and geometry.

            - -

            Tigris can also download boundaries for counties, divisions, regions, tracts, blocks, congressional and school districts, and a whole host of other groupings. A complete list of available data can be found on the packages’ GitHub.

            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -
                ## download state data using tigris()
            -    us_states <- tigris::states(cb = FALSE, year = 2020)  %>% 
            -        filter(STATEFP < 57)  %>% 
            -        shift_geometry(preserve_area = FALSE,
            -                       position = "below")  %>% 
            -        sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -    
            -    ## save the shifted shapefile
            -    st_write(us_states, "path/to/file/usa.shp")
            -
            - -

            Here we create the us_states variable, save the geographic data to it, move Alaska and Hawaii so they’re beneath the continental US, and save the shifted shapefile.

            - -
            - -
              -
            • line 2:
            • -
            - -
            2   us_states <- tigris::states(cb = FALSE, year = 2020)  %>%
            - -

            R uses the <- operator to define new variables. Here, we’re naming our new variable us_states.

            - -

            In our us_states variable we’re going to store data on the 50 states downloaded using tigris. Within (::) tigris, we’re going to use the states() function.

            - -

            The states() function allows you to pull state-level data from the Census. This function takes several arguments

            - -

            The cb argument can either be TRUE or FALSE. If cb = FALSE tells Tigris() to download the most detailed shapefile. If cb = TRUE it will download a generalized (1:5000k) file. After a lot of trial and error, I found that using cb = TRUE prevents the Frankenstate from happening.

            - -

            If the year argument is omitted it will download the shapefile for the default year (currently 2020). I set out of habit from when I work with county boundaries. When I work with county boundaries I have to set the year because their boundaries change more than states.

            - -

            Finally, the %>% operator is part of the Tidyverse. It basically tells R “Hey! I’m not done, keep going to the next line!”

            - -
            - -
              -
            • line 3:
            • -
            - -
            3    filter(STATEFP < 57)  %>% 
            - -

            tigris::states() downloads data for the 50 states and the United States’ minor outlying islands, Puerto Rico, and its associated territories. Each state and territory is assigned a unique two-digit Federal Information Processing Standard [FIPS] code.

            - -

            They’re mostly consecutive (Alaska is 01) but when they were conceived of in the 1970s a couple were reserved for the US territories (American Samoa was 03), but in the updated version the “reserved codes” were left out and the territories were assigned to new numbers (American Samoa is now 60). The important bit about this is that the last official state (Wyoming) has a FIPS of 56.

            - -

            This line of code uses the filter() function on the STATEFP variable downloaded using Tigris(). All it says is keep any row that has a FIPS of less than 57. This will keep only the 50 states and exclude the United States’ empire associated territories.

            - -
            - -
              -
            • lines 4-5:
            • -
            - -
            4    shift_geometry(preserve_area = FALSE,
            -5                   position = "below")  %>%
            - -

            The shift_geometry() is from the Tigris package. It takes two arguments preserve_area and position.

            - -
            Note: Strictly speaking, these two lines are not required. Delete them if you want Alaska and Hawaii to remain in their original locations.
            - -

            When preserve_area = FALSE tigris will shrink Alaska’s size and increase Hawaii’s so that they are comparable to the size of the other states.

            - -

            The position argument can either be "below" or "outside". When it’s below, both Alaska and Hawaii are moved to be below California. When it’s outside then Alaska is moved to be near Washington and Hawaii is moved to be near California.

            - -
            - -

            Since I’m a born-theorist, I should warn you that messing with maps has inherent normative implications. The most common projection is Mercator which stretches the continents near the poles and squishes the ones near the equator.

            - -
            - - - Image: Mercator vs. Gall-Peters Projection - - -
            Mercator vs. Gall-Peters Projection
            -
            - -

            One of the competing projections is Gall-Peters which claims to be more accurate because it was - at the time it was created in the 1980s - the only “area-correct map.” Though it has now been criticized for skewing the polar continents and the equatorial ones. The above photo shows you just how different the projects are from one another.

            - -

            The problem arises because we’re trying to project a 3D object into 2D space. It’s a classic case of even though we can, maybe we shouldn’t. Computers can do these computations and change the projections to anything we want fairly easily. However, humans think and exist in metaphors. We assume bigger = better and up = good. When we project maps that puts the Northern Hemisphere as both upwards and larger than other parts of the world we are imbuing that projection with metaphorical meaning.

            - -

            I caution you to be careful when creating maps. Think through the implications of something as simple as making Alaska more visually appealing by distorting it to be of similar size as the other states.

            - -

            If you want to read more about map projections this is a good post. If you want to read more about metaphors, I suggest Metaphors We Live By by George Lakoff and Mark Johnson.

            - -
            - -
              -
            • line 6:
            • -
            - -
            6    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            - -

            The sf package includes a function called st_transform() which will reproject the data for us. There are a lot of projects. You can read them at the proj website.

            - -

            Leaflet requires all boundaries use the World Geodetic Service 1984 (WGS84) coordinate system. While making maps I’ve come across two main coordinate systems: WGS84 and North American Datum (1983). WGS84 uses the WGS84 ellipsoid and NAD83 uses the Geodetic Reference System (GRS80). From what I’ve gathered, the differences are slight, but leaflet requires WGS and the Census uses NAD83. As a result, we have to reproject the the data in order to make our map.

            - -

            The st_transform function takes four arguments, each preceded by a +. All four arguments are required to transform the data from NAD83 to WGS84.

            - -

            Briefly, +proj=longlat tells R to use project the code into longitude and latitude [rather than, for example, transverse mercator (tmerc)].

            - -

            +ellps=WGS84 sets the ellipsoid to the WGS84 standard.

            - -

            +datum=WGS84 is a holdover from previous proj releases. It tells R to use the WGS84 data.

            - -

            +no_defs is also a holdover.

            - -

            Essentially, you need to include line 6 before you create the map, but after you do any data manipulation. It might throw some warnings which you can just ignore.

            - -
            - -
              -
            • line 9
            • -
            - -
            9    st_write(us_states, "path/to/file/usa.shp")
            - -

            In the last line, we save the data we manipulated in lines 2-6. Strictly speaking you don’t have to save the shapefile. You can manipulate the data and then skip right to mapping the data. I caution against it because the files can get unreadable once you start using multiple data sets. I usually comment out line 9 after I save the file. That way I’m not saving and re-saving it whenever I need to run the code above it.

            - -

            The st_write() function is part of the sf package and it takes two arguments. The first is the data set you want to save. Since I used us_states to save the data, it will be the first argument in the st_write() function call.

            - -

            The second argument is the path to where you want the file saved and what name you want to give it. I named mine usa. It is mandatory that you add .shp to the end of the filepath so that R knows to save it as a shapefile.

            - -

            Although it’s called a shapefile, it’s actually four files. I usually create a separate folder for each set of shapefiles and store that in one master folder called shapefiles. An example of my folder structure is below. I keep all of this in my GitHub repo and track changes using DVC.

            - -
            - - - Image: Example folder structure - - -
            Example folder structure
            -
            - -

            On my C:// drive is My Documents. In that folder I keep a GitHub folder that holds all my repos, including my nps one. Inside the nps folder I separate my shapefiles into their own folder. For this tutorial I am using original and shifted shapefiles, so I’ve also separated them into two separate folders to keep things neat. I also know I’m going to have multiple shapefiles (one for the USA, one for the National Parks, and a final one for the State Parks) so I created a folder for each set. In the usa folder I saved the shifted states shapefile.

            - -

            Altogether, my line 9 would read:

            - -
            9    st_write(us_states, "~/Documents/GitHub/nps/shapefiles/shifted/usa/usa.shp")
            - -

            Running that line will save the four necessary files that R needs to load the geographic data.

            - -

            That’s it for method 1 using tigris. The next section, method 2, shows how to load and transform a previously downloaded shapefile. If you used method 1, feel free to leave this post and go directly to mapping the shapefile in part II of this series.

            - -
            - -
            method 2: using downloaded shapefiles
            - -

            In this section, I’ll go through the process of downloading the shapefiles from the Census website. If you tried method 1 and tigris caused the weird Frankenstate, you can try using the data downloaded from the Census website. I don’t know why it works, since tigris uses the same data, but it does.

            - -

            Generally, though, finding and using shapefiles created by others is a great way to create cool maps. There are thousands of shapefiles available, many from ArcGis’ Open Data Website.

            - -
              -
            • step 1:
              -On the Census website, select the Data & Maps dropdown. In the dropdown, on the right-hand side you’ll select Mapping Files.
            • -
            - -
            - - - Image: Census website dropdown. - - -
            Census website dropdown.
            -
            - -
              -
            • step 2:
              -From here, about halfway down on the page, there’s a link called TIGER Data Products Guide which will take you to a complete list of the shapefiles available.
            • -
            - -
            - - - Image: TIGER Data Products Download Link - - -
            TIGER Data Products Download Link
            -
            - -
              -
            • step 3:
              -There’s a lot of downloads available on this page, but for now just click on “Cartographic Boundaries Shapefiles.” Make sure you select the shapefiles one and not the Geodatabases or Geopackages link.
            • -
            - -
            - - - Image: Cartographic Boundaries Shapefiles - - -
            Cartographic Boundaries Shapefiles
            -
            - -

            Save the file wherever you want, but I prefer to keep it within the “original” shapefiles folder in a sub-folder called “zips.” Once it downloads, unzip it - again, anywhere is fine. It will download all 30 Census shapefiles. We’re only going to use the one called “cb_2021_us_state_500k.zip”. The rest you can delete, if you want.

            - -
            - - - Image: cb_2021_us_state_500k folder - - -
            cb_2021_us_state_500k folder
            -
            - -

            When you unzip the cb_2021_us_state_500k.zip, it will contain four files. You’ll only ever work with the .shp file, but the other three are used in the background to display the data.

            - -
              -
            • step 4:
            • -
            - -

            Once all the files are unzipped, we can load the .shp file into R.

            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -
                ## load a previously downloaded shapefile
            -    usa <- read_sf("shapefiles/original/usa/states/cb_2021_us_state_500k.shp") %>%
            -        filter(STATEFP < 57) %>%
            -        shift_geometry(preserve_area = FALSE,
            -            position = "below") %>%
            -        sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save the shifted shapefile
            -    st_write(usa, "path/to/file/usa.shp")
            -
            - -

            Everything except line 2 is the same as in method 1. I won’t go over lines 3-9 here, because all the information is above.

            - -
              -
            • line 2:
            • -
            - -

            This line is very similar to the one above. I changed the name of the variable to usa so I could keep both methods in the same R file (each R variable needs to be unique or it will be overwritten).

            - -

            read_sf is part of the sf() package. It’s used to load shapefiles into R. The path to the file is enclosed in quotation marks and parentheses. Simply navigate to wherever you unzipped the cb_2021_us_state_500k file and choose the file with the .shp extension.

            - -

            4. Conclusion

            -
            - -

            Once the shapefiles are downloaded - either using tigris() or by loading the shapefiles from the Census website - you can create the base map. I’ll tackle making the base map in part II of this series.*

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/ac/f1d7a04d93d63e1cfd38f18f51f377993675ece83f0e5253bd794637fb9188 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/ac/f1d7a04d93d63e1cfd38f18f51f377993675ece83f0e5253bd794637fb9188 deleted file mode 100644 index 886abda..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/ac/f1d7a04d93d63e1cfd38f18f51f377993675ece83f0e5253bd794637fb9188 +++ /dev/null @@ -1,701 +0,0 @@ -I"ś

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            VI. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 31-37
            • -
            - -
            31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -33                             Unit_Nm == "Salton Sea" ~ "visited",
            -34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -37                             TRUE ~ "not visited")  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

            - -
            - -
              -
            • lines 38-43
            • -
            - -
            38   shift_geometry(preserve_area = FALSE, 
            -39                  position = "below") %>% 
            -40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -41
            -42   ## save shifted park data
            -43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            - -

            I’ve covered these lines extensively in part II and part III of this series.

            - -

            Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

            - -

            Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

            - -

            Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

            - -

            7. divide by state

            -
            - -
            - - -
            - - - Image: chaos ensues - - -
            chaos ensues
            -
            - - -
            - -

            I tried to map the base map, National Parks, and the state parks. It did not go well. R froze, my computer screamed, and chaos ensued. As a result, I had to rethink my map. I decided to separate the state parks by state, save them, and in part VI of this never-ending series* I’ll create individual state maps. When you click on a state it’ll take you to a map that includes the state parks.

            - -

            Unfortunately, this also means I need to separate the National Parks by state so they also appear on the individual maps. The logic will be the same so I am not going to update part III to reflect that change. If you want to see that code it’s available on the project repo].

            - -

            I don’t want to manually separate and save each state, so I’m going to use a loop! I hate loops. The logic is simple enough “as long as condition X is true, do something.” So simple, yet esvery time I’ve tried to learn a programming language I have struggled with loops. That’s pretty sad considering it’s like day 2 of any programming class. Day 1 is learning how to write “Hello World!”**

            - -
            1
            -2
            -3
            -4
            -5
            -
                split_states <- split(state_parks, f = state_parks$State_Nm)
            -    all_names <- names(split_states)   
            -
            -    for(name in all_names){            
            -        st_write(split_states[[name]], paste0("shapefiles/shifted/states/individual/", name, '.shp'))}
            -
            - -

            Look ma, new code!

            - -
              -
            • line 1
            • -
            - -
            1    split_states <- split(state_parks, f = state_parks$State_Nm) 
            - -

            The split() is part of base R. It takes quite a few arguments, most of which are optional.

            - -

            The first argument is the vector (or data frame) that you want to split into different groups. I want to split the state_parks data into its corresponding states, so it is listed first.

            - -

            The second argument f = is how you want the data split. f in this instance stands for factor. If we run levels(as.factor(state_parks$State_Nm)) in the terminal, it will return a list of the 50 state abbreviations. That is what we’re telling R to do here.

            - -

            You can access an individual state using the $ operator. split_states$CA will return the state park data for California.

            - -
            - -
              -
            • line 2
            • -
            - -
            2    all_names <- names(split_states)
            - -

            names is also part of base R. It does what it sounds like - it gets the names of an object. Here, I want to get the names of each split data sets.

            - -
            - -
              -
            • lines 4-5
            • -
            - -
            4    for(name in all_names){            
            -5        st_write(split_states[[name]], paste0("shapefiles/shifted/states/individual/", name, '.shp'))}
            - -

            Here’s the actual for loop.

            - -

            The basic logic of a for loop is:
            - for(x in y){
            - do something}

            - -

            Inside the parenthesis is the condition that must evaluate to TRUE if the content in the curly braces is to run.

            - -

            In line 4, for(name in all_names){ says as long as there’s a name in the list of all names, do whatever is inside the curly braces. name can be whatever you want. It’s a placeholder value. I can have it say for(dogs in all_names){ it will still do the exact same thing. A lot of time you’ll see it as an i for item. I like to use more descriptive language because, again, for loops are my Achilles’ heel.

            - -

            The all_names part is where ever you want R to look for the data. It will change based on your data set and variable naming conventions.

            - -

            In line 5, I save the split data sets.

            - -

            st_write() is part of the sf package which allows us to create shapefiles. This can be any saving function (eg. write_csv() if you want to save CSVs). The function takes several arguments. In line 43 above I showed the basic structure: st_write(data, path/to/file.shp). This is good if you only have one file, but since I’m saving them in a loop I don’t want all of the files to have the same name. R will error out after the first and tell you the file already exists.

            - -

            The first part split_states[[name]] is still telling R what data to save, but using an index instead of a specific data frame name. To access an index you use data[[some-value]] where some-value is the index location. In my code, R will take the split_states data and go alright the first index location in [[name]] is 1 and return whatever value is stored in that index (here, AK). It will then do that for every index location as it loops through the split_states data.

            - -

            paste0() is also part of base R - it’s apparently faster than paste(). It concatenates (or links together) different pieces into one. I’m using it to create the filename. Within the paste0 call anything within quotation marks is static. So every file will be saved to "shapefiles/shifted/states/individual/" and every file will have the extension .shp. What will change with each loop is the name of the file. One by one, R will loop through and save each file using the name it pulled from all_names

            - -

            8. conclusion

            -
            - -

            * I have annoyed myself with how long this series is. Hopefully it is helpful. Drop me an email or a tweet if it is.
            -** print("Hello World!")

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/ae/01182a9b38687fb517be48f52ebb237601228289a3d4bd9eea4534cc517659 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/ae/01182a9b38687fb517be48f52ebb237601228289a3d4bd9eea4534cc517659 new file mode 100644 index 0000000..1e29328 --- /dev/null +++ b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/ae/01182a9b38687fb517be48f52ebb237601228289a3d4bd9eea4534cc517659 @@ -0,0 +1,49 @@ +I" +
             <blockquote class = "blockquote">It made a difference to that one.
            +  <footer class="blockquote-footer">The Star Thrower | <cite title="Source Title"><a href = "https://mrjakeparker.tumblr.com/post/87041680432/star-thrower-is-based-off-of-this-story-which-was">Mr. Jake Parker</a></cite> (my favorite version)</footer>
            +  </blockquote>
            +  <p>I have a simple motto in life: Do what you can, where you are, with what you have. As a result, I believe strongly in doing whatever is in my means to make the world a better place. Below are some groups that I have either founded or joined in order to help those around me.</p>
            +  <h1>Current Groups</h1>     
            +  <hr class = "h-line">     
            +  <ul>
            +    <li><i>Political Science Methodology Group</i> | Co-organizer with Melina Much <br/>
            +    University of California, Irvine </li><br/>
            +    <li><i>Political Science Womxn's Caucus</i> | Student leader <br/>
            +    University of California, Irvine </li><br/>
            +    <li><i>Political Science Workshop Coordinator</i> <br/>
            +    University of California, Irvine </li><br/>
            +    <li><i>Legal Politics Writing Workshop</i><br/>
            +    University of California, Irvine </li><br/>
            +    <li><i>Center for Democracy: Writing Workshop</i> | Member <br/>
            +    University of California, Irvine</li><br/>
            +    <li><i>UCI Humanities: Writing Workshop</i> | Member <br/>
            +    University of California, Irvine</li><br/> 
            +  </ul>
            +  <h1>Previous Groups</h1>
            +  <hr class = "h-line">
            +  <ul>
            +    <li><i>Friends of the San Dimas Dog Park</i> | Ambassador <br/>
            +    San Dimas, California </li><br/>
            +    <li><i>Prisoner Education Project</i> | Volunteer <br/>
            +    Pomona, California</li><br/>
            +    <li><i>Tails of the City</i> | Volunteer Photographer <br/>
            +    Los Angeles, California</li><br/>
            +    <li><i>Philosophy Club</i> | President, Graphic Designer, and Banquet Chair <br/>
            +    California State Polytechnic University, Pomona</li><br/>
            +    <li><i><a href = "https://www.voteamerica.com/">Long Distance Voter</a></i> | Intern <br/>
            +    Social Media Content Creator</li><br/>
            +    <li><i><a href = "https://www.freepress.net/">Free Press</a></i> | Intern <br/>
            +    Social Media Content Creator</li>
            +  </ul>
            +<!-- </div>
            +
            +
            +
            +

            </div> + –>

            +:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/ae/aac6de25a3018957bfd207974447f44e3a40a693e6ae6ee571cc91f9458d20 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/ae/aac6de25a3018957bfd207974447f44e3a40a693e6ae6ee571cc91f9458d20 deleted file mode 100644 index 118a437..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/ae/aac6de25a3018957bfd207974447f44e3a40a693e6ae6ee571cc91f9458d20 +++ /dev/null @@ -1,290 +0,0 @@ -I"ˇK

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/b1/c273246ffff1cc2367ff0905edc798542314fcf0c6eb6c1d05695c4328ee51 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/b1/c273246ffff1cc2367ff0905edc798542314fcf0c6eb6c1d05695c4328ee51 new file mode 100644 index 0000000..8426c9d --- /dev/null +++ b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/b1/c273246ffff1cc2367ff0905edc798542314fcf0c6eb6c1d05695c4328ee51 @@ -0,0 +1,22 @@ +I"ń

            welcome

            +
                Here you'll find several non-partisan ballot guides. I try to include as much information as possible without directly recreating the official voter guide. The information is sourced from the ballot guide, calmatters, ballotpedia, LA Times, voter’s edge, and Mercury News. I have included links to the campaign website or wherever most of the information came from.
            +
            +    I have done my best to keep my views out of it.
            +
            +    I started this to help my two aunts because they would ask me to simplify their ballots for them. Democracy relies on an informed and participatory citizenry, but it’s not always easy. This is meant to alleviate some of the burden. 
            +
            +    If you notice any errors, you feel like I’ve missed something, or you found this guide helpful feel free to send me an email [click the envelope at the bottom of the page]  <br><br>
            +    <h3>helpful resources:</h3>
            +    <ol><li>Register to vote (Deadline May 23): <a href="https://registertovote.ca.gov/">https://registertovote.ca.gov/</a> </li>
            +    <li>Check your registration status: <a href="https://voterstatus.sos.ca.gov/">https://voterstatus.sos.ca.gov/</a></li>
            +    <li>Access the official voter guide: <a href="https://voterguide.sos.ca.gov/">https://voterguide.sos.ca.gov/</a> </li>
            +    <li>Early voting & ballot drop off locations: <a href="https://caearlyvoting.sos.ca.gov/">https://caearlyvoting.sos.ca.gov/</a></li>
            +    <li>Track your ballot: <a href="https://california.ballottrax.net/voter/">https://california.ballottrax.net/voter/</a></li>
            +    <li>If you are in the Los Angeles, San Bernardino, Orange County area and need help getting to your polling place I will either find you resources or help you get there. I also offer to go with you to vote (and I will bring my two large German Shepherds) if you feel unsafe going to vote alone.</li></ol>
            +
            + +

            ballot guides

            + +
                <a href="https://github.com/liz-muehlmann/Election_Guides/raw/main/California/Primary%20Elections/National%20and%20State/2022%20-%20Primary%20-%20California.pdf" download="2022_Primary_CA.pdf">2022 Primary California</a> s
            +
            +:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/b2/5fcbf3a338730435edc59a4a2399d09dcb2916ce0fffba1986576f57ec7c8f b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/b2/5fcbf3a338730435edc59a4a2399d09dcb2916ce0fffba1986576f57ec7c8f deleted file mode 100644 index 21e7aba..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/b2/5fcbf3a338730435edc59a4a2399d09dcb2916ce0fffba1986576f57ec7c8f +++ /dev/null @@ -1,261 +0,0 @@ -I")E

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/b5/5111852e32e243664db5e600e86a8b2fed27fdc7aa672860341969c1e934bd b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/b5/5111852e32e243664db5e600e86a8b2fed27fdc7aa672860341969c1e934bd deleted file mode 100644 index f17ed4c..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/b5/5111852e32e243664db5e600e86a8b2fed27fdc7aa672860341969c1e934bd +++ /dev/null @@ -1,622 +0,0 @@ -I"]é

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            VI. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 31-37
            • -
            - -
            31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -33                             Unit_Nm == "Salton Sea" ~ "visited",
            -34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -37                             TRUE ~ "not visited")  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

            - -
            - -
              -
            • lines 38-43
            • -
            - -
            38   shift_geometry(preserve_area = FALSE, 
            -39                  position = "below") %>% 
            -40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -41
            -42   ## save shifted park data
            -43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            - -

            I’ve covered these lines extensively in part II and part III of this series.

            - -

            Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

            - -

            Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

            - -

            Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

            - -

            7. divide by state

            -
            - -
            -
            -
            "chaos ensued"
            - -
            - -

            I tried to map the base map, National Parks, and the state parks. It did not go well. R froze, my computer screamed, and chaos ensued. As a result, I had to rethink my map. I decided to separate the state parks by state, save them, and in part VI of this never-ending series* I’ll create individual state maps. When you click on a state it’ll take you to a map that includes the state parks.

            - -

            Unfortunately, this also means I need to separate the National Parks by state so they also appear on the individual maps. The logic will be the same so I am not going to update part III to reflect that change. If you want to see that code it’s available on the project repo].

            - -

            I don’t want to manually separate and save each state, so I’m going to use a loop! I hate loops. The logic is simple enough “as long as condition X is true, do something.” So simple, yet esvery time I’ve tried to learn a programming language I have struggled with loops. That’s pretty sad considering it’s like day 2 of any programming class. Day 1 is learning how to write “hello world!”**

            - -

            * I have annoyed myself with how long this series is. Hopefully it is helpful. Drop me a line if it is. -** print("Hello World!")

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/bd/b108b4a0139d548f122f9653a7abd4f85cddddf0a2a4e66f29ee0be02d2062 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/bd/b108b4a0139d548f122f9653a7abd4f85cddddf0a2a4e66f29ee0be02d2062 new file mode 100644 index 0000000..1613ccb --- /dev/null +++ b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/bd/b108b4a0139d548f122f9653a7abd4f85cddddf0a2a4e66f29ee0be02d2062 @@ -0,0 +1,33 @@ +I"!
            +
            +
            +
            + + +
            + Here you'll find several non-partisan ballot guides. I try to include as much information as possible without directly recreating the official voter guide. The information is sourced from the ballot guide, calmatters, ballotpedia, LA Times, voter’s edge, and Mercury News. I have included links to the campaign website or wherever most of the information came from. + + I have done my best to keep my views out of it. + + I started this to help my two aunts because they would ask me to simplify their ballots for them. Democracy relies on an informed and participatory citizenry, but it’s not always easy. This is meant to alleviate some of the burden. + + If you notice any errors, you feel like I’ve missed something, or you found this guide helpful feel free to send me an email [click the envelope at the bottom of the page] +

            + Helpful Resources: +
            1. Register to vote (Deadline May 23): https://registertovote.ca.gov/
            2. +
            3. Check your registration status: https://voterstatus.sos.ca.gov/
            4. +
            5. Access the official voter guide: https://voterguide.sos.ca.gov/
            6. +
            7. Early voting & ballot drop off locations: https://caearlyvoting.sos.ca.gov/
            8. +
            9. Track your ballot: https://california.ballottrax.net/voter/
            10. +
            11. If you are in the Los Angeles, San Bernardino, Orange County area and need help getting to your polling place I will either find you resources or help you get there. I also offer to go with you to vote (and I will bring my two large German Shepherds) if you feel unsafe going to vote alone.
            + +<a href="> + +
            + +
            +
            + +
            +
            +:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/71/d52ac8bd1a40f67c0ed29dd196965c4ab9a501a6c349feb440d694c3703a56 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/c2/10c995bd811a36efc25dd141167d46a81b3655bd1c767134f34eafff1f0af4 similarity index 88% rename from .jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/71/d52ac8bd1a40f67c0ed29dd196965c4ab9a501a6c349feb440d694c3703a56 rename to .jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/c2/10c995bd811a36efc25dd141167d46a81b3655bd1c767134f34eafff1f0af4 index f228ad6..cf3694c 100644 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/71/d52ac8bd1a40f67c0ed29dd196965c4ab9a501a6c349feb440d694c3703a56 +++ b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/c2/10c995bd811a36efc25dd141167d46a81b3655bd1c767134f34eafff1f0af4 @@ -1,4 +1,4 @@ -I"ě%
            +I"c*
            @@ -7,6 +7,23 @@ +

            cartography in R part five

            +
            +

            Welcome to part five of my cartography in R series. In this post I’ll return to the maps created in part II and part III to include a Shiny information box and popups linking to posts about my adventures in the National Parks.

            + + + + + + | + + + +read more +
            + + +

            cartography in R part four


            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/c2/96502ccbab1bca4bf4cf08a9653f7d70e029cf9cbc9e33fb1366f539bfe950 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/c2/96502ccbab1bca4bf4cf08a9653f7d70e029cf9cbc9e33fb1366f539bfe950 deleted file mode 100644 index 22dfcbd..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/c2/96502ccbab1bca4bf4cf08a9653f7d70e029cf9cbc9e33fb1366f539bfe950 +++ /dev/null @@ -1,645 +0,0 @@ -I"‘ń

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            VI. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 31-37
            • -
            - -
            31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -33                             Unit_Nm == "Salton Sea" ~ "visited",
            -34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -37                             TRUE ~ "not visited")  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

            - -
            - -
              -
            • lines 38-43
            • -
            - -
            38   shift_geometry(preserve_area = FALSE, 
            -39                  position = "below") %>% 
            -40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -41
            -42   ## save shifted park data
            -43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            - -

            I’ve covered these lines extensively in part II and part III of this series.

            - -

            Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

            - -

            Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

            - -

            Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

            - -

            7. divide by state

            -
            - -
            - - -
            - - - Image: chaos ensues - - -
            chaos ensues
            -
            - - -
            - -

            I tried to map the base map, National Parks, and the state parks. It did not go well. R froze, my computer screamed, and chaos ensued. As a result, I had to rethink my map. I decided to separate the state parks by state, save them, and in part VI of this never-ending series* I’ll create individual state maps. When you click on a state it’ll take you to a map that includes the state parks.

            - -

            Unfortunately, this also means I need to separate the National Parks by state so they also appear on the individual maps. The logic will be the same so I am not going to update part III to reflect that change. If you want to see that code it’s available on the project repo].

            - -

            I don’t want to manually separate and save each state, so I’m going to use a loop! I hate loops. The logic is simple enough “as long as condition X is true, do something.” So simple, yet esvery time I’ve tried to learn a programming language I have struggled with loops. That’s pretty sad considering it’s like day 2 of any programming class. Day 1 is learning how to write “Hello World!”**

            - -
            1
            -2
            -3
            -4
            -5
            -6
            -
                split_states <- split(state_parks, f = state_parks$State_Nm) # split the data by state
            -    all_names <- names(split_states)   
            -
            -    for(name in all_names){            
            -        st_write(split_states[[name]], paste0("shapefiles/shifted/states/individual/", name, '.shp')) 
            -    }
            -
            - -

            * I have annoyed myself with how long this series is. Hopefully it is helpful. Drop me a line if it is.
            -** print("Hello World!")

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/c2/9bc4de57ad75b2070f251cb01a979da6fbcc141831e1e27626e2fd61a05ed7 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/c2/9bc4de57ad75b2070f251cb01a979da6fbcc141831e1e27626e2fd61a05ed7 deleted file mode 100644 index 84bca2b..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/c2/9bc4de57ad75b2070f251cb01a979da6fbcc141831e1e27626e2fd61a05ed7 +++ /dev/null @@ -1,643 +0,0 @@ -I"[ń

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            VI. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 31-37
            • -
            - -
            31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -33                             Unit_Nm == "Salton Sea" ~ "visited",
            -34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -37                             TRUE ~ "not visited")  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

            - -
            - -
              -
            • lines 38-43
            • -
            - -
            38   shift_geometry(preserve_area = FALSE, 
            -39                  position = "below") %>% 
            -40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -41
            -42   ## save shifted park data
            -43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            - -

            I’ve covered these lines extensively in part II and part III of this series.

            - -

            Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

            - -

            Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

            - -

            Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

            - -

            7. divide by state

            -
            - -
            - - -
            - - - Image: chaos ensues - - -
            chaos ensues
            -
            - - -
            - -

            I tried to map the base map, National Parks, and the state parks. It did not go well. R froze, my computer screamed, and chaos ensued. As a result, I had to rethink my map. I decided to separate the state parks by state, save them, and in part VI of this never-ending series* I’ll create individual state maps. When you click on a state it’ll take you to a map that includes the state parks.

            - -

            Unfortunately, this also means I need to separate the National Parks by state so they also appear on the individual maps. The logic will be the same so I am not going to update part III to reflect that change. If you want to see that code it’s available on the project repo].

            - -

            I don’t want to manually separate and save each state, so I’m going to use a loop! I hate loops. The logic is simple enough “as long as condition X is true, do something.” So simple, yet esvery time I’ve tried to learn a programming language I have struggled with loops. That’s pretty sad considering it’s like day 2 of any programming class. Day 1 is learning how to write “Hello World!”**

            - -
            1
            -2
            -3
            -4
            -5
            -
                split_states <- split(state_parks, f = state_parks$State_Nm) # split the data by state
            -    all_names <- names(split_states)   
            -
            -    for(name in all_names){            
            -        st_write(split_states[[name]], paste0("shapefiles/shifted/states/individual/", name, '.shp'))}
            -
            - -

            * I have annoyed myself with how long this series is. Hopefully it is helpful. Drop me a line if it is.
            -** print("Hello World!")

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/c3/978d62fb8a5850d62bbca92a9aa15f9534dcb8c1742b7f212a828ba6bc46e6 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/c3/978d62fb8a5850d62bbca92a9aa15f9534dcb8c1742b7f212a828ba6bc46e6 deleted file mode 100644 index 1240461..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/c3/978d62fb8a5850d62bbca92a9aa15f9534dcb8c1742b7f212a828ba6bc46e6 +++ /dev/null @@ -1,347 +0,0 @@ -I"%c

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. pre-process data in R

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -        filter(State_Nm %!in% territories)  %>% 
            -        filter(Own_Type == "State")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
                territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
                state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • line 5
            • -
            - -
                filter(State_Nm %!in% territories & Own_Type == "State")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call.

            - -

            Common filter operators include & (and), | (or), < (less than), or </code>></code> (greater than).

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/c3/de00c416733c2ca42fb3f9cd109569d9ec7ed9d9d5e1c91103b119486709ad b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/c3/de00c416733c2ca42fb3f9cd109569d9ec7ed9d9d5e1c91103b119486709ad deleted file mode 100644 index 36e3211..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/c3/de00c416733c2ca42fb3f9cd109569d9ec7ed9d9d5e1c91103b119486709ad +++ /dev/null @@ -1,259 +0,0 @@ -I"%E

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management and the Bureau of Reclamation. Having visited the park, I can tell you there’s no fences blocking these areas off. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area. It will be a good test case to make sure I’m selecting the correct data.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/c4/6bed8a13a9c81f479eb4ef15428eb80d902b19ffe2ebc564cccfca34712f6f b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/c4/6bed8a13a9c81f479eb4ef15428eb80d902b19ffe2ebc564cccfca34712f6f deleted file mode 100644 index 6ca5aa5..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/c4/6bed8a13a9c81f479eb4ef15428eb80d902b19ffe2ebc564cccfca34712f6f +++ /dev/null @@ -1,617 +0,0 @@ -I"úć

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            VI. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 31-37
            • -
            - -
            31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -33                             Unit_Nm == "Salton Sea" ~ "visited",
            -34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -37                             TRUE ~ "not visited")  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

            - -
            - -
              -
            • lines 38-43
            • -
            - -
            38   shift_geometry(preserve_area = FALSE, 
            -39                  position = "below") %>% 
            -40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -41
            -42   ## save shifted park data
            -43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            - -

            I’ve covered these lines extensively in part II and part III of this series.

            - -

            Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

            - -

            Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

            - -

            Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

            - -

            7. divide by state

            -
            - -
            -
            -
            "chaos ensued"
            - -
            - -

            I tried to map the US Map, National Parks, and the state parks. It did not go well. R froze, my computer screamed, and chaos ensued. As a result, I had to rethink my map. I decided to separate the state parks by state, save it, and in part VI of this never-ending series* I’ll create individual state maps and link them to the map of National Parks.

            - -

            Unfortunately, this also means I need to separate the National Parks by state so they also appear on the individual maps. The logic will be the same so I am not going to update part III to reflect that change. If you want to see that code it’s available on the project repo].

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/c8/0cc3a134f4680bf2a260397c76eab4663c3b8059e8443254c3186567d18289 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/c8/0cc3a134f4680bf2a260397c76eab4663c3b8059e8443254c3186567d18289 deleted file mode 100644 index d613beb..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/c8/0cc3a134f4680bf2a260397c76eab4663c3b8059e8443254c3186567d18289 +++ /dev/null @@ -1,699 +0,0 @@ -I"Ů

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            VI. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 31-37
            • -
            - -
            31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -33                             Unit_Nm == "Salton Sea" ~ "visited",
            -34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -37                             TRUE ~ "not visited")  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

            - -
            - -
              -
            • lines 38-43
            • -
            - -
            38   shift_geometry(preserve_area = FALSE, 
            -39                  position = "below") %>% 
            -40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -41
            -42   ## save shifted park data
            -43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            - -

            I’ve covered these lines extensively in part II and part III of this series.

            - -

            Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

            - -

            Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

            - -

            Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

            - -

            7. divide by state

            -
            - -
            - - -
            - - - Image: chaos ensues - - -
            chaos ensues
            -
            - - -
            - -

            I tried to map the base map, National Parks, and the state parks. It did not go well. R froze, my computer screamed, and chaos ensued. As a result, I had to rethink my map. I decided to separate the state parks by state, save them, and in part VI of this never-ending series* I’ll create individual state maps. When you click on a state it’ll take you to a map that includes the state parks.

            - -

            Unfortunately, this also means I need to separate the National Parks by state so they also appear on the individual maps. The logic will be the same so I am not going to update part III to reflect that change. If you want to see that code it’s available on the project repo].

            - -

            I don’t want to manually separate and save each state, so I’m going to use a loop! I hate loops. The logic is simple enough “as long as condition X is true, do something.” So simple, yet esvery time I’ve tried to learn a programming language I have struggled with loops. That’s pretty sad considering it’s like day 2 of any programming class. Day 1 is learning how to write “Hello World!”**

            - -
            1
            -2
            -3
            -4
            -5
            -
                split_states <- split(state_parks, f = state_parks$State_Nm)
            -    all_names <- names(split_states)   
            -
            -    for(name in all_names){            
            -        st_write(split_states[[name]], paste0("shapefiles/shifted/states/individual/", name, '.shp'))}
            -
            - -

            Look ma, new code!

            - -
              -
            • line 1
            • -
            - -
            1    split_states <- split(state_parks, f = state_parks$State_Nm) 
            - -

            The split() is part of base R. It takes quite a few arguments, most of which are optional.

            - -

            The first argument is the vector (or data frame) that you want to split into different groups. I want to split the state_parks data into its corresponding states, so it is listed first.

            - -

            The second argument f = is how you want the data split. f in this instance stands for factor. If we run levels(as.factor(state_parks$State_Nm)) in the terminal, it will return a list of the 50 state abbreviations. That is what we’re telling R to do here.

            - -

            You can access an individual state using the $ operator. split_states$CA will return the state park data for California.

            - -
            - -
              -
            • line 2
            • -
            - -
            2    all_names <- names(split_states)
            - -

            names is also part of base R. It does what it sounds like - it gets the names of an object. Here, I want to get the names of each split data sets.

            - -
            - -
              -
            • lines 4-5
            • -
            - -
            4    for(name in all_names){            
            -5        st_write(split_states[[name]], paste0("shapefiles/shifted/states/individual/", name, '.shp'))}
            - -

            Here’s the actual for loop.

            - -

            The basic logic of a for loop is:
            - for(x in y){
            - do something}

            - -

            Inside the parenthesis is the condition that must evaluate to TRUE if the content in the curly braces is to run.

            - -

            In line 4, for(name in all_names){ says as long as there’s a name in the list of all names, do whatever is inside the curly braces. name can be whatever you want. It’s a placeholder value. I can have it say for(dogs in all_names){ it will still do the exact same thing. A lot of time you’ll see it as an i for item. I like to use more descriptive language because, again, for loops are my Achilles’ heel.

            - -

            The all_names part is where ever you want R to look for the data. It will change based on your data set and variable naming conventions.

            - -

            In line 5, I save the split data sets.

            - -

            st_write() is part of the sf package which allows us to create shapefiles. This can be any saving function (eg. write_csv() if you want to save CSVs). The function takes several arguments. In line 43 above I showed the basic structure: st_write(data, path/to/file.shp). This is good if you only have one file, but since I’m saving them in a loop I don’t want all of the files to have the same name. R will error out after the first and tell you the file already exists.

            - -

            The first part split_states[[name]] is still telling R what data to save, but using an index instead of a specific data frame name. To access an index you use data[[some-value]]

            - -

            8. conclusion

            -
            - -

            * I have annoyed myself with how long this series is. Hopefully it is helpful. Drop me an email or a tweet if it is.
            -** print("Hello World!")

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/c9/41114281a8a69f4eb960c2c4d486d4b2e6e72105b0fae9f58bfaa43bc7ac77 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/c9/41114281a8a69f4eb960c2c4d486d4b2e6e72105b0fae9f58bfaa43bc7ac77 deleted file mode 100644 index edaa42c..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/c9/41114281a8a69f4eb960c2c4d486d4b2e6e72105b0fae9f58bfaa43bc7ac77 +++ /dev/null @@ -1,355 +0,0 @@ -I"«e

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. pre-process data in R

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -        filter(State_Nm %!in% territories)  %>% 
            -        filter(Own_Type == "State")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
                territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
                state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • line 5
            • -
            - -
                filter(State_Nm %!in% territories & Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/cd/5c4213badccd372ba4332e95112b9279ad1fd8f181545ac1bbdb8b6e89edd0 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/cd/5c4213badccd372ba4332e95112b9279ad1fd8f181545ac1bbdb8b6e89edd0 deleted file mode 100644 index c87c55d..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/cd/5c4213badccd372ba4332e95112b9279ad1fd8f181545ac1bbdb8b6e89edd0 +++ /dev/null @@ -1,248 +0,0 @@ -I"C

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -

            The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management and the Bureau of Reclamation. Having visited the park, I can tell you there’s no fences blocking these areas off. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area. It will be a good test case to make sure I’m selecting the correct data.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/ce/79712b1ddac46abbccafc0439b2dee5df051ecc72c5e7c907e3a4e96fb728c b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/ce/79712b1ddac46abbccafc0439b2dee5df051ecc72c5e7c907e3a4e96fb728c new file mode 100644 index 0000000..c022024 --- /dev/null +++ b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/ce/79712b1ddac46abbccafc0439b2dee5df051ecc72c5e7c907e3a4e96fb728c @@ -0,0 +1,22 @@ +I"ń

            welcome

            +
                Here you'll find several non-partisan ballot guides. I try to include as much information as possible without directly recreating the official voter guide. The information is sourced from the ballot guide, calmatters, ballotpedia, LA Times, voter’s edge, and Mercury News. I have included links to the campaign website or wherever most of the information came from.
            +
            +    I have done my best to keep my views out of it.
            +
            +    I started this to help my two aunts because they would ask me to simplify their ballots for them. Democracy relies on an informed and participatory citizenry, but it’s not always easy. This is meant to alleviate some of the burden. 
            +
            +    If you notice any errors, you feel like I’ve missed something, or you found this guide helpful feel free to send me an email [click the envelope at the bottom of the page]  <br><br>
            +    <h3>helpful resources:</h3>
            +    <ol><li>Register to vote (Deadline May 23): <a href="https://registertovote.ca.gov/">https://registertovote.ca.gov/</a> </li>
            +    <li>Check your registration status: <a href="https://voterstatus.sos.ca.gov/">https://voterstatus.sos.ca.gov/</a></li>
            +    <li>Access the official voter guide: <a href="https://voterguide.sos.ca.gov/">https://voterguide.sos.ca.gov/</a> </li>
            +    <li>Early voting & ballot drop off locations: <a href="https://caearlyvoting.sos.ca.gov/">https://caearlyvoting.sos.ca.gov/</a></li>
            +    <li>Track your ballot: <a href="https://california.ballottrax.net/voter/">https://california.ballottrax.net/voter/</a></li>
            +    <li>If you are in the Los Angeles, San Bernardino, Orange County area and need help getting to your polling place I will either find you resources or help you get there. I also offer to go with you to vote (and I will bring my two large German Shepherds) if you feel unsafe going to vote alone.</li></ol>
            +
            + +

            ballot guides

            + +
                <a href="https://github.com/liz-muehlmann/Election_Guides/raw/main/California/Primary%20Elections/National%20and%20State/2022%20-%20Primary%20-%20California.pdf" download="2022_Primary_CA.pdf">2022 Primary California</a> s
            +
            +:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/cf/bc2b418d2b3ae055ddad3b7996d1e01b9215faca2fecddfbab67c88d754f2d b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/cf/bc2b418d2b3ae055ddad3b7996d1e01b9215faca2fecddfbab67c88d754f2d deleted file mode 100644 index 6abf3dd..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/cf/bc2b418d2b3ae055ddad3b7996d1e01b9215faca2fecddfbab67c88d754f2d +++ /dev/null @@ -1,461 +0,0 @@ -I"$’

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/d0/7a5eebefd0c47d098e6d5eb13de0e9f29b6d52ed945b64db999cf8c43c39de b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/d0/7a5eebefd0c47d098e6d5eb13de0e9f29b6d52ed945b64db999cf8c43c39de deleted file mode 100644 index 8b77f41..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/d0/7a5eebefd0c47d098e6d5eb13de0e9f29b6d52ed945b64db999cf8c43c39de +++ /dev/null @@ -1,562 +0,0 @@ -I"ŹĚ

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/d0/a6f77e1daa791ba9d7a5532cf976e5125033df09a5d9fbd4a6590fa7bbff30 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/d0/a6f77e1daa791ba9d7a5532cf976e5125033df09a5d9fbd4a6590fa7bbff30 new file mode 100644 index 0000000..e01cc83 --- /dev/null +++ b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/d0/a6f77e1daa791ba9d7a5532cf976e5125033df09a5d9fbd4a6590fa7bbff30 @@ -0,0 +1,46 @@ +I"w

            welcome

            +
            + +

            Here you’ll find several non-partisan ballot guides. I try to include as much information as possible without directly recreating the official voter guide. The information is sourced from the ballot guide, calmatters, ballotpedia, LA Times, voter’s edge, and Mercury News. I have included links to the campaign website or wherever most of the information came from.

            + +

            I have done my best to keep my views out of it.

            + +

            I started this to help my two aunts because they would ask me to simplify their ballots for them. Democracy relies on an informed and participatory citizenry, but it’s not always easy. This is meant to alleviate some of the burden.

            + +

            If you notice any errors, you feel like I’ve missed something, or you found this guide helpful feel free to send me an email [click the envelope at the bottom of the page]

            + +

            helpful resources:

            +
            + +
              +
            1. Register to vote (Deadline May 23): https://registertovote.ca.gov/
            2. +
            3. Check your registration status: https://voterstatus.sos.ca.gov/
            4. +
            5. Access the official voter guide: https://voterguide.sos.ca.gov/
            6. +
            7. Early voting & ballot drop off locations: https://caearlyvoting.sos.ca.gov/
            8. +
            9. Track your ballot: https://california.ballottrax.net/voter/
            10. +
            11. If you are in the Los Angeles, San Bernardino, Orange County area and need help getting to your polling place I will either find you resources or help you get there. I also offer to go with you to vote (and I will bring my two large German Shepherds) if you feel unsafe going to vote alone.
            12. +
            + +

            ballot guides

            +
            + + + + + + + + + + + + + + + + + + +
            Primary ElectionsGeneral Elections
            2022 Primary CaliforniaTitle
            ParagraphText
            + +:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/d1/0ebcafb4fdee64325a112bf977924139773b8a605e9b6b39b09fa25f1daaa1 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/d1/0ebcafb4fdee64325a112bf977924139773b8a605e9b6b39b09fa25f1daaa1 deleted file mode 100644 index 4f1d754..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/d1/0ebcafb4fdee64325a112bf977924139773b8a605e9b6b39b09fa25f1daaa1 +++ /dev/null @@ -1,706 +0,0 @@ -I" 

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            VI. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 31-37
            • -
            - -
            31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -33                             Unit_Nm == "Salton Sea" ~ "visited",
            -34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -37                             TRUE ~ "not visited")  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

            - -
            - -
              -
            • lines 38-43
            • -
            - -
            38   shift_geometry(preserve_area = FALSE, 
            -39                  position = "below") %>% 
            -40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -41
            -42   ## save shifted park data
            -43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            - -

            I’ve covered these lines extensively in part II and part III of this series.

            - -

            Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

            - -

            Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

            - -

            Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

            - -

            7. divide by state

            -
            - -
            - - -
            - - - Image: chaos ensues - - -
            chaos ensues
            -
            - - -
            - -

            I tried to map the base map, National Parks, and the state parks. It did not go well. R froze, my computer screamed, and chaos ensued. As a result, I had to rethink my map. I decided to separate the state parks by state, save them, and in part VI of this never-ending series* I’ll create individual state maps. When you click on a state it’ll take you to a map that includes the state parks.

            - -

            Unfortunately, this also means I need to separate the National Parks by state so they also appear on the individual maps. The logic will be the same so I am not going to update part III to reflect that change. If you want to see that code it’s available on the project repo].

            - -

            I don’t want to manually separate and save each state, so I’m going to use a loop! I hate loops. The logic is simple enough “as long as condition X is true, do something.” So simple, yet esvery time I’ve tried to learn a programming language I have struggled with loops. That’s pretty sad considering it’s like day 2 of any programming class. Day 1 is learning how to write “Hello World!”**

            - -
            1
            -2
            -3
            -4
            -5
            -
                split_states <- split(state_parks, f = state_parks$State_Nm)
            -    all_names <- names(split_states)   
            -
            -    for(name in all_names){            
            -        st_write(split_states[[name]], paste0("shapefiles/shifted/states/individual/", name, '.shp'))}
            -
            - -

            Look ma, new code!

            - -
              -
            • line 1
            • -
            - -
            1    split_states <- split(state_parks, f = state_parks$State_Nm) 
            - -

            The split() is part of base R. It takes quite a few arguments, most of which are optional.

            - -

            The first argument is the vector (or data frame) that you want to split into different groups. I want to split the state_parks data into its corresponding states, so it is listed first.

            - -

            The second argument f = is how you want the data split. f in this instance stands for factor. If we run levels(as.factor(state_parks$State_Nm)) in the terminal, it will return a list of the 50 state abbreviations. That is what we’re telling R to do here.

            - -

            You can access an individual state using the $ operator. split_states$CA will return the state park data for California.

            - -
            - -
              -
            • line 2
            • -
            - -
            2    all_names <- names(split_states)
            - -

            names is also part of base R. It does what it sounds like - it gets the names of an object. Here, I want to get the names of each split data sets.

            - -
            - -
              -
            • lines 4-5
            • -
            - -
            4    for(name in all_names){            
            -5        st_write(split_states[[name]], paste0("shapefiles/shifted/states/individual/", name, '.shp'))}
            - -

            Here’s the actual for loop.

            - -

            The basic logic of a for loop is:
            - for(x in y){
            - do something}

            - -

            Inside the parenthesis is the condition that must evaluate to TRUE if the content in the curly braces is to run.

            - -

            In line 4, for(name in all_names){ says as long as there’s a name in the list of all names, do whatever is inside the curly braces. name can be whatever you want. It’s a placeholder value. I can have it say for(dogs in all_names){ it will still do the exact same thing. A lot of time you’ll see it as an i for item. I like to use more descriptive language because, again, for loops are my Achilles’ heel.

            - -

            The all_names part is where ever you want R to look for the data. It will change based on your data set and variable naming conventions.

            - -

            In line 5, I save the split data sets.

            - -

            st_write() is part of the sf package which allows us to create shapefiles. This can be any saving function (eg. write_csv() if you want to save CSVs). The function takes several arguments. In line 43 above I showed the basic structure: st_write(data, path/to/file.shp). This is good if you only have one file, but since I’m saving them in a loop I don’t want all of the files to have the same name. R will error out after the first and tell you the file already exists.

            - -

            The first part split_states[[name]] is still telling R what data to save, but using an index instead of a specific data frame name. To access an index you use data[[some-value]] where some-value is the index location. In my code, R will take the split_states data and go alright the first index location in [[name]] is 1 and return whatever value is stored in that index (here, AK). It will then do that for every index location as it loops through the split_states data.

            - -

            paste0() is also part of base R - it’s apparently faster than paste(). It concatenates (or links together) different pieces into one. I’m using it to create the filename. Within the paste0 call anything within quotation marks is static. So every file will be saved to "shapefiles/shifted/states/individual/" and every file will have the extension .shp. What will change with each loop is the name of the file. One by one, R will loop through and save each file using the name it pulled from all_names.

            - -

            st_write() automatically creates the other three files that each “shapefile” needs. When the loop is done, you should have a folder of 200 files (50 states * 4 files each). Which is why I strongly recommend using DVC if you’re doing any kind of version control.

            - -

            8. conclusion

            -
            - -

            That’s all the processing done for the state files… for now. In part VI I’ll return to the states to create each state’s own map. Next up, in part V, I’m going back to my base map with the National Parks to add in some informational tool tips and interactivity.

            - -




            -* I have annoyed myself with how long this series is. Hopefully it is helpful. Drop me an email or a tweet if it is.
            -** print("Hello World!")

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/d1/3e8b109cac8303e102822ccba5a0a1d8abd69c308d0c7dbdc7415ff59fd712 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/d1/3e8b109cac8303e102822ccba5a0a1d8abd69c308d0c7dbdc7415ff59fd712 deleted file mode 100644 index 2d97605..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/d1/3e8b109cac8303e102822ccba5a0a1d8abd69c308d0c7dbdc7415ff59fd712 +++ /dev/null @@ -1,314 +0,0 @@ -I"AV

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. pre-process data in R

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -
                state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -        filter(State_Nm %!in% territories)  %>% 
            -        filter(Own_Type == "State")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 1 -Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()
            • -
            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/d3/4c1d0b502a9ca370fde8a1fe675859efd13f4ffe97a50c6c762b44a9ed0139 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/d3/4c1d0b502a9ca370fde8a1fe675859efd13f4ffe97a50c6c762b44a9ed0139 deleted file mode 100644 index e847556..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/d3/4c1d0b502a9ca370fde8a1fe675859efd13f4ffe97a50c6c762b44a9ed0139 +++ /dev/null @@ -1,290 +0,0 @@ -I"ŻK

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: PAD-US Viewer Key - - -
            PAD-US Viewer Key
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/d6/a1cb20e5bbdb0b76ecbeb575e9149cc4d888ee5a7816310179966208de5ab8 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/d6/a1cb20e5bbdb0b76ecbeb575e9149cc4d888ee5a7816310179966208de5ab8 new file mode 100644 index 0000000..6383c37 --- /dev/null +++ b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/d6/a1cb20e5bbdb0b76ecbeb575e9149cc4d888ee5a7816310179966208de5ab8 @@ -0,0 +1,180 @@ +I"Ë3

            Welcome to part five of my cartography in R series. In this post I’ll return to the maps created in part II and part III to include a Shiny information box and popups linking to posts about my adventures in the National Parks.

            + + + +

            project outline

            +
            + +

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            + +

            I. cartography in r part one

            +
              +
            • install packages
            • +
            • download usa shapefile
            • +
            • shift alaska and hawaii
            • +
            • save shapefile
            • +
            + +

            II. cartography in r part two

            +
              +
            • create usa base map
            • +
            + +

            III. cartography in r part three

            +
              +
            • download national park data
            • +
            • process national park data
            • +
            • shift alaska and hawaii national parks
            • +
            • save shapefile
            • +
            • add national parks to map
            • +
            + +

            IV. cartography in r part four

            +
              +
            • download state park data
            • +
            • process state park data
            • +
            • shift alaska and hawaii state parks
            • +
            • separate state park data
            • +
            • save state park data
            • +
            + +

            V. cartography in r part five

            +
              +
            • add in shiny functionality
            • +
            • add markers to visited parks
            • +
            • save and embed map
            • +
            + +

            VI. cartography in r part six

            +
              +
            • create individual state maps
            • +
            • add in shiny functionality
            • +
            • add markers to visited parks
            • +
            • save and embed maps
            • +
            + +

            1. load libraries

            +
            + +

            Create a new file called app.r which we’ll use to build in the Shiny functionality. Keep in mind Shiny requires the filename to be app.

            + +
            + Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            + +
            1
            +2
            +3
            +4
            +5
            +6
            +
                library(tidyverse)      # useful data manipulation tools
            +    library(sf)             # read and write shapefiles
            +    library(tigris)         # downloading shapefiles for Method 1
            +    library(leaflet)        # map creation
            +    library(operator.tools) # not-in function
            +    library(shiny)          # interactivity
            +
            + +

            I am not going to explain in detail what the packages in lines 1-5 do because I already covered it in part one.

            + +
              +
            • Shiny is an R-Studio package, which let’s “users interact with your data and your analysis” (from the Shiny website). The package let’s you create interactive web apps in R. The web apps can be hosted as stand-alone pages or can be embedded into R. And, according to the website, they can be modified and extended with CSS, HTML, and JavaScript.
            • +
            + +

            In a previous map I made I used labels to create a pop up which contained information about the number of newspapers in each county. In that map, I was only interested in showing the population and number of newspapers.

            + +
            + + + Image: Map showing information pop ups + + +
            Map showing information pop ups
            +
            + +

            In this map I want to display more information but since National and State Parks are close together using pop ups created a lot of overlap and quickly became unreadable. I want to move a lot of the basic information about park name and its size to a box in the corner and only use the pop ups to display a small photo of the park that leads to a blog about my adventure in the park.

            + +

            2. load data

            +
            + +
            1
            +2
            +3
            +
                ## load data
            +    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            +    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            +
            + +

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            + +

            3. the complete code

            +
            + +

            The customization aspects of the map - using special colors, adding in Shiny functionality, etc - are all declared before the map widget call. That creates some difficulty in how to best present the code.

            + +

            First, I’ll give all the code then cover what’s different in each section. If you’re using the code we created in part III be mindful of where the new lines appear so you can avoid any errors.

            + +

            4. define national park colors

            +
            + +

            I like to use different colors on my maps to indicate different things. In the map above, the warmer colors indicate less newspapers and the cooler colors indicate more newspapers with green being areas with the most newspapers.

            + +

            In this map, I want to use colors that reflect the land’s designation types. I want the rivers to show up blue, the parks to be green, and other areas to be colored as I dictate.

            + +
            1
            +2
            +3
            +4
            +5
            +6
            +7
            +8
            +
                nps_color <- colorFactor(c("#B2AC88", # national historical
            +                            "#F9F2CB", # international historical       
            +                            "#99941A", # military
            +                            "#006C5F", # park
            +                            "#568762", # preserves and rec areas
            +                            "#31B8E6", # lakes and rivers
            +                            "#899B7C", # trails
            +                            "#AFAC99"), nps$type) # other
            +
            + +

            Line 1 creates a variable which I’ll use later in the leaflet call. It makes the leaflet call cleaner and less cluttered, though I imagine you could declare the colors directly in the addPolygons() call.

            + +

            colorFactor() is part of the Leaflet package. It assigns colors to factors (categories) - here the factors are the types of National Public Lands. It takes several arguments which you can read about on the R website.

            + +

            The first argument is the palette. This can be one of the palettes built into R Color Brewer or like here (the hex codes in lines 1-8), one that the user specifies.

            + +
            + +Note: + A helpful tool specifically for picking map colors Color Brewer 2. I also use mycolor.space to create palettes that I like. +
            + +

            nps$type is the domain of the data. It’s the categories that R will map the colors to. colorFactor requires categorical data.

            + +

            If you’re creating your own palette, the order you list the colors has to match the order of the domain. To easily check what order the domain is in run levels(as.factor(nps$type)) in the terminal. This will return a list of values which you can use to map the colors.

            + +

            I include a comment of which colors will be mapped to which category so that I can easily change them if necessary.

            + +
            + + + Image: List of National Land Categories + + +
            List of National Land Categories
            +
            + +

            5. create information box

            +
            + +

            Shiny apps have three components:

            + +
              +
            1. A User-Interface app (ui) which controls the “layout and appearance” of the app.
            2. +
            3. A server function which tells the computer how to create the app.
            4. +
            5. A call to the shinyApp function creates the Shiny objects using the first two components.
            6. +
            + +:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/d8/aab162ecfe381cd6a09438874b13740cda0577ffacacdb38baebffd7e97d6f b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/d8/aab162ecfe381cd6a09438874b13740cda0577ffacacdb38baebffd7e97d6f deleted file mode 100644 index 2f8a373..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/d8/aab162ecfe381cd6a09438874b13740cda0577ffacacdb38baebffd7e97d6f +++ /dev/null @@ -1,403 +0,0 @@ -I"‘s

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW")  
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines
            • -
            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -
            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/d8/aec8c8b8702c007ec78410000efc55f1adf56acec4b5b1e037f029e90c3f0d b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/d8/aec8c8b8702c007ec78410000efc55f1adf56acec4b5b1e037f029e90c3f0d deleted file mode 100644 index 4f674a5..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/d8/aec8c8b8702c007ec78410000efc55f1adf56acec4b5b1e037f029e90c3f0d +++ /dev/null @@ -1,211 +0,0 @@ -I";

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -

            The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management and the Bureau of Reclamation. Having visited the park, I can tell you there’s no fences blocking these areas off. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area. It will be a good test case to make sure I’m selecting the correct data.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/da/3507f7fe69d048357c9cffcb9f966b5dc4e0c7f2e30eafc8361ceaa3f1c1f6 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/da/3507f7fe69d048357c9cffcb9f966b5dc4e0c7f2e30eafc8361ceaa3f1c1f6 deleted file mode 100644 index e060b5c..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/da/3507f7fe69d048357c9cffcb9f966b5dc4e0c7f2e30eafc8361ceaa3f1c1f6 +++ /dev/null @@ -1,303 +0,0 @@ -I"©R

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. pre-process data in R

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -
                state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories)  %>% 
            -    filter(Own_Type == "State")
            -
            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/da/f40e516cb416846057e11d5e40954fe2212dd0b050480d95129fa11bd42ea0 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/da/f40e516cb416846057e11d5e40954fe2212dd0b050480d95129fa11bd42ea0 deleted file mode 100644 index 69cf429..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/da/f40e516cb416846057e11d5e40954fe2212dd0b050480d95129fa11bd42ea0 +++ /dev/null @@ -1,656 +0,0 @@ -I"ô

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            VI. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 31-37
            • -
            - -
            31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -33                             Unit_Nm == "Salton Sea" ~ "visited",
            -34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -37                             TRUE ~ "not visited")  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

            - -
            - -
              -
            • lines 38-43
            • -
            - -
            38   shift_geometry(preserve_area = FALSE, 
            -39                  position = "below") %>% 
            -40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -41
            -42   ## save shifted park data
            -43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            - -

            I’ve covered these lines extensively in part II and part III of this series.

            - -

            Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

            - -

            Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

            - -

            Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

            - -

            7. divide by state

            -
            - -
            - - -
            - - - Image: chaos ensues - - -
            chaos ensues
            -
            - - -
            - -

            I tried to map the base map, National Parks, and the state parks. It did not go well. R froze, my computer screamed, and chaos ensued. As a result, I had to rethink my map. I decided to separate the state parks by state, save them, and in part VI of this never-ending series* I’ll create individual state maps. When you click on a state it’ll take you to a map that includes the state parks.

            - -

            Unfortunately, this also means I need to separate the National Parks by state so they also appear on the individual maps. The logic will be the same so I am not going to update part III to reflect that change. If you want to see that code it’s available on the project repo].

            - -

            I don’t want to manually separate and save each state, so I’m going to use a loop! I hate loops. The logic is simple enough “as long as condition X is true, do something.” So simple, yet esvery time I’ve tried to learn a programming language I have struggled with loops. That’s pretty sad considering it’s like day 2 of any programming class. Day 1 is learning how to write “Hello World!”**

            - -
            1
            -2
            -3
            -4
            -5
            -
                split_states <- split(state_parks, f = state_parks$State_Nm)
            -    all_names <- names(split_states)   
            -
            -    for(name in all_names){            
            -        st_write(split_states[[name]], paste0("shapefiles/shifted/states/individual/", name, '.shp'))}
            -
            - -

            Look ma, new code!

            - -
              -
            • line 1
            • -
            - -
            1    split_states <- split(state_parks, f = state_parks$State_Nm) 
            - -

            The split() is part of base R.

            - -

            8. conclusion

            -
            - -

            * I have annoyed myself with how long this series is. Hopefully it is helpful. Drop me a line if it is.
            -** print("Hello World!")

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/db/18e34460ea061f3a8c9937b8b7624634e298cc79ef6f615e23e3e222a5e026 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/db/18e34460ea061f3a8c9937b8b7624634e298cc79ef6f615e23e3e222a5e026 deleted file mode 100644 index 2f8a06c..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/db/18e34460ea061f3a8c9937b8b7624634e298cc79ef6f615e23e3e222a5e026 +++ /dev/null @@ -1,597 +0,0 @@ -I"·ĺ

            This is part three of my cartography in R series. If you are just finding this, I suggest taking a look at part I and part II first.

            - -

            In this post, I will download and process the National Park data. Once that’s done, I’ll add it to the base map I created in part II.

            - - - -
            - Note: I know that my posts are verbose. I am sorry that I am not more concise but one of the biggest problems I have with code documentation is that it assumes you already know what you're doing. I'm writing these for absolute beginners that want to know what each part of the code does. If you want more concise explanations, I suggest looking at the annotated r files in the project's GitHub. -
            - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            VI. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load usa data

            -
            - -
            1
            -2
            -
                ## load data
            -    states <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp to reflect wherever you saved the shifted shapefile.

            - -

            If your data processing and base map creation are in the same file, you can skip this line, and when you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download national park data

            -
            - -

            The National Park Service provides all the data we’ll need to make the map. The data is accessible on the ArcGIS’ Open Data website. Once you click on the link you’ll see a bunch of icons that lead to different data that’s available for download. Click on the one for boundaries.

            - -
            - - - Image: NPS GIS website - - -
            NPS GIS website
            -
            - -

            From here, you’ll be taken to a list of available National Park data. The second link should be nps boundary which contains the shape data for all the National Parks in the United States. The file contains all the data for the park outlines along with hiking trails, rest areas, and lots of other data.

            - -
            - - - Image: NPS download page - - -
            NPS download page
            -
            - -

            The nps boundary link will take you to a map showing the national parks. On the left, there will be a download link on the left.

            - -
            - - - Image: NPS map and download link - - -
            NPS map and download link
            -
            - -

            From here, you’ll have a few download options. The National Park Service provides the data in different formats including CSV and Shapefile. You’ll want to download the shapefile version.

            - -
            - - - Image: download nps shapefile - - -
            download nps shapefile
            -
            - -

            Be sure to save the file somewhere on your hard drive that is easy to find. When it finishes downloading, be sure to unzip the file. There will be four files inside the folder. All of them need to be kept in the same location. Even though we’ll only load the .shp file, R uses the three others to create the necessary shapes.

            - -

            4. process national park data

            -
            - -

            The code below may look intimidating, but it’s fairly straight forward. I’ll go over each line below.

            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    ## load and process nps data
            -    nps <- read_sf("./shapefiles/original/nps/NPS_-_Land_Resources_Division_Boundary_and_Tract_Data_Service.shp")  %>% 
            -    select(STATE, UNIT_TYPE, PARKNAME, Shape__Are, geometry) %>% 
            -    filter(STATE %!in% territories) %>%  
            -    mutate(type = case_when(UNIT_TYPE == "International Historic Site" ~ "International Historic Site", # there's 23 types of national park, I wanted to reduce this number.
            -                UNIT_TYPE == "National Battlefield Site" ~ "National Military or Battlefield", # lines 56-77 reduce the number of park types
            -                UNIT_TYPE == "National Military Park" ~ "National Military or Battlefield", 
            -                UNIT_TYPE == "National Battlefield" ~ "National Military or Battlefield",
            -                UNIT_TYPE == "National Historical Park" ~ "National Historical Park, Site, Monument, or Memorial",
            -                UNIT_TYPE == "National Historic Site" ~ "National Historical Park, Site, Monument, or Memorial",
            -                UNIT_TYPE == "National Historic Trail" ~ "National Historical Park, Site, Monument, or Memorial",
            -                UNIT_TYPE == "National Memorial" ~ "National Historical Park, Site, Monument, or Memorial",
            -                UNIT_TYPE == "National Monument" ~ "National Historical Park, Site, Monument, or Memorial",
            -                UNIT_TYPE == "National Preserve" ~ "National Preserve, Reserve, or Recreation Area",
            -                UNIT_TYPE == "National Reserve" ~ "National Preserve, Reserve, or Recreation Area",
            -                UNIT_TYPE == "National Recreation Area" ~ "National Preserve, Reserve, or Recreation Area",
            -                UNIT_TYPE == "National River" ~ "National River, Lakeshore, or Seashore",
            -                UNIT_TYPE == "National Lakeshore" ~ "National River, Lakeshore, or Seashore",
            -                UNIT_TYPE == "National Wild & Scenic River" ~ "National River, Lakeshore, or Seashore",
            -                UNIT_TYPE == "National Seashore" ~ "National River, Lakeshore, or Seashore",
            -                UNIT_TYPE == "National Trails Syste" ~ "National Trail",
            -                UNIT_TYPE == "National Scenic Trail" ~ "National Trail",
            -                UNIT_TYPE == "National Park" ~ "National Park or Parkway",
            -                UNIT_TYPE == "Park" ~ "National Park or Parkway",
            -                UNIT_TYPE == "Parkway" ~ "National Park or Parkway",
            -                UNIT_TYPE == "Other Designation" ~ "Other National Land Area"))  %>%  
            -    mutate(visited = case_when(PARKNAME == "Joshua Tree" ~ "visited", 
            -                                PARKNAME == "Redwood" ~ "visited", 
            -                                PARKNAME == "Santa Monica Mountains" ~ "visited", 
            -                                PARKNAME == "Sequoia" ~ "visited", 
            -                                PARKNAME == "Kings Canyon" ~ "visited",
            -                                PARKNAME == "Lewis and Clark" ~ "visited",
            -                                PARKNAME == "Mount Rainier" ~ "visited",
            -                                TRUE ~ "not visited")) %>%  
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save the shifted national park data
            -    st_write(nps, "~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            In part I of this series I talked about how R has an %in% function, but not a %!in% function. Here’s where the latter function shines.

            - -

            The United States is still an empire with its associated territories and islands. In this project I am interested in the 50 states - without these other areas. As a result, I need to filter them out. Using base R’s %in% function I would have to create a variable that contains the postal abbreviations for all 50 states. That is annoying. Instead, I want to use the shorter list that only includes the US’ associated islands and territories. To do so, however, I need to use the operator tools’ %!in% function.

            - -

            Line 2 creates the list of US territories that I filter out in line 7. The c() function in R means combine or concatenate. Inside the parenthesis are the five postal codes for the American Samoa, Guam, the Northern Mariana Islands, Puerto Rico, and the Virgin Islands.

            - -
            - -
              -
            • line 5
            • -
            - -
            5   nps <- read_sf("path/to/file/NPS_-_Land_Resources_Division_Boundary_and_Tract_Data_Service.shp")  %>% 
            - -

            nps <- read_sf("path/to/file.shp") loads the National Park data set to a variable called nps using the read_sf() function that is part of the sf package. You will need to change the file path so it reflects where you saved the data on your hard drive.

            - -

            The %>% operator is part of the tidyverse package. It tells R to go to the next line and process the next command. It has to go at the end of a line, rather than the beginning.

            - -
            - -
              -
            • line 6
            • -
            - -
            6    select(STATE, UNIT_TYPE, PARKNAME, Shape__Are, geometry) %>%
            - -

            select is part of the tidyverse package. With it, we can select columns by their name rather than their associated number. Large data sets take more computing power because the computer has to iterate over more rows. Unfortunately, rendering maps also takes a lot of computing power so I like to discard any unnecessary columns to reduce the amount of effort my computer has to exert.

            - -

            Deciding on which columns to keep will depend on the data you’re using and what you want to map (or analyze). I know for my project I want to include a few things:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type (park, battlefield, memorial, etc)
            • -
            • and the park’s size.
            • -
            - -

            There’s a couple ways to inspect the data to see what kind of information is available.

            - -
              -
            1. You can view the entire data set by using view(nps) but as the number of data points increases, so does R’s struggle with opening it. I’ve found that VSCode doesn’t throw as big of a fit as R Studio when opening large data sets.
            2. -
            3. Another way is to just look at the column names using data.frame(colnames(nps)). This will return a list of the data set’s column names. This is my preferred method. I then go to the documentation to see what each column contains. This isn’t fool-proof because it really depends on if the data has good documentation.
            4. -
            - -

            The National Park data includes a lot of information about who created the data and maintains the property. I’m not interested in this, so in line 6 I select the following columns:

            -
              -
            • STATE [the two character state abbreviation]
            • -
            • UNIT_TYPE [if it’s a national park, battlefield, etc]
            • -
            • PARKNAME [the name of the park]
            • -
            • Shape__Are [the size of the park]
            • -
            • geometry
            • -
            - -
            Note: The column name must be exact for select() to work. For example, in the NPS data, the park area is stored the Shape__Are column - two underscores and no a in area - and I have to use that exact format when selecting the column.
            - -

            The geometry column is specific to shapefiles and it includes the coordinates of the shape. It will be kept automatically - unless you use the st_drop_geometry() function. I like to specifically select so I remember it’s there.

            - -
            - -
              -
            • line 7
            • -
            - -
            7    filter(STATE %!in% territories) %>%  
            - -

            In line 7 I use the territories list I created in line 2 to filter out the United States’ associated areas. Since the nps data uses the two character state abbreviation, I have to use the two character abbreviation for the territories. Searching for “Guam,” for example, won’t work.

            - -

            filter() is part of the tidyverse and it uses conditional language. In the parentheses is a condition that must be true if the tidyverse is going to keep the row. Starting at the top of the data, R goes “alright, does the value in the STATE column match any of the values in the territories list?” If the condition is TRUE, R adds the row to the new data frame.

            - -
            Note: Since we're using the %!in% operator, any row that evaluates as TRUE will be kept because the value is NOT found in the territories list. If I wanted to keep only the territories, I would use the %in% operator and only the rows with STATE abbreviations found in the territories list would be kept.
            - -

            For example, if the STATE value in row 1 is CA, filter looks at it and goes “is CA NOT IN territories?” If that is TRUE, keep it because we want only the values that are NOT IN the territories list.

            - -
            - -
              -
            • lines 8-29
            • -
            - -
            8    mutate(type = case_when(UNIT_TYPE == "International Historic Site" ~ "International Historic Site",
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            The NPS data set has 23 different types of National Parks listed (you can view all of them by running levels(as.factor(nps$UNIT_TYPE))). I know that in later posts, I’m going to color code the land by type (blue for rivers, green for national parks, etc) so I wanted to reduce the number of colors I would have to use.

            - -

            mutate()’s first argument, type = creates a new column called type. R will populate the newly created column with whatever comes after the first (singular) equal = sign. For example, I can put type = NA and every row in the column will say NA.

            - -

            Here, I am using the case_when() function, which is also part of the tidyverse. The logic of case_when is fairly straight forward. The first value is the name of the column you want R to look in (here: UNIT_TYPE). Next, is a conditional. Here I am looking for an exact match (==) to the string (words) inside the first set of quotation marks (in line 8: "International Historic Site"). The last part of the argument is what I want R to put in the type column when it finds a row where the UNIT_TYPE is "International Historic Site".

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value")

            - -

            Lines 9-29 do the same thing for the other park types. You can reduce the parks however you want or use all 23 types. Just remember that the value before the tilde ~ has to match the values found in the data exactly. For example, in line 24 I change the NPS data’s National Trail Syste value to be National Trail. Whomever created the data set did not spell system correctly, so for R to match the value I also have to omit the last letter in system.

            - -
            - -
              -
            • lines 30-37
            • -
            - -
            30    mutate(visited = case_when(PARKNAME == "Joshua Tree" ~ "visited",
            - -

            Lines 30-37 use the same mutate() and case_when logic as above. Instead of reducing the number of park types, I use it to mark the different parks I have visited.

            - -

            Line 30 creates the new column, visited and uses case_when to look for the names of the parks that I’ve been to. If I have visited them, it adds visited to the column of the same name.

            - -

            The last line, TRUE ~ "not_visited)), acts as an else statement. For any park not listed above, it will put not visited in the visited column I created.

            - -

            This feels like a very brute-force method of tracking which parks I’ve visited, but I haven’t spend much time trying to find another way.

            - -
            - -
              -
            • line 38-39
            • -
            - -
            38    shift_geometry(preserve_area = FALSE, 
            -39                   position = "below") %>%
            - -

            In part I, when I made the base map, I moved Alaska and Hawaii so they were of similar size and closer to the continental USA. For the map to display the parks correctly, I have to shift them as well.

            - -

            I went over these two lines in part II, so I won’t go over them again here. If you want to read more about them, check out that post.

            - -
            - -
              -
            • line 40
            • -
            - -
            40  sf::st_transform('+proj=longlat +datum=WGS84')
            - -

            The last line uses the st_transform() function from the sf package to covert the data set from NAD83 to WGS84. Leaflet requires WGS84, so be sure to include this line at the end of your data manipulation.

            - -

            I covered the WGS84 ellipsoid in part I, if you want to read more about it.

            - -
            - -
              -
            • line 43
            • -
            - -
            43  st_write(nps, "~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            - -

            Strictly speaking, this line isn’t necessary. You can do all your data processing in the same file where you make your map, but I prefer to separate the steps into different files.

            - -

            As a result, I save the shifted data to my hard drive so it’s easier to load later. I usually have this line commented out (by placing # at the start of the line) after I save it the first time. I don’t want it to save every time I run the rest of the code.

            - -

            5. add national parks to base map

            -
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -
            ## create usa Base Map using leaflet()
            -    map <- leaflet() %>%
            -    addPolygons(data = states,
            -        smoothFactor = 0.2,
            -        fillColor = "#808080",
            -        fillOpacity = 0.5,
            -        stroke = TRUE,
            -        weight = 0.5,
            -        opacity = 0.5,
            -        color = "#808080",
            -        highlight = highlightOptions(
            -            weight = 0.5,
            -            color = "#000000",
            -            fillOpacity = 0.7,
            -            bringToFront = FALSE),
            -        group = "Base Map")  %>% 
            -    addPolygons(data = nps,
            -        smoothFactor = 0.2,                 
            -        fillColor = "#354f52",
            -        fillOpacity = 1,
            -        stroke = TRUE,
            -        weight = 1,     
            -        opacity = 0.5,                       
            -        color = "#354f52",             
            -        highlight = highlightOptions(
            -            weight = 3,
            -            color = "#fff",
            -            fillOpacity = 0.8,
            -            bringToFront = TRUE),
            -        group = "National Parks")  %>%
            -    addLayersControl(
            -        baseGroups = "Base Map",
            -        overlayGroups = "National Parks",
            -        options = layersControlOptions(collapsed = FALSE))
            -
            - -
              -
            • lines 2-16
            • -
            - -

            Lines 2-16 are identical to those in part II where I created the base map. I am not going to cover these sections in detail, because I covered it previously.

            - -
            - -
              -
            • line 17
            • -
            - -
            17    addPolygons(data = nps,
            - -

            To add the National Park data to the base map, we call addPolygons() again. The arguments are the same as before - color, opacity, outline style - just with different values. By changing those values, we can differentiate the base map from the national park data.

            - -

            Since we’re mapping the National Parks and not the states, we have to tell R where the data is located using data = nps.

            - -
            - -
              -
            • line 18
            • -
            - -
            18   smoothFactor = 0.2,
            - -

            smoothFactor() determines how detailed the park boundaries should be. The lower the number, the more detailed the shape. The higher the number, the smoother the parks will render. I usually match this to whatever I set for the base map for consistency.

            - -
            - -
              -
            • lines 19-20
            • -
            - -
            19   fillColor = "#354f52",
            -20   fillOpacity = 1,
            - -

            Define the color and transparency of the National Parks. In a future post, I am going to change the color of each type of public land, but for now, I’ll make them all a nice sage green color #354f52. I also want to make the parks to be fully opaque.

            - -
            - -
              -
            • lines 21-24
            • -
            - -
            21    stroke = TRUE,
            -22    weight = 1,     
            -23    opacity = 0.5,                       
            -24    color = "#354f52",
            - -

            The next four lines (21-24) define what kind of outline the National Parks will have. I detail each of these arguments in part II of this series.

            - -

            Briefly, I want there to be an outline to each park (stroke = TRUE) that’s thicker weight = 1 than the outline used on the base map. I do not like the way it looks at full opacity, so I make it half-transparent (opacity = 0.5). Finally, I want the outline color = "#354f52 to be the same color as the fill. This will matter more when I change the fill color of the parks later on.

            - -
            - -
              -
            • lines 25-28
            • -
            - -
            25  highlight = highlightOptions(
            -26              weight = 3,
            -27              color = "#fff",
            -28              fillOpacity = 0.8,
            - -

            Lines 25-28 define the National Park’s behavior on mouseover. First we have to define and initialize the highlightOptions() function. The function take similar arguments as the addPolygons function - both of which I go over in detail in part II.

            - -

            I want to keep the mouseover behavior noticeable, but simple. To do so, I set the outline’s thickness to be weight = 3. This will give the shape a nice border that differentiates it from the rest of the map.

            - -

            color = "#fff sets the outline’s color on mouseover only. So, when inactive, the outline color will match the fill color, but on mouseover the outline color switches to white (#fff).

            - -
            - Note: When the hex code repeats (like, white - #ffffff, black - #000000, or grey #808080) you only need to include the first three characters.
            - -
            - -
              -
            • line 29
            • -
            - -
            29    bringToFront = TRUE),
            - -

            bringToFront can either be TRUE or FALSE. If TRUE, Leaflet will bring the park to the forefront on mouseover. This is useful later when we add in the state parks because national and state parks tend to be close together.

            - -

            When FALSE the shape will remain static.

            - -
            - -
              -
            • line 30
            • -
            - -
            30    group = "National Parks")  %>%
            - -

            Since Leaflet adds all new data to the top of the base map, I think it’s useful to group the layers together. In the next block of code, we add in some layer functionality. For now, though, I want to add the National Parks to their own group so I can hide the National Parks if I want.

            - -
            - -
              -
            • line 31-34
            • -
            - -
            31    addLayersControl(
            -32        baseGroups = "Base Map",
            -33        overlayGroups = "National Parks",
            -34        options = layersControlOptions(collapsed = FALSE))
            - -

            addLayersControl defines how layers are displayed on the final map. The function takes three arguments.

            - -

            First, we have to tell Leaflet which layer should be used as the base map: baseGroups = "Base Map". The name in the quotations (here: "Base Map") has to match the name given to the layer you set in the addPolygons() call. In line 14, I put the 50 states into a group called "Base Map", but you can name it anything you like.

            - -

            There can be more than one base map, too. It’s not super helpful here since I shifted Alaska and Hawaii, but when using map tiles you can add multiple types of base maps that users can switch between.

            - -

            Next, we have to define the layers that are shown on top of the base group: overlayGroups = "National Parks". Just like the base map, this is defined in the corresponding addPolygons call. Here, I called the layer National Parks in line 30.

            - -

            Finally, on the map I don’t want the layers to be collapsed, so I set options = layersControlOptions(collapsed = FALSE). When TRUE the map will display an icon in the top right that, when clicked, will show the available layers.

            - -
            - - - Image: Layers collapsed - - -
            Layers collapsed
            -
            - -

            6. conclusion

            -
            - -

            Hey, look at that! You made a base map and you added some National Park data to it. You’re a certified cartographer now!

            - -

            In the next part IV post we’ll download and process the state park data before adding it to the map. Part V of this series we’ll add Shiny functionality and some additional markers.

            - - -
            Map showing US National Parks
            -

            </figure>

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/dc/2a65920806ec1e45e9b5614492fea639f7bde969e20899ba9b31c6121b7f84 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/dc/2a65920806ec1e45e9b5614492fea639f7bde969e20899ba9b31c6121b7f84 deleted file mode 100644 index 9ceb33b..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/dc/2a65920806ec1e45e9b5614492fea639f7bde969e20899ba9b31c6121b7f84 +++ /dev/null @@ -1,355 +0,0 @@ -I"«e

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. pre-process data in R

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -        filter(State_Nm %!in% territories)  %>% 
            -        filter(Own_Type == "State")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
                territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
                state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • line 5
            • -
            - -
                filter(State_Nm %!in% territories & Own_Type == "Stat")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is Stat. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as Stat. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/dd/c89e4885b12cef33557c5d7a6706d9c1be85d9863eba423643654c41604804 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/dd/c89e4885b12cef33557c5d7a6706d9c1be85d9863eba423643654c41604804 deleted file mode 100644 index 7d3d70e..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/dd/c89e4885b12cef33557c5d7a6706d9c1be85d9863eba423643654c41604804 +++ /dev/null @@ -1,7 +0,0 @@ -I"` -
            1
            -2
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/df/c939395daed16219456055a06e20b577d6420f0ed956348699d34b904dbdc6 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/df/c939395daed16219456055a06e20b577d6420f0ed956348699d34b904dbdc6 new file mode 100644 index 0000000..3980806 --- /dev/null +++ b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/df/c939395daed16219456055a06e20b577d6420f0ed956348699d34b904dbdc6 @@ -0,0 +1,65 @@ +I"Ě + +
            It made a difference to that one. + +
            + +

            I have a simple motto in life: Do what you can, where you are, with what you have. As a result, I believe strongly in doing whatever is in my means to make the world a better place. Below are some groups that I have either founded or joined in order to help those around me.</p>

            + +

            Current Groups

            +
            + +
              +
            • +

              Political Science Methodology Group | Co-organizer with Melina Much
              +University of California, Irvine

              +
            • +
            • +

              Political Science Womxn’s Caucus | Student leader
              +University of California, Irvine

              +
            • +
            • +

              Political Science Workshop Coordinator
              +University of California, Irvine

              +
            • +
            • +

              Legal Politics Writing Workshop
              +University of California, Irvine

              +
            • +
            • +

              Center for Democracy: Writing Workshop | Member
              +University of California, Irvine

              +
            • +
            • +

              UCI Humanities: Writing Workshop | Member
              +University of California, Irvine

              + +
              <h1>Previous Groups</h1>
              +<hr class = "h-line">
              +<ul>
              +  <li><i>Friends of the San Dimas Dog Park</i> | Ambassador <br/>
              +  San Dimas, California </li><br/>
              +  <li><i>Prisoner Education Project</i> | Volunteer <br/>
              +  Pomona, California</li><br/>
              +  <li><i>Tails of the City</i> | Volunteer Photographer <br/>
              +  Los Angeles, California</li><br/>
              +  <li><i>Philosophy Club</i> | President, Graphic Designer, and Banquet Chair <br/>
              +  California State Polytechnic University, Pomona</li><br/>
              +  <li><i><a href = "https://www.voteamerica.com/">Long Distance Voter</a></i> | Intern <br/>
              +  Social Media Content Creator</li><br/>
              +  <li><i><a href = "https://www.freepress.net/">Free Press</a></i> | Intern <br/>
              +  Social Media Content Creator</li>
              +</ul>   <!-- </div>
              +
              +
              +
              +
            • +
            +

            </div> + –>

            +:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/e1/0320cc69ef23d825da6b357bcb29829b0569da7716ee336cb6ae2e24f8c902 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/e1/0320cc69ef23d825da6b357bcb29829b0569da7716ee336cb6ae2e24f8c902 deleted file mode 100644 index 0e56673..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/e1/0320cc69ef23d825da6b357bcb29829b0569da7716ee336cb6ae2e24f8c902 +++ /dev/null @@ -1,624 +0,0 @@ -I"sé

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            VI. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 31-37
            • -
            - -
            31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -33                             Unit_Nm == "Salton Sea" ~ "visited",
            -34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -37                             TRUE ~ "not visited")  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

            - -
            - -
              -
            • lines 38-43
            • -
            - -
            38   shift_geometry(preserve_area = FALSE, 
            -39                  position = "below") %>% 
            -40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -41
            -42   ## save shifted park data
            -43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            - -

            I’ve covered these lines extensively in part II and part III of this series.

            - -

            Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

            - -

            Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

            - -

            Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

            - -

            7. divide by state

            -
            - -
            -
            -
            "chaos ensued"
            - -
            - -

            I tried to map the base map, National Parks, and the state parks. It did not go well. R froze, my computer screamed, and chaos ensued. As a result, I had to rethink my map. I decided to separate the state parks by state, save them, and in part VI of this never-ending series* I’ll create individual state maps. When you click on a state it’ll take you to a map that includes the state parks.

            - -

            Unfortunately, this also means I need to separate the National Parks by state so they also appear on the individual maps. The logic will be the same so I am not going to update part III to reflect that change. If you want to see that code it’s available on the project repo].

            - -

            I don’t want to manually separate and save each state, so I’m going to use a loop! I hate loops. The logic is simple enough “as long as condition X is true, do something,” but they are my nemesis. Every time I’ve tried to learn a programming language I have struggled with loops. That’s pretty sad considering it’s like day 2 of any programming class. Day 1 is learning how to write “hello world!”**

            - -
              -
            • I have annoyed myself with how long this series is. Hopefully it is helpful. Drop me a line if it is. -** print("Hello World!")
            • -
            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/e5/900ad780a372fb397db6a5bfb0de57c05ceb760b1ac82430dcbe9e65927c27 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/e5/900ad780a372fb397db6a5bfb0de57c05ceb760b1ac82430dcbe9e65927c27 new file mode 100644 index 0000000..decfb51 --- /dev/null +++ b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/e5/900ad780a372fb397db6a5bfb0de57c05ceb760b1ac82430dcbe9e65927c27 @@ -0,0 +1,49 @@ +I"Ö +
            It made a difference to that one. + +
            +
              <p>I have a simple motto in life: Do what you can, where you are, with what you have. As a result, I believe strongly in doing whatever is in my means to make the world a better place. Below are some groups that I have either founded or joined in order to help those around me.</p>
            +  <h1>Current Groups</h1>     
            +  <hr class = "h-line">     
            +  <ul>
            +    <li><i>Political Science Methodology Group</i> | Co-organizer with Melina Much <br/>
            +    University of California, Irvine </li><br/>
            +    <li><i>Political Science Womxn's Caucus</i> | Student leader <br/>
            +    University of California, Irvine </li><br/>
            +    <li><i>Political Science Workshop Coordinator</i> <br/>
            +    University of California, Irvine </li><br/>
            +    <li><i>Legal Politics Writing Workshop</i><br/>
            +    University of California, Irvine </li><br/>
            +    <li><i>Center for Democracy: Writing Workshop</i> | Member <br/>
            +    University of California, Irvine</li><br/>
            +    <li><i>UCI Humanities: Writing Workshop</i> | Member <br/>
            +    University of California, Irvine</li><br/> 
            +  </ul>
            +  <h1>Previous Groups</h1>
            +  <hr class = "h-line">
            +  <ul>
            +    <li><i>Friends of the San Dimas Dog Park</i> | Ambassador <br/>
            +    San Dimas, California </li><br/>
            +    <li><i>Prisoner Education Project</i> | Volunteer <br/>
            +    Pomona, California</li><br/>
            +    <li><i>Tails of the City</i> | Volunteer Photographer <br/>
            +    Los Angeles, California</li><br/>
            +    <li><i>Philosophy Club</i> | President, Graphic Designer, and Banquet Chair <br/>
            +    California State Polytechnic University, Pomona</li><br/>
            +    <li><i><a href = "https://www.voteamerica.com/">Long Distance Voter</a></i> | Intern <br/>
            +    Social Media Content Creator</li><br/>
            +    <li><i><a href = "https://www.freepress.net/">Free Press</a></i> | Intern <br/>
            +    Social Media Content Creator</li>
            +  </ul>
            +<!-- </div>
            +
            +
            +
            +

            </div> + –>

            +:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/e6/41e43138169733738be4bdd182fe68fa4abfd8d4409cda2af526a94156c186 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/e6/41e43138169733738be4bdd182fe68fa4abfd8d4409cda2af526a94156c186 deleted file mode 100644 index 4007fc4..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/e6/41e43138169733738be4bdd182fe68fa4abfd8d4409cda2af526a94156c186 +++ /dev/null @@ -1,604 +0,0 @@ -I"Câ

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            VI. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 31-37
            • -
            - -
            31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -33                             Unit_Nm == "Salton Sea" ~ "visited",
            -34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -37                             TRUE ~ "not visited")  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

            - -
            - -
              -
            • lines 38-43
            • -
            - -
            38   shift_geometry(preserve_area = FALSE, 
            -39                  position = "below") %>% 
            -40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -41
            -42   ## save shifted park data
            -43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            - -

            I’ve covered these lines extensively in part II and part III of this series.

            - -

            Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

            - -

            Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

            - -

            Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/e7/cd4c686d5b5c42ddd3a833f646d111f8ecb2ce2900a174679e2c74f195b6e1 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/e7/cd4c686d5b5c42ddd3a833f646d111f8ecb2ce2900a174679e2c74f195b6e1 new file mode 100644 index 0000000..fe91ddb --- /dev/null +++ b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/e7/cd4c686d5b5c42ddd3a833f646d111f8ecb2ce2900a174679e2c74f195b6e1 @@ -0,0 +1,33 @@ +I"7
            +
            +
            +
            + + +
            + Here you'll find several non-partisan ballot guides. I try to include as much information as possible without directly recreating the official voter guide. The information is sourced from the ballot guide, calmatters, ballotpedia, LA Times, voter’s edge, and Mercury News. I have included links to the campaign website or wherever most of the information came from. + + I have done my best to keep my views out of it. + + I started this to help my two aunts because they would ask me to simplify their ballots for them. Democracy relies on an informed and participatory citizenry, but it’s not always easy. This is meant to alleviate some of the burden. + + If you notice any errors, you feel like I’ve missed something, or you found this guide helpful feel free to send me an email [click the envelope at the bottom of the page] +

            + Helpful Resources: +
            1. Register to vote (Deadline May 23): https://registertovote.ca.gov/
            2. +
            3. Check your registration status: https://voterstatus.sos.ca.gov/
            4. +
            5. Access the official voter guide: https://voterguide.sos.ca.gov/
            6. +
            7. Early voting & ballot drop off locations: https://caearlyvoting.sos.ca.gov/
            8. +
            9. Track your ballot: https://california.ballottrax.net/voter/
            10. +
            11. If you are in the Los Angeles, San Bernardino, Orange County area and need help getting to your polling place I will either find you resources or help you get there. I also offer to go with you to vote (and I will bring my two large German Shepherds) if you feel unsafe going to vote alone.
            + +2022 Primary California + +
            + +
            +
            + +
            +
            +:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/eb/0ee9b9633b8c78957a295b9a7b5722a1a2766e0613d8be0a5ac61be1647309 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/eb/0ee9b9633b8c78957a295b9a7b5722a1a2766e0613d8be0a5ac61be1647309 new file mode 100644 index 0000000..0badcf9 --- /dev/null +++ b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/eb/0ee9b9633b8c78957a295b9a7b5722a1a2766e0613d8be0a5ac61be1647309 @@ -0,0 +1,55 @@ +I"† + +
            It made a difference to that one. + +
            + +

            I have a simple motto in life: Do what you can, where you are, with what you have. As a result, I believe strongly in doing whatever is in my means to make the world a better place. Below are some groups that I have either founded or joined in order to help those around me.</p>

            + +

            Current Groups

            +
            + +
              +
            • Political Science Methodology Group | Co-organizer with Melina Much +
              University of California, Irvine
            • +
            • Political Science Womxn’s Caucus | Student leader +
              University of California, Irvine
            • +
            • Political Science Workshop Coordinator +
              University of California, Irvine
            • +
            • Legal Politics Writing Workshop
              +
              University of California, Irvine
            • +
            • Center for Democracy: Writing Workshop | Member +
              University of California, Irvine
            • +
            • +

              UCI Humanities: Writing Workshop | Member +
              University of California, Irvine

              + +
              <h1>Previous Groups</h1>
              +<hr class = "h-line">
              +<ul>
              +  <li><i>Friends of the San Dimas Dog Park</i> | Ambassador <br/>
              +  San Dimas, California </li><br/>
              +  <li><i>Prisoner Education Project</i> | Volunteer <br/>
              +  Pomona, California</li><br/>
              +  <li><i>Tails of the City</i> | Volunteer Photographer <br/>
              +  Los Angeles, California</li><br/>
              +  <li><i>Philosophy Club</i> | President, Graphic Designer, and Banquet Chair <br/>
              +  California State Polytechnic University, Pomona</li><br/>
              +  <li><i><a href = "https://www.voteamerica.com/">Long Distance Voter</a></i> | Intern <br/>
              +  Social Media Content Creator</li><br/>
              +  <li><i><a href = "https://www.freepress.net/">Free Press</a></i> | Intern <br/>
              +  Social Media Content Creator</li>
              +</ul>   <!-- </div>
              +
              +
              +
              +
            • +
            +

            </div> + –>

            +:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/ed/f3ca0c1065299402acb160458fe40acd5ac1b5b65cd02bbec44cd448e7db7c b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/ed/f3ca0c1065299402acb160458fe40acd5ac1b5b65cd02bbec44cd448e7db7c deleted file mode 100644 index 23b91b8..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/ed/f3ca0c1065299402acb160458fe40acd5ac1b5b65cd02bbec44cd448e7db7c +++ /dev/null @@ -1,617 +0,0 @@ -I"üć

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            VI. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 31-37
            • -
            - -
            31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -33                             Unit_Nm == "Salton Sea" ~ "visited",
            -34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -37                             TRUE ~ "not visited")  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

            - -
            - -
              -
            • lines 38-43
            • -
            - -
            38   shift_geometry(preserve_area = FALSE, 
            -39                  position = "below") %>% 
            -40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -41
            -42   ## save shifted park data
            -43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            - -

            I’ve covered these lines extensively in part II and part III of this series.

            - -

            Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

            - -

            Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

            - -

            Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

            - -

            7. divide by state

            -
            - -
            -
            -
            "chaos ensued"
            - -
            - -

            I tried to map the base map, National Parks, and the state parks. It did not go well. R froze, my computer screamed, and chaos ensued. As a result, I had to rethink my map. I decided to separate the state parks by state, save it, and in part VI of this never-ending series* I’ll create individual state maps and link them to the map of National Parks.

            - -

            Unfortunately, this also means I need to separate the National Parks by state so they also appear on the individual maps. The logic will be the same so I am not going to update part III to reflect that change. If you want to see that code it’s available on the project repo].

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/f1/53988ae2a401cf27c8bcf9a5bb4b2fa67a6dc4e1724c707f45b61b01f8323d b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/f1/53988ae2a401cf27c8bcf9a5bb4b2fa67a6dc4e1724c707f45b61b01f8323d deleted file mode 100644 index e76d786..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/f1/53988ae2a401cf27c8bcf9a5bb4b2fa67a6dc4e1724c707f45b61b01f8323d +++ /dev/null @@ -1,285 +0,0 @@ -I"ßI

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: PAD-US Viewer Key - - -
            PAD-US Viewer Key
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/f3/b968f2da9f2e228b0f2f61ad11e95ff4a96a2429de956a54c231a0c12af28e b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/f3/b968f2da9f2e228b0f2f61ad11e95ff4a96a2429de956a54c231a0c12af28e deleted file mode 100644 index 2a3f917..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/f3/b968f2da9f2e228b0f2f61ad11e95ff4a96a2429de956a54c231a0c12af28e +++ /dev/null @@ -1,454 +0,0 @@ -I"™Ť

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines

            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/f6/39707a350f09783dc957f58819d951ed69368412a34905946b94a4d58e86e6 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/f6/39707a350f09783dc957f58819d951ed69368412a34905946b94a4d58e86e6 deleted file mode 100644 index 752aed3..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/f6/39707a350f09783dc957f58819d951ed69368412a34905946b94a4d58e86e6 +++ /dev/null @@ -1,305 +0,0 @@ -I"l\

            This is a continuation of my previous post where I walked through how to download and modify shape data. I also showed how to shift Alaska and Hawaii so they are closer to the continental usa. -

            - -

            In this post, I’ll go over how to use Leaflet to map the shapefile we made in the previous post. If you’ve come here from part one of the series, you probably have the libraries and data loaded already. However, if you don’t, be sure to load the libraries and shapefiles before moving to number two.

            - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            V. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -
                ## load libraries
            -    library("tidyverse")    # data manipulation & map creation
            -    library("sf")           # loads shapefile
            -    library("leaflet")      # creates the map
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -
                ## load data
            -    states <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp to reflect wherever you saved the shifted shapefile.

            - -

            If your data processing and base map creation are in the same file, you can skip this line, and when you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. create the base map

            -
            - -

            At its most basic, all Leaflet needs to create a map is a base map and data layers. The code below may look intimidating, but it’s mostly style options.

            - -

            This is the map we’re going to create. It’s a simple grey map and each state darkens in color as you hover over it. I’ll show the same map after each style option is added so you can see what effect it has.

            - -
            - - - Image: Final USA basemap - - -
            Final USA basemap
            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line. -
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -
                ## create usa base map using leaflet()
            -    map <- leaflet() %>%
            -    addPolygons(data = states,
            -        smoothFactor = 0.2,
            -        fillColor = "#808080",
            -        fillOpacity = 0.5,
            -        stroke = TRUE,
            -        weight = 0.5,
            -        opacity = 0.5,
            -        color = "#808080",
            -        highlight = highlightOptions(
            -            weight = 0.5,
            -            color = "#000000",
            -            fillOpacity = 0.7,
            -            bringToFront = FALSE),
            -        group = "Base Map")
            -
            - -
              -
            • line 2:
            • -
            - -
            2    map <- leaflet() %>% 
            - -

            leaflet() initializes the map widget. I save it to a variable called map (map <-) so I can run other code in the file without recreating the map each time. When you want to see the map, you can type map (or whatever you want to name your map) in the terminal and hit enter. R will display the map in the viewer.

            - -
            - -
              -
            • line 3:
            • -
            - -
            3    addPolygons(data = states,
            - -

            addPolygons() adds a layer to the map widget. Leaflet has different layer options, including addTiles and addMarkers which do different things. You can read about them on the leaflet website. Since we’re using a previously created shapefile, we’ll add the shapefile to the map using addPolygons().

            - -

            The first argument you need to specify after calling addPolygons is data = [data-source]. [data-source] is whatever variable your data is stored in. For me, it’s called states. This is either the processed data from part I of this series or the saved shapefile loaded above under the section called load data.

            - -

            When you run only the first two lines, Leaflet will use its default styling. The base color will be a light blue and the outlines of the states will be dark blue and fairly thick.

            - -
            - - - Image: USA map with default leaflet styling - - -
            USA map with default leaflet styling
            -
            - -

            You can leave the base map like this if you want, but all additional data will be added as a layer on top</i>* of this map which can become distracting very quickly. I prefer to make my base maps as basic and unobtrusive as possible so the data I add on top of the base map is more prominent.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    smoothFactor = 0.2,
            - -

            smoothFactor controls how much the polygon shape should be smoothed at each zoom level. The lower the number the more accurate your shapes will be. A larger number, on the other hand, will lead to better performance, but can distort the shapes of known areas.

            - -

            I keep the smoothFactor low because I want the United States to appear as a coherent land mass. The image below shows three different maps, each with a different smoothFactor to illustrate what this argument does. On the left, the map’s smoothFactor=0.2, the center map’s smoothFactor=10, and the right’s smoothFactor=100.

            - -
            - - - Image: SmoothFactor = 0.2 (left), 10 (center), 100 (right) - - -
            SmoothFactor = 0.2 (left), 10 (center), 100 (right)
            -
            - -

            As you can see, the higher the smoothFactor the less coherent the United States becomes.

            - -
            - Note: Leaflet - all map programs, really - don't know that continents, countries, states, or any other land mass is a coherent set. It reads the shape data and spits out a map without regard to where things "should" be. It doesn't know California goes on the left or that Alaska is part of the United States. It's important to keep this in mind because each state's shape is rendered individually when using addPolygons(). -
            - -
            - -
              -
            • lines 5-6:
            • -
            - -
            5    fillColor = "#808080",
            -6    fillOpacity = 0.5,
            - -

            fillColor refers to what color is on the inside of the polygons. Since I want a minimal base map, I usually set this value to be some shade of grey. If you want a different color, you only need to replace #808080 with the corresponding hex code for the color you want. Here is a useful hex color picker. If you have a hex value and you want the same color in a different shade, this is a useful site.

            - -

            fillOpacity determines how transparent the color inside the shape should be. I set mine to be 0.5 because I like the way it looks. The number can be between 0 and 1 with 1 being fully opaque and 0 being fully transparent.

            - -
            - -
              -
            • line 7-10:
            • -
            - -
            7    stroke = TRUE,
            -8    weight = 0.5,
            -9    opacity = 0.5,
            -10   color = "#808080",
            - -

            The next four lines define the appearance of the shapes’ outline.

            - -

            The stroke property can be set to either TRUE or FALSE. When true, Leaflet adds an outline around each polygon. When false, the polygons have no outline. In the image below, the map on the left has the default outlines and on the right stroke = FALSE.

            - -
            - - - Image: Default outlines on the left, no outline on the right - - -
            Default outlines on the left, no outline on the right
            -
            - -

            weight = 0.5 sets the thickness of the outlines to be 0.5 pixels. This can be any value you want with higher numbers corresponding to thicker lines. Lower numbers correspond to thinner lines.

            - -

            The opacity property operates in the same way as fill opacity above, but on the outlines. The number can be between 0 and 1. Lower numbers correspond to the lines being more transparent and 1 means fully opaque.

            - -

            color = "#808080" sets the color of the outline. I typically set it to be the same color as the fill color.

            - -

            If you want a static base map then lines 2-10 are all you need, as shown in the image below. I like to add some functionality to my base map so that the individual states become darker when they’re hovered over.

            - -
            - - - Image: static base map - - -
            static base map
            -
            - -

            Lines 11-15 define the map’s behavior when the mouse hovers over the shape. Most of the options are the same as the ones used on the base polygon shapes, so I won’t go into them with much detail.

            - -
            - -
              -
            • line 11:
            • -
            - -
            11    highlight = highlightOptions()
            - -

            highlight = highlightOptions() contains the mouseover specifications. The word before the equal sign has to be either highlight or highlightOptions. I am not sure why you have to declare highlight twice, but you do.

            - -

            highlightOptions() is the actual function call.

            - -
            - -
              -
            • lines 12-14
            • -
            - -
            12    weight = 0.5,
            -13    color = "#000000",
            -14    fillOpacity = 0.7,
            - -

            weight, color, and fillOpacity all operate in the same way as before, but whatever values you specify here will only show up when the mouse hovers over.

            - -
            - -
              -
            • line 15
            • -
            - -
            15    bringToFront = FALSE),
            - -

            bringToFront takes one of two values: TRUE or FALSE. It only really matters when you have multiple layers (like we will in later parts of this series). When bringToFront = TRUE hovering over the state will bring it to the front. When bringToFront = FALSE it will stay in the back.

            - -

            Since the base map has only one layer, this property doesn’t affect anything.

            - -
            - -
              -
            • line 16
            • -
            - -
            16    group = "Base Map")
            - -

            group = "Base Map") lets you group multiple layers together. This argument will come in handy as we add more information to the map. The base map is the default layer and is always visible - though, when you use map tiles you can define multiple base layers. All other layers will be on top of the base layer. When using different groups, you can define functionality that allows users to turn off certain layers.

            - -

            4. Conclusion

            -

            You’ve created your first base map! It’s a boring flat, grey map, but it’s the base we’ll use when adding in the national and state park data. In part III of this series we’ll process and add in the National Parks.

            - -
            -
            United States base map made with Leaflet
            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/f6/c8ec1b66f423ee8286319faf5ef6b916a8b62f8640e8c82b1975f8009b7062 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/f6/c8ec1b66f423ee8286319faf5ef6b916a8b62f8640e8c82b1975f8009b7062 deleted file mode 100644 index 715b456..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/f6/c8ec1b66f423ee8286319faf5ef6b916a8b62f8640e8c82b1975f8009b7062 +++ /dev/null @@ -1,622 +0,0 @@ -I"cé

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • separate state park data
            • -
            • save state park data
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            VI. cartography in r part six

            -
              -
            • create individual state maps
            • -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed maps
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -16
            -17
            -18
            -19
            -20
            -21
            -22
            -23
            -24
            -25
            -26
            -27
            -28
            -29
            -30
            -31
            -32
            -33
            -34
            -35
            -36
            -37
            -38
            -39
            -40
            -41
            -42
            -43
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW") %>% 
            -    filter(d_Pub_Acce != "Closed" & 
            -           d_Pub_Acce != "Unknown") %>% 
            -    filter(Loc_Ds != "ACC" &
            -           Loc_Ds != "Hunter Access",
            -           Loc_Ds != "Public Boat Ramp")  %>% 
            -    select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres)  %>% 
            -    mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -                            d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -                            d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -                            d_Des_Tp == "State Park" ~ "State Park or Parkway"))  %>% 
            -    mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -                               Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -                               Unit_Nm == "Salton Sea" ~ "visited",
            -                               Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -                               Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -                               Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -                               TRUE ~ "not visited")  %>% 
            -    shift_geometry(preserve_area = FALSE, 
            -                    position = "below") %>% 
            -    sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -
            -    ## save shifted park data
            -    # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10          Des_Tp == "SCA" |
            -11          Des_Tp == "SHCA" |
            -12          Des_Tp == "SP" |
            -13          Des_Tp == "SREC" |
            -14          Des_Tp == "SRMA" |
            -15          Des_Tp == "SW")  
            - -

            Theoretically, lines 7-15 can be included with the first filter() call in line 5, but I couldn’t get it to work.

            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -

            This will leave me with 50,102 rows.

            - -
            - -Note: -I use the number of rows as a simple gut-check that the code is doing what I want it to. You can get the number of rows by running nrow(state_parks) -
            - -
            - -
              -
            • Lines 16-17
            • -
            - -
            16    filter(d_Pub_Acce != "Closed" & 
            -17           d_Pub_Acce != "Unknown")  %>% 
            - -

            Yet another filter() call. These two lines tell R to exclude any row whose d_Pub_Acce is not Closed or Unknown

            - -

            The data has four types of access: Closed, Unknown, Open Access, and Restricted Access. I’m only interested in land that I can visit, so I want to keep only the parks with Open or Restricted Access. In the filter() call, I chose to use != solely because months or years from now when I look at this code it will be easier for me to figure out what I was doing. I know myself and if I saw d_Pub_Acce == "Open Access" my first thought would be: “What are the other types?” and then I’ll try and find out and waste a bunch of time.

            - -

            This last filter brings the total number of state parks down to 49,719. I don’t think I can reduce that number more without removing places that should be kept.

            - -
            - -

            *lines 18-20

            - -
            18    filter(Loc_Ds != "ACC" &
            -19           Loc_Ds != "Hunter Access",
            -20           Loc_Ds != "Public Boat Ramp")
            - -

            Lines 18-20 have the same logic as lines 16-17 except here I want to filter out the Hunter Access areas and Boat Ramps.

            - -
            - -
              -
            • Line 21
            • -
            - -
            21      select(d_Own_Type, d_Des_Tp, Loc_Ds, Unit_Nm, State_Nm, d_State_Nm, GIS_Acres, SHAPE)  %>% 
            - -

            Now that I’ve pared down the data a little bit, I want discard any column don’t need.

            - -

            select() lets me choose the columns I want to keep by name, rather than by index number.

            - -

            I decided to keep:

            -
              -
            • d_Own_Type (owner type)
            • -
            • d_Des_Tp (designation type - state park, resource area, etc)
            • -
            • Loc_Ds (location description - I like this because it tells me where ponds are)
            • -
            • Unit_Nm (park name)
            • -
            • State_Nm (two character state abbreviation)
            • -
            • d_State_Nm (long state name)
            • -
            • GIS_Acres (park size)
            • -
            • SHAPE (geometry column)
            • -
            - -
            - -
              -
            • lines 22-30
            • -
            - -
            22  mutate(type = case_when(d_Des_Tp == "Access Area" ~ "State Trail",
            -23                          d_Des_Tp == "Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -24                          d_Des_Tp == "State Historic or Cultural Area" ~ "State Historical Park, Site, Monument, or Memorial",
            -25                          d_Des_Tp == "Recreation Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -26                          d_Des_Tp == "State Resource Management Area" ~ "State Preserve, Reserve, or Recreation Area",
            -27                          d_Des_Tp == "State Wilderness" ~ "State Preserve, Reserve, or Recreation Area",
            -28                          d_Des_Tp == "State Recreation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -29                          d_Des_Tp == "State Conservation Area" ~ "State Preserve, Reserve, or Recreation Area",
            -30                          d_Des_Tp == "State Park" ~ "State Park or Parkway"))
            - -

            mutate() is part of the tidyverse package and it’s extremely versatile. It is mainly used to create new variables or modify existing ones.

            - -

            I wanted the state park designations to match closely with the types I used in the National Park data.

            - -

            I went over the logic of using mutate() and case_when() in Part III of this series, so I won’t cover it again here.

            - -

            In its general form, the format is case_when(COLUMN_NAME == "original_value" ~ "new_value"). I only needed to change the values for "Recreation Management Areas, the rest I just populated the new column with the old values.

            - -
            - -
              -
            • lines 31-37
            • -
            - -
            31  mutate(visited = case_when(Unit_Nm == "Valley of Fire State Park" ~ "visited",
            -32                             Unit_Nm == "Crissey Field State Recreation Site" ~ "visited",
            -33                             Unit_Nm == "Salton Sea" ~ "visited",
            -34                             Unit_Nm == "Anza-Borrego Desert State Park" ~ "visited",
            -35                             Unit_Nm == "Jedediah Smith Redwoods State Park" ~ "visited",
            -36                             Unit_Nm == "Del Norte Coast Redwoods State Park" ~ "visited",
            -37                             TRUE ~ "not visited")  %>% 
            - -

            Here is where I ran into some issues. In part III of the series when I processed the National Park data I included a mutate() and case_when() call to mark whether I’ve visited the park or not. It’s not a very elegant solution since I have to modify each park individually, but it was passable since I’ve only been to a handful of National Parks. For the state parks, though, it is unwieldy.

            - -

            I had original wanted to drop the geometry and download the parks as a CSV, but even that was overwhelming.

            - -
            - Note: The code to drop the geometry and save some of the park data as csv can be found in the r files on this project's repo
            - -

            In the end, I decided to focus on the parks that I know I’ve visited and have taken photos at. I’ve visited many, many state parks, but until I have the photos to add to the markers (covered in part five), I’m omitting them from this code. Hopefully in the mean time I’ll figure out a better way to keep track of the parks I’ve been to.

            - -

            The logic is the same as the National Park data. mutate() created a new column type and populated it by using case_when().

            - -
            - -
              -
            • lines 38-43
            • -
            - -
            38   shift_geometry(preserve_area = FALSE, 
            -39                  position = "below") %>% 
            -40   sf::st_transform("+proj=longlat +ellps=WGS84 +datum=WGS84 +no_defs")
            -41
            -42   ## save shifted park data
            -43   # st_write(state_parks, "./shapefiles/shifted/states/state_parks.shp")
            - -

            I’ve covered these lines extensively in part II and part III of this series.

            - -

            Lines 38-39 shift the state park data from Alaska and Hawaii so it appears under the continental US and of comparable size.

            - -

            Line 40 is required to change the coordinate system from Albers to WGS84 - the latter of which is required by Leaflet.

            - -

            Line 43 saves the shifted shapefile to the hard drive. Delete the # from the start of the line to save the file.

            - -

            7. divide by state

            -
            - -
            -
            -
            "chaos ensued"
            - -
            - -

            I tried to map the base map, National Parks, and the state parks. It did not go well. R froze, my computer screamed, and chaos ensued. As a result, I had to rethink my map. I decided to separate the state parks by state, save them, and in part VI of this never-ending series* I’ll create individual state maps. When you click on a state it’ll take you to a map that includes the state parks.

            - -

            Unfortunately, this also means I need to separate the National Parks by state so they also appear on the individual maps. The logic will be the same so I am not going to update part III to reflect that change. If you want to see that code it’s available on the project repo].

            - -

            I don’t want to manually separate and save each state, so I’m going to use a loop! I hate loops. The logic is simple enough “as long as condition X is true, do something.” So simple, yet esvery time I’ve tried to learn a programming language I have struggled with loops. That’s pretty sad considering it’s like day 2 of any programming class. Day 1 is learning how to write “Hello World!”**

            - -

            * I have annoyed myself with how long this series is. Hopefully it is helpful. Drop me a line if it is.
            -** print("Hello World!")

            -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/f8/670086fa4eb8040131aef2fdb4df8c109889770773d317a05350eb1f5273b2 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/f8/670086fa4eb8040131aef2fdb4df8c109889770773d317a05350eb1f5273b2 deleted file mode 100644 index fc191d3..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/f8/670086fa4eb8040131aef2fdb4df8c109889770773d317a05350eb1f5273b2 +++ /dev/null @@ -1,338 +0,0 @@ -I"Â_

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. pre-process data in R

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -        filter(State_Nm %!in% territories)  %>% 
            -        filter(Own_Type == "State")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
                territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
                state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/f9/2d13cafe61d3ba6d9cda0743f957b5f87936f46057b982564b4f357debd96d b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/f9/2d13cafe61d3ba6d9cda0743f957b5f87936f46057b982564b4f357debd96d deleted file mode 100644 index b799a9e..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/f9/2d13cafe61d3ba6d9cda0743f957b5f87936f46057b982564b4f357debd96d +++ /dev/null @@ -1,290 +0,0 @@ -I"ˇK

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/fc/a1e49b7030b0eeb6fc84e0634cc1b76a3f1fee7713a3d212618c8994dbc925 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/fc/a1e49b7030b0eeb6fc84e0634cc1b76a3f1fee7713a3d212618c8994dbc925 new file mode 100644 index 0000000..96e2724 --- /dev/null +++ b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/fc/a1e49b7030b0eeb6fc84e0634cc1b76a3f1fee7713a3d212618c8994dbc925 @@ -0,0 +1,53 @@ +I"ł +
            It made a difference to that one. + +
            + +

            I have a simple motto in life: Do what you can, where you are, with what you have. As a result, I believe strongly in doing whatever is in my means to make the world a better place. Below are some groups that I have either founded or joined in order to help those around me.</p>

            + +

            Current Groups

            +
            + +
              +
            • Political Science Methodology Group | Co-organizer with Melina Much
              +University of California, Irvine
            • +
            • Political Science Womxn’s Caucus | Student leader
              + University of California, Irvine + Political Science Workshop Coordinator
              + University of California, Irvine + Legal Politics Writing Workshop
              + University of California, Irvine + Center for Democracy: Writing Workshop | Member
              + University of California, Irvine + UCI Humanities: Writing Workshop | Member
              + University of California, Irvine + </ul> + <h1>Previous Groups</h1> + <hr class = "h-line"> + <ul> + <li>Friends of the San Dimas Dog Park | Ambassador
              + San Dimas, California </li>
              + <li>Prisoner Education Project | Volunteer
              + Pomona, California</li>
              + <li>Tails of the City | Volunteer Photographer
              + Los Angeles, California</li>
              + <li>Philosophy Club | President, Graphic Designer, and Banquet Chair
              + California State Polytechnic University, Pomona</li>
              + <li>Long Distance Voter | Intern
              + Social Media Content Creator</li>
              + <li>Free Press | Intern
              + Social Media Content Creator</li> + </ul> + <!– </div> +
              +
              +
            • +
            +

            </div> + –>

            +:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/fd/7b44ab61e2ae44ac2a6c69d0e4981eec0860d7b2bc00536015e43f1af4b1f4 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/fd/7b44ab61e2ae44ac2a6c69d0e4981eec0860d7b2bc00536015e43f1af4b1f4 deleted file mode 100644 index 1dbdd57..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/fd/7b44ab61e2ae44ac2a6c69d0e4981eec0860d7b2bc00536015e43f1af4b1f4 +++ /dev/null @@ -1,333 +0,0 @@ -I"\

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. pre-process data in R

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -        filter(State_Nm %!in% territories)  %>% 
            -        filter(Own_Type == "State")
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
                territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 7.

            - -
            - -
              -
            • line 4 -Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()
            • -
            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -:ET \ No newline at end of file diff --git a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/ff/fb6f26b6b14f953c10f1e421752efa0f3339ca9d7dc86ce5aac1327839d3c6 b/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/ff/fb6f26b6b14f953c10f1e421752efa0f3339ca9d7dc86ce5aac1327839d3c6 deleted file mode 100644 index 93bb3a5..0000000 --- a/.jekyll-cache/Jekyll/Cache/Jekyll--Converters--Markdown/ff/fb6f26b6b14f953c10f1e421752efa0f3339ca9d7dc86ce5aac1327839d3c6 +++ /dev/null @@ -1,413 +0,0 @@ -I"?}

            Welcome to part four of my cartography in R series. In this post, we’ll download and process the state park data before adding it to the base map created in part II.

            - - - -

            project outline

            -
            - -

            I had to break the tutorial into different parts because it became unwieldy. I list the component parts below. The annotated version of the code can be found in this project’s repository in the folder called r files

            - -

            I. cartography in r part one

            -
              -
            • install packages
            • -
            • download usa shapefile
            • -
            • shift alaska and hawaii
            • -
            • save shapefile
            • -
            - -

            II. cartography in r part two

            -
              -
            • create usa base map
            • -
            - -

            III. cartography in r part three

            -
              -
            • download national park data
            • -
            • process national park data
            • -
            • shift alaska and hawaii national parks
            • -
            • save shapefile
            • -
            • add national parks to map
            • -
            - -

            IV. cartography in r part four [this post]

            -
              -
            • download state park data
            • -
            • process state park data
            • -
            • shift alaska and hawaii state parks
            • -
            • save shapefile
            • -
            • add state parks to map
            • -
            - -

            V. cartography in r part five

            -
              -
            • add in shiny functionality
            • -
            • add markers to visited parks
            • -
            • save and embed map
            • -
            - -

            1. load libraries

            -
            - -
            1
            -2
            -3
            -4
            -5
            -
                ## load libraries
            -    library("tidyverse")        # data manipulation & map creation
            -    library("sf")               # loads shapefile
            -    library("leaflet")          # creates the map
            -    library("operator.tools")   # not-in function
            -
            - -

            I am not going to explain in detail what each of these packages do because I already covered it in part one.

            - -

            2. load data

            -
            - -
            1
            -2
            -3
            -
                ## load data
            -    usa <- read_sf("~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp")
            -    nps <- read_sf("~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp")
            -
            - -

            Be sure to change ~/Documents/Github/nps/shapefiles/shifted/usa/usa.shp and ~/Documents/GitHub/nps/shapefiles/shifted/nps/nps.shp to reflect where you saved the shifted shapefiles.

            - -

            If your data processing and base map creation are in the same file, you can skip these lines. When you make the Leaflet call below, you’ll use the name of the variable where the shape data is stored.

            - -

            3. download state park data

            -
            - -

            Unlike the National Park data, the state data is harder to come by. After quite a bit of searching I found the Protected Areas Database of the United States (PAD-US) created by the United States Geological Survey. The data they collect is amazing. It is a “national inventory of U.S. terrestrial and marine protected areas.”

            - -

            When they say national inventory, they mean it. It includes a detailed accounting of nearly every piece of public land “preserved through legal or other effective means” in the United States. For my project, it’s overkill and it will take some investigating to determine what information I want to keep and what to discard.

            - -

            The first step is to navigate to the PAD-US website and download the data. On the PAD-US website, click View Data Download.

            - -
            - - - Image: PAD-US download - - -
            PAD-US download
            -
            - -

            On the next page, you can download the National PAD-US data, the data by Census region, and by state. Click on National Geopackage. Make sure you select the geopackage one rather than the geodatabase version. On the next page, you’ll confirm you’re not a robot before the download link appears.

            - -
            - - - Image: National data download - PAD-US - - -
            National data download - PAD-US
            -
            - -

            When it downloads, unzip it somewhere on your hard drive. The zip folder contains 35 files, so I suggest you create a folder before unzipping it so they stay together.

            - -

            4. find the right layer

            -
            - -

            Since the geopackage is so large, R has a difficult time opening and displaying the data. There’s a couple of ways to view the data that does not rely on R. First, is to load the geopackage layers. This doesn’t load the complete data set. Instead, it just loads the layer names to use in conjunction with the documentation and the PAD-US Viewer.

            - -
            1
            -
                layers <- st_layers("./shapefiles/original/state parks/padus_national_gpkg/PADUS3_0Geopackage.gpkg")
            -
            - -

            I save the layer names to a variable called layers so I can load them whenever I want. You don’t have to, the st_layers() function will return the list regardless of whether it’s saved.

            - -

            st_layers() returns the name of the layer, then what kind of layer it is (Multi Polygon), how many features it contains, and what coordinate system it uses.

            - -
            - - - Image: PAD-US Layer List - - -
            PAD-US Layer List
            -
            - -

            The PAD-US geopackage contains seven layers which contain different kinds of information. The type of map you want to create will determine which layer you will use. For my map I want the same information for the state parks as I have for the national parks. I want:

            -
              -
            • the state where the park is located
            • -
            • the park name
            • -
            • the park type
            • -
            • and the park’s size.
            • -
            - -

            I have to figure out which of the seven layers listed above includes this information. You can load each layer (using the st_read() code below), but this method usually causes R to crash because there’s too much information for it to handle.

            - -

            Instead, I go to the PAD-US Viewer and search for a state park. I am going to choose the Valley of Fire State Park in Nevada because 1) I know it exists, 2) I have been there. You can choose any state park you’ve been to, or choose one from a list of parks by state on Wikipedia.

            - -
            - - - Image: PAD-US Viewer - - -
            PAD-US Viewer
            -
            - -

            In the search bar on the left, type in Valley of Fire (1) and select it from under the Official Place Names (2) panel that appears after you start typing. The map will adjust to show the park and you can click Done (3).

            - -
            - - - Image: Search for Valley of Fire State Park - - -
            Search for Valley of Fire State Park
            -
            - -

            The default view on the PAD-US Viewer includes the first layer’s information, Fee Managers in the viewer and PADUS3_0Fee in the geopackage. According to the documentation, most public land is “owned in fee” though some land is held in “long-term easements, leases, agreements” or by Congressional designation. To own land “in fee” means to own it completely, without limitations or conditions. The Fee layer lists what agency owns the land in its entirety.

            - -

            I don’t actually care whether it’s in fee or easement (allowed to use someone else’s land), my interest is solely in whether the Park’s polygon is visible when this layer is selected in the viewer.

            - -
            - - - Image: Valley of Fire State Park Polygon - - -
            Valley of Fire State Park Polygon
            -
            - -

            The purple-ish polygon on the map shows the geographic boundaries of the Valley of Fire State Park. This is what I want to see. Not all of the layers include the shape data for the state parks.

            - -

            For example, if I uncheck the Fee Manager layer and select the Federal Fee Managers (Authoritative Data) layer the polygon for the Valley of Fire disappears. This is because it is not federal land - it’s state land. Be careful when choosing the layers because not all of them contain the correct information.

            - -
            - - - Image: Federal Fee Managers Layer - - -
            Federal Fee Managers Layer
            -
            - -

            5. find the right data

            -
            - -

            The USGS is interested in fundamentally different information than I am. Finding the right information for my map is going to take some data manipulation. Since the USGS is interested in the more technical aspects of land management, their categories are more detailed than what I need.

            - -

            Using the PAD-US Viewer, I can investigate what information is available without having to load the entire data set into R’s viewer.

            - -

            For this section, I am going to use the Valley of Fire State Park in Nevada, Crissey Field State Recreation Site in Oregon, and the Salton Sea State Recreation Area in California to make sure that I am getting the data I want from the Fee layer. I chose these parks because I have been to them. I know where they are and that they are all (mostly) state land.

            - -
            - - - Image: Valley of Fire (left), Crissey Field (center), Salton Sea (right) - - -
            Valley of Fire (left), Crissey Field (center), Salton Sea (right)
            -
            - -
            - -Note: -Clicking on an image will open it in a new tab. -
            - -

            First, I need a good way to filter out national, tribal, or military land. On the PAD-US Viewer I will look at the Valley of Fire State Park information. When you search for it (described above) a marker will pop up. If you click on the marker a table will show on the map that includes all the information available on that layer.

            - -
            - - - Image: Valley of Fire State Park Table - - -
            Valley of Fire State Park Table
            -
            - -

            In the image the values on the left are the variable names - I’ll use these later to select certain variables. On the right is the value for the specific park. I’ve boxed a few of the properties which I’ll check across the different parks to make sure I can get all the information I need.

            - -
              -
            1. Own_Type designates who owns the land. For Valley of Fire it’s state owned land.
            2. -
            3. Mang_Type is the name of the agency that manages the land.
            4. -
            5. Mang_Name is the type of agency that owns the land, here state.
            6. -
            7. Des_Tp is the designation type.
            8. -
            9. Unit_Nm is the land’s name.
            10. -
            - -

            When we look at Crissey Field, many of these fields have different values.

            - -
            - - - Image: Crissey Field State Recreation Site Table - - -
            Crissey Field State Recreation Site Table
            -
            - -

            The ownership type (Own_Type) is the same for both: State. The Mang_Type, Mang_Name, Des_Tp, and (obviously) Unit_Nm are different.

            - -

            Next, we need to check the Salton Sea. The Salton Sea demonstrates the problem with finding appropriate data. Since the Salton Sea is toxic, it’s owned in part by the State of California, the Bureau of Land Management, and the Bureau of Reclamation - each represented by a different color on the map.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            Having visited the park, I can tell you there’s no fences or markers designating the different owners. On paper the Salton Sea may have many different owners, in real life you wouldn’t know it just by visiting the area.

            - -

            On the left hand side of the viewer there’s a color-coded key that indicates the different Fee managers. It’s not very user-friendly because the the colors are very similar.

            - -
            - - - Image: Salton Sea - - -
            Salton Sea
            -
            - -

            I don’t actually care who the fee manager of the Salton Sea is. What I am looking for, though, is whether any part of the Salton Sea includes Own_Type as State. If it does, I can use this column to separate the state data from any other kind.

            - -

            In the viewer, I’ll look for the blueish violet color (boxed in red above).

            - -
            - - - Image: Salton Sea State Recreation Area Table - - -
            Salton Sea State Recreation Area Table
            -
            - -

            After some trial and error I found a location about a quarter of the way down the lake’s right hand side. I am fairly certain I’ve actually been to that exact location. When looking at the table, I can see that the Own_Type matches the Valley of Fire and Crissey Fields which means I can select the state data using this property.

            - -

            Now we’ll move back to R to load the data layer, filter for the 50 states, and for state owned land.

            - -

            6. process state park data

            -
            - -
            - Note: I will give the entire codeblock first. I also include the individual line before describing what it does. I find this to be cleaner than including a bunch of comments in the code itself. If you want to copy and paste the code to your own file, you only need to copy the code block and not each individual line.
            - -
            1
            -2
            -3
            -4
            -5
            -6
            -7
            -8
            -9
            -10
            -11
            -12
            -13
            -14
            -15
            -
                ## create territories list
            -    territories <- c("AS", "GU", "MP", "PR", "VI")
            -
            -    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            -    filter(State_Nm %!in% territories & 
            -           Own_Type == "STAT")  %>% 
            -    filter(Des_Tp == "ACC" |
            -           Des_Tp == "HCA" |
            -           Des_Tp == "REC" |
            -           Des_Tp == "SCA" |
            -           Des_Tp == "SHCA" |
            -           Des_Tp == "SP" |
            -           Des_Tp == "SREC" |
            -           Des_Tp == "SRMA" |
            -           Des_Tp == "SW")  
            -
            - -

            These lines should be familiar by now, so I will briefly cover them here. For a detailed description of each line, check out part III of this series.

            - -
              -
            • line 2
            • -
            - -
            2    territories <- c("AS", "GU", "MP", "PR", "VI")
            - -

            Line 2 creates the list of US territories that I filter out in line 5.

            - -

            If you want to read more about this line visit part III of the series.

            - -
            - -
              -
            • line 4
            • -
            - -
            4    state_parks <- st_read("./shapefiles/original/state_parks_padus/PADUS3_0Geopackage.gpkg", layer = "PADUS3_0Fee")  %>%
            - -

            Since we’re working with a geopackage (instead of a shapefile) we have to load the data using st_read() instead of read_sf()

            - -

            st_read() takes two arguments. The first is the path to the geopackage. When we downloaded the PAD-US data and unzipped it, the folder contained 35 files. From those 35 make sure you choose the one with the .gpkg extension.

            - -

            The second argument layer = specifies which layer R should load. Here, I am selecting the layer PADUS3_0Fee layer because I know it contains the shape data from the state parks.

            - -
            - -
              -
            • lines 5-6
            • -
            - -
            5    filter(State_Nm %!in% territories  &
            -6           Own_Type == "STAT")
            - -

            R and the Tidyverse’s filter() function allows you to combine conditions within one filter call by using or | or and &.

            - -

            The logic in this line says filter the data for rows where the State_Nm is not in the territories list (discard all but the 50 states) and the Own_Type is STAT. For the row to be selected, both conditions must evaluate to true.

            - -
            - -Note: -The data lists state ownership as STAT. I found this out the hard way by running the filter with "State" and it returning no data. You can check the different values in a single column by running levels(as.factor(state_parks$Own_Type)) -
            - -

            The unfiltered data set had 247,507 rows. After these the two conditions in this line the data set has 53,139 rows. That’s a significant reduction but still a substantial number of rows.

            - -
            - -
              -
            • lines 7-15
            • -
            - -
            7    filter(Des_Tp == "ACC" |
            -8           Des_Tp == "HCA" |
            -9           Des_Tp == "REC" |
            -10           Des_Tp == "SCA" |
            -11           Des_Tp == "SHCA" |
            -12           Des_Tp == "SP" |
            -13           Des_Tp == "SREC" |
            -14           Des_Tp == "SRMA" |
            -15           Des_Tp == "SW")  
            - -

            Next, I want to choose certain types of state owned land. For that, I am going to look at the Des_Tp column. According to the PAD-US documentation, the Des_Tp column holds information about the Designation Type. It contains 37 different land designations.

            - -
            - -Note: -You can view the land designation types by referencing the PAD-US documentation. -
            - -

            I am going to restrict my data to include the following designations:

            -
              -
            • ACC (Access Areas) - after much searching this includes Hunter Access areas, Boat Ramps, and Trails. I’m most interested in the Trails.
            • -
            • HCA (Historic or Cultural Area)
            • -
            • REC (Recreation Management Area)
            • -
            • SCA (State Conservation Area)
            • -
            • SHCA (State Historical or Cultural Area)
            • -
            • SP (State Park)
            • -
            • SREC (State Recreation Area)
            • -
            • SRMA (State Recreation Management Area)
            • -
            • SW (State Wilderness)
            • -
            - -
            -:ET \ No newline at end of file diff --git a/_includes/general.html b/_includes/general.html new file mode 100644 index 0000000..50c661f --- /dev/null +++ b/_includes/general.html @@ -0,0 +1,9 @@ +

            general elections

            +
            + +

            national & state

            +2022 Primary California + +

            counties

            + +

            city

            \ No newline at end of file diff --git a/_includes/navbar.html b/_includes/navbar.html index 0a29b33..57c6d18 100644 --- a/_includes/navbar.html +++ b/_includes/navbar.html @@ -20,6 +20,10 @@ + +