Skip to content

Summary of Biodiversity Informatics Resources I have contributed too.

Notifications You must be signed in to change notification settings

mgaynor1/BiodiversityResources

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Biodiversity Informatics Resources

Michelle L. Gaynor

Introduction

Here I provide a round-up of biodiversity informatic resources I have created or contributed to.

Contributions

Contributions are indicated on each script/repository, but to summarize, contributions to these script or presentations have been made by many, including but not limited to: Natalie Patten, Mike Belitz, Rhett Rautsaw, Maria Cortez, Andre Naranjo, Lauren Whitehurst, Anthony Melton, Johanna Jantzen, Blaine Marchant, Charlotte Germain-Aubrey, Grant Godden, Doug Soltis, and Pam Soltis.

Where to start:

If you are new to R-based coding, I suggest you start with my Biodiversity Literacy in Undergraduate Education (BLUE) module here. My coding style is a mix of Data Carpentry and tidyverse, both of which have additional tutorials that can be used to learn this material.

Resources

soltislab/BotanyENMWorkshop

This repository contains workshop material for "Using Digitized Collections-Based Data in Research: Applications for Ecology, Phylogenetics, and Biogeography" which the Soltis lab presents at the annual Botany meetings. This contains the material used in the 2020, 2021, 2022, and 2023. This workshop was often done in conjuction with Biotaphy, so this repository contains material outside of the scope of my work. As an undergraduate in 2017, I learned all about ENMs with biodiversity occurrence data from Blaine Marchant. When I started as a graduate research assistant with iDigBio in Fall 2018, I started to modify this material to include new methods I was learning. With every additional workshop/course/office-hour I helped lead, I learned something new that I would incorporate into this material. In Spring 2020 and Summer 2021, I heavily revised this material so that it could be used by undergraduate researchers. In Summer 2022 and 2023, I modified this material to include our package gatoRs. This workshop will likely be updated every summer, as we present the "Using Digitized Collections-Based Data in Research: Applications for Ecology, Phylogenetics, and Biogeography" workshop at the annual Botany meeting each year. You can see the latest workshop version here.

A version of the workshops can also be found in mgaynor1/ENMCrashCourse, however this repository is now outdated and likely will be archieved.

General Flowchart

nataliepatten/gatoRs

This repository contains the R package gatoRs: Geographic and Taxonomic Occurrence R-Based Scrubbing which provides users with tools for downloading and processing biodiversity data. This package is also avaliable on CRAN.

See publication: Patten NN, Gaynor ML, Soltis DE, and Soltis PS. 2024. Geographic and Taxonomic Occurrence R-Based Scrubbing (gatoRs): An R package and reproducible workflow for processing biodiversity data.

mgaynor1/CURE-FL-Plants
In Spring 2020, I helped teach a Course-based undergraduate research experience at the University of Florida titled 'CURE: Florida Plants and Climate Change'. This class was a 3 credit undergraduate course and it was co-taught with Pam and Doug Soltis. The goal of this research was to look at how climate change would impact the distribution of rare and endangered species currently residing at Archbold Biological Station. This repository contains demo and project scripts, which are written in a way to loop through a long list of species. Much of these methods are still acceptable, however since this class, I improved this pipeline to defined accessible area with alpha hulls + buffers, rather than convex hulls. I also modified my methods for selecting climatic layers, ecological niche generation, and ecological niche model evaluation. Some of these scripts have also been shortened or simplified by an amazing undergraduate researcher NN Patten.

mgaynor1/R4NaturalHistoryCollections-BCEENET2021
In 2021, I taught a workshop with BCEENET on "Using R for dNHC (Digitized Natural History Collections) Research". This workshop reviews cleaning, mapping, and analyzing natural history collections data in R. It also reviewed point sampling and extracting elevation data from occurrence records, as well as how to run an ANOVA based on this data.

mgaynor1/long-winded-scripts
Over the last few years, I generated many scripts related to using Digitized Natural History Collections data in research. This repository is a collection of some of those scripts, specifically those that took me a large amount of time to put together in R and python.

  • Occurrence Data (R). - This is the first draft of a function that will be part of Natalie's R package gatoRs - this script has since been completely rewritten.
  • SoilGridDownload (python3). - Back when Max Gebhart was an undergrad at the University of Central Florida, we wanted soil layers included in our ENMs. With the help of Mike Belitz, this code was generated. You can see some of Max's work here.
  • Convex Hulls (R). - This code was generated for a post doc in the Soltis lab who had a question about convex hulls.
  • iDigBio Search Bar (R). - Using ridigbio, I wanted to match a simple search from the iDigBio web-portal. This sounds simple, right? Well, it took a long time, but this simple line of code is now avaliable.

mgaynor1/BLUE-Intro2RwithBiodiversityData.
In 2020, I participated in a BIOME Fall Working Group and generated this module. This is an open educational resource which can be cited as followed: Gaynor, M. (2020). Introduction to R with Biodiversity Data. Biodiversity Literacy in Undergraduate Education, QUBES Educational Resources. doi:10.25334/84FC-TE88

With this material, students will learn R basics while downloading biodiversity data from multiple data repositories. This module will walk students through installing R, navigating R, reproducibility in R, and using R to download biodiversity data.

mgaynor1/BCEENET-DataCleaning
In Summer 2020, I helped lead a Data Cleaning Workshop led with Pam Soltis for BCEENET. With the help of Molly Philips, this can be cited as an open educational resource: Gaynor, M. (2020). Cleaning Biodiversity Data: A Botanical Example Using Excel or RStudio. Biodiversity Literacy in Undergraduate Education, QUBES Educational Resources. doi:10.25334/DRGD-F069.

Upon completion of this module, each student should be able to: (1) Access biodiversity data from open sources. (2) Use descriptive, retrievable, and consistent file names to manage datasets. (3) Identify common problems with digital datasets. (4) Rectify common problems with digital datasets. (5) Apply disciplinary knowledge for smart data cleaning. (6) Explain the importance of reproducible data and cleaning steps. (7) Document data cleaning steps to provide reproducibility.

iDigBio API Working Group
From 2019 - 2021, I helped host the API Office Hours with Erica Krimmel and Ron Canepa (iDigBio). There are a lot of resources avaliable on this repository, but some of my contributions include:

Ron and Erica also created some great demos/resources. Here are just some of my favorite:

Gists
Short snibits of code I generated related to using Digitized Natural History Collections data in research.

  • Searching the ScientificName Field
    • Compares three approaches for retrieving records from iDigBio based on the scientificname field.
  • Milton Tan
    - Milton wanted to know how to download records with the ridigbio API based on Institution ID and Catelog Number.
  • spocc_combine loop
    - One of my lab mates wanted to know how to use the spocc_combine function (which is an early edition of the gator_download() function) for a list of species. This gist shows how.
  • SERNEC-R
    - We wanted to combined data downloaded via SERNEC with data download using early functions of the gators R package functions.

Folk and Gaynor et al. 2023

I had the opportunity to work with Ryan Folk on this fun project which resulted in many repositories related to biodiversity informatics:

Folk RA*, Gaynor ML*, Engle-Wrye NJ, O’Meara BC, Soltis PS, Soltis DS, Guralnick RP, Smith SA, Grady CJ, & Okuyama Y. 2023. Identifying climatic drivers of hybridization with a new ancestral niche reconstruction method. Systematic Biology, syad018

Others Resources

About

Summary of Biodiversity Informatics Resources I have contributed too.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published