oai - an OAI-PMH client

oai is a general purpose client to work with any ‘OAI-PMH’ service. The ‘OAI-PMH’ protocol is described at https://www.openarchives.org/OAI/openarchivesprotocol.html. The main functions follow the OAI-PMH verbs: GetRecord Identify ListIdentifiers ListMetadataFormats ListRecords ListSets The repo is at https://github.com/sckott/oai I will be using this in a number of packages I maintain that use OAI-PMH data services. If you try it, let me know what you think. This package is heading to rOpenSci soon: https://github....

September 11, 2015 · 3 min · Scott Chamberlain

Weecology can has new mammal dataset

So the Weecology folks have published a large dataset on mammal communities in a data paper in Ecology. I know nothing about mammal communities, but that doesn’t mean one can’t play with the data… Their dataset consists of five csv files: communities, references, sites, species, and trapping data Where are these sites, and by the way, do they vary much in altitude? Let’s zoom in on just the states What phylogenies can we get for the species in this dataset?...

December 29, 2011 · 1 min · Scott Chamberlain

My little presentation on getting web data through R

With examples from rOpenSci R packages. p.s. I am no expert at this... Web data from R View more presentations from schamber

October 28, 2011 · 1 min · Scott Chamberlain

rnpn: An R interface for the National Phenology Network

The team at rOpenSci and I have been working on a wrapper for the USA National Phenology Network API. The following is a demo of some of the current possibilities. We will have more functions down the road. Get the publicly available code, and contribute, at Github here. If you try this out look at the Description file for the required R packages to run rnpn. Let us know at Github (here) or at our website http://ropensci....

August 31, 2011 · 2 min · Scott Chamberlain

(#ESA11) rOpenSci: a collaborative effort to develop R-based tools for facilitating Open Science

Our development team would like to announce the launch of rOpenSci. As the title states, this project aims to create R packages to make open science more available to researchers. http://ropensci.org/ What this means is that we seek to connect researchers using R with as much open data as possible, mainly through APIs. There are a number of R packages that already do this (e.g., infochimps, twitteR), but we are making more packages, e....

August 8, 2011 · 1 min · Scott Chamberlain

Archiving ecology/evolution data sets online

We now have many options for archiving data sets online: Dryad, KNB, Ecological Archives, Ecology Data Papers, Ecological Data, etc. However, these portals largely do not communicate with one another as far as I know, and there is no way to search over all data set sources, again, as far as I know. So, I wonder if it would ease finding of all these different data sets to get these different sites to get their data sets cloned on a site like Infochimps, or have links from Infochimps....

July 15, 2011 · 2 min · Scott Chamberlain

rbold: An R Interface for Bold Systems barcode repository

Have you ever wanted to search and fetch barcode data from Bold Systems? I am developing functions to interface with Bold from R. I just started, but hopefully folks will find it useful. The code is at Github here. The two functions are still very buggy, so please bring up issues below, or in the Issues area on Github. For example, some searches work and other similar searches don’t. Apologies in advance for the bugs....

June 28, 2011 · 1 min · Scott Chamberlain

iEvoBio 2011 Synopsis

We just wrapped up the 2011 iEvoBio meeting. It was awesome! If you didn’t go this year or last year, definitely think about going next year. Here is a list of the cool projects that were discussed at the meeting (apologies if I left some out): Vistrails: workflow tool, awesome project by Claudio Silva Commplish: purpose is to use via API’s, not with the web UI Phylopic: a database of life-form silouhettes, including an API for remote access, sweet!...

June 22, 2011 · 2 min · Scott Chamberlain

OpenStates from R via API: watch your elected representatives

I am writing some functions to acquire data from the OpenStates project, via their API. They have a great support community at Google Groups as well. On its face this post is not obviously about ecology or evolution, but well, our elected representatives do, so to speak, hold our environment in a noose, ready to let the Earth hang any day. Code I am developing is over at Github. Here is an example of its use in R, in this case using the Bill Search option (billsearch....

June 10, 2011 · 2 min · Scott Chamberlain

Farmer's markets data

I combined USDA data on farmer’s markets in the US with population census data to get an idea of the disparity in farmers markets by state, and then also expressed per capita. Download USDA data here. The formatted file I used below is here (in excel format, although I read into R as csv file). The census data is read from url as below. California has a ton of absolute number of farmer’s markets, but Vermont takes the cake by far with number of markets per capita....

February 16, 2011 · 2 min · Scott Chamberlain