rnpn: An R interface for the National Phenology Network

The team at rOpenSci and I have been working on a wrapper for the USA National Phenology Network API. The following is a demo of some of the current possibilities. We will have more functions down the road. Get the publicly available code, and contribute, at Github here. If you try this out look at the Description file for the required R packages to run rnpn. Let us know at Github (here) or at our website http://ropensci.org/, or in the comments below, or on twitter (@rOpenSci), what use cases you would like to see with the rnpn package. ...

August 31, 2011 · 2 min · Scott Chamberlain

Tenure track position in systematics at the University of Vermont

There is an awesome position opening up for an assistant professor in systematics at the University of Vermont. Below is the announcement, and see the original post at the Distributed Ecology blog. Why is this related to R? One can do a lot of systematics work in R, including retrieving scientific collections data through an upcoming package handshaking with VertNet (part of the rOpenSci project), managing large data sets, retrieval of GenBank data through the ape package (see fxn read.genbank), phylogenetic reconstruction and analysis, and more. So I am sure a systematist with R ninja skills will surely have a head up on the rest of the field. ...

August 22, 2011 · 3 min · Scott Chamberlain

Thursday at #ESA11

Interesting talks/posters: Richard Lankau presented research on trade-offs and competitive ability. He suggests that during range expansion selection for increased intraspecific competitive ability in older populations leads to loss of traits for interspecific competitive traits due to trade-offs between these traits. Ellner emphatically states that rapid evolution DOES matter for ecological responses, and longer-term evolutionary patterns as well. [His paper on the talk he was giving came out prior to his talk, which he pointed out, good form sir] ...

August 12, 2011 · 2 min · Scott Chamberlain

Wednesday at #ESA11

Interesting talks/posters: Ethan White’s poster describing EcologicalData.org was of course awesome given my interest in getting data into the hands of ecologists over at rOpenSci.org. Ethan also has software you can download on your machine to get the datasets you want easily - EcoData Retriever. [rOpenSci will try to take advantage of their work and allow you to call the retriever from R] Carl Boettiger’s talk was awesome. He explained how we need better tools to be able to predict collapses using early warning signals. He developed a way to estimate the statistical distribution of probabilities of system collapse. Jennifer Dunne: Explained how she put together an ancient network from Germany. Bravo. Carlos Melian explained his model of network buildup that starts from individuals, allows speciation, and other evolutionary processes. Rachel Winfree told us that in two sets of mutualistic plant-pollinator networks in New Jersey and California, that the least connected pollinator species were the most likely to be lost from the network with increasing agricultural intensity. Dan Cariveau suggests that pollination crop services can be stabilized even with increasing agriculture intensity if in fact pollinator species respond in different ways. That is, some pollinators may decrease in abundance with increasing ag intensity, while other species may increase - retaining overall pollination services to crops.

August 11, 2011 · 2 min · Scott Chamberlain

Monday at ESA11

Monday was a good day at ESA in Austin. There were a few topics I promised to report on in my blogging/tweeting. …focused on open source data. Carly Strasser’s presentation on guidelines for data management was awesome (including other talks in the symposium on Creating Effective Data Management Plans for Ecological Research). Although this was a good session, I can’t help but wish that they had hammered home the need for open science more. Oh well. Also, they talked a lot about how, and not a lot of why we should properly curate data. Still, a good session. One issue Carly and I talked about was tracking code in versioning systems such as Github. There doesn’t seem to be a culture of versioning code for analyses/simulations in ecology. But when we get there…it will be easier to share/track/collaborate on code. ...

August 8, 2011 · 2 min · Scott Chamberlain

(#ESA11) rOpenSci: a collaborative effort to develop R-based tools for facilitating Open Science

Our development team would like to announce the launch of rOpenSci. As the title states, this project aims to create R packages to make open science more available to researchers. http://ropensci.org/ What this means is that we seek to connect researchers using R with as much open data as possible, mainly through APIs. There are a number of R packages that already do this (e.g., infochimps, twitteR), but we are making more packages, e.g., for Mendeley, PLoS Journals, and taxonomic sources (ITIS, EOL, TNRS, Phylomatic, UBio). ...

August 8, 2011 · 1 min · Scott Chamberlain

Blogging/tweeting from ESA11

I will be blogging about the upcoming Ecological Society of America meeting in Austin, TX. I will focus on discussing talks/posters that: Have taken a cool approach to using data, or Have focused on open science/data, or Done something cool with R software, or Are just exciting in general I will also tweet throughout the meeting from @recology_ (yes the underscore is part of the name, recology was already taken). ...

July 31, 2011 · 1 min · Scott Chamberlain

Models in Evolutionary Ecology seminar, organized by Timothee Poisot

Here is one of the talks by Thomas Brouquet, and see the rest here. Thomas Broquet by mez_en_video

July 18, 2011 · 1 min · Scott Chamberlain

Archiving ecology/evolution data sets online

We now have many options for archiving data sets online: Dryad, KNB, Ecological Archives, Ecology Data Papers, Ecological Data, etc. However, these portals largely do not communicate with one another as far as I know, and there is no way to search over all data set sources, again, as far as I know. So, I wonder if it would ease finding of all these different data sets to get these different sites to get their data sets cloned on a site like Infochimps, or have links from Infochimps. Infochimps already has APIs (and there’s an R wrapper for the Infochimps API already set up here: http://cran.r-project.org/web/packages/infochimps/index.html by Drew Conway), and they have discussions set up there, etc. ...

July 15, 2011 · 2 min · Scott Chamberlain

CRdata vs. Cloudnumbers

Cloudnumbers and CRdata are two new cloud computing services. I tested the two services with a very simple script. The script simply creates a dataframe of 10000 numbers via rnorm, and assigns them to a factor of one of two levels (a or b). I then take the mean of the two factor levels with the aggregate function. In CRdata you need to put in some extra code to format the output in a browser window. For example, the last line below needs to have ‘<crdata_object>’ on both sides of the output object so it can be rendered in a browser. And etc. for other things that one would print to a console. Whereas you don’t need this extra code for using Cloudnumbers. ...

July 14, 2011 · 3 min · Scott Chamberlain