Getting data from figures in published papers

The problem: There are a lot of figures in published papers in the scholarly literature, like the below, from (Attwood et. al. 2012)): At some point, a scientist wants to ask a question for which they can synthesize the knowledge on that question by collecting data from the published literature. This often requires something like the following workflow: Search for relevant papers (e.g., via Google Scholar). Collect the papers. Decide which are appropriate for inclusion....

September 18, 2012 · 5 min · Scott Chamberlain

Scholarly metadata from R

Metadata! Metadata is very cool. It’s super hot right now - everybody is talking about it. Okay, maybe not everyone, but it’s an important part of archiving scholarly work. We are working on a repo on GitHub rmetadata to be a one stop shop for querying metadata from around the web. Various repos on GitHub we have started - rpmc, rdatacite, rdryad, rpensoft, rhindawi - will at least in part be folded into rmetadata....

September 17, 2012 · 6 min · Scott Chamberlain