Article Type

EScience in Action

Publication Date

February 2012


Deposition of biomedical data sets is on the rise as more scientists submit experimental data to accompany their publications. Scientists are also increasingly reusing these publicly available data sets in their own work. Despite these developments, lack of both context and metadata can create barriers to understanding and repurposing these data sets. Researchers from the Bioinformatics Core Group in the Harvard School of Public Health attempted to address this issue by assembling a team of data curators who used the open source software suite ISA tools to annotate and contextualize microarray data sets.

This paper describes the workflow and software used in curating these data sets, discusses similarities and differences in the approaches of team members to the work, and suggests possible roles for librarians in similar data curation projects.


curation, microarray, escience, ISA, gene expression

Rights and Permissions

Copyright © 2012 The Author(s).

Creative Commons License

Creative Commons License
This work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 License.

jasfjeslibFigure1.tif (2496 kB)

jasfjeslibFigure2.tif (1866 kB)



To view the content in your browser, please download Adobe Reader or, alternately,
you may Download the file to your hard drive.

NOTE: The latest versions of Adobe Reader do not support viewing PDF files within Firefox on Mac OS and if you are using a modern (Intel) Mac, there is no official plugin for viewing PDF files within the browser window.