|Wednesday, April 4th|
Limor Peer, Yale University
1:00 PM - 2:00 PM
Objective: To describe the process and challenges of creating a replication data archive at the Institution for Social and Policy Studies (ISPS) at Yale University. The Archive provides open access to research data, links data to publications, and ultimately facilitates reproducibility.
Description: The ISPS Data Archive is a digital repository for research produced by scholars affiliated with ISPS, with special focus on experimental design and methods. The primary goal of the Archive is to be used for replicating research results, i.e. by using author-provided code and data. The Archive was launched in September 2010 as a pilot for Yale’s Office of Digital Assets and Infrastructure (ODAI) to find solutions relating to storage, persistent linking, long-term preservation, and integration with a developing institutional repository.
Results: Before data publication, Archive staff processes data and code files, including verifying replication, adding metadata, and converting to CSV and R. To date, the ISPS Data Archive has published over 750 files for about 45 studies.
Conclusions: The development and implementation of the ISPS Data Archive, though outside the library, raises issues familiar to librarians: the need for clear policies from the institution; the challenge of finding support for the provision of high quality services; the complexity of working in close partnership with IT; the need to keep up with fast-paced changes in technology and in user expectations; and the challenge of bringing about change in community norms and practices. Alongside these practical issues, fundamental questions arise about the appropriate role of the university vs. the disciplines when it comes to data archiving, especially in light of the need to comply with requirements from funders and journals.
John F. Furfey, MBLWHOI Library
1:00 PM - 2:00 PM
Purpose: This poster focuses on the methods, tools and outcomes involved in creating two targeted research networks to support large, long-running research programs in the Woods Hole scientific community.
Participants: These efforts are managed by librarians from the Marine Biological Laboratory/Woods Hole Oceanographic Institution (MBLWHOI) Library in collaboration with administrators and researchers from two programs: The Whitman Center for Research and Discovery at the Marine Biological Laboratory and the National Ocean Sciences Accelerator Mass Spectrometry Facility (NOSAMS) at the Woods Hole Oceanographic Institution.
Description: In 2008, the MBLWHOI Library launched Connected Village (http://bibapp.mbl.edu), a research networking and discovery service for the Woods Hole Science Community1. The community has begun to recognize the Library as experts in promoting up-to-date information about researchers and research activities in Woods Hole. This year, NOSAMS and The Whitman Center have upcoming 10-year reviews from their respective funding agencies and governing boards. The ability to provide analytics regarding publication output, and demonstrate networking and outreach efforts is critical to the success of each review. This poster describes the planning, technical implementation, data gathering, deliverables, ongoing support and future directions of the Whitman (http://bibapp.mbl.edu/groups/51-MBL_Whitman_Center) and NOSAMS (http://nosams.mblwhoilibrary.org) research network tools.
Results/Outcome: Whitman Center – 611 researcher profiles created, 2064 published works harvested. NOSAMS – 1030 research profiles created, 1899 published works harvested. Administrators now have the ability to quantify and visualize the research output and impact of their programs. The researchers, who are from institutions all over the globe, now have the ability to discover potential collaborators in their field and get a much better sense of the collective scientific trends and contributions of their affiliated program. The Library is working with each program to develop workflows to systematically harvest new publications and maintain contact with their researchers on a continuous basis. Ongoing development includes refining our process for automated deposits of full-text and supporting data for publications into our Institutional Repository, the Woods Hole Open Access Server.
1 Connected Village runs on the open source software Bibapp (http://www.bibapp.org), developed by the University of Wisconsin and the University of Illinois.
Matt Sheridan, University of Massachusetts - Amherst
1:00 PM - 2:00 PM
Objective: To describe the challenges and outcomes of the University of Massachusetts Amherst Libraries' Data Working Group's series of training workshops for graduate students on the subject of data management and preservation, with specific regard to the data management requirements of the National Science Foundation and National Institutes of Health.
Participants: The Libraries' Data Working Group is composed of six members with expertise in project management, systems and web development, scholarly communication, digital archives and metadata, and science and social science librarianship. The Data Working Group is one of three subgroups of the Digital Strategies Group at the University Libraries.
Description: The University of Massachusetts Amherst Libraries provides a number of services to faculty and graduate students in support of research at an institution classified as a Research University with Very High research activity (RU/VH) by the Carnegie Foundation. Recognizing a high demand for greater data education, the Libraries' Data Working Group has conducted workshops for graduate students in specific disciplines -- humanities, social sciences, and sciences -- designed to address their data needs and highlight smart data management practices. Graduate students were also guided through the data management requirements of national funding agencies and potential solutions.
Results: In its current capacity the Data Working Group provides educational workshops and individual consulting sessions for faculty and graduate students. The Data Working Group observed a significant portion of graduate students who had no prior experience with smart data practices or useful data management resources. This process has identified a clear need for wider, more intensive education for graduate students on data practices and the data management requirements of national funding agencies.
Sarah J. Wright, Cornell University
1:00 PM - 2:00 PM
OBJECTIVE: All NSF grant proposals must now include a data management plan, and many other funding agencies either already have, or are expected to follow suit. Cornell’s Research Data Management Service Group (RDMSG) can help researchers prepare their data management plans, as well as being available for more general data management help. Our objective is to deliver research data management planning information to researchers.
METHODS: We delivered one hour information sessions at several times during the semester, and in several different locations on campus. In addition, individual consultations by email or in person were initiated by researchers.
RESULTS: We held nine open sessions in 2011 (3 in January, 2 in May, 2 in October, and 2 in December), and one session specifically for engineering faculty submitting CAREER proposals to NSF. Total attendance for all sessions was just over 300. We solicited feedback after the December sessions, which was uniformly positive. When asked whether the session made them better prepared to address the NSF's data management plan requirement, 21 of 23 respondents answered “yes” and 2 answered “unsure.”
CONCLUSIONS: Based on information session attendance and use of consultants, we consider this an important service for researchers. We will continue to offer NSF-specific information sessions as long as attendance warrants this; monitor emerging data management policies of funders such as NIH, and plan additional outreach accordingly; and determine need for additional data management training opportunities for faculty and graduate students, and develop training as need arises.
Lisa Raymond, Woods Hole Oceanographic Institution
1:00 PM - 2:00 PM
Purpose: This poster demonstrates the procedures and tools developed to deposit datasets in an Institutional Repository (IR) and assign Digital Object Identifiers (DOIs).
Setting/Participants/Resources: This research is being conducted by a team of librarians, data managers and scientists that are collaborating with representatives from the Scientific Committee on Oceanic Research (SCOR) and the International Oceanographic Data and Information Exchange (IODE) of the Intergovernmental Oceanographic Commission (IOC). The goal is to identify best practices for tracking data provenance and clearly attributing credit to data collectors/providers.
Description: Current literature on the topic of data publication suggests that success is best achieved when there is a partnership between scientists, data managers, and librarians. The Marine Biological Laboratory/Woods Hole Oceanographic Institution (MBLWHOI) Library and the Biological and Chemical Oceanography Data Management Office (BCO-DMO) have developed tools and processes to automate the ingestion of metadata from BCO-DMO for deposit with datasets into the Institutional Repository (IR) Woods Hole Open Access Server (WHOAS). The system also incorporates functionality for BCO-DMO to request a Digital Object Identifier (DOI) from the Library. This partnership allows the Library to work with a trusted data repository to ensure high quality data while the data repository utilizes library services and is assured of a permanent archive of the copy of the data extracted from the repository database.
While the WHOI Data Library and Archives has long been recognized as a valuable local resource, this project is an example of an important new role for the facility in providing the essential service of scientific data publication. The diversity and volume of modern research data sets precludes their publication in the peer reviewed journals that are the currency of scientific research. The process of developing the initial publication system required active engagement of the research community and the working system provides a valuable resource that supports networked science by improving access to data, enabling accurate reuse of data and facilitating proper citation of data resources.
Results/Outcome: The assignment of persistent identifiers enables accurate data citation. The Library can assign a DOI to appropriate datasets deposited in WHOAS. A primary activity is working with authors to deposit datasets associated with published articles. The DOI would ideally be assigned before submission and be included in the published paper so readers can link directly to the dataset, but DOIs are also being assigned to datasets related to articles after publication. WHOAS metadata records link the article to the datasets and the datasets to the article.
The assignment of DOIs has enabled another important collaboration with Elsevier, publisher of educational and professional science journals. Elsevier can now link from articles in the Science Direct database to the datasets available from WHOAS that are related to that article. The data associated with the article are freely available from WHOAS and accompanied by a Dublin Core metadata record.
In addition, the Library has worked with researchers to deposit datasets in WHOAS that are not appropriate for national, international, or domain specific data repositories. These datasets currently include audio, text and image files.
Donna Kafel, University of Massachusetts Medical School
1:00 PM - 2:00 PM
Objective: This poster illustrates the Frameworks for a Data Management Curriculum that was developed by the Lamar Soutter Library of UMASS Medical School and the George C. Gordon Library at Worcester Polytechnic Institute through an IMLS National Leadership Planning Grant. These frameworks are intended undergraduate and graduate students studying science, health sciences, and engineering disciplines.
Methods: An Education Committee composed of librarians, faculty, a curriculum consultant, an evaluation consultant, and an instructional design consultant collaborated in the development of these frameworks. At both schools, consultants collected data from students regarding their current data management practices and interviewed faculty about their students’ data management skills and learning needs. A literature review of current data management courses was conducted. From these resources learning objectives were identified, a simplified data management plan was developed, and a lesson plans for seven course modules were created. The evaluation consultant and a WPI librarian interviewed faculty to develop real-life research case scenarios that illustrate data management practices in the lab and clinical settings.
Results: The curriculum frameworks are mapped to the data management plan requirements of the National Science Foundation and include lesson plans for seven instructional modules, a simplified data management plan, course readings, research cases in medicine, biomedical lab research, clinical behavioral health, and aerospace engineering. The Education Committee fully developed course content including readings, activities, research case excerpts, and assessment questions and answers for the fifth module, “Legal and Ethical Considerations for Research Data” as proof of concept.
Conclusion: Faculty and librarians have responded that the curriculum frameworks, when fully developed, will be a useful tool for providing data management instruction to students at the undergraduate and graduate levels who are enrolled in diverse science, health science and engineering courses. The modular format of the curriculum and variety of research cases is flexible ; allowing faculty to select modules that are relevant to their course programs. When completed, the curriculum can be delivered in multiple ways: face-to-face, as online interactive modules, or hybrid. Implementation funding for full development of the course modules, additional research cases, and piloting the curriculum modules is proposed.
Jessica Adamick, University of Massachusetts - Amherst
1:00 PM - 2:00 PM
Objective: This poster describes the methods by which InterNano, the information service and virtual home of the National Nanomanufacturing Network1 (NNN), supports the information needs of the nanomanufacturing cohort: multi‐institutional and organizational stakeholders across diverse domains.
Methods: InterNano2 strives to support the nanomanufacturing research and development community through collection, organization, and dissemination of information on the nanomanufacturing domain. Services include a monthly newsletter and weekly mailer distributed to 4,500 subscribers, a directory of 600 organizations and experts, a nanomanufacturing process database, expert reviews of topical peerreviewed articles, “highlight” articles that contextualize nanomanufacturing trends and news, an eprints research repository, and an events calendar, among other unique features and content. All content is tagged with terms from an original interactive nanomanufacturing taxonomy.
Results: Since the internano.org launch in 2008, the NNN core team has seen an increase of user buy‐in and participation; users create and self‐manage directory entries, submit press releases for dissemination, and self‐subscribe to the newsletter. InterNano site traffic continues to increase as the NNN develops more content.
Conclusions: The NNN team is currently refining and developing its services to better serve the diverse nanomanufacturing community, with plans underway to build a dashboard interface to better direct users to relevant information and to develop content across the board with a team of new contributing editors.
1 Funded by the National Science Foundation, CMMI‐1025020
Jen Ferguson, Northeastern University
1:00 PM - 2:00 PM
Experimental data files found on molecular biology laboratory instruments were examined in order to: (1) gain insight into current data management practices, and (2) evaluate possible curation and preservation challenges with this type of data.
A faculty member granted me access to several instruments in her molecular biology teaching lab. Files and metadata on the hard drives were captured using Xplorer2 file management software. Data file formats were sorted and analyzed with Xplorer2 and Microsoft Excel, and formats were categorized as proprietary or open. Informal discussions with the faculty member and research staff during the course of the work also informed the findings.
Files in both proprietary and open formats were found on the instruments' hard drives. 62% of the experimental data files were in proprietary formats. Image files in various formats accounted for the most prevalent types of data found. Faculty and research staff mentioned several challenges in managing this data, including inconsistent practices in data storage locations and file naming conventions. They noted that students found working with and sharing experimental data frustrating at times, largely due to proprietary format issues.
This study found lack of a consistent approach to data management on laboratory instruments. Prevalence of proprietary file formats is a concern with this type of data. Students express frustration in working with this data now, and files in these proprietary formats may well pose curation and preservation challenges in the future.
Gail Steinhart, Cornell University
1:00 PM - 2:00 PM
OBJECTIVE: Cornell University Library (CUL) and the School of Information Studies at Syracuse University (SU) are nearing the completion of a partnership to support SU’s eScience Fellows Program by offering a mentorship program for enrolled students. The program introduces students to the practice of science and eScience librarianship, and provides students with opportunities to be exposed to world-class science libraries as well as innovative eScience projects.
METHODS: Each student is matched with a librarian-mentor at Cornell for one-on-one mentoring. Other elements of the program include: In-person events at Syracuse and at Cornell; Opportunities for students to participate in the life of CUL at large via in-person and remote participation in CUL events and programming, and regular communication to students on issues of interest at CUL; Virtual interaction and collaboration, including blogging by program participants on their work and experiences; Possible project and internship opportunities.
RESULTS: After the first year, students responded quite favorably to the mentorship program. Distance between the two institutions is a challenge, and students requested more opportunities for face-to-face and virtual interaction. The program has been most successful in familiarizing the students with the work of eScience librarianship and the library profession as a whole.
CONCLUSIONS: Mentoring students for careers in eScience, careers for which we ourselves may not have been formally trained, presents both challenges and opportunities. Students gain valuable perspective from practicing librarians, while librarians are exposed to emerging the technologies and practices associated with the new discipline.
Raquel Abad, University of Massachusetts Medical School
1:00 PM - 2:00 PM
Objective: To describe the planning process and activities of the University of Massachusetts Medical School's Lamar Soutter Library around the publication of the new Journal of eScience Librarianship (JESLIB).
Methods: The University of Massachusetts Medical School’s Lamar Soutter Library through funding from the National Network of Libraries of Medicine has been a leader in educating librarians about eScience and its impact on librarianship. In spring 2011 the Library began to explore the idea of publishing a peer-reviewed, open access electronic journal about eScience and data management for librarians. Planning and implementation considerations included: choosing a unique and appropriate name; infrastructure and hosting options; organizational and governance structure; roles and responsibilities; journal structure and content; aims and scope; editorial, peer review and other policies and procedures; and dissemination.
Results: The inaugural issue of the Journal of eScience Librarianship (http://escholarship.umassmed.edu/jeslib/) was published on February 15, 2012 via the journal management platform of the Library's institutional repository, eScholarship@UMMS. JESLIB has been assigned ISSN 2161-3974. The medical school joined CrossRef so that article metadata could be deposited into their system and each article assigned a DOI (Digital Object Identifier).
Conclusion: Libraries can successfully publish as well as host online journals. Helpful planning guides and other resources are available to assist libraries and academic groups in publishing open access peer-reviewed journals. Lessons learned include: consider professional copy editing services to assist the Editorial Board; Editorial Team roles and responsibilities should be clearly defined but allow room for flexibility; and have a clear marketing communication and promotion strategy.
Sally A. Gore, University of Massachusetts Medical School
1:00 PM - 2:00 PM
Objective: Provide current librarians with a variety of options to become adept at the knowledge and skills needed to successfully work in the area of e-Science.
Methods: In 2009, the Lamar Soutter Library, University of Massachusetts Medical School, hosted its first e-Science Symposium for librarians in New England. The event marked the beginning of an ongoing initiative to prepare librarians to take active roles in an emerging trend in research that will impact the futures of academic, research and medical libraries. Based upon feedback from attendees at the Symposium, other events were developed to address the needs expressed. These include single-day professional development workshops that provide a combined lecture and hands-on work in relevant topics; 3-day science boot camps that bring researchers and librarians together to learn about both the basics of scientific disciplines, as well as how the science is applied in today's world; a web-based portal that provides a collection of resources for librarians to improve their knowledge of e-Science issues such as data management or developing metadata, plus primers on scientific disciplines; and a recently launched eJournal, The Journal of eScience Librarianship, that aims to bring together the theory and practice of librarianship in the area of e-Science.
Results: The Lamar Soutter Library's e-Science Initiative is currently in its fourth year of programming. Hundreds of librarians from New England, as well as other parts of the country, have taken advantage of the different events over the years. The symposium and professional development days have been replicated by attendees in their respective institutions and/or regions.
Conclusions: The need for librarians to gain new skills to remain relevant in their role(s) is ongoing, yet difficult to achieve once ensconced in a position and/or career. By providing different ways to grow professionally, the e-Science Initiative of the Lamar Soutter Library is leading a national trend in continuing education and improved awareness of e-Science Librarianship.