Bridging Data Lifecycles: Tracking Data Use via Data Citations

Bridging Data Lifecycles: Tracking Data Use via Data Citations Data Workshop

University Corporation for Atmospheric Research (UCAR)
Center Green Campus
Boulder, CO
April 5-6, 2012

The UCAR workshop on data citation and data use brought together over 75 people from data centers, research libraries, federal agencies, and research organizations. Presentations focused on data lifecycles, current options for data citation and data reuse, and highlighted pilot projects in implementing citation standards and practices. Mark Parsons of the National Snow and Ice Data Center (NSIDC), Joan Starr of the California Digital Library, and Bill Cook, the Director of Publications with the American Geophysical Union (AGU), provided the opening keynote presentations. Dr. Tim Killeen, Assistant Director for Geosciences at the National Science Foundation (NSF) led off the second day with a keynote address that discussed multiple facets of the NSF's approach to promoting data management and citation.

Workshop speakers presented multiple perspectives on the development of data citation policies and procedures, as well as technologies for assigning DOIs and promoting data citations. Topics ranged across multiple stages of the "lifecycle of data": data conceptualization, collection, evaluation, use, management, and preservation. The widespread interest amongst data specialists and librarians in these issues led to many good discussions about effective strategies to support curation of, and access to, diverse kinds of data. Data citations were discussed as being critical to enabling scientific results to be traced back to their underlying data, and to enable scientists, data managers, and data centers to be credited for producing useful data.

Thanks to all of the presenters and attendees. The presentations for each speaker are posted below.

Final Workshop Report

Workshop Agenda

Thursday, April 5th

8:30am - 9:00am – Registration check in; Coffee, tea, breakfast rolls


9:00am - 9:10amIntroduction

Mary Marlino, Director, NCAR Library


9:10am - 9:20amWelcome

Dr. Maura Hagan, NCAR Deputy Director


9:20am - 10:20amData publication and citation – What are the big issues around data publication and citation? Why is this difficult? Are we thinking about this in the right way? What work is being done currently in this area?

Mark Parsons (NSIDC - National Snow and Ice Data Center)


10:20am - 10:30am – Break


10:30am - 11:30amData citations and identifiers – What is the role of unique web identifiers? How do different identifiers compare? How should different identifiers be used across data life cycles?

Joan Starr (California Digital Library, EZID, DataCite)


11:30am - 12:00pm - Publisher perspective on data publication and citation

Bill Cook (American Geophysical Union)


12:00pm - 1:30pm – Buffet lunch provided


1:30pm - 3:15pmData citation and identifier implementation issues – How can/should data citations and unique identifier schemes be implemented? How should a data citation initiative be organized within particular institutional settings? What challenges must be addressed in those settings? How are data citations created, managed, and linked within, and across, data life cycles?

Matt Mayernik (NCAR)
Mike Daniels (NCAR)
Gary Strand (NCAR)
Nicole Kaplan (CSU/LTER)
Mike Wright (NCAR) - Moderator


3:15pm - 3:30pm – Break


3:30pm - 5:00pm - Data Curation Service Models – How do data citations fit within larger data curation services? How do data citations and identifiers fit within other metadata structures that are used for data management and curation?

Barbara Losoff (CU Library)
Ruth Duerr (NSIDC)
Lynn Yarmey (NSIDC)
Ted Habermann (NOAA)
Matt Mayernik (NCAR) - Moderator


5:00pm - 6:30pm – Reception: Light appetizers


Friday, April 6

8:30am - 9:00am – Coffee, tea, breakfast rolls


9:00am - 9:15amIntroduction

Mary Marlino, Director, NCAR Library


9:15am - 10:15am Exploring the Life Cycle, Citation and Integration of Geo-Data

Dr. Tim Killeen (Assistant Director NSF/GEO)


10:15am - 10:30am – Break


10:30am - 11:45amTracking data use: Current practices – How do data archives/repositories currently track data use? How would data citations (with or without unique web identifiers) help to make tracking data use easier? Are there challenges related to tracking data use that data citations do not help to address?

Steve Worley (NCAR)
Dan Kowal (NOAA)
Leonard Sitongia (NCAR)
Matt Mayernik (NCAR) - Moderator

11:45am - 12:15pmClosing "How to get started without solving every detailed problem"

Matt Mayernik (NCAR)


Funding for this workshop is provided by the National Oceanic and Atmospheric Administration (NOAA) through the UCAR Joint Office for Science Support (JOSS) program.

For Program questions contact Karon Kelly at
For Logistical questions contact Pat Steinkamp at

Meeting location is at UCAR Center Green campus in the CG 1 building - see map/directions.

3080 Center Green Drive
Boulder, CO 80301
Reception phone 303-497-2525