Having trouble viewing this message? Links disabled? Click or copy:
This issue of DDI Directions provides a look at some of the events and developments taking place in the DDI community over the past few months. As always, thanks go to all of the hardworking people who support DDI and are helping to spread its adoption.
Volume VII, Number 3, July 2014
In May, the DDI Alliance chose officers to oversee the Scientific Board, the arm of the Alliance that has responsibility for shaping the DDI specifications. The new officers are Adam Brown, Statistics New Zealand, Chair, and Steve McEachern, Australian Data Archive, Vice Chair.
The DDI Alliance recently welcomed two new member institutions into the membership:
Three new DDI representatives have recently taken over this position for their organizations: Nicole Kirgis is now the representative for University of Michigan, Survey Research Operations, and Anne Sofie Fink Kjeldgaard is now representing the Danish Data Archive. Vigdis Kvalheim is the new representative for the Norwegian Social Science Data Service.
The DDI Alliance met in Toronto on June 2 with the morning devoted to administrative matters and the afternoon focused on substantive concerns. Chair Gillian Nicoll (Australian Bureau of Statistics) led the Meeting of Members and Scientific Board Vice Chair Steve McEachern (Australian Data Archive) chaired the first official meeting of the Scientific Board.
Scene from DDI Meeting of Members: From left, Catharina Wasner (GESIS),
The 2nd Annual North American DDI Users Conference took place in Vancouver on March 31-April 2, 2014. Organized by the University of British Columbia, Simon Fraser University, University of Alberta Libraries, and the Institute for Policy and Social Research at the University of Kansas, the meeting was held at Simon Fraser University, with the theme “Documenting Reproducible Research.”
The Keynote Speaker for the conference was Ann Green. Ann spoke about committing to data quality and the need for “intelligent openness” with respect to providing access to data. The conference program featured 20 sessions on a wide range of topics. Training workshops were held on Monday, March 31: Jane Fry, Carleton University, provided an introduction to DDI in the morning, showcasing the power of DDI metadata; in the afternoon Barry Radler, University of Wisconsin, and David Johnson, University of Kansas, led a session on the use of DDI to document health-related data.
The conference hosted around 40 participants from five Canadian provinces, eight U.S. states, and three countries.
NADDI presentations are now online.
DDI 4 Project Manager Thérèse Lalor of the Australian Bureau of Statistics has implemented a communication platform for the DDI community to encourage progress on DDI 4. There is a great deal of information on the site, including a slide deck for lay audiences called “Explain DDI 4 to Me,” information on all of the sprints, and plans for future work.
A “sprint” was held in Vancouver, BC, Canada, the week of March 24, just before the NADDI conference, with the goal of accelerating progress on DDI 4 content modeling. Led by Project Manager Thérèse Lalor, the sprint participants accomplished a lot. One important outcome of the meeting was that a new architecture for DDI 4 was specified. This innovative approach involves a Library of DDI elements and Functional Views on the Library. Functional Views are similar to DDI profiles in that they are subsets of the full Library. They focus on common use cases – for example, the set of DDI elements needed for data discovery, a simple codebook, a simple instrument, etc.
Participants wrote a paper that describes this approach and further develops the technical framework for generating XML and RDF representations of DDI 4. More about the sprint is available on the DDI collaboration site.
Olof Olsson of the Swedish National Data Service (SND) has set up a Web site for DDI 4 modelers to enter information about DDI elements (see lion.ddialliance.org). Based on the properties and relationships entered, UML model graphs can be generated. The Web site, implemented using the Drupal content management system, makes the development of DDI more transparent and open.
Section of a graph representing data discovery elements in DDI 4 on the lion.ddialliance.org site
A session on “No Tools, No Standard – Software from the DDI Community” was held during the IASSIST conference, which took place in Toronto, Ontario, June 3-6. This session, which led off with a presentation on the DDI developers community, provided a summary of some of the tools and functionality now available for DDI.
Two new tools (XSLT stylesheets) developed by Joachim Wackerow are now available for DDI Profiles. One creates a DDI Profile template on the basis of an existing DDI instance. The other renders a DDI Profile in HTML, including links to the field-level documentation of the used elements. Versions for DDI 3.1 and 3.2 are available.
Also, a set of DDI Profiles has been developed by the community working with the Generic Statistical Information Model (GSIM). These profiles are intended to harmonize the use of DDI in statistical organizations and to help ensure that GSIM implementations using DDI will interoperate. As part of the GSIM work, DDI Profiles were developed for Basic Technical Objects, Variable, Represented Variable, Questionnaire, and Codelist.
Three new RDF vocabularies – DDI-RDF Discovery (Disco), Physical Data Description (PHDD), and Extended Knowledge Organization System (XKOS) – have been created and are now out for public technical review. The vocabularies are scheduled to be finalized by the end of the year.
The Copenhagen Mapping is an effort to implement GSIM 1.1 Statistical Classifications using the new DDI 3.2. This work started at a Classifications workshop hosted by Statistics Denmark in December, and also included participants from Statistics Sweden, the Danish Data Archive, and Colectica. A first draft of the Copenhagen Mapping has been completed, and the authors are looking for feedback.
The DDI Alliance Controlled Vocabularies Working Group (DDI-CVG) is pleased to announce the publication of a new controlled vocabulary for describing the Mode of (data) Collection. For more information regarding the use of the controlled vocabularies published by the DDI Alliance, and the work of the DDI-CVG please see the Controlled Vocabularies section on the DDI Alliance Web site.
Organized by Joachim Wackerow and supported by GESIS-Leibniz Institute for the Social Sciences, a group met in Cologne during the week of November 11-15, 2013, to launch a new project with the aim of producing a collection of best practices for using DDI. These best practice descriptions will be modular with a homogeneous format, allowing reorganization in multiple ways. The primary structure for the collection will be organized in alignment with the DDI Lifecycle. A goal will be to involve the DDI community in producing a shared body of resources for all organizations and individuals using the DDI specification. Best practices will be reviewed by a team of editors and reviewers and published on a dedicated Web site.
The project will produce guidelines for institutions introducing DDI into their workflows and for other institutions already using DDI Codebook and shifting some of their workflows to DDI Lifecycle, particularly in the area of archival processing activities. Others contributing to the activity include Peter Granda (ICPSR), Larry Hoyle (University of Kansas, Institute for Policy and Social Research), Catharina Wasner (GESIS), and Wolfgang Zenk-Möltgen (GESIS).
Joachim Wackerow gave a presentation on the Handbook at IASSIST in June.
On June 9-10, a group of international experts met in Ann Arbor, MI, to discuss requirements for tools to document data transformations in DDI. With support from NSF and the World Bank, the meeting focused on defining a standard language for describing data transformations. While a system-independent language currently under development was considered, the group decided to create a new standard appropriate for variable-level transformations. To that end, a list of the fundamental data transformation commands in each of the four main statistical packages (SAS, SPSS, Stata, and R) was compiled and consolidated. The group intends to continue on as a DDI working group and the group's output will be integrated into the DDI modeling efforts. Tools will be developed in a future phase of the project, and they will support both DDI Codebook and DDI Lifecycle.
Colectica is partnering with the Institution for Social and Policy Studies (ISPS) at Yale University and Innovations for Poverty Action (IPA) to develop a repository for research data from randomized controlled trials in the social sciences. The repository will be an expansion – and major upgrade – of the existing ISPS Data Archive. The main objective of this project is the technical integration of the various curation processes with inventory management, metadata workflow, and Web access.
“DDI 3.2 Lifecycle allows precise description of curation and data archival operations over time,” said Jeremy Iverson, a partner at Colectica. “We are pleased to be working with ISPS and IPA on this exciting project.”
Colectica and the partners will develop a software platform that allows archives to leverage the DDI 3.2 standard for data documentation and to structure the curation workflow, which includes checking data for confidentiality and completeness, creating preservation formats, and reviewing and verifying code. The new integrated system will combine several open source and off-the-shelf components with a new, web-based ISPS-IPA Data Pipeline application, and will enable a seamless framework for collecting, processing, archiving, and publishing data.