Data citation and attribution in Linguistics: Toward a common standard
A short introduction describing the activities and the scope of the group:
The Linguistics Data Interest Group plans to identify, prioritize, and get to work on data challenges across the Linguistics domain. As a first step, this new group will focus P10 time on developing the discipline-wide adoption of common standards for data citation and attribution, and to improve research data management training in the discipline. In our parlance citation refers to the practice of identifying the source of linguistic data, and attribution refers to mechanisms for assessing the intellectual and academic value of creating, managing, storing, sharing, and citing primary data.
The LDIG is for data at all linguistic levels (from individual sounds or words to video recordings of conversations to experimental data) and data for all of the world’s languages, and acknowledges that many of the world’s languages have high cultural value and are underrepresented with regards to the amount of information that is available about them.
This interest group is aligned with the RDA mission to improve open sharing of data through forming transparent discipline-specific data citation and attribution conventions to be adopted by the international research community. Linguistics is a discipline that straddles social/behavioral sciences and the humanities, and thus we have a great deal to contribute to the general RDA discussion on a multiplicity of data types.
Additional links to informative material related to the group i.e. group page, Case statement, working documents etc:
- Linguistics Data Interest Group page: https://www.rd-alliance.org/groups/linguistics-data-ig
- Linguistics Data Interest Group Charter Statement: https://www.rd-alliance.org/group/linguistics-data-interest-group/case-s...
- The Austin Principles of Data Citation in Linguistics (draft): https://sites.google.com/a/hawaii.edu/data-citation/austin-principles-of...
- The RDA Data Citation Working Group final report: https://www.rd-alliance.org/groups/data-citation-wg.html
- The FORCE11 Joint Declaration of Data Citation Principles https://www.force11.org/group/joint-declaration-data-citation-principles...
- Generate and synthesise feedback on the draft of the Austin Principles of Data Citation in Linguistics (circulated prior to meeting)
- Identify possible synergies, gaps and clashes between RDA DCWG and Austin draft principles, and determine how to resolve those
- Build list of points that need refinement
- Establish a WG or a plan with the LDIG to revise before P11
The aim of this meeting is to revise the draft version of the Austin Principles of Data Citation in Linguistics, which is one result of a two-year National Science Foundation project aiming to start discussion in linguistics about the value of research data (https://sites.google.com/a/hawaii.edu/data-citation/welcome). The purpose of the Austin Principles document is to carefully and comprehensively articulate the values of linguists with respect to the citation of digital data, and to serve as a guidepost in the future development of specific formats for citing digital linguistics data.The field currently has no such formats, and the development of a set of citation formats is the likely topic of a future RDA WG. This RDA meeting session aims to synthesise feedback on the draft from linguists in the RDA LDIG and more broadly, as well as feedback from participants at P10 (including non-linguists).
The Austin Principles document is based on the FORCE11 Joint Declaration of Data Citation Principles (https://www.force11.org/group/joint-declaration-data-citation-principles...), annotated for linguistic data. The draft document will be made available to the public with a commenting facility by July 2017, and these comments will be brought to the RDA meeting in September for integration. Feedback will be presented based on emergent themes, these will then be translated by the group into specific goals for improving the current draft. Drafting suggestions can be submitted before the plenary, and will be worked on during the session in a shared Google document (or equivalent). Should there be too much to work on directly in the meeting then the meeting will be used to structure the development of the draft.
Based on the extent of changes needed, the next steps will be decided, either a dedicated working group will be created, and its aims decided on at the meeting, or a timeline for work to continue within the current LDIG structure will be proposed.
The session will be chaired by Lauren Gawne (La Trobe University), Andrea L. Berez-Kroeker (University of Hawai‘i at Mānoa) and Helene N. Andreassen (UiT The Arctic University of Norway), the LDIG co-chairs.
Outline of meeting plan
0-5 minutes: An introduction to the LDIG, goals, and directions
5-10 minutes: An high-level introduction to Linguistics data issues and starting points
10-20 minutes: Overview of the principles and the motivation behind them with summary of current feedback
20-40 minutes: Feedback from floor, comments from DCWG and others
40-60 minutes: Real-time text updates and drafting
60-90 minutes: How to proceed next, forming a WG if necessary
We will try to offer remote participation, and those who cannot attend in person will be asked to submit feedback prior to the session. The meeting will be audio recorded and all written presentations and notes made available through the LDIG RDA page. If a working group is formed, the details of the structure and timeline will be decided in an LDIG meeting after the session.
1) Linguists, both those who collect primary data and those who use their own or other researchers’ data in their analysis
2) Publishers, of both research publications and datasets, commercial and institutional
3) Data archivists, both linguist specific and more general
4) Researchers from other fields who use linguistic data, including, but not limited to anthropologists, psychologists, sociologists and historians
5) Members of the DCWG
6) Other RDA members who have implemented the DCWG recommendations for other fields
7) RDA members from other domains interested in advancing their understanding of linguistics use cases, domain interests, etc.
Participants should prepare for the meeting by reading the draft version of the Austin Data Citation Principles in Linguistics, made available approximately two months prior to the RDA meeting.
Group chair serving as contact person: Lauren Gawne
Type of meeting: Working meeting
Session Room: Mansfield 9
Session Time: Tuesday 19 September, Breakout 3, 16:00 - 17:30
Collaborative session notes:
Session slides and materials:
Remote Access Instructions (Gotomeeting)
Access Code: 684-913-261
Australia: +61 2 9087 3604
Austria: +43 1 2530 22520
Belgium: +32 28 93 7018
Canada: +1 (647) 497-9353
Denmark: +45 32 72 03 82
Finland: +358 923 17 0568
France: +33 170 950 594
Germany: +49 692 5736 7317
Ireland: +353 15 360 728
Italy: +39 0 230 57 81 42
Netherlands: +31 207 941 377
New Zealand: +64 9 280 6302
Norway: +47 21 93 37 51
Spain: +34 932 75 2004
Sweden: +46 853 527 827
Switzerland: +41 225 4599 78
United Kingdom: +44 20 3713 5028
United States: +1 (224) 501-3216