{"id":299,"date":"2012-01-03T18:04:18","date_gmt":"2012-01-03T18:04:18","guid":{"rendered":"http:\/\/opencitations.wordpress.com\/?p=299"},"modified":"2022-08-04T13:10:42","modified_gmt":"2022-08-04T11:10:42","slug":"comments-on-is-data-publication-the-right-metaphor","status":"publish","type":"post","link":"https:\/\/opencitations.hypotheses.org\/299","title":{"rendered":"Comments on IS DATA PUBLICATION THE RIGHT METAPHOR?"},"content":{"rendered":"<p><em>Is Data Publication the Right Metaphor?<\/em> is an essay by Mark Parsons and Peter Fox to be published in the <a href=\"http:\/\/www.codata.org\/dsj\/\">Data Science Journal<\/a>, for which a preprint has been provided for open pre-publication community peer review at <a href=\"http:\/\/mp-datamatters.blogspot.com\/2011\/12\/seeking-open-review-of-provocative-data.html\">http:\/\/mp-datamatters.blogspot.com\/2011\/12\/seeking-open-review-of-provocative-data.html<\/a>.<\/p>\n<p>To supplement the discussion this excellent essay has already prompted, I would like to add some belated comments relating to DOIs for dataset versions, peer review of datasets, metadata as first class scientific objects, and linking the data publication and linked data metaphors.<\/p>\n<p>First, DOIs <em>can<\/em> easily be used to provide unique identifiers to datasets that undergo updating and versioning, as exemplified in the <a href=\"http:\/\/datadryad.org\/\">Dryad Data Repository<\/a>, a repository for the publication of small heterogeneous biological datasets linked to peer-reviewed journal articles.\u00a0 Dryad DOIs permit the version numbers of each data package <em>and<\/em> of each of the data package\u2019s constituent data files to be specified explicitly (see <a href=\"http:\/\/wiki.datadryad.org\/wiki\/DOI_Usage\">http:\/\/wiki.datadryad.org\/wiki\/DOI_Usage<\/a>).<\/p>\n<p>Second, relating to the problems of applying traditional models of peer review to datasets, it is important to realize a fundamental difference between a journal article and a dataset, namely that the journal article is an exercise in rhetorial persuasion, with the authors presenting selected data to convince the reader of the correctness of particular hypotheses \u2013 see, for example, papers on this topic by Anita de Waard <em>et al<\/em>. [1-3] \u2013 while datasets lack this rhetorial structure, and thus cannot be judged by the same standards of logical persuasiveness, fit between data and hypothesis, etc.\u00a0 Rather, datasets contain \u2018mere facts\u2019, albeit organized according to some underlying data model, and thus their review and quality control <em>have <\/em>to be \u201cmore like an audit that assures that a data set adheres to best practices of documentation, format, error characterization, etc.\u201d.\u00a0 For this reason, citation links between datasets and the journal articles upon which they are based, as routinely given in the Dryad Data Repository, are of particular value, since the latter provide the peer-reviewed contextual framework within which to undestand the data.<\/p>\n<p>Third, as mentioned by John Milner in the discussion of the essay at <a href=\"https:\/\/www.jiscmail.ac.uk\/cgi-bin\/webadmin?A1=ind1112&amp;L=RESEARCH-DATAMAN\">https:\/\/www.jiscmail.ac.uk\/cgi-bin\/webadmin?A1=ind1112&amp;L=RESEARCH-DATAMAN<\/a>, there is increasing recognition that the metadata describing a dataset should themselves also be recognised as independent first class scientific objects, with the possibility of publishing such a metadata file with its own DOI, either as a supplementary file within a data package, or as a \u2018metadata paper\u2019 within a \u2018metadata journal\u2019.<\/p>\n<p>Finally, one way of linking the \u2018data publication\u2019 metaphor with the \u2018linked data\u2019 metaphor is to ensure that such metadata describing datasets are made available not only in human-readable form but also in machine-readable form, by publishing them as RDF, either as a separate named graph with a unique URI or embedded as RDFa within the human-readable metadata document.<\/p>\n<p>Hope these comments help.<\/p>\n<p>[1]\u00a0 \u00a0\u00a0 de Waard, A., Breure, L., Kircz, J.G., Oostendorp, H. van (2006). Modeling Rhetoric in Scientific Publications. Current Res. in Inf. Sci. and Techn. pp. 352-356.<\/p>\n<p>[2]\u00a0 \u00a0\u00a0 de Waard, A., (2007). A Pragmatic Structure for the Research Article, in: Proceedings ICPW&#8217;07: 2nd International Conference on the Pragmatic Web, 22-23 Oct. 2007, Tilburg: NL. (Eds.) Buckingham Shum, S., Lind, M. and Weigand, H. Published in: ACM Digital Library &amp; Open University ePrint 9275.<\/p>\n<p>[3]\u00a0 \u00a0\u00a0 de Waard, A. and Kircz, J.G. (2008). Modeling scientific discourse &#8211; shifting perspectives and persistent issues, ELPUB2008. Open Scholarship: Authority, Community, and Sustainability in the Age of Web 2.0 \u2013 Proc. of the 12th Int. Conference on Electronic Publishing, June 2008, Eds. L. Chan and S. Mornati, pp. 234-245.<\/p>\n\n","protected":false},"excerpt":{"rendered":"<p>Is Data Publication the Right Metaphor? is an essay by Mark Parsons and Peter Fox to be published in the Data Science Journal, for which a preprint has been provided for open pre-publication community peer review at http:\/\/mp-datamatters.blogspot.com\/2011\/12\/seeking-open-review-of-provocative-data.html. To supplement the discussion this excellent essay has already prompted, I would like to add some belated &hellip; <a href=\"https:\/\/opencitations.hypotheses.org\/299\" class=\"more-link\">Continue reading <span class=\"screen-reader-text\">Comments on IS DATA PUBLICATION THE RIGHT METAPHOR?<\/span><\/a><\/p>\n","protected":false},"author":49570,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_license":"","footnotes":""},"categories":[16,64],"tags":[316,320,356,376,432,440,660,716,748,756,824,948,1236],"ppma_author":[1602],"class_list":["post-299","post","type-post","status-publish","format-standard","hentry","category-data-publication","category-semantic-publishing","tag-data","tag-data-citation","tag-data-publication","tag-datacite","tag-doi","tag-dryad","tag-jiscmrd","tag-linked-data","tag-metadata","tag-metaphor","tag-open-citation-corpus","tag-peer-review","tag-versioning"],"authors":[{"term_id":1602,"user_id":49570,"is_guest":0,"slug":"davidshotton","display_name":"davidshotton","avatar_url":"https:\/\/secure.gravatar.com\/avatar\/1b079c015f6061719f7eed5c29710158bd20975a827eeb585035ac580291f22e?s=96&d=blank&r=g","1":"","2":"","3":"","4":"","5":"","6":"","7":"","8":""}],"_links":{"self":[{"href":"https:\/\/opencitations.hypotheses.org\/wp-json\/wp\/v2\/posts\/299","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/opencitations.hypotheses.org\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/opencitations.hypotheses.org\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/opencitations.hypotheses.org\/wp-json\/wp\/v2\/users\/49570"}],"replies":[{"embeddable":true,"href":"https:\/\/opencitations.hypotheses.org\/wp-json\/wp\/v2\/comments?post=299"}],"version-history":[{"count":1,"href":"https:\/\/opencitations.hypotheses.org\/wp-json\/wp\/v2\/posts\/299\/revisions"}],"predecessor-version":[{"id":2413,"href":"https:\/\/opencitations.hypotheses.org\/wp-json\/wp\/v2\/posts\/299\/revisions\/2413"}],"wp:attachment":[{"href":"https:\/\/opencitations.hypotheses.org\/wp-json\/wp\/v2\/media?parent=299"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/opencitations.hypotheses.org\/wp-json\/wp\/v2\/categories?post=299"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/opencitations.hypotheses.org\/wp-json\/wp\/v2\/tags?post=299"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/opencitations.hypotheses.org\/wp-json\/wp\/v2\/ppma_author?post=299"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}