{"id":272,"date":"2017-02-24T05:33:20","date_gmt":"2017-02-24T05:33:20","guid":{"rendered":"http:\/\/blogs.harvard.edu\/copyrightosc\/?p=272"},"modified":"2017-02-24T05:36:06","modified_gmt":"2017-02-24T05:36:06","slug":"fair-use-week-2017-day-five-with-guest-expert-sara-r-benson","status":"publish","type":"post","link":"https:\/\/archive.blogs.harvard.edu\/copyrightosc\/2017\/02\/24\/fair-use-week-2017-day-five-with-guest-expert-sara-r-benson\/","title":{"rendered":"Fair Use Week 2017: Day Five With Guest Expert Sara R. Benson"},"content":{"rendered":"<h2>Make \u201cNon-Consumptive Use\u201d Part of Your Fair Use Vocabulary<\/h2>\n<p>by\u00a0Sara R. Benson<\/p>\n<p>The HathiTrust Digital Library continues to push the boundaries of open access.<\/p>\n<p>In late 2016, the Library\u2019s Research Center made the entire corpus available for non-consumptive use through its <a href=\"https:\/\/wiki.htrc.illinois.edu\/x\/GoA5Ag\">Extracted Features dataset<\/a>.\u00a0 Using this dataset, researchers can access the non-expressive content of public domain and copyright-protected works for the purpose of performing data analysis. The dataset opens the corpus to computational research techniques such as topic modeling or machine classification while limiting traditional forms of reading by virtue of its abstracted data structure.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-274\" src=\"http:\/\/blogs.harvard.edu\/copyrightosc\/files\/2017\/02\/DataSetsHathi.jpg\" alt=\"datasetshathi\" width=\"513\" height=\"384\" srcset=\"https:\/\/archive.blogs.harvard.edu\/copyrightosc\/files\/2017\/02\/DataSetsHathi.jpg 513w, https:\/\/archive.blogs.harvard.edu\/copyrightosc\/files\/2017\/02\/DataSetsHathi-300x225.jpg 300w\" sizes=\"auto, (max-width: 513px) 100vw, 513px\" \/><\/p>\n<p>The structured files, presented in JSON format, provide information about the text (the ideas) without revealing its original form (the expression). Although the term \u201cnon-consumptive\u201d was never specifically defined in the <em>HathiTrust<\/em> case,[1] the type of text mining at issue in that case serves as the building block for the transformative use asserted by the HathiTrust and the users of the Extracted Features dataset.<\/p>\n<p>Notably, in <em>Author\u2019s Guild v. HathiTrust<\/em>, the Second Circuit Court of Appeals stated that the \u201ccreation of a full-text searchable database is a quintessentially transformative use.\u201d[2]\u00a0 The HathiTrust uses the following definition for non-consumptive research:\u00a0 It is \u201cresearch in which computational analysis is performed on one or more volumes (textual or image objects) in the HTDL, but not research in which a researcher reads or displays substantial portions of an in-copyright or rights-restricted volume to understand the expressive content presented within that volume.\u201d[3]<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-275\" src=\"http:\/\/blogs.harvard.edu\/copyrightosc\/files\/2017\/02\/AGvHathi.jpg\" alt=\"agvhathi\" width=\"579\" height=\"158\" srcset=\"https:\/\/archive.blogs.harvard.edu\/copyrightosc\/files\/2017\/02\/AGvHathi.jpg 579w, https:\/\/archive.blogs.harvard.edu\/copyrightosc\/files\/2017\/02\/AGvHathi-300x82.jpg 300w\" sizes=\"auto, (max-width: 579px) 100vw, 579px\" \/><\/p>\n<p>In the case of the Extracted Features dataset, instead of reading or consuming the text, researchers are moving from the extracted content to perform statistical analyses, pull out derived data sets, and look at patterns across words to reach new research conclusions.\u00a0 This is a decidedly different use then for a work of fiction (say,\u00a0<em>Harry Potter)<\/em>\u00a0which is unequivocally for narrative entertainment.<\/p>\n<p>Here instead, researchers are engaged in another important fair use endeavor\u2014 to transform the transmission of and interaction with the work from readable text to minable data in order to better understand connections between literature and historical documents and society.<\/p>\n<p>Thus, non-consumptive use, when defined correctly, could never be construed as anything but a fair use. The concept can provide an important framework for other libraries and data providers who wish to open greater access to datasets without infringement. \u00a0It also can embolden researchers to incorporate computational techniques into their scholarship, much of which to date has been limited to pre-twentieth century inquires.<\/p>\n<p>And so, with this brief introduction, I issue a call to all fair use advocates: \u00a0please make \u201cnon-consumptive use\u201d a part of your fair use vocabulary, promote the use of the HathiTrust Extracted Features Dataset, and continue to promote the fair use rights.<\/p>\n<ol>\n<li>\n<h5>It was, however, defined in the amended settlement agreement, ultimately rejected by the court, in Authors Guild v. Google, available at <a href=\"https:\/\/www.authorsguild.org\/wp-content\/uploads\/2014\/10\/2009-Nov-13-AGvGoogle-Amended-Settlement-Agreement.pdf\">https:\/\/www.authorsguild.org\/wp-content\/uploads\/2014\/10\/2009-Nov-13-AGvGoogle-Amended-Settlement-Agreement.pdf<\/a>.<\/h5>\n<\/li>\n<li>\n<h5>755 F.3d 87, 97 (2d Cir 2014)<\/h5>\n<\/li>\n<li>\n<h5>HathiTrust Digital Library, HathiTrust Research Center, Non Consumptive Use Research Policy, available at <a href=\"https:\/\/urldefense.proofpoint.com\/v2\/url?u=https-3A__www.hathitrust.org_htrc-5Fncup&amp;d=CwMFAg&amp;c=WO-RGvefibhHBZq3fL85hQ&amp;r=sLjykPVK6rYnb5xQBJWWgzvTiqS5Ic0JMO5L6p0mJkw&amp;m=MpQohm8j1w69QSbC8kR3hfmkCC1F8pZYVvjVjS14IU8&amp;s=xi268vGqvVUDs-azrgEaPlq2RI9aUimfHvVAIAVeIKU&amp;e=\">https:\/\/www.hathitrust.org\/htrc_ncup<\/a><\/h5>\n<\/li>\n<\/ol>\n<p><em>Sara R. Benson is Copyright Librarian &amp; Assistant Professor at the University of Illinois Library<\/em><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Make \u201cNon-Consumptive Use\u201d Part of Your Fair Use Vocabulary by\u00a0Sara R. Benson The HathiTrust Digital Library continues to push the boundaries of open access. In late 2016, the Library\u2019s Research Center made the entire corpus available for non-consumptive use through its Extracted Features dataset.\u00a0 Using this dataset, researchers can access the non-expressive content of public [&hellip;]<\/p>\n","protected":false},"author":6259,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":true,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_feature_clip_id":0,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2},"jetpack_post_was_ever_published":false},"categories":[257,690,138871],"tags":[],"class_list":["post-272","post","type-post","status-publish","format-standard","hentry","category-copyright","category-fair-use","category-fair-use-week"],"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/p7gxeS-4o","_links":{"self":[{"href":"https:\/\/archive.blogs.harvard.edu\/copyrightosc\/wp-json\/wp\/v2\/posts\/272","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/archive.blogs.harvard.edu\/copyrightosc\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/archive.blogs.harvard.edu\/copyrightosc\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/archive.blogs.harvard.edu\/copyrightosc\/wp-json\/wp\/v2\/users\/6259"}],"replies":[{"embeddable":true,"href":"https:\/\/archive.blogs.harvard.edu\/copyrightosc\/wp-json\/wp\/v2\/comments?post=272"}],"version-history":[{"count":5,"href":"https:\/\/archive.blogs.harvard.edu\/copyrightosc\/wp-json\/wp\/v2\/posts\/272\/revisions"}],"predecessor-version":[{"id":280,"href":"https:\/\/archive.blogs.harvard.edu\/copyrightosc\/wp-json\/wp\/v2\/posts\/272\/revisions\/280"}],"wp:attachment":[{"href":"https:\/\/archive.blogs.harvard.edu\/copyrightosc\/wp-json\/wp\/v2\/media?parent=272"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/archive.blogs.harvard.edu\/copyrightosc\/wp-json\/wp\/v2\/categories?post=272"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/archive.blogs.harvard.edu\/copyrightosc\/wp-json\/wp\/v2\/tags?post=272"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}