{"id":4040,"date":"2012-04-24T14:08:07","date_gmt":"2012-04-24T14:08:07","guid":{"rendered":"http:\/\/blogs.law.harvard.edu\/dplaalpha\/?p=4040"},"modified":"2012-04-24T14:08:08","modified_gmt":"2012-04-24T14:08:08","slug":"press-harvard-releases-big-data-for-books","status":"publish","type":"post","link":"https:\/\/archive.blogs.harvard.edu\/dplaalpha\/2012\/04\/24\/press-harvard-releases-big-data-for-books\/","title":{"rendered":"Press: &#8220;Harvard Releases Big Data for Books&#8221;"},"content":{"rendered":"<p>&#8220;Harvard\u2019s library is making public the information on more than 12 million books, videos, audio recordings, images, manuscripts, maps, and more things inside Harvard\u2019s 73 libraries.<\/p>\n<p>&#8220;Harvard can\u2019t put the actual content of much of this material online, owing to intellectual property laws, but this so-called metadata of things like titles, publication or recording dates, book sizes, or descriptions of what is in the video is also considered highly valuable material. Frequently descriptors of things like audio recordings are more valuable for search engines than the material itself. Search engines frequently rely on metadata over content, particularly when it can\u2019t easily be scanned and understood.<\/p>\n<p>[&#8230;]<\/p>\n<p>&#8220;The release follows Harvard\u2019s decision, via its\u00a0<a href=\"http:\/\/osc.hul.harvard.edu\/\">Office of Scholarly Communications<\/a>\u00a0to release for free much of the published research from its faculty. The metadata will be available for bulk download both from Harvard and from the\u00a0Digital Public Library of America, which is an effort to create a national public library online.&#8221;<\/p>\n<p>From Quentin Hardy&#8217;s post on the <em>New York Times<\/em>&#8216; Bits blog, <a href=\"http:\/\/bits.blogs.nytimes.com\/2012\/04\/24\/harvard-releases-big-data-for-books\/\" target=\"_blank\">Harvard Releases Big Data for Books<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>&#8220;The metadata will be available for bulk download both from Harvard and from the Digital Public Library of America, which is an effort to create a national public library online.&#8221;<\/p>\n","protected":false},"author":4454,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1919],"tags":[],"class_list":["post-4040","post","type-post","status-publish","format-standard","hentry","category-press"],"jetpack_featured_media_url":"","_links":{"self":[{"href":"https:\/\/archive.blogs.harvard.edu\/dplaalpha\/wp-json\/wp\/v2\/posts\/4040","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/archive.blogs.harvard.edu\/dplaalpha\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/archive.blogs.harvard.edu\/dplaalpha\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/archive.blogs.harvard.edu\/dplaalpha\/wp-json\/wp\/v2\/users\/4454"}],"replies":[{"embeddable":true,"href":"https:\/\/archive.blogs.harvard.edu\/dplaalpha\/wp-json\/wp\/v2\/comments?post=4040"}],"version-history":[{"count":1,"href":"https:\/\/archive.blogs.harvard.edu\/dplaalpha\/wp-json\/wp\/v2\/posts\/4040\/revisions"}],"predecessor-version":[{"id":4041,"href":"https:\/\/archive.blogs.harvard.edu\/dplaalpha\/wp-json\/wp\/v2\/posts\/4040\/revisions\/4041"}],"wp:attachment":[{"href":"https:\/\/archive.blogs.harvard.edu\/dplaalpha\/wp-json\/wp\/v2\/media?parent=4040"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/archive.blogs.harvard.edu\/dplaalpha\/wp-json\/wp\/v2\/categories?post=4040"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/archive.blogs.harvard.edu\/dplaalpha\/wp-json\/wp\/v2\/tags?post=4040"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}