{"id":64,"date":"2005-02-27T15:15:09","date_gmt":"2005-02-27T19:15:09","guid":{"rendered":"http:\/\/blogs.law.harvard.edu\/metasj\/on-transcription-the-joys-and-sorrows-of\/"},"modified":"2005-02-27T15:15:09","modified_gmt":"2005-02-27T19:15:09","slug":"on-transcription-the-joys-and-sorrows-of","status":"publish","type":"page","link":"https:\/\/archive.blogs.harvard.edu\/sj\/on-transcription-the-joys-and-sorrows-of\/","title":{"rendered":"On Transcription [ the Joys and Sorrows of ]"},"content":{"rendered":"<p><a name='a813'><\/a><\/p>\n<p><font size=\"2\">First, here is a <span style=\"font-weight: bold;\">quick overview<\/span><br \/>\nof the professional <a href=\"http:\/\/blogs.law.harvard.edu\/sj\/2005\/01\/31#a765\">webcred<\/a><br \/>\ntranscripts, as initially received.&nbsp; (By the time you read this,<br \/>\nthey are probably already cleaned up, so this is a moot point.&nbsp;<br \/>\nThis is presented for transcription geeking value only.)&nbsp; More<br \/>\ndetailed<br \/>\ncommentary and <span style=\"font-weight: bold;\">examples<\/span> at the end.&nbsp;&nbsp; <br \/>\n<\/font><br \/>\n<font size=\"2\"><font size=\"1\">[Now I <span style=\"font-style: italic;\">really <\/span>want<br \/>\nto interview someone who runs a transcription business&#8230; what kind of<br \/>\nspecial effort do they take with celebrities&#8217; names and quotes?&nbsp;<br \/>\nWhat other VIP <span style=\"font-weight: bold;\">services <\/span>are available?&nbsp; What kind of insurance do they need, and what happens when you transcribe <span style=\"font-weight: bold;\">slander<\/span>?&nbsp;<br \/>\nHow hard is it to produce a perfect real-time transcript, and how close<br \/>\nto &#8220;real-time&#8221; can one get?&nbsp;&nbsp; My private hunch: as close as<br \/>\nneeded to exactly real time, including &#8216;post&#8217;-processing.]<\/font><\/p>\n<p><span style=\"font-style: italic;\"><br \/>\n<\/span><\/p>\n<p><\/font><\/p>\n<div style=\"margin-left: 40px;\"><font size=\"2\"><span style=\"font-style: italic;\">The good<\/span>: <br \/>\n<\/font>\n<\/div>\n<ol>\n<li><font size=\"2\">The transcripts are <span style=\"font-style: italic;\">beautiful<\/span>.&nbsp; In contrast to some gov<br \/>\nand court transcripts I have seen, they are well laid out and easy to scan. <br \/>\n    <\/font>\n<\/li>\n<li><font size=\"2\">The English is <span style=\"font-style: italic;\">clean<\/span>.<br \/>\nInterruptions and stuttering are cleanly handled, and most sessions<br \/>\nhave useful sentence and paragraph breaks (even when the speaker was<br \/>\nrambling). <\/font><\/li>\n<\/p>\n<li><font size=\"2\">They get most names, proper nouns, organizations, and technology<br \/>\nreferences right.&nbsp; More than I would get without a list of names<br \/>\nin front of me.<\/font><\/li>\n<li><font size=\"2\">Their attention to small connecting words and comments<br \/>\nunder-the-breath is generally excellent.&nbsp; In general, their<br \/>\naccuracy is <span style=\"font-weight: bold;\">fabulous <\/span>[which is, as I&#8217;m sure you all know by now, the Official Word of 2005], some 99.5% <br \/>\n    <\/font>\n<\/li>\n<li><font size=\"2\">This was done quickly : a ~1 week turnaround for 15 hours of dense audio.<br \/>\n    <\/font>\n  <\/li>\n<\/p>\n<\/ol>\n<p><font size=\"2\"><br \/>\n<\/font><\/p>\n<div style=\"margin-left: 40px;\"><font size=\"2\"><span style=\"font-style: italic;\">The bad<\/span>:<br \/>\n<\/font>\n<\/div>\n<ol>\n<li><font size=\"2\">They are inconsistent. In some places, [inaudible] is used and<br \/>\nevery<br \/>\nspeaker gets his\/her own line and paragraph.&nbsp; In others, just a<br \/>\nfew dashes or underscores &#8220;___&#8221; are used to indicate something<br \/>\ninaudible.&nbsp; Full names are used in some places, and not in others,<br \/>\nat times ambiguously.&nbsp; Some sessions have poor formatting,<br \/>\nparagraph breaks; the transcription of podcasting audio clips, for<br \/>\ninstance.<br \/>\n    <\/font>\n <\/li>\n<li><font size=\"2\">They get many names, proper nouns, organizations, and technology<br \/>\nreferences wrong.&nbsp; They should have lists of these terms in front<br \/>\nof them, and should ask for what they don&#8217;t have.&nbsp; This would help<br \/>\nthem spell podcasting without a space, write &#8220;blogosphere&#8221; without a<br \/>\ncap, and remember that yes, Jon Bonn<\/p>\n","protected":false},"excerpt":{"rendered":"<p>First, here is a quick overview of the professional webcred transcripts, as initially received.&nbsp; (By the time you read this, they are probably already cleaned up, so this is a moot point.&nbsp; This is presented for transcription geeking value only.)&nbsp; More detailed commentary and examples at the end.&nbsp;&nbsp; [Now I really want to interview someone [&hellip;]<\/p>\n","protected":false},"author":135,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"open","ping_status":"open","template":"","meta":{"footnotes":""},"class_list":["post-64","page","type-page","status-publish","hentry"],"jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/P7iVvB-12","jetpack-related-posts":[],"_links":{"self":[{"href":"https:\/\/archive.blogs.harvard.edu\/sj\/wp-json\/wp\/v2\/pages\/64","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/archive.blogs.harvard.edu\/sj\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/archive.blogs.harvard.edu\/sj\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/archive.blogs.harvard.edu\/sj\/wp-json\/wp\/v2\/users\/135"}],"replies":[{"embeddable":true,"href":"https:\/\/archive.blogs.harvard.edu\/sj\/wp-json\/wp\/v2\/comments?post=64"}],"version-history":[{"count":0,"href":"https:\/\/archive.blogs.harvard.edu\/sj\/wp-json\/wp\/v2\/pages\/64\/revisions"}],"wp:attachment":[{"href":"https:\/\/archive.blogs.harvard.edu\/sj\/wp-json\/wp\/v2\/media?parent=64"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}