{"id":8013,"date":"2018-02-07T15:25:22","date_gmt":"2018-02-07T20:25:22","guid":{"rendered":"https:\/\/scholarblogs.emory.edu\/woodruff\/?p=8013"},"modified":"2018-02-07T15:25:22","modified_gmt":"2018-02-07T20:25:22","slug":"hathitrust-workshop","status":"publish","type":"post","link":"https:\/\/scholarblogs.emory.edu\/woodruff-sandbox\/2018\/02\/07\/hathitrust-workshop\/","title":{"rendered":"HathiTrust Workshop"},"content":{"rendered":"<p>On November 3, 2017, Emory Libraries hosted the <a href=\"https:\/\/www.hathitrust.org\/htrc\">HathiTrust Research Center<\/a> Digging Deeper, Reaching Further workshop. Thirty-four librarians from across the southeast attended this train-the-trainer workshop on text mining. The workshop covered text analysis, distant reading, and non-consumptive research in five interactive modules taught over the course of six hours.<\/p>\n<p><a href=\"http:\/\/scholarblogs.emory.edu\/woodruff-sandbox\/files\/2018\/01\/Hathi_elephant.png\"><img fetchpriority=\"high\" decoding=\"async\" class=\"alignleft size-full wp-image-7937\" src=\"http:\/\/scholarblogs.emory.edu\/woodruff-sandbox\/files\/2018\/01\/Hathi_elephant.png\" alt=\"\" width=\"240\" height=\"240\" srcset=\"https:\/\/scholarblogs.emory.edu\/woodruff-sandbox\/files\/2018\/01\/Hathi_elephant.png 240w, https:\/\/scholarblogs.emory.edu\/woodruff-sandbox\/files\/2018\/01\/Hathi_elephant-150x150.png 150w\" sizes=\"(max-width: 240px) 100vw, 240px\" \/><\/a>The HTRC workshop focused on ways to use computers to discover patterns in digitized texts. In addition to general text analysis, HTRC helps support non-consumptive research, where one can do computational analysis without having access to a reading version of the text (so that researchers can conduct text analysis more easily on in-copyright works). We covered finding and retrieving digital texts, manipulating textual data,\u00a0analyzing textual data, and finally visualizing analysis of textual data. During the workshop, we worked in the <a href=\"https:\/\/analytics.hathitrust.org\/\">HathiTrust Research Center platform<\/a> as well as with <a href=\"http:\/\/discovere.emory.edu\/primo_library\/libweb\/action\/search.do?vl(freeText0)=Python%20+(Computer+program+language)&amp;vl(4439115UI0)=sub&amp;vl(38512462UI1)=all_items&amp;fn=search&amp;tab=emory_catalog&amp;mode=Basic&amp;vid=discovere&amp;scp.scps=scope%3a(repo)%2cscope%3a(01EMORY_ALMA)%2cEmory_PrimoThirdNode&amp;ct=lateralLinking\">Python<\/a> using <a href=\"https:\/\/www.pythonanywhere.com\/\">Python-Anywhere<\/a>. The workshop attendees had an opportunity to work with datasets from the <a href=\"http:\/\/pid.emory.edu\/mp3vr\">HathiTrust digital library<\/a> that the instructors had prepared.<\/p>\n<p>This workshop was an excellent opportunity to bring librarians from across the Atlanta area and nearby states. We had eight out-of-state participants, two from public libraries, and one from the Federal Reserve. Our three instructors came from the University of North Carolina at Chapel Hill and Indiana University. Participants had fun exploring the materials. Our subject librarians shared their views on the workshop:<\/p>\n<p><a href=\"http:\/\/web.library.emory.edu\/about\/staff-directory\/woodruff\/collins-kim.html\">Kim Collins<\/a>, Art History\/Classics Librarian and Research Engagement Services Leader \u2013 \u201cLearning how to use the tools and data from HathiTrust Research Center to text mine, i.e., find associations and patterns, was both daunting and rewarding.\u00a0 Step-by step hands-on exercises gave me an appreciation of the power of Python and the context to make suggestions to other Emory researchers interested in this type of digital scholarship.\u201d<\/p>\n<p><a href=\"http:\/\/web.library.emory.edu\/about\/staff-directory\/woodruff\/ambrosone-ellen.html\">Ellen Ambrosone<\/a>, the South Asian Studies and Religion Librarian \u2013 &#8220;The HathiTrust workshop was helpful and inspiring! The hands-on portion of the day gave me a much better understanding of the mechanics of text mining and made me think about the skills that I could acquire to better assist users who are exploring computational methods of research.&#8221;<\/p>\n<p><a href=\"http:\/\/web.library.emory.edu\/about\/staff-directory\/woodruff\/bruchko-erica.html\">Erica Bruchko<\/a>, the African American Studies and U.S. History Librarian \u2013 \u201cOver the past several years, Emory graduate students and faculty have expressed interest in text mining, especially how to acquire the data that they need to complete their text-mining projects. The HathiTrust workshop provided a great primer on how to identify open data sources. I also appreciated the opportunity to get hands-on experience cleaning and prepping data.\u201d<\/p>\n<p>As trainers now, we plan to do a few things.\u00a0 Those of us who participated will be bringing these methods to our faculty and students. First, either in the spring or summer, we will be hosting a workshop for faculty and fellows at the Fox Center for Humanistic Inquiry on HathiTrust and the HathiTrust Research Center. Second, we will be initiating Word Lab, a group for people interested in computational text analysis. We had eighteen participants express their interest in joining the Word Lab. Third, we will be providing training for other subject librarians. Those of us who participated in the workshop as well as the other Emory librarians we train will, in turn, share these methods with the faculty and students in our subject areas. We may also host an open workshop for some of these techniques in the future; however, if you are interested in learning more in the meantime, you can contact <a href=\"javascript:secureDecryptAndNavigate('sk1JS3TniTDqNJZ8HcsKjYjMozmPddI+NzsDz11fTk0XGINZIfWw36QTXJTQq9+CBw3K6UaxIGmlhQfI0PSm9qE5winCqR9PMvGB8nM=', '10141fb37f5951040eaf871acc54837048e00740e443e7bada667c4c89e4ac77')\">Katie Rawson<\/a> or <a href=\"javascript:secureDecryptAndNavigate('g7zH25xM3oK7Erbs2+CByZAdKhjBKXz0WUvbcapeZUq2wxqY4+LVPjM845Vl3Hk\/tLEu2s3Tq\/dVE2E0VBI7y3AeE9\/sgCQt', '10141fb37f5951040eaf871acc54837048e00740e443e7bada667c4c89e4ac77')\">Chella Vaidyanathan<\/a> for more information.<\/p>\n<p>More information about the HathiTrust Research Center&#8217;s \u201cDigging Deeper, Reaching Further\u201d project is available at <a href=\"https:\/\/teach.htrc.illinois.edu\/about-the-project\/\">https:\/\/teach.htrc.illinois.edu\/about-the-project\/<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>On November 3, 2017, Emory Libraries hosted the HathiTrust Research Center Digging Deeper, Reaching Further workshop. Thirty-four librarians from across<\/p>\n","protected":false},"author":1979,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"colormag_page_container_layout":"default_layout","colormag_page_sidebar_layout":"default_layout","_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[3],"tags":[333,517,1106],"class_list":["post-8013","post","type-post","status-publish","format-standard","hentry","category-news","tag-digital-scholarship","tag-hathitrust","tag-text-mining"],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"jetpack-related-posts":[{"id":7932,"url":"https:\/\/scholarblogs.emory.edu\/woodruff-sandbox\/2018\/01\/26\/hathitrust-reaches-16-million-volumes-emorys-most-popular-contributions\/","url_meta":{"origin":8013,"position":0},"title":"HathiTrust Reaches 16 Million Volumes &amp; Emory&#8217;s Most Popular Contributions","author":"caloveladmin","date":"January 26, 2018","format":false,"excerpt":"HathiTrust Digital Library deposited its\u00a016th million volume, Osborne's London and Birmingham Railway guide. Illustrated, on December 7, 2017. This marked a nine-year effort on behalf of research institutions and libraries including Emory who joined the partnership in 2010. Well done HathiTrust, Emory, and fellow HathiTrust partners! Most Popular HathiTrust Books\u2026","rel":"","context":"In &quot;News&quot;","block_context":{"text":"News","link":"https:\/\/scholarblogs.emory.edu\/woodruff-sandbox\/category\/news\/"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/scholarblogs.emory.edu\/woodruff-sandbox\/files\/2018\/01\/Hathi_elephant-150x150.png?resize=350%2C200","width":350,"height":200},"classes":[]},{"id":8931,"url":"https:\/\/scholarblogs.emory.edu\/woodruff-sandbox\/2019\/03\/12\/new-database-gales-digital-scholar-lab\/","url_meta":{"origin":8013,"position":1},"title":"New Database: Gale\u2019s Digital Scholar Lab","author":"caloveladmin","date":"March 12, 2019","format":false,"excerpt":"Are you interested in digital humanities tools but don\u2019t know how to get started? Would you like to engage your students with visualizations from primary source datasets? Were you looking for an opportunity to download and explore primary source data sets? If you answered yes to one or more of\u2026","rel":"","context":"In &quot;News&quot;","block_context":{"text":"News","link":"https:\/\/scholarblogs.emory.edu\/woodruff-sandbox\/category\/news\/"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/scholarblogs.emory.edu\/woodruff-sandbox\/files\/2019\/03\/2.-Side-by-side-view-whisky-house-300x147.png?resize=350%2C200","width":350,"height":200},"classes":[]},{"id":8206,"url":"https:\/\/scholarblogs.emory.edu\/woodruff-sandbox\/2018\/04\/17\/emory-libraries-2017-18-woodruff-fellows-presentations\/","url_meta":{"origin":8013,"position":2},"title":"Emory Libraries 2017-18 Woodruff Fellows Presentations","author":"caloveladmin","date":"April 17, 2018","format":false,"excerpt":"On Thursday, April 5 in the Jones Room, the Emory Libraries 2017-18 Woodruff Fellows presented on the work they have done in their fellowships. Following is a brief summary of their presentations. Jonathan Coulis is a Ph.D. candidate in Latin American History. For the Subject Librarian fellowship, he worked on\u2026","rel":"","context":"In &quot;News&quot;","block_context":{"text":"News","link":"https:\/\/scholarblogs.emory.edu\/woodruff-sandbox\/category\/news\/"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/scholarblogs.emory.edu\/woodruff-sandbox\/files\/2018\/04\/j_coulis.png?resize=350%2C200","width":350,"height":200},"classes":[]},{"id":9330,"url":"https:\/\/scholarblogs.emory.edu\/woodruff-sandbox\/2019\/08\/28\/register-for-the-open-humanities-graduate-student-workshop\/","url_meta":{"origin":8013,"position":3},"title":"Register for the Open Humanities Graduate Student Workshop!","author":"caloveladmin","date":"August 28, 2019","format":false,"excerpt":"Registration is still open for the Open Humanities Graduate Student Workshop (September 11-12, 2019) cosponsored by the Emory Libraries Scholarly Communications Office, the Emory Center for Digital Scholarship and the Bill and Carol Fox Center for Humanistic Inquiry. \u00a0 While open access continues to gain traction in the social and\u2026","rel":"","context":"In &quot;News&quot;","block_context":{"text":"News","link":"https:\/\/scholarblogs.emory.edu\/woodruff-sandbox\/category\/news\/"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":5537,"url":"https:\/\/scholarblogs.emory.edu\/woodruff-sandbox\/2014\/04\/08\/archivescopyrightworkshop\/","url_meta":{"origin":8013,"position":4},"title":"April 9th Archives and Copyright Workshop","author":"caloveladmin","date":"April 8, 2014","format":false,"excerpt":"Please join us at the Library for a Archives and Copyright Workshop to be held, Wednesday April 9th 9:30-11 a.m in Room 312 (across from the Jones Room, third floor Robert W. Woodruff Library. ) This workshop is open to all and will include information and discussion about copyright, fair\u2026","rel":"","context":"In &quot;Rose Library&quot;","block_context":{"text":"Rose Library","link":"https:\/\/scholarblogs.emory.edu\/woodruff-sandbox\/category\/rose-library\/"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/scholarblogs.emory.edu\/woodruff-sandbox\/files\/2014\/04\/Screen-Shot-2014-04-08-at-12_opt.png?resize=350%2C200&ssl=1","width":350,"height":200},"classes":[]},{"id":8741,"url":"https:\/\/scholarblogs.emory.edu\/woodruff-sandbox\/2019\/01\/22\/research-data-management-workshop-series-spring-2019\/","url_meta":{"origin":8013,"position":5},"title":"Research Data Management Workshop Series &#8211; Spring 2019","author":"caloveladmin","date":"January 22, 2019","format":false,"excerpt":"Are you collecting data for your research? Do you have questions about where to save your data, and how to wrangle all of your digital files? Are you curious about sharing data with other scholars? The Scholarly Communications Office is offering a 3-part series of Research Data Management workshops this\u2026","rel":"","context":"In &quot;News&quot;","block_context":{"text":"News","link":"https:\/\/scholarblogs.emory.edu\/woodruff-sandbox\/category\/news\/"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/scholarblogs.emory.edu\/woodruff-sandbox\/files\/2019\/01\/data-crack_flickr14411397343-1024x768.jpg?resize=350%2C200","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/scholarblogs.emory.edu\/woodruff-sandbox\/files\/2019\/01\/data-crack_flickr14411397343-1024x768.jpg?resize=350%2C200 1x, https:\/\/i0.wp.com\/scholarblogs.emory.edu\/woodruff-sandbox\/files\/2019\/01\/data-crack_flickr14411397343-1024x768.jpg?resize=525%2C300 1.5x, https:\/\/i0.wp.com\/scholarblogs.emory.edu\/woodruff-sandbox\/files\/2019\/01\/data-crack_flickr14411397343-1024x768.jpg?resize=700%2C400 2x"},"classes":[]}],"_links":{"self":[{"href":"https:\/\/scholarblogs.emory.edu\/woodruff-sandbox\/wp-json\/wp\/v2\/posts\/8013","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/scholarblogs.emory.edu\/woodruff-sandbox\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/scholarblogs.emory.edu\/woodruff-sandbox\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/scholarblogs.emory.edu\/woodruff-sandbox\/wp-json\/wp\/v2\/users\/1979"}],"replies":[{"embeddable":true,"href":"https:\/\/scholarblogs.emory.edu\/woodruff-sandbox\/wp-json\/wp\/v2\/comments?post=8013"}],"version-history":[{"count":0,"href":"https:\/\/scholarblogs.emory.edu\/woodruff-sandbox\/wp-json\/wp\/v2\/posts\/8013\/revisions"}],"wp:attachment":[{"href":"https:\/\/scholarblogs.emory.edu\/woodruff-sandbox\/wp-json\/wp\/v2\/media?parent=8013"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/scholarblogs.emory.edu\/woodruff-sandbox\/wp-json\/wp\/v2\/categories?post=8013"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/scholarblogs.emory.edu\/woodruff-sandbox\/wp-json\/wp\/v2\/tags?post=8013"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}