{"id":6,"date":"2017-04-25T17:39:46","date_gmt":"2017-04-25T17:39:46","guid":{"rendered":"https:\/\/scholarblogs.emory.edu\/japanese-text-mining\/?page_id=6"},"modified":"2017-05-17T17:37:25","modified_gmt":"2017-05-17T17:37:25","slug":"day-one","status":"publish","type":"page","link":"https:\/\/scholarblogs.emory.edu\/japanese-text-mining\/day-one\/","title":{"rendered":"Day One"},"content":{"rendered":"<p class=\"title\" style=\"text-align: center\"><span style=\"text-decoration: underline;font-size: 18pt\">Tuesday May 30, 2017<span class=\"c0\">\u00a0<\/span><\/span><\/p>\n<p class=\"bodystyle_header\"><strong>Breakfast (8:45 &#8211; 9:30 AM) Jones Room<\/strong><\/p>\n<p class=\"bodystyle_header\"><strong>Session One (9:30 &#8211; 11:00 AM)\u00a0Woodruff Library 312<\/strong><\/p>\n<p class=\"c2\"><span class=\"c0\">Self-introductions: <\/span><span class=\"c0\">Participants and instructors introduce themselves and their research.\u00a0<\/span><span class=\"c0\">Instructors explain course structure<\/span><\/p>\n<p class=\"c2\"><strong><span class=\"bodystyle_header\">Session Two (11:15 AM-12:30 PM)\u00a0Woodruff Library 312<\/span><\/strong><\/p>\n<p class=\"c2\"><span class=\"c0\">Web-based tools: Philologic and the Aozora Bunko (Long)<\/span><\/p>\n<p class=\"c2\">This session will introduce the University of Chicago\u2019s project linking the <a href=\"http:\/\/www.aozora.gr.jp\/\" target=\"_new\">Aozora Bunko<\/a> and <a href=\"http:\/\/artflsrv02.uchicago.edu\/philologic4\/aozora\/\" target=\"_new\">PhiloLogic<\/a>, pioneered by Hoyt Long. The Aozora Bunko is an on-line collection of over 13,000 digitized Japanese texts, including fiction, non-fiction, and poetry. Because of copyright law, all texts are from before 1966, but the corpus includes a wide range of texts, such as Shimazaki Toson\u2019s <span class=\"c11\">Yo\u2019ake mae<\/span><span class=\"c0\">, the poetry of Hakush\u016b Kitahara, letters by Sakamoto Ry\u014dma, and children\u2019s fiction by Muruyama Kazuko.<\/span><\/p>\n<p class=\"c2\"><span class=\"c0\">PhiloLogic is a suite of software developed by the ARTFL Project at the University of Chicago. It is an easy to use, yet powerful, full-text search, retrieval, and reporting system, allowing searches based on multiple criteria. \u00a0One can, for example, either retrieve all text in the Aozora Bunko written by select authors, from 1935 through 1945, which also contain the phrases \u5e73\u548c and \u5815\u843d. The system will report results for word frequency, word context (KWIC \u2013 key words in context), and word collocation (which words occur together).<\/span><\/p>\n<p class=\"c2\"><span class=\"c0\">Key topics will include:<\/span><\/p>\n<ul>\n<li><span class=\"c0\">Word frequencies<\/span><\/li>\n<li><span class=\"c0\">Collocation<\/span><\/li>\n<li><span class=\"c0\">KWIC<\/span><\/li>\n<li><span class=\"c0\">Word occurrence in time series<\/span><\/li>\n<\/ul>\n<p class=\"c2\"><strong><span class=\"bodystyle_header\">Lunch Break (12:30 &#8211; 1:30 PM)<\/span><\/strong><span class=\"c0\">\u00a0<strong>Jones Room<\/strong><\/span><\/p>\n<p class=\"c2\"><strong><span class=\"bodystyle_header\">Session Three (1:30 PM &#8211; 3:00 PM)\u00a0Woodruff Library 312<\/span><\/strong><\/p>\n<p class=\"c2\"><span class=\"c0\">Web-based tools for user-selected texts (Des Jardin and Goss)<\/span><\/p>\n<p class=\"c2\"><span class=\"c0\">Building on the introduction of <a href=\"http:\/\/artflsrv02.uchicago.edu\/philologic4\/aozora\/\" target=\"_new\">PhiloLogic<\/a>, this session will demonstrate how researchers can move the Aozora Bunko to analyze other digitized corpora, such as the etexts at <a href=\"http:\/\/jti.lib.virginia.edu\/japanese\/\" target=\"_new\">UVA\u2019s Japanese Text Initiative<\/a>. Coverage will include:<\/span><\/p>\n<ul>\n<li style=\"list-style-type: none\">\n<ul>\n<li><span class=\"c0\"><a href=\"https:\/\/voyant-tools.org\/\" target=\"_new\">Voyant Tools<\/a>, for general analysis with an English-language interface<\/span><\/li>\n<li><span class=\"c0\"><a href=\"http:\/\/chamame.ninjal.ac.jp\/\" target=\"_new\">NINJAL\u2019s \u8336\u307e\u3081<\/a> series, which includes specialized tokenizers for modern \u73fe\u4ee3\u8a9e, early modern \u8fd1\u4ee3\u6587\u8a9e, and classical \u4e2d\u53e4\u548c\u6587<\/span><\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<p class=\"bodystyle_header\"><strong>Coffee Break:\u00a0<span class=\"s1\">Woodruff Library 303, Emory Center for Digital Scholarship<\/span><\/strong><\/p>\n<p class=\"bodystyle_header\"><strong>Session Four (3:15 PM &#8211; 4:00 PM)\u00a0Woodruff Library 312<\/strong><\/p>\n<p class=\"c1\"><span class=\"c0\">Small groups with hands-on support for basic text mining with web interfaces<\/span><\/p>\n<ul>\n<li style=\"list-style-type: none\">\n<ul>\n<li><span class=\"c0\"><a href=\"http:\/\/artflsrv02.uchicago.edu\/philologic4\/aozora\/\" target=\"_new\">PhiloLogic<\/a> and the Aozora Bunko<\/span><\/li>\n<li><span class=\"c0\">Voyant Tools<\/span><\/li>\n<li><span class=\"c0\">NINJAL\u2019s \u8336\u307e\u3081 series<\/span><\/li>\n<\/ul>\n<\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>Tuesday May 30, 2017\u00a0 Breakfast (8:45 &#8211; 9:30 AM) Jones Room Session One (9:30 &#8211; 11:00 AM)\u00a0Woodruff Library 312 Self-introductions: Participants and instructors introduce themselves and their research.\u00a0Instructors explain course structure Session Two (11:15 AM-12:30 PM)\u00a0Woodruff Library 312 Web-based tools: Philologic and the Aozora Bunko (Long) This session will introduce the University of Chicago\u2019s project &hellip; <\/p>\n<p class=\"link-more\"><a href=\"https:\/\/scholarblogs.emory.edu\/japanese-text-mining\/day-one\/\" class=\"more-link\">Continue reading<span class=\"screen-reader-text\"> &#8220;Day One&#8221;<\/span><\/a><\/p>\n","protected":false},"author":3354,"featured_media":0,"parent":0,"menu_order":1,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-6","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/scholarblogs.emory.edu\/japanese-text-mining\/wp-json\/wp\/v2\/pages\/6","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/scholarblogs.emory.edu\/japanese-text-mining\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/scholarblogs.emory.edu\/japanese-text-mining\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/scholarblogs.emory.edu\/japanese-text-mining\/wp-json\/wp\/v2\/users\/3354"}],"replies":[{"embeddable":true,"href":"https:\/\/scholarblogs.emory.edu\/japanese-text-mining\/wp-json\/wp\/v2\/comments?post=6"}],"version-history":[{"count":12,"href":"https:\/\/scholarblogs.emory.edu\/japanese-text-mining\/wp-json\/wp\/v2\/pages\/6\/revisions"}],"predecessor-version":[{"id":81,"href":"https:\/\/scholarblogs.emory.edu\/japanese-text-mining\/wp-json\/wp\/v2\/pages\/6\/revisions\/81"}],"wp:attachment":[{"href":"https:\/\/scholarblogs.emory.edu\/japanese-text-mining\/wp-json\/wp\/v2\/media?parent=6"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}