{"id":84,"date":"2017-09-22T03:16:46","date_gmt":"2017-09-22T03:16:46","guid":{"rendered":"http:\/\/lingconf.com\/turkicsymposium\/?page_id=84"},"modified":"2017-09-30T14:58:56","modified_gmt":"2017-09-30T14:58:56","slug":"freeopen-source-text-processing-technologies-for-turkic-languages","status":"publish","type":"page","link":"https:\/\/lingconf.com\/turkicsymposium\/program\/freeopen-source-text-processing-technologies-for-turkic-languages\/","title":{"rendered":"Free\/open-source text-processing technologies for Turkic languages"},"content":{"rendered":"<p>Jonathan N. Washington (Swarthmore College) and Francis Tyers (National Research University \u00abHigher School of Economics\u00bb)<\/p>\n<p>This talk describes the application of free\/open-source text-processing technologies to Turkic languages, including morphological analysis and generation, machine translation, and spell checking. The need for these technologies is motivated, along with the need for developing them under free\/open-source licenses.\u00a0 An overview of how some of the technologies are implemented is provided, with a focus on the successful strategies of computationally encoding linguistic information that comprise the Apertium framework. Which technologies currently exist for which Turkic languages is presented, along with some information on what state each one is in. Also outlined are some outstanding problems in Turkic text-processing, represented by linguistic patterns that are difficult to deal with computationally, either due to difficulty of linguistic analysis or shortcomings of the computational tools employed by these technologies.<\/p>\n<p>We additionally outline how linguistic communities and language experts (especially working together) can develop or improve text-processing tools for their languages.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Jonathan N. Washington (Swarthmore College) and Francis Tyers (National Research University \u00abHigher School of Economics\u00bb) This talk describes the application of free\/open-source text-processing technologies to Turkic languages, including morphological analysis and generation, machine translation, and spell checking. The need for these technologies is motivated, along with the need for developing them under free\/open-source licenses.\u00a0 An &hellip; <a href=\"https:\/\/lingconf.com\/turkicsymposium\/program\/freeopen-source-text-processing-technologies-for-turkic-languages\/\" class=\"more-link\">Continue reading <span class=\"screen-reader-text\">Free\/open-source text-processing technologies for Turkic languages<\/span> <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":8,"featured_media":0,"parent":28,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"advanced_seo_description":"","jetpack_seo_html_title":"","jetpack_seo_noindex":false,"jetpack_post_was_ever_published":false,"footnotes":""},"class_list":["post-84","page","type-page","status-publish","hentry"],"jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/P8RRaX-1m","jetpack-related-posts":[{"id":28,"url":"https:\/\/lingconf.com\/turkicsymposium\/program\/","url_meta":{"origin":84,"position":0},"title":"Program","author":"turkicsymposium","date":"July 8, 2017","format":false,"excerpt":"Download a copy of the program here. Schedule of Events Thursday October 5, 2017 211 Denny Hall Opening Event 9:00 \u2013 9:15 \u00a0\u00a0\u00a0\u00a0 Welcome and Morning Coffee: Selim Kuru and Talant Mawkanuli (University of Washington) Invited Talk 9:15 \u2013 9: 55 \u00a0\u00a0\u00a0 Christopher Beckwith (Indiana University) The Archaic Turkic Prophecy\u2026","rel":"","context":"Similar post","block_context":{"text":"Similar post","link":""},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":78,"url":"https:\/\/lingconf.com\/turkicsymposium\/program\/three-notes-on-translation-from-modern-uygur-to-turkish\/","url_meta":{"origin":84,"position":1},"title":"Three Notes on Translation from Modern Uygur to Turkish","author":"turkicsymposium","date":"September 22, 2017","format":false,"excerpt":"Nurettin Demir (Hacettepe University) For centuries the Turkic speaking world has been using Ottoman and Chaghatay, which were written with the same alphabet, as the two dominant written languages. The first half of the 20th century saw the rise of the new literary languages. Literary and cultural contacts between these\u2026","rel":"","context":"Similar post","block_context":{"text":"Similar post","link":""},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":90,"url":"https:\/\/lingconf.com\/turkicsymposium\/program\/on-the-typology-of-copied-verbs-in-modern-turkic-languages\/","url_meta":{"origin":84,"position":2},"title":"On the Typology of Copied Verbs in Modern Turkic Languages","author":"turkicsymposium","date":"September 22, 2017","format":false,"excerpt":"\u0130brahim Ahmet Aydemir (Hacettepe University) The purpose of this presentation is to present the typology of the copied verbs in Modern Turkic languages. As a result of the fact that the Turks have been in contact with different communities and cultures throughout history, many lexical elements have been copied from\u2026","rel":"","context":"Similar post","block_context":{"text":"Similar post","link":""},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":65,"url":"https:\/\/lingconf.com\/turkicsymposium\/program\/the-archaic-turkic-prophecy-and-the-sons-of-attila-analysis-of-the-earliest-turkic-language-data\/","url_meta":{"origin":84,"position":3},"title":"The Archaic Turkic Prophecy and the Sons of Attila: Analysis of the Earliest Turkic Language Data","author":"turkicsymposium","date":"September 22, 2017","format":false,"excerpt":"Christopher I. Beckwith (Indiana University) There has long been thought to be a Turkic component in the Empire of Attila the Hun (d. 453), as attested by many Turkic-sounding ethnonyms. But the names of Attila\u2019s sons are not mere random names (or titles); they constitute a clearly Turkic set of\u2026","rel":"","context":"Similar post","block_context":{"text":"Similar post","link":""},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":87,"url":"https:\/\/lingconf.com\/turkicsymposium\/program\/turkmen-language-from-the-view-of-semantic-typology\/","url_meta":{"origin":84,"position":4},"title":"Turkmen Language from the View of Semantic Typology","author":"turkicsymposium","date":"September 22, 2017","format":false,"excerpt":"Sema Aslan Demir (Hacettepe University) Turkmen belongs to the Oghuz branch of Turkic languages and is mostly spoken in Turkmenistan. It also has speakers in Iran, Afghanistan, Uzbekistan, Tajikistan, and some parts of the north Caucasus. Al\u00adthough it is an Oghuz language, it also shares some common areal features with\u2026","rel":"","context":"Similar post","block_context":{"text":"Similar post","link":""},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":75,"url":"https:\/\/lingconf.com\/turkicsymposium\/program\/the-2017-latinization-project-for-kazakh-a-report-from-the-field\/","url_meta":{"origin":84,"position":5},"title":"The 2017 Latinization Project for Kazakh: A Report from the Field","author":"turkicsymposium","date":"September 22, 2017","format":false,"excerpt":"Uli Schamiloglu (Nazarbayev University) The project to introduce a Latin alphabet for the official language of the Republic of Kazakhstan has a long and complicated history. In the early 20th century Kazakhs used a version of the Arabic script based on the Chagatay Turkic literary language. Later, following the Bolshevik\u2026","rel":"","context":"Similar post","block_context":{"text":"Similar post","link":""},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"_links":{"self":[{"href":"https:\/\/lingconf.com\/turkicsymposium\/wp-json\/wp\/v2\/pages\/84","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/lingconf.com\/turkicsymposium\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/lingconf.com\/turkicsymposium\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/lingconf.com\/turkicsymposium\/wp-json\/wp\/v2\/users\/8"}],"replies":[{"embeddable":true,"href":"https:\/\/lingconf.com\/turkicsymposium\/wp-json\/wp\/v2\/comments?post=84"}],"version-history":[{"count":3,"href":"https:\/\/lingconf.com\/turkicsymposium\/wp-json\/wp\/v2\/pages\/84\/revisions"}],"predecessor-version":[{"id":118,"href":"https:\/\/lingconf.com\/turkicsymposium\/wp-json\/wp\/v2\/pages\/84\/revisions\/118"}],"up":[{"embeddable":true,"href":"https:\/\/lingconf.com\/turkicsymposium\/wp-json\/wp\/v2\/pages\/28"}],"wp:attachment":[{"href":"https:\/\/lingconf.com\/turkicsymposium\/wp-json\/wp\/v2\/media?parent=84"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}