{"id":189,"date":"2021-05-14T08:24:22","date_gmt":"2021-05-14T06:24:22","guid":{"rendered":"https:\/\/miningtext.at\/?page_id=189"},"modified":"2021-05-14T10:17:44","modified_gmt":"2021-05-14T08:17:44","slug":"named-entity-recognition-linking-ner-nel","status":"publish","type":"page","link":"https:\/\/miningtext.at\/en\/named-entity-recognition-linking-ner-nel","title":{"rendered":"Named Entity Recognition &amp; Linking (NER\/NEL)"},"content":{"rendered":"<div data-colibri-id=\"189-c1\" class=\"style-470 style-local-189-c1 position-relative\">\n  <!---->\n  <div data-colibri-component=\"section\" data-colibri-id=\"189-c2\" id=\"overlappable\" class=\"h-section h-section-global-spacing d-flex align-items-lg-center align-items-md-center align-items-center style-507 style-local-189-c2 position-relative\">\n    <!---->\n    <!---->\n    <div class=\"h-section-grid-container h-section-boxed-container\">\n      <!---->\n      <div data-colibri-id=\"189-c3\" class=\"h-row-container gutters-row-lg-2 gutters-row-md-2 gutters-row-0 gutters-row-v-lg-2 gutters-row-v-md-2 gutters-row-v-2 style-521 style-local-189-c3 position-relative\">\n        <!---->\n        <div class=\"h-row justify-content-lg-center justify-content-md-center justify-content-center align-items-lg-stretch align-items-md-stretch align-items-stretch gutters-col-lg-2 gutters-col-md-2 gutters-col-0 gutters-col-v-lg-2 gutters-col-v-md-2 gutters-col-v-2\">\n          <!---->\n          <div class=\"h-column h-column-container d-flex h-col-lg-auto h-col-md-auto h-col-auto style-522-outer style-local-189-c4-outer\">\n            <div data-colibri-id=\"189-c4\" class=\"d-flex h-flex-basis h-column__inner h-px-lg-2 h-px-md-2 h-px-2 v-inner-lg-2 v-inner-md-2 v-inner-2 style-522 style-local-189-c4 position-relative\">\n              <!---->\n              <!---->\n              <div class=\"w-100 h-y-container h-column__content h-column__v-align flex-basis-100 align-self-lg-start align-self-md-start align-self-start\">\n                <!---->\n                <div data-colibri-id=\"189-c5\" class=\"style-524 style-local-189-c5 position-relative h-element\">\n                  <!---->\n                <\/div>\n              <\/div>\n            <\/div>\n          <\/div>\n        <\/div>\n      <\/div>\n      <div data-colibri-id=\"189-c6\" class=\"h-row-container gutters-row-lg-2 gutters-row-md-2 gutters-row-0 gutters-row-v-lg-2 gutters-row-v-md-2 gutters-row-v-2 style-519 style-local-189-c6 position-relative\">\n        <!---->\n        <div class=\"h-row justify-content-lg-center justify-content-md-center justify-content-center align-items-lg-stretch align-items-md-stretch align-items-stretch gutters-col-lg-2 gutters-col-md-2 gutters-col-0 gutters-col-v-lg-2 gutters-col-v-md-2 gutters-col-v-2\">\n          <!---->\n          <div class=\"h-column h-column-container d-flex h-col-lg-auto h-col-md-auto h-col-auto style-520-outer style-local-189-c7-outer\">\n            <div data-colibri-id=\"189-c7\" class=\"d-flex h-flex-basis h-column__inner h-px-lg-2 h-px-md-2 h-px-2 v-inner-lg-2 v-inner-md-2 v-inner-2 style-520 style-local-189-c7 position-relative\">\n              <!---->\n              <!---->\n              <div class=\"w-100 h-y-container h-column__content h-column__v-align flex-basis-100 align-self-lg-start align-self-md-start align-self-start\">\n                <!---->\n                <div data-colibri-id=\"189-c8\" class=\"h-text h-text-component style-518 style-local-189-c8 position-relative h-element\">\n                  <!---->\n                  <!---->\n                  <div class=\"\">\n                    <p>To develop a workflow for the automatic recognition of place and person names (Named Entity Recognition), T.M.M.M.T. combines two research approaches:<\/p>\n                  <\/div>\n                <\/div>\n              <\/div>\n            <\/div>\n          <\/div>\n        <\/div>\n      <\/div>\n    <\/div>\n  <\/div>\n  <div data-colibri-component=\"section\" data-colibri-id=\"189-c9\" id=\"overlappable-2\" class=\"h-section h-section-global-spacing d-flex align-items-lg-center align-items-md-center align-items-center style-526 style-local-189-c9 position-relative\">\n    <!---->\n    <!---->\n    <div class=\"h-section-grid-container h-section-boxed-container\">\n      <!---->\n      <div data-colibri-id=\"189-c10\" class=\"h-row-container gutters-row-lg-0 gutters-row-md-0 gutters-row-2 gutters-row-v-lg-0 gutters-row-v-md-0 gutters-row-v-2 style-527 style-local-189-c10 position-relative\">\n        <!---->\n        <div class=\"h-row justify-content-lg-center justify-content-md-center justify-content-center align-items-lg-stretch align-items-md-stretch align-items-stretch gutters-col-lg-0 gutters-col-md-0 gutters-col-2 gutters-col-v-lg-0 gutters-col-v-md-0 gutters-col-v-2\">\n          <!---->\n          <div class=\"h-column h-column-container d-flex h-col-lg-4 h-col-md-4 h-col-12 style-528-outer style-local-189-c11-outer\">\n            <div data-colibri-id=\"189-c11\" class=\"d-flex h-flex-basis h-column__inner h-px-lg-3 h-px-md-3 h-px-2 v-inner-lg-3 v-inner-md-3 v-inner-2 style-528 style-local-189-c11 position-relative\">\n              <!---->\n              <!---->\n              <div class=\"w-100 h-y-container h-column__content h-column__v-align flex-basis-100 align-self-lg-start align-self-md-start align-self-start\">\n                <!---->\n                <div data-colibri-id=\"189-c12\" class=\"h-icon style-529 style-local-189-c12 position-relative h-element\">\n                  <!----><span class=\"h-svg-icon h-icon__icon style-529-icon style-local-189-c12-icon\"><!--Icon by Font Awesome (https:\/\/fontawesome.com)--><svg version=\"1.1\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" id=\"connectdevelop\" viewbox=\"0 0 2048 1896.0833\"><path d=\"M2048 895q0 21-13 36.5t-33 19.5l-205 356q3 9 3 18 0 20-12.5 35.5T1755 1380l-193 337q3 8 3 16 0 23-16.5 40t-40.5 17q-25 0-41-18h-400q-17 20-43 20t-43-20H582q-17 20-43 20-23 0-40-16.5t-17-40.5q0-8 4-20l-193-335q-20-4-32.5-19.5T248 1325q0-9 3-18L45 951q-20-5-32.5-20.5T0 895q0-21 13.5-36.5T47 839l199-344q0-1-.5-3t-.5-3q0-36 34-51L488 75q-4-10-4-18 0-24 17-40.5T541 0q26 0 44 21h396q16-21 43-21t43 21h398q18-21 44-21 23 0 40 16.5t17 40.5q0 6-4 18l207 358q23 1 39 17.5t16 38.5q0 13-7 27l187 324q19 4 31.5 19.5T2048 895zm-985 799h389l-342-354H967l-342 354h360q18-16 39-16t39 16zM112 882q1 4 1 13 0 10-2 15l208 360 15 6 188-199V730L335 536q-13 8-29 10zM986 98H598l190 200 554-200h-280q-16 16-38 16t-38-16zm703 1212q1-6 5-11l-64-68-17 79h76zm-106 0l22-105-252-266-296 307 63 64h463zm-88 368l16-28 65-310h-427l333 343q8-4 13-5zm-917 16h5l342-354H552v335l4 6q14 5 22 13zm-26-384h402l64-66-309-321-157 166v221zm-193 0h163v-189l-168 177q4 8 5 12zm-1-825q0 1 .5 2t.5 2q0 16-8 29l171 177V426zm194-70v311l153 157 297-314-223-236zm4-304l-4 8v264l205-74-191-201q-6 2-10 3zm891-13h-16L810 322l213 225zm-424 492L726 905l311 319 296-307zM688 902L552 761v284zm350 364l-42 44h85zm336-348l238 251 132-624-3-5-1-1zm344-400q-8-13-8-29v-2l-216-376q-5-1-13-5l-437 463 310 327zM522 394V171L359 453zm0 946H359l163 283v-283zm1085 0l-48 227 130-227h-82zm122-70l207-361q-2-10-2-14 0-1 3-16l-171-296-129 612 77 82q5-3 15-7z\"><\/path><\/svg><\/span><\/div>\n                <div\n                  data-colibri-id=\"189-c13\" class=\"h-global-transition-all h-heading style-530 style-local-189-c13 position-relative h-element\">\n                  <!---->\n                  <div class=\"h-heading__outer style-530 style-local-189-c13\">\n                    <!---->\n                    <!---->\n                    <h5 class=\"\">NER for historical texts<\/h5>\n                  <\/div>\n              <\/div>\n              <div data-colibri-id=\"189-c14\" class=\"h-text h-text-component style-531 style-local-189-c14 position-relative h-element\">\n                <!---->\n                <!---->\n                <div class=\"\">\n                  <p>The pipeline performs Named Entity Recognition (NER) in two steps: (1) a specialised NER step based on, in simple terms, comparisons between tokens and pre-built lists of place and person names, and (2) NER as part of the part-of-speech tagging step.<\/p>\n                <\/div>\n              <\/div>\n            <\/div>\n          <\/div>\n        <\/div>\n        <div class=\"h-column h-column-container d-flex h-col-lg-4 h-col-md-4 h-col-12 style-528-outer style-local-189-c15-outer\">\n          <div data-colibri-id=\"189-c15\" class=\"d-flex h-flex-basis h-column__inner h-px-lg-3 h-px-md-3 h-px-2 v-inner-lg-3 v-inner-md-3 v-inner-2 style-528 style-local-189-c15 position-relative\">\n            <!---->\n            <!---->\n            <div class=\"w-100 h-y-container h-column__content h-column__v-align flex-basis-100 align-self-lg-start align-self-md-start align-self-start\">\n              <!---->\n              <div data-colibri-id=\"189-c16\" class=\"h-icon style-529 style-local-189-c16 position-relative h-element\">\n                <!----><span class=\"h-svg-icon h-icon__icon style-529-icon style-local-189-c16-icon\"><!--Icon by Font Awesome (https:\/\/fontawesome.com)--><svg version=\"1.1\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" id=\"file-text-o\" viewbox=\"0 0 1611.2499 1896.0833\"><path d=\"M1468 380q28 28 48 76t20 88v1152q0 40-28 68t-68 28H96q-40 0-68-28t-28-68V96q0-40 28-68T96 0h896q40 0 88 20t76 48zm-444-244v376h376q-10-29-22-41l-313-313q-12-12-41-22zm384 1528V640H992q-40 0-68-28t-28-68V128H128v1536h1280zM384 800q0-14 9-23t23-9h704q14 0 23 9t9 23v64q0 14-9 23t-23 9H416q-14 0-23-9t-9-23v-64zm736 224q14 0 23 9t9 23v64q0 14-9 23t-23 9H416q-14 0-23-9t-9-23v-64q0-14 9-23t23-9h704zm0 256q14 0 23 9t9 23v64q0 14-9 23t-23 9H416q-14 0-23-9t-9-23v-64q0-14 9-23t23-9h704z\"><\/path><\/svg><\/span><\/div>\n              <div\n                data-colibri-id=\"189-c17\" class=\"h-global-transition-all h-heading style-530 style-local-189-c17 position-relative h-element\">\n                <!---->\n                <div class=\"h-heading__outer style-530 style-local-189-c17\">\n                  <!---->\n                  <!---->\n                  <h5 class=\"\">Information Extraction &amp; Gazetteers<\/h5>\n                <\/div>\n            <\/div>\n            <div data-colibri-id=\"189-c18\" class=\"h-text h-text-component style-531 style-local-189-c18 position-relative h-element\">\n              <!---->\n              <!---->\n              <div class=\"\">\n                <p>Extraction of names (mines, places, persons) using Postgres. The created gazetteers and registers were used to develop &amp; support a NER workflow.<\/p>\n              <\/div>\n            <\/div>\n          <\/div>\n        <\/div>\n      <\/div>\n    <\/div>\n  <\/div>\n<\/div>\n<\/div>\n<div data-colibri-component=\"section\" data-colibri-id=\"189-c19\" id=\"custom\" class=\"h-section h-section-global-spacing d-flex align-items-lg-center align-items-md-center align-items-center style-534 style-local-189-c19 position-relative\">\n  <!---->\n  <!---->\n  <div class=\"h-section-grid-container h-section-boxed-container\">\n    <!---->\n    <div data-colibri-id=\"189-c20\" class=\"h-row-container gutters-row-lg-2 gutters-row-md-2 gutters-row-0 gutters-row-v-lg-2 gutters-row-v-md-2 gutters-row-v-2 style-535 style-local-189-c20 position-relative\">\n      <!---->\n      <div class=\"h-row justify-content-lg-center justify-content-md-center justify-content-center align-items-lg-stretch align-items-md-stretch align-items-stretch gutters-col-lg-2 gutters-col-md-2 gutters-col-0 gutters-col-v-lg-2 gutters-col-v-md-2 gutters-col-v-2\">\n        <!---->\n        <div class=\"h-column h-column-container d-flex h-col-lg-auto h-col-md-auto h-col-auto style-536-outer style-local-189-c21-outer\">\n          <div data-colibri-id=\"189-c21\" class=\"d-flex h-flex-basis h-column__inner h-px-lg-2 h-px-md-2 h-px-2 v-inner-lg-2 v-inner-md-2 v-inner-2 style-536 style-local-189-c21 position-relative\">\n            <!---->\n            <!---->\n            <div class=\"w-100 h-y-container h-column__content h-column__v-align flex-basis-100 align-self-lg-start align-self-md-start align-self-start\">\n              <!---->\n              <div data-colibri-id=\"189-c22\" class=\"h-text h-text-component style-538 style-local-189-c22 position-relative h-element\">\n                <!---->\n                <!---->\n                <div class=\"\">\n                  <p>References<\/p>\n                  <p>Schmid, Helmut. &#8222;Probabilistic part-of-speech tagging using decision trees.&#8220; International Conference on New Methods in Language Processing, 1994. 1994.<\/p>\n                  <p>Schmid, Helmut. &#8222;Deep learning-based morphological taggers and lemmatizers for annotating historical texts.&#8220; Proceedings of the 3rd international conference on digital access to textual cultural heritage. 2019.<\/p>\n                <\/div>\n              <\/div>\n            <\/div>\n          <\/div>\n        <\/div>\n      <\/div>\n    <\/div>\n  <\/div>\n<\/div>\n<\/div>","protected":false},"excerpt":{"rendered":"<p>Zur Entwicklung eines Workflows zur automatischen Erkennung von Orts- und Personennamen (Named Entity Recognition) kombiniert T.M.M.M.T. zwei Forschungsans\u00e4tze NER f\u00fcr historische Texte F\u00fcr sp\u00e4tmittelhochdeutsche Texte adaptierte NER in zwei Schritten: (1) Spezialisierter NER-Schritt basierend auf Vergleichen zwischen Token und vorgefertigten Listen von Orts- und Personennamen (2) NER als Teil des Part-of-Speech-Tagging-Schrittes (Deep Learning-Ansatz) Information Extraction [&hellip;]<\/p>","protected":false},"author":1,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"page-templates\/full-width-page.php","meta":{"footnotes":""},"class_list":["post-189","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/miningtext.at\/en\/wp-json\/wp\/v2\/pages\/189","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/miningtext.at\/en\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/miningtext.at\/en\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/miningtext.at\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/miningtext.at\/en\/wp-json\/wp\/v2\/comments?post=189"}],"version-history":[{"count":3,"href":"https:\/\/miningtext.at\/en\/wp-json\/wp\/v2\/pages\/189\/revisions"}],"predecessor-version":[{"id":281,"href":"https:\/\/miningtext.at\/en\/wp-json\/wp\/v2\/pages\/189\/revisions\/281"}],"wp:attachment":[{"href":"https:\/\/miningtext.at\/en\/wp-json\/wp\/v2\/media?parent=189"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}