{"id":559,"date":"2023-02-19T18:19:24","date_gmt":"2023-02-19T18:19:24","guid":{"rendered":"http:\/\/stari.jerteh.rs\/?page_id=559"},"modified":"2023-02-19T18:19:25","modified_gmt":"2023-02-19T18:19:25","slug":"elg-resources-and-tools","status":"publish","type":"page","link":"https:\/\/jerteh.rs\/index.php\/en\/elg-resources-and-tools\/","title":{"rendered":"ELG resources and tools"},"content":{"rendered":"\n<p><\/p>\n\n\n\n<h4 class=\"wp-block-heading\">Corpora<\/h4>\n\n\n\n<ul class=\"wp-block-list\">\n<li><em><a rel=\"noreferrer noopener\" aria-label=\" (opens in a new tab)\" href=\"https:\/\/live.european-language-grid.eu\/catalogue\/corpus\/9485\" target=\"_blank\">SrpELTeC-gold<\/a> &#8211; Named Entity Recognition Training corpus for Serbian<\/em>  &#8211; The selection of 11 full novels and excerpts from 15 novels from Serbian literary corpus of novels written more than a century ago, have been automatically labelled with SrpNER system for Serbian&nbsp; in the first stage of the gold standard preparation.  Contains 330.119  tokens, 7 classes: person, organization, location, event, work, demonym, role. License <strong><em>CC-BY-NC-SA-4.0.<\/em><\/strong>  <\/li>\n<\/ul>\n\n\n\n<ul class=\"wp-block-list\">\n<li><a rel=\"noreferrer noopener\" aria-label=\"SrpKor4Tagging (opens in a new tab)\" href=\"https:\/\/live.european-language-grid.eu\/catalogue\/corpus\/9295\" target=\"_blank\"><em>SrpKor4Tagging<\/em><\/a> &#8211; Corpus is created via mix of literary (\u2153) and administrative (\u2154) texts in Serbian. It is tagged for POS for 2 tagsets: Universal POS tagset and SrpLemKor tagset (made according to traditional, descriptive Serbian grammar) and lemmatized. Consists of  342.803 tokens, license <em><strong>CC-BY-4.0<\/strong><\/em>.  <\/li>\n<\/ul>\n\n\n\n<ul class=\"wp-block-list\">\n<li><a rel=\"noreferrer noopener\" aria-label=\"RudKorP - Serbian Public Mining Corpus (opens in a new tab)\" href=\"https:\/\/live.european-language-grid.eu\/catalogue\/corpus\/9292\" target=\"_blank\"><em>RudKorP<\/em><\/a> &#8211; <em>Serbian Public Mining Corpus<\/em> &#8211;  Mining Corpus, specialized corpus in the field of mining and mineral resource exploitation, University of Belgrade, Faculty of Mining and Geology. Contains  2.34 million words, license <strong><em>CC-BY-4.0. <\/em><\/strong>  <\/li>\n\n\n\n<li><a href=\"https:\/\/live.european-language-grid.eu\/catalogue\/corpus\/657\">INTERA Corpus &#8211; the Serbian-English part<\/a> &#8211; bilingual corpus 1 million words per language, paired at sentence level, license <strong><em>CC-BY-4.0. 1.<\/em><\/strong><\/li>\n\n\n\n<li> <a href=\"https:\/\/live.european-language-grid.eu\/catalogue\/corpus\/685\">INTERA Corpus &#8211; the Serbian POS annotated part of the SR-EN pair <\/a>&#8211;  <br> million words, license <strong><em>CC BY 4.0.<\/em><\/strong><\/li>\n\n\n\n<li><a href=\"https:\/\/live.european-language-grid.eu\/catalogue\/corpus\/8185\">MULTEXT-East &#8220;1984&#8221; annotated corpus 4.0<\/a> &#8211; automatically tagged by grammar categories, part of speech and lemmas and manually corrected, license MULTEXT-East <a href=\"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/\"><strong><em>CC BY-NC-SA 4.0<\/em><\/strong><\/a><strong><em>. <\/em><\/strong><\/li>\n\n\n\n<li> <a href=\"https:\/\/live.european-language-grid.eu\/catalogue\/corpus\/13141\">Corpus 80 jours<\/a> parallel corpus consists of 3.700 paired segments, mainly sentences, license <strong><em>CC-BY-NC-SA-4.0.<\/em><\/strong>  <\/li>\n<\/ul>\n\n\n\n<h4 class=\"wp-block-heading\">Models<\/h4>\n\n\n\n<ul class=\"wp-block-list\">\n<li><a rel=\"noreferrer noopener\" href=\"https:\/\/live.european-language-grid.eu\/catalogue\/ld\/9484\" target=\"_blank\"><em>SrpCNNER<\/em><\/a> &#8211; <em>Named Entity Recognizer for Serbian (7 classes)<\/em> &#8211;  <br>A Named Entity Recognizer (NER) trained to recognize 7 different named entity types, with a Convolutional Neural Network (CNN) architecture, having F1 score of approx 91% on the test dataset. <br>License <strong><em>CC-BY-NC-SA-4.0. <\/em><\/strong>  <\/li>\n<\/ul>\n\n\n\n<ul class=\"wp-block-list\">\n<li><a rel=\"noreferrer noopener\" aria-label=\"SrpKor4Tagging-TreeTagger (opens in a new tab)\" href=\"https:\/\/live.european-language-grid.eu\/catalogue\/ld\/9296\" target=\"_blank\"><em>SrpKor4Tagging-TreeTagger<\/em><\/a> &#8211;  TreeTagger models for tagging using Universal POS and SrpLemKor tagsets, trained using the SrpKor4Tagging annotated corpora and SrpMD4Tagging lexicons. License  <strong><em>CC-BY-4.0.<\/em><\/strong>  <\/li>\n<\/ul>\n\n\n\n<ul class=\"wp-block-list\">\n<li><a rel=\"noreferrer noopener\" aria-label=\"SrpKor4Tagging-spaCy (opens in a new tab)\" href=\"https:\/\/live.european-language-grid.eu\/catalogue\/ld\/9297\" target=\"_blank\">SrpKor4Tagging-spaCy<\/a> &#8211;  spaCy POS-tagging models for tagging using Universal POS and SrpLemKor tagsets, trained using the SrpKor4Tagging annotated corpora. License<strong><em>&nbsp;CC-BY-4.0.<\/em><\/strong>   <\/li>\n<\/ul>\n\n\n\n<h4 class=\"wp-block-heading\">Lexicons<\/h4>\n\n\n\n<ul class=\"wp-block-list\">\n<li><a rel=\"noreferrer noopener\" aria-label=\"SrpMD (opens in a new tab)\" href=\"https:\/\/live.european-language-grid.eu\/catalogue\/lcr\/17355\" target=\"_blank\"><em>SrpMD<\/em><\/a><em> &#8211; Serbian Morphological Dictionaries<\/em> &#8211;  SrpMD follows the methodology and format (known as DELAS\/DELAF) that was developed in LADL (Laboratoire d&#8217;Automatique Documentaire et Linguistique), 10.288 multiword units, 88.753 simple words \u0438 3.753.750 word forms, license <strong><em>CC-BY-NC-SA-4.0.<\/em><\/strong>  <\/li>\n<\/ul>\n\n\n\n<ul class=\"wp-block-list\">\n<li><a rel=\"noreferrer noopener\" aria-label=\"SrpMD4Tagging (opens in a new tab)\" href=\"https:\/\/live.european-language-grid.eu\/catalogue\/lcr\/9294\" target=\"_blank\"><em>SrpMD4Tagging<\/em><\/a><em> &#8211; Serbian Morphological Dictionaries for Tagging <\/em>-SrpMD4Tagging &#8211; Serbian Morphological Dictionaries for Tagging derived from Serbian Morphological Dictionaries (Krstev &amp; Vitas) &nbsp;as lookup dictionary for assigning lemma for given inflected form and POS tag. Two files for two POS tagsets available: Univesal Dependencies and traditional Serbian POS tagset, <br>935.466 tagged word forms, license <strong><em>CC-BY-NC-SA-4.0.<\/em><\/strong><\/li>\n<\/ul>\n\n\n\n<ul class=\"wp-block-list\">\n<li><a rel=\"noreferrer noopener\" aria-label=\"GeolISSTerm (opens in a new tab)\" href=\"https:\/\/live.european-language-grid.eu\/catalogue\/lcr\/9285\" target=\"_blank\"><em>GeolISSTerm<\/em><\/a><em> &#8211; dictionary of geologic terms<\/em>  is the electronic dictionary as a special-purpose taxonomy of basic geologic concepts and terms. GeolISSTerm is part of the Geologic Information System of Serbia (GeolISS) used for validation, classification and specification of the observed and the interpreted geological attributes. Contains 2.631 bilingual terms with definitions and synonyms, license<br><strong><em>CC-BY-NC-SA-4.0.<\/em><\/strong>  <\/li>\n<\/ul>\n\n\n\n<h4 class=\"wp-block-heading\">Tools<\/h4>\n\n\n\n<ul class=\"wp-block-list\">\n<li><a rel=\"noreferrer noopener\" aria-label=\"Bibli\u0161a (opens in a new tab)\" href=\"https:\/\/live.european-language-grid.eu\/catalogue\/tool-service\/17357\" target=\"_blank\"><em>Bibli\u0161a<\/em><\/a><em> &#8211; Multilingual digital library tool<\/em> &#8211;  Biblisha is publicly available multilingual digital library, developed for management, search and the browsing of aligned bilingual text collections. Based on MongoDB NoSQL-database, a web tool enable the use of rich information in the stored text collections. Two level search, with and without login.<\/li>\n<\/ul>\n\n\n\n<ul class=\"wp-block-list\">\n<li><a rel=\"noreferrer noopener\" aria-label=\"Leximirka (opens in a new tab)\" href=\"https:\/\/live.european-language-grid.eu\/catalogue\/tool-service\/17356\" target=\"_blank\"><em>Leximirka<\/em><\/a><em> &#8211; lexical database <\/em> and a web application for developing, managing and exploring lexicographic data. It enables lexical entry control, automatic vocabulary enrichment, multiuser work, and establishment of relations among lexical entries. The rule-based system enables automatic linking between lexical entries. Login required for the search.<\/li>\n<\/ul>\n\n\n<p><!--EndFragment--><\/p>\n<p><\/p>\n\n\n<h4 class=\"wp-block-heading\" id=\"mce_19\">ELG services<\/h4>\n\n\n\n<ul class=\"wp-block-list\">\n<li><a rel=\"noreferrer noopener\" href=\"https:\/\/live.european-language-grid.eu\/catalogue\/tool-service\/18077\" target=\"_blank\"><em>SrpCNNER service<\/em><\/a> &#8211; Web service that enables annotation of provided text using&nbsp;<a rel=\"noreferrer noopener\" href=\"https:\/\/live.european-language-grid.eu\/catalogue\/ld\/9484\/\" target=\"_blank\">SrpCNNER<\/a>&nbsp;having the following tagset: PERS, ROLE, LOC, DEMO, ORG, WORK &amp; EVENT.&nbsp;Available online without login.<\/li>\n<\/ul>\n\n\n<p><\/p>","protected":false},"excerpt":{"rendered":"<p>Corpora Models Lexicons Tools ELG services<\/p>\n<p class=\"continue-reading-button\"> <a class=\"continue-reading-link\" href=\"https:\/\/jerteh.rs\/index.php\/en\/elg-resources-and-tools\/\">\u041f\u0440\u043e\u0447\u0438\u0442\u0430\u0458\u0442\u0435 \u0432\u0438\u0448\u0435<i class=\"crycon-right-dir\"><\/i><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-559","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/jerteh.rs\/index.php\/wp-json\/wp\/v2\/pages\/559","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/jerteh.rs\/index.php\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/jerteh.rs\/index.php\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/jerteh.rs\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/jerteh.rs\/index.php\/wp-json\/wp\/v2\/comments?post=559"}],"version-history":[{"count":1,"href":"https:\/\/jerteh.rs\/index.php\/wp-json\/wp\/v2\/pages\/559\/revisions"}],"predecessor-version":[{"id":560,"href":"https:\/\/jerteh.rs\/index.php\/wp-json\/wp\/v2\/pages\/559\/revisions\/560"}],"wp:attachment":[{"href":"https:\/\/jerteh.rs\/index.php\/wp-json\/wp\/v2\/media?parent=559"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}