Artykuł w czasopiśmie
Brak miniatury
Licencja

 

From the National Corpus of Polish to the Polish Corpus Infrastructure

Uproszczony widok
dc.abstract.enThe National Corpus of Polish emerged as a cumulative result of many years of work on large reference corpora by computer scientists and linguists in Poland. While its impact on research in linguistics, humanities and language technology is unquestionable and highly significant, the construction of the national corpus was halted in 2011. In the paper we call for activating the research community and funding institutions around the construction of a corpus infrastructure with the national corpus at its heart. It is claimed that on the verge of an artificial intelligence revolution the envisaged Polish Corpus Infrastructure would provide reliable language data, combine available resources and allow easy integration of new ones.
dc.affiliationUniwersytet Warszawski
dc.contributor.authorOgrodniczuk, Maciej
dc.contributor.authorPęzik, Piotr
dc.contributor.authorGórski, Rafał
dc.contributor.authorŁaziński, Marek
dc.date.accessioned2024-01-25T01:37:37Z
dc.date.available2024-01-25T01:37:37Z
dc.date.copyright2019-12-01
dc.date.issued2019
dc.description.accesstimeAT_PUBLICATION
dc.description.financeNie dotyczy
dc.description.number2
dc.description.versionFINAL_PUBLISHED
dc.description.volume70
dc.identifier.doi10.2478/JAZCAS-2019-0061
dc.identifier.issn0021-5597
dc.identifier.urihttps://repozytorium.uw.edu.pl//handle/item/107576
dc.identifier.weblinkhttp://dx.doi.org/10.2478/jazcas-2019-0061
dc.languageeng
dc.pbn.affiliationlinguistics
dc.relation.ispartofJazykovedny Casopis
dc.relation.pages315-232
dc.rightsOther
dc.sciencecloudnosend
dc.subject.encorpus linguistics
dc.subject.encorpus lexicography
dc.subject.endialect corpora
dc.titleFrom the National Corpus of Polish to the Polish Corpus Infrastructure
dc.typeJournalArticle
dspace.entity.typePublication