User profiles for Bartlomiej Niton
Bartłomiej NitońInstitute of Computer Science, Polish Academy of Sciences Verified email at mion.elka.pw.edu.pl Cited by 359 |
[PDF][PDF] Measuring Readability of Polish Texts: Baseline Experiments.
Measuring readability of a text is the first sensible step to its simplification. In this paper we
present an overview of the most common approaches to automatic measuring of readability. …
present an overview of the most common approaches to automatic measuring of readability. …
Introducing the CURLICAT corpora: seven-language domain specific annotated corpora from curated sources
This article presents the current outcomes of the CURLICAT CEF Telecom project, which
aims to collect and deeply annotate a set of large corpora from selected domains. The …
aims to collect and deeply annotate a set of large corpora from selected domains. The …
The MARCELL legislative corpus
This article presents the current outcomes of the MARCELL CEF Telecom project aiming to
collect and deeply annotate a large comparable corpus of legal documents. The MARCELL …
collect and deeply annotate a large comparable corpus of legal documents. The MARCELL …
Keyword extraction from short texts with a text-to-text transfer transformer
The paper explores the relevance of the Text-To-Text Transfer Transformer language model
(T5) for Polish (plT5) to the task of intrinsic and extrinsic keyword extraction from short text …
(T5) for Polish (plT5) to the task of intrinsic and extrinsic keyword extraction from short text …
[PDF][PDF] Jasnopis–a program to compute readability of texts in polish based on psycholinguistic research
Readability of a text is a measure how difficult the text is to understand on average. The aim
of the present paper is twofold. First, we have determined through a psychological …
of the present paper is twofold. First, we have determined through a psychological …
New developments in the Polish parliamentary corpus
M Ogrodniczuk, B Nitoń - Proceedings of the Second …, 2020 - aclanthology.org
This short paper presents the current (as of February 2020) state of preparation of the Polish
Parliamentary Corpus (PPC)—an extensive collection of transcripts of Polish parliamentary …
Parliamentary Corpus (PPC)—an extensive collection of transcripts of Polish parliamentary …
[PDF][PDF] Deep neural networks for coreference resolution for Polish
B Nitoń, P Morawiecki… - Proceedings of the …, 2018 - aclanthology.org
The paper presents several configurations of deep neural networks aimed at the task of
coreference resolution for Polish. Starting with the basic feature set and standard word …
coreference resolution for Polish. Starting with the basic feature set and standard word …
Linguistically annotated multilingual comparable corpora of parliamentary debates in English ParlaMint-en. ana 3.0
ParlaMint-en 3.0 comprises linguistically annotated multilingual comparable corpora of
parliamentary debates ParlaMint.ana 3.0 (http://hdl.handle.net/11356/1488) which were machine …
parliamentary debates ParlaMint.ana 3.0 (http://hdl.handle.net/11356/1488) which were machine …
Transferable keyword extraction and generation with text-to-text language models
P Pęzik, A Mikołajczyk, A Wawrzyński… - International Conference …, 2023 - Springer
This paper explores the performance of the T5 text-to-text transfer-transformer language
model together with some other generative models on the task of generating keywords from …
model together with some other generative models on the task of generating keywords from …
HerBERT Based Language Model Detects Quantifiers and Their Semantic Properties in Polish
The paper presents a tool for automatic marking up of quantifying expressions, their semantic
features, and scopes. We explore the idea of using a BERT based neural model for the …
features, and scopes. We explore the idea of using a BERT based neural model for the …