User profiles for Bartlomiej Niton

Bartłomiej Nitoń

Institute of Computer Science, Polish Academy of Sciences
Verified email at mion.elka.pw.edu.pl
Cited by 359

[PDF][PDF] Measuring Readability of Polish Texts: Baseline Experiments.

B Broda, B Niton, W Gruszczynski, M Ogrodniczuk - LREC, 2014 - academia.edu
Measuring readability of a text is the first sensible step to its simplification. In this paper we
present an overview of the most common approaches to automatic measuring of readability. …

Introducing the CURLICAT corpora: seven-language domain specific annotated corpora from curated sources

T Váradi, B Nyéki, S Koeva, M Tadić… - Proceedings of the …, 2022 - aclanthology.org
This article presents the current outcomes of the CURLICAT CEF Telecom project, which
aims to collect and deeply annotate a set of large corpora from selected domains. The …

The MARCELL legislative corpus

T Váradi, S Koeva, M Yamalov, M Tadić… - Proceedings of the …, 2020 - aclanthology.org
This article presents the current outcomes of the MARCELL CEF Telecom project aiming to
collect and deeply annotate a large comparable corpus of legal documents. The MARCELL …

Keyword extraction from short texts with a text-to-text transfer transformer

P Pęzik, A Mikołajczyk, A Wawrzyński, B Nitoń… - Asian Conference on …, 2022 - Springer
The paper explores the relevance of the Text-To-Text Transfer Transformer language model
(T5) for Polish (plT5) to the task of intrinsic and extrinsic keyword extraction from short text …

[PDF][PDF] Jasnopis–a program to compute readability of texts in polish based on psycholinguistic research

Ł Dębowski, B Broda, B Nitoń… - … Processing and Cognitive …, 2015 - academia.edu
Readability of a text is a measure how difficult the text is to understand on average. The aim
of the present paper is twofold. First, we have determined through a psychological …

New developments in the Polish parliamentary corpus

M Ogrodniczuk, B Nitoń - Proceedings of the Second …, 2020 - aclanthology.org
This short paper presents the current (as of February 2020) state of preparation of the Polish
Parliamentary Corpus (PPC)—an extensive collection of transcripts of Polish parliamentary …

[PDF][PDF] Deep neural networks for coreference resolution for Polish

B Nitoń, P Morawiecki… - Proceedings of the …, 2018 - aclanthology.org
The paper presents several configurations of deep neural networks aimed at the task of
coreference resolution for Polish. Starting with the basic feature set and standard word …

Linguistically annotated multilingual comparable corpora of parliamentary debates in English ParlaMint-en. ana 3.0

T Kuzman, N Ljubešić, T Erjavec… - … debates in English …, 2023 - investigacion.usc.gal
ParlaMint-en 3.0 comprises linguistically annotated multilingual comparable corpora of
parliamentary debates ParlaMint.ana 3.0 (http://hdl.handle.net/11356/1488) which were machine …

Transferable keyword extraction and generation with text-to-text language models

P Pęzik, A Mikołajczyk, A Wawrzyński… - International Conference …, 2023 - Springer
This paper explores the performance of the T5 text-to-text transfer-transformer language
model together with some other generative models on the task of generating keywords from …

HerBERT Based Language Model Detects Quantifiers and Their Semantic Properties in Polish

M Woliński, B Nitoń, W Kieraś… - Proceedings of the …, 2022 - aclanthology.org
The paper presents a tool for automatic marking up of quantifying expressions, their semantic
features, and scopes. We explore the idea of using a BERT based neural model for the …