default search action
Vilém Zouhar
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c21]Vilém Zouhar, Shuoyang Ding, Anna Currey, Tatyana Badeka, Jenyuan Wang, Brian Thompson:
Fine-Tuned Machine Translation Metrics Struggle in Unseen Domains. ACL (Short Papers) 2024: 488-500 - [c20]Tom Kocmi, Vilém Zouhar, Christian Federmann, Matt Post:
Navigating the Metrics Maze: Reconciling Score Magnitudes and Accuracies. ACL (1) 2024: 1999-2014 - [c19]Peng Cui, Vilém Zouhar, Xiaoyu Zhang, Mrinmaya Sachan:
How to Engage your Readers? Generating Guiding Questions to Promote Active Reading. ACL (1) 2024: 11749-11765 - [c18]Furui Cheng, Vilém Zouhar, Simran Arora, Mrinmaya Sachan, Hendrik Strobelt, Mennatallah El-Assady:
RELIC: Investigating Large Language Model Responses using Self-Consistency. CHI 2024: 647:1-647:18 - [c17]Vilém Zouhar, Kalvin Chang, Chenxuan Cui, Nate B. Carlson, Nathaniel Romney Robinson, Mrinmaya Sachan, David R. Mortensen:
PWESuite: Phonetic Word Embeddings and Tasks They Facilitate. LREC/COLING 2024: 13344-13355 - [c16]Marco Cognetta, Vilém Zouhar, Sangwhan Moon, Naoaki Okazaki:
Two Counterexamples to Tokenization and the Noiseless Channel. LREC/COLING 2024: 16897-16906 - [c15]Marco Cognetta, Vilém Zouhar, Naoaki Okazaki:
Distributional Properties of Subword Regularization. EMNLP 2024: 10753-10763 - [c14]Sankalan Pal Chowdhury, Vilém Zouhar, Mrinmaya Sachan:
AutoTutor meets Large Language Models: A Language Model Tutor with Rich Pedagogy and Guardrails. L@S 2024: 5-15 - [i34]Vilém Zouhar, Ondrej Bojar:
Quality and Quantity of Machine Translation References for Automated Metrics. CoRR abs/2401.01283 (2024) - [i33]Tom Kocmi, Vilém Zouhar, Christian Federmann, Matt Post:
Navigating the Metrics Maze: Reconciling Score Magnitudes and Accuracies. CoRR abs/2401.06760 (2024) - [i32]Vilém Zouhar:
Stolen Subwords: Importance of Vocabularies for Machine Translation Model Stealing. CoRR abs/2401.16055 (2024) - [i31]Sankalan Pal Chowdhury, Vilém Zouhar, Mrinmaya Sachan:
Scaling the Authoring of AutoTutors with Large Language Models. CoRR abs/2402.09216 (2024) - [i30]Marco Cognetta, Vilém Zouhar, Sangwhan Moon, Naoaki Okazaki:
Two Counterexamples to Tokenization and the Noiseless Channel. CoRR abs/2402.14614 (2024) - [i29]Vilém Zouhar, Shuoyang Ding, Anna Currey, Tatyana Badeka, Jenyuan Wang, Brian Thompson:
Fine-Tuned Machine Translation Metrics Struggle in Unseen Domains. CoRR abs/2402.18747 (2024) - [i28]Furui Cheng, Vilém Zouhar, Robin Shing Moon Chan, Daniel Fürst, Hendrik Strobelt, Mennatallah El-Assady:
Interactive Analysis of LLMs using Meaningful Counterfactuals. CoRR abs/2405.00708 (2024) - [i27]Tom Kocmi, Vilém Zouhar, Eleftherios Avramidis, Roman Grundkiewicz, Marzena Karpinska, Maja Popovic, Mrinmaya Sachan, Mariya Shmatova:
Error Span Annotation: A Balanced Approach for Human Evaluation of Machine Translation. CoRR abs/2406.11580 (2024) - [i26]Vilém Zouhar, Tom Kocmi, Mrinmaya Sachan:
AI-Assisted Human Evaluation of Machine Translation. CoRR abs/2406.12419 (2024) - [i25]Peng Cui, Vilém Zouhar, Xiaoyu Zhang, Mrinmaya Sachan:
How to Engage Your Readers? Generating Guiding Questions to Promote Active Reading. CoRR abs/2407.14309 (2024) - [i24]Tom Kocmi, Eleftherios Avramidis, Rachel Bawden, Ondrej Bojar, Anton Dvorkovich, Christian Federmann, Mark Fishel, Markus Freitag, Thamme Gowda, Roman Grundkiewicz, Barry Haddow, Marzena Karpinska, Philipp Koehn, Benjamin Marie, Kenton Murray, Masaaki Nagata, Martin Popel, Maja Popovic, Mariya Shmatova, Steinþór Steingrímsson, Vilém Zouhar:
Preliminary WMT24 Ranking of General MT Systems and LLMs. CoRR abs/2407.19884 (2024) - [i23]Marco Cognetta, Vilém Zouhar, Naoaki Okazaki:
Distributional Properties of Subword Regularization. CoRR abs/2408.11443 (2024) - [i22]Vilém Zouhar, Pinzhen Chen, Tsz Kin Lam, Nikita Moghe, Barry Haddow:
Pitfalls and Outlooks in Using COMET. CoRR abs/2408.15366 (2024) - 2023
- [c13]Vilém Zouhar, Clara Meister, Juan Luis Gastaldi, Li Du, Tim Vieira, Mrinmaya Sachan, Ryan Cotterell:
A Formal Perspective on Byte-Pair Encoding. ACL (Findings) 2023: 598-614 - [c12]Vilém Zouhar, Clara Meister, Juan Luis Gastaldi, Li Du, Mrinmaya Sachan, Ryan Cotterell:
Tokenization and the Noiseless Channel. ACL (1) 2023: 5184-5207 - [c11]Vilém Zouhar, Shehzaad Dhuliawala, Wangchunshu Zhou, Nico Daheim, Tom Kocmi, Yuchen Eleanor Jiang, Mrinmaya Sachan:
Poor Man's Quality Estimation: Predicting Reference-Based MT Metrics Without the Reference. EACL 2023: 1303-1317 - [c10]Shehzaad Dhuliawala, Vilém Zouhar, Mennatallah El-Assady, Mrinmaya Sachan:
A Diachronic Perspective on User Trust in AI under Uncertainty. EMNLP 2023: 5567-5580 - [c9]Dominik Stammbach, Vilém Zouhar, Alexander Miserlis Hoyle, Mrinmaya Sachan, Elliott Ash:
Revisiting Automated Topic Model Evaluation with Large Language Models. EMNLP 2023: 9348-9357 - [c8]Janvijay Singh, Vilém Zouhar, Mrinmaya Sachan:
Enhancing Textbooks with Visuals from the Web for Improved Learning. EMNLP 2023: 11931-11944 - [c7]Kirill Semenov, Vilém Zouhar, Tom Kocmi, Dongdong Zhang, Wangchunshu Zhou, Yuchen Eleanor Jiang:
Findings of the WMT 2023 Shared Task on Machine Translation with Terminologies. WMT 2023: 663-671 - [i21]Vilém Zouhar, Shehzaad Dhuliawala, Wangchunshu Zhou, Nico Daheim, Tom Kocmi, Yuchen Eleanor Jiang, Mrinmaya Sachan:
Poor Man's Quality Estimation: Predicting Reference-Based MT Metrics Without the Reference. CoRR abs/2301.09008 (2023) - [i20]Vilém Zouhar, Sunit Bhattacharya, Ondrej Bojar:
Multimodal Shannon Game with Images. CoRR abs/2303.11192 (2023) - [i19]Vilém Zouhar, Kalvin Chang, Chenxuan Cui, Nathaniel Carlson, Nathaniel R. Robinson, Mrinmaya Sachan, David R. Mortensen:
PWESuite: Phonetic Word Embeddings and Tasks They Facilitate. CoRR abs/2304.02541 (2023) - [i18]Janvijay Singh, Vilém Zouhar, Mrinmaya Sachan:
Enhancing Textbooks with Visuals from the Web for Improved Learning. CoRR abs/2304.08931 (2023) - [i17]Dominik Stammbach, Vilém Zouhar, Alexander Miserlis Hoyle, Mrinmaya Sachan, Elliott Ash:
Re-visiting Automated Topic Model Evaluation with Large Language Models. CoRR abs/2305.12152 (2023) - [i16]Houcemeddine Turki, Abraham Toluwase Owodunni, Mohamed Ali Hadj Taieb, René Fabrice Bile, Mohamed Ben Aouicha, Vilém Zouhar:
A Decade of Scholarly Research on Open Knowledge Graphs. CoRR abs/2306.13186 (2023) - [i15]Vilém Zouhar, Clara Meister, Juan Luis Gastaldi, Li Du, Tim Vieira, Mrinmaya Sachan, Ryan Cotterell:
A Formal Perspective on Byte-Pair Encoding. CoRR abs/2306.16837 (2023) - [i14]Vilém Zouhar, Clara Meister, Juan Luis Gastaldi, Li Du, Mrinmaya Sachan, Ryan Cotterell:
Tokenization and the Noiseless Channel. CoRR abs/2306.16842 (2023) - [i13]Shehzaad Dhuliawala, Vilém Zouhar, Mennatallah El-Assady, Mrinmaya Sachan:
A Diachronic Perspective on User Trust in AI under Uncertainty. CoRR abs/2310.13544 (2023) - [i12]Vilém Zouhar, Vera Kloudová, Martin Popel, Ondrej Bojar:
Evaluating Optimal Reference Translations. CoRR abs/2311.16787 (2023) - [i11]Furui Cheng, Vilém Zouhar, Simran Arora, Mrinmaya Sachan, Hendrik Strobelt, Mennatallah El-Assady:
RELIC: Investigating Large Language Model Responses using Self-Consistency. CoRR abs/2311.16842 (2023) - 2022
- [c6]Sunit Bhattacharya, Vilém Zouhar, Ondrej Bojar:
Sentence Ambiguity, Grammaticality and Complexity Probes. BlackboxNLP@EMNLP 2022: 40-50 - [i10]Vilém Zouhar, Marius Mosbach, Debanjali Biswas, Dietrich Klakow:
Artefact Retrieval: Overview of NLP Models with Knowledge Base Access. CoRR abs/2201.09651 (2022) - [i9]Sunit Bhattacharya, Vera Kloudová, Vilém Zouhar, Ondrej Bojar:
EMMT: A simultaneous eye-tracking, 4-electrode EEG and audio corpus for multi-modal reading and translation scenarios. CoRR abs/2204.02905 (2022) - [i8]Vilém Zouhar, Marius Mosbach, Miaoran Zhang, Dietrich Klakow:
Knowledge Base Index Compression via Dimensionality and Precision Reduction. CoRR abs/2204.02906 (2022) - [i7]Vilém Zouhar, Marius Mosbach, Dietrich Klakow:
Fusing Sentence Embeddings Into LSTM-based Autoregressive Language Models. CoRR abs/2208.02402 (2022) - [i6]Sunit Bhattacharya, Vilém Zouhar, Ondrej Bojar:
Sentence Ambiguity, Grammaticality and Complexity Probes. CoRR abs/2210.06928 (2022) - 2021
- [j2]Vilém Zouhar, Daria Pylypenko:
Leveraging Neural Machine Translation for Word Alignment. Prague Bull. Math. Linguistics 116: 43- (2021) - [c5]Vilém Zouhar, Martin Popel, Ondrej Bojar, Ales Tamchyna:
Neural Machine Translation Quality and Post-Editing Performance. EMNLP (1) 2021: 10204-10214 - [c4]Vilém Zouhar:
Sampling and Filtering of Neural Machine Translation Distillation Data. NAACL-HLT (Student Research Workshop) 2021: 1-8 - [c3]Vilém Zouhar, Michal Novák, Matús Zilinec, Ondrej Bojar, Mateo Obregón, Robin L. Hill, Frédéric Blain, Marina Fomicheva, Lucia Specia, Lisa Yankovskaya:
Backtranslation Feedback Improves User Confidence in MT, Not Quality. NAACL-HLT 2021: 151-161 - [i5]Vilém Zouhar, Daria Pylypenko:
Leveraging Neural Machine Translation for Word Alignment. CoRR abs/2103.17250 (2021) - [i4]Vilém Zouhar:
Sampling and Filtering of Neural Machine Translation Distillation Data. CoRR abs/2104.00664 (2021) - [i3]Vilém Zouhar, Michal Novák, Matús Zilinec, Ondrej Bojar, Mateo Obregón, Robin L. Hill, Frédéric Blain, Marina Fomicheva, Lucia Specia, Lisa Yankovskaya:
Backtranslation Feedback Improves User Confidence in MT, Not Quality. CoRR abs/2104.05688 (2021) - [i2]Vilém Zouhar, Ales Tamchyna, Martin Popel, Ondrej Bojar:
Neural Machine Translation Quality and Post-Editing Performance. CoRR abs/2109.05016 (2021) - 2020
- [j1]Vilém Zouhar, Michal Novák:
Extending Ptakopět for Machine Translation User Interaction Experiments. Prague Bull. Math. Linguistics 115: 129-142 (2020) - [c2]Vilém Zouhar, Ondrej Bojar:
Outbound Translation User Interface Ptakopet: A Pilot Study. LREC 2020: 6967-6975 - [c1]Vilém Zouhar, Tereza Vojtechová, Ondrej Bojar:
WMT20 Document-Level Markable Error Exploration. WMT@EMNLP 2020: 371-380
2010 – 2019
- 2019
- [i1]Vilém Zouhar, Ondrej Bojar:
Outbound Translation User Interface Ptakopet: A Pilot Study. CoRR abs/1911.10835 (2019)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-15 20:39 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint