Contextual biasing of named-entities with large language models

C Sun, Z Ahmed, Y Ma, Z Liu, L Kabela… - ICASSP 2024-2024 …, 2024 - ieeexplore.ieee.org
We explore contextual biasing with Large Language Models (LLMs) to enhance Automatic
Speech Recognition (ASR) in second-pass rescoring. Our approach introduces the utilization …

Joint audio/text training for transformer rescorer of streaming speech recognition

S Kim, K Li, L Kabela, R Huang, J Zhu, O Kalinli… - arXiv preprint arXiv …, 2022 - arxiv.org
Recently, there has been an increasing interest in two-pass streaming end-to-end speech
recognition (ASR) that incorporates a 2nd-pass rescoring model on top of the conventional 1st-…

DISGO: Automatic end-to-end evaluation for scene text OCR

…, G Pang, P Krishnan, L Kabela… - arXiv preprint arXiv …, 2023 - arxiv.org
This paper discusses the challenges of optical character recognition (OCR) on natural
scenes, which is harder than OCR on documents due to the wild content and various image …

NARCCAP model assessment and future projections for the Southeast United States

ED Kabela - 2012 - search.proquest.com
Global climate models (GCMs) provide most climate change projections, but their coarse
resolution must be downscaled to more local scales in order to conduct meaningful climate …

[PDF][PDF] Experience Replay Methods in Soft Actor-Critic

L Kabela - cs.utexas.edu
Soft Actor-Critic (SAC) is a state of the art offpolicy algorithm for deep reinforcement learning;
however, little exploration into a critical component of the algorithm, experience replay, has …

Contemporary NLP modeling in six comprehensive programming assignments

…, J Chen, S Desai, T Goyal, L Kabela… - Proceedings of the …, 2021 - aclanthology.org
We present a series of programming assignments, adaptable to a range of experience levels
from advanced undergraduate to PhD, to teach students design and implementation of …

Lumos: Empowering Multimodal LLMs with Scene Text Recognition

A Shenoy, Y Lu, S Jayakumar, D Chatterjee… - Proceedings of the 30th …, 2024 - dl.acm.org
We introduce Lumos, the first end-to-end multimodal question-answering system with text
understanding capabilities. At the core of Lumos is a Scene Text Recognition (STR) …

Analiza izvedivosti primjene tehnologije RFID u tvornici kabela

P Capparelli - 2016 - repozitorij.fsb.unizg.hr
Rad razmatra primjenu sustava RFID u proizvodnji kabela. Prvi dio rada opisuje električne
kabele i njihovo tržište s posebnim naglaskom na Istočnu Europu. Dan je opis tvornice ELKA …

Pomáhání z pohledu křesťanské morálky

A Kabela - 2010 - digilib.k.utb.cz
Bakalářská práce Pomáhání z pohledu křesťanské morálky se zabývá významem pomáhání
v křesťanské kultuře, která vychází z učení Ježíše Krista. Popisuje historii křesťanské charity …

[PDF][PDF] Proceedings of the Fifth Workshop on Teaching NLP

D Jurgens, V Kolhatkar, L Lucy, M Mieskes… - Proceedings of the …, 2021 - aclanthology.org
Welcome to the Fifth Workshop on Teaching Natural Language Processing (NLP). This online
workshop featured an exciting mix of papers, teaching material submissions, panels, talks, …