Contextual biasing of named-entities with large language models
We explore contextual biasing with Large Language Models (LLMs) to enhance Automatic
Speech Recognition (ASR) in second-pass rescoring. Our approach introduces the utilization …
Speech Recognition (ASR) in second-pass rescoring. Our approach introduces the utilization …
Joint audio/text training for transformer rescorer of streaming speech recognition
Recently, there has been an increasing interest in two-pass streaming end-to-end speech
recognition (ASR) that incorporates a 2nd-pass rescoring model on top of the conventional 1st-…
recognition (ASR) that incorporates a 2nd-pass rescoring model on top of the conventional 1st-…
DISGO: Automatic end-to-end evaluation for scene text OCR
…, G Pang, P Krishnan, L Kabela… - arXiv preprint arXiv …, 2023 - arxiv.org
This paper discusses the challenges of optical character recognition (OCR) on natural
scenes, which is harder than OCR on documents due to the wild content and various image …
scenes, which is harder than OCR on documents due to the wild content and various image …
NARCCAP model assessment and future projections for the Southeast United States
ED Kabela - 2012 - search.proquest.com
Global climate models (GCMs) provide most climate change projections, but their coarse
resolution must be downscaled to more local scales in order to conduct meaningful climate …
resolution must be downscaled to more local scales in order to conduct meaningful climate …
[PDF][PDF] Experience Replay Methods in Soft Actor-Critic
L Kabela - cs.utexas.edu
Soft Actor-Critic (SAC) is a state of the art offpolicy algorithm for deep reinforcement learning;
however, little exploration into a critical component of the algorithm, experience replay, has …
however, little exploration into a critical component of the algorithm, experience replay, has …
Contemporary NLP modeling in six comprehensive programming assignments
We present a series of programming assignments, adaptable to a range of experience levels
from advanced undergraduate to PhD, to teach students design and implementation of …
from advanced undergraduate to PhD, to teach students design and implementation of …
Lumos: Empowering Multimodal LLMs with Scene Text Recognition
We introduce Lumos, the first end-to-end multimodal question-answering system with text
understanding capabilities. At the core of Lumos is a Scene Text Recognition (STR) …
understanding capabilities. At the core of Lumos is a Scene Text Recognition (STR) …
Analiza izvedivosti primjene tehnologije RFID u tvornici kabela
P Capparelli - 2016 - repozitorij.fsb.unizg.hr
Rad razmatra primjenu sustava RFID u proizvodnji kabela. Prvi dio rada opisuje električne
kabele i njihovo tržište s posebnim naglaskom na Istočnu Europu. Dan je opis tvornice ELKA …
kabele i njihovo tržište s posebnim naglaskom na Istočnu Europu. Dan je opis tvornice ELKA …
Pomáhání z pohledu křesťanské morálky
A Kabela - 2010 - digilib.k.utb.cz
Bakalářská práce Pomáhání z pohledu křesťanské morálky se zabývá významem pomáhání
v křesťanské kultuře, která vychází z učení Ježíše Krista. Popisuje historii křesťanské charity …
v křesťanské kultuře, která vychází z učení Ježíše Krista. Popisuje historii křesťanské charity …
[PDF][PDF] Proceedings of the Fifth Workshop on Teaching NLP
Welcome to the Fifth Workshop on Teaching Natural Language Processing (NLP). This online
workshop featured an exciting mix of papers, teaching material submissions, panels, talks, …
workshop featured an exciting mix of papers, teaching material submissions, panels, talks, …