Extending Phrase Grounding with Pronouns in Visual Dialogues.

AllImages Videos Books Maps News Shopping

Extending Phrase Grounding with Pronouns in Visual Dialogues

aclanthology.org › 2022.emnlp-main.518

Here we extend the task by considering pronouns as well. First, we construct a dataset of phrase grounding with both noun phrases and pronouns to image regions.

Extending Phrase Grounding with Pronouns in Visual Dialogues - arXiv

arxiv.org › cs

Oct 23, 2022 · Abstract:Conventional phrase grounding aims to localize noun phrases mentioned in a given caption to their corresponding image regions, ...

[PDF] Extending Phrase Grounding with Pronouns in Visual Dialogues

aclanthology.org › 2022.emnlp-ma...

We extend the task of phrase grounding by taking account of pronouns, and correspond- ingly establish a new dataset manually, named. VD-Ref, which is the first ...

[PDF] Extending Phrase Grounding with Pronouns in Visual Dialogues

www.semanticscholar.org › paper

Experiments show that pronouns are easier to ground than noun phrases, where the possible reason might be that these pronouns are much less ambiguous, ...

izhx/Phrase-Grounding-with-Pronoun - GitHub

github.com › izhx › Phrase-Grounding-...

Phrase-Grounding-with-Pronoun-baseline ... Code and data for [EMNLP 22] Extending Phrase Grounding with Pronouns in Visual Dialogues.

Extending Phrase Grounding with Pronouns in Visual Dialogues

www.researchgate.net › publication › 37...

Sep 20, 2024 · This task has seen significant progress since the introduction of the Flickr30k Entities dataset [48] and has played a crucial role in learning ...

Phrase Grounding - Papers With Code

paperswithcode.com › task › latest

Given an image and a corresponding caption, the **Phrase Grounding** task aims to ground each entity mentioned by a noun phrase in the caption to a region ...

VD-Ref Dataset - Papers With Code

paperswithcode.com › dataset › vd-ref

Oct 22, 2022 · VD-Ref is a dataset with ground-truth mappings from both noun phrases and pronouns to image regions. This dataset contains a set of 10k complete sets.

[PDF] Improved Visual Grounding through Self-Consistent Explanations ...

openaccess.thecvf.com › CVPR2024

We present details about our two-level LLM prompts to obtain high-quality paraphrases for region-centric phrases, provide examples of such extracted ...

[PDF] Neural Sequential Phrase Grounding (SeqGROUND)

studios.disneyresearch.com › uploads

Consider image grounding noun phrases from a given sentence: “Alady sitting on acolorfuldecoration with a bouquet of flowers, that match her hair, in her hand.”.

Missing: Dialogues. | Show results with:Dialogues.