From captions to visual concepts and back.

AllImages Videos Books Maps News Shopping

[1411.4952] From Captions to Visual Concepts and Back - arXiv

Nov 18, 2014 · This paper presents a novel approach for automatically generating image descriptions: visual detectors, language models, and multimodal similarity models

Scholarly articles for From captions to visual concepts and back.

scholar.google.com › citations

From captions to visual concepts and back
Fang · Cited by 1674

… eye: A recurrent visual representation for image caption …
Chen · Cited by 668

… a recurrent visual representation for image caption …
Chen · Cited by 239

[PDF] From Captions to Visual Concepts and Back - CVF Open Access

openaccess.thecvf.com › papers › F...

This paper presents a novel approach for automatically generating image descriptions: visual detectors, language models, and multimodal similarity models ...

From captions to visual concepts and back - IEEE Xplore

ieeexplore.ieee.org › iel7

This paper presents a novel approach for automatically generating image descriptions: visual detectors, language models, and multimodal similarity models ...

[1411.4952v3] From Captions to Visual Concepts and Back - arXiv

arxiv.org › cs

Abstract: This paper presents a novel approach for automatically generating image descriptions: visual detectors, language models, and multimodal similarity ...

[PDF] From Captions to Visual Concepts and Back

saurabhg.web.illinois.edu › pdfs › c...

When does a machine “understand” an image? One definition is when it can generate a novel caption that summarizes the salient content within an image.

(PDF) From captions to visual concepts and back - ResearchGate

www.researchgate.net › publication › 30...

From Captions to Visual Concepts and Back ... This paper presents a novel approach for automatically generating image descriptions: visual detectors and language ...

From captions to visual concepts and back - IEEE Computer Society

www.computer.org › csdl › cvpr

This paper presents a novel approach for automatically generating image descriptions: visual detectors, language models, and multimodal similarity models ...

[PDF] From Captions to Visual Concepts and Back - Saurabh Gupta

saurabhg.web.illinois.edu › pdfs › c...

• From Captions to Visual Concepts and Back, Hao Fang*,. Saurabh Gupta*, Forrest Iandola*, Rupesh Srivastava*, Li. Deng, Piotr Dollár, Jianfeng Gao, Xiaodong ...

[PDF] From captions to visual concepts and back | Semantic Scholar

www.semanticscholar.org › paper › Fro...

This paper uses multiple instance learning to train visual detectors for words that commonly occur in captions, including many different parts of speech ...

From Captions to Visual Concepts and Back - Microsoft Research

www.microsoft.com › Home › Projects

Apr 9, 2015 · We introduce a novel approach for automatically generating image descriptions. Visual detectors, language models, and deep multimodal similarity models are ...