Nothing Special   »   [go: up one dir, main page]

Faiyaz Khan et al., 2021 - Google Patents

Improved bengali image captioning via deep convolutional neural network based encoder-decoder model

Faiyaz Khan et al., 2021

View PDF
Document ID
6644988305381464848
Author
Faiyaz Khan M
Sadiq-Ur-Rahman S
Saiful Islam M
Publication year
Publication venue
Proceedings of International Joint Conference on Advances in Computational Intelligence: IJCACI 2020

External Links

Snippet

Image Captioning is an arduous task of producing syntactically and semantically correct textual descriptions of an image in natural language with context related to the image. Existing notable pieces of research in Bengali Image Captioning (BIC) are based on …
Continue reading at arxiv.org (PDF) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/28Processing or translating of natural language
    • G06F17/2809Data driven translation
    • G06F17/2827Example based machine translation; Alignment
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/3061Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F17/30634Querying
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/28Processing or translating of natural language
    • G06F17/289Use of machine translation, e.g. multi-lingual retrieval, server side translation for client devices, real-time translation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • G06F17/27Automatic analysis, e.g. parsing
    • G06F17/2765Recognition
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30861Retrieval from the Internet, e.g. browsers
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/62Methods or arrangements for recognition using electronic means
    • G06K9/6217Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/36Image preprocessing, i.e. processing the image information without deciding about the identity of the image
    • G06K9/46Extraction of features or characteristics of the image
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/02Computer systems based on biological models using neural network models
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N99/00Subject matter not provided for in other groups of this subclass
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computer systems utilising knowledge based models

Similar Documents

Publication Publication Date Title
Andreas et al. Learning with latent language
Faiyaz Khan et al. Improved bengali image captioning via deep convolutional neural network based encoder-decoder model
Prakash et al. Neural paraphrase generation with stacked residual LSTM networks
Gupta et al. Integration of textual cues for fine-grained image captioning using deep CNN and LSTM
Sharma et al. Image captioning improved visual question answering
Zhao et al. ZYJ123@ DravidianLangTech-EACL2021: Offensive language identification based on XLM-RoBERTa with DPCNN
Nagaraj et al. Kannada to English Machine Translation Using Deep Neural Network.
Nie et al. Attention-based encoder-decoder model for answer selection in question answering
Singh et al. An encoder-decoder based framework for hindi image caption generation
Xian et al. Self-guiding multimodal LSTM—when we do not have a perfect training dataset for image captioning
Choi et al. Analyzing zero-shot cross-lingual transfer in supervised NLP tasks
Rathi Deep learning apporach for image captioning in Hindi language
Palash et al. Bangla image caption generation through cnn-transformer based encoder-decoder network
Chaudhary et al. Signnet ii: A transformer-based two-way sign language translation model
An et al. Resource mention extraction for MOOC discussion forums
Deroy et al. Question generation: Past, present & future
Singh et al. Generation and evaluation of hindi image captions of visual genome
CN115129807A (en) Fine-grained classification method and system for social media topic comments based on self-attention
Zhang et al. Modeling the relationship between user comments and edits in document revision
Uttarwar et al. Artificial intelligence based system for preliminary rounds of recruitment process
Parshakova et al. Latent question interpretation through variational adaptation
Reddy et al. Multilingual image captioning: multimodal framework for bridging visual and linguistic realms in Tamil and Telugu through transformers
Wang et al. RSRNeT: a novel multi-modal network framework for named entity recognition and relation extraction
Mundu et al. ETransCap: efficient transformer for image captioning
Alnami et al. Story Generation from Images Using Deep Learning