Faiyaz Khan et al., 2021 - Google Patents

Improved bengali image captioning via deep convolutional neural network based encoder-decoder model

Faiyaz Khan et al., 2021

Document ID: 6644988305381464848
Author: Faiyaz Khan M; Sadiq-Ur-Rahman S; Saiful Islam M
Publication year: 2021
Publication venue: Proceedings of International Joint Conference on Advances in Computational Intelligence: IJCACI 2020

External Links

Cited by

Snippet

Image Captioning is an arduous task of producing syntactically and semantically correct textual descriptions of an image in natural language with context related to the image. Existing notable pieces of research in Bengali Image Captioning (BIC) are based on …

Continue reading at arxiv.org (PDF) (other versions)

230000001537 neural 0 title abstract description 4

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2809—Data driven translation
- G06F17/2827—Example based machine translation; Alignment
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30634—Querying
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/289—Use of machine translation, e.g. multi-lingual retrieval, server side translation for client devices, real-time translation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2765—Recognition
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30861—Retrieval from the Internet, e.g. browsers
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models

Similar Documents

Publication	Publication Date	Title
Andreas et al.	2017	Learning with latent language
Faiyaz Khan et al.	2021	Improved bengali image captioning via deep convolutional neural network based encoder-decoder model
Prakash et al.	2016	Neural paraphrase generation with stacked residual LSTM networks
Gupta et al.	2020	Integration of textual cues for fine-grained image captioning using deep CNN and LSTM
Sharma et al.	2022	Image captioning improved visual question answering
Zhao et al.	2021	ZYJ123@ DravidianLangTech-EACL2021: Offensive language identification based on XLM-RoBERTa with DPCNN
Nagaraj et al.	2021	Kannada to English Machine Translation Using Deep Neural Network.
Nie et al.	2017	Attention-based encoder-decoder model for answer selection in question answering
Singh et al.	2021	An encoder-decoder based framework for hindi image caption generation
Xian et al.	2019	Self-guiding multimodal LSTM—when we do not have a perfect training dataset for image captioning
Choi et al.	2021	Analyzing zero-shot cross-lingual transfer in supervised NLP tasks
Rathi	2020	Deep learning apporach for image captioning in Hindi language
Palash et al.	2022	Bangla image caption generation through cnn-transformer based encoder-decoder network
Chaudhary et al.	2022	Signnet ii: A transformer-based two-way sign language translation model
An et al.	2019	Resource mention extraction for MOOC discussion forums
Deroy et al.	2024	Question generation: Past, present & future
Singh et al.	2021	Generation and evaluation of hindi image captions of visual genome
CN115129807A (en)	2022-09-30	Fine-grained classification method and system for social media topic comments based on self-attention
Zhang et al.	2019	Modeling the relationship between user comments and edits in document revision
Uttarwar et al.	2020	Artificial intelligence based system for preliminary rounds of recruitment process
Parshakova et al.	2019	Latent question interpretation through variational adaptation
Reddy et al.	2023	Multilingual image captioning: multimodal framework for bridging visual and linguistic realms in Tamil and Telugu through transformers
Wang et al.	2024	RSRNeT: a novel multi-modal network framework for named entity recognition and relation extraction
Mundu et al.	2024	ETransCap: efficient transformer for image captioning
Alnami et al.	2021	Story Generation from Images Using Deep Learning