Faiyaz Khan et al., 2021 - Google Patents
Improved bengali image captioning via deep convolutional neural network based encoder-decoder modelFaiyaz Khan et al., 2021
View PDF- Document ID
- 6644988305381464848
- Author
- Faiyaz Khan M
- Sadiq-Ur-Rahman S
- Saiful Islam M
- Publication year
- Publication venue
- Proceedings of International Joint Conference on Advances in Computational Intelligence: IJCACI 2020
External Links
Snippet
Image Captioning is an arduous task of producing syntactically and semantically correct textual descriptions of an image in natural language with context related to the image. Existing notable pieces of research in Bengali Image Captioning (BIC) are based on …
- 230000001537 neural 0 title abstract description 4
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2809—Data driven translation
- G06F17/2827—Example based machine translation; Alignment
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30634—Querying
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/289—Use of machine translation, e.g. multi-lingual retrieval, server side translation for client devices, real-time translation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2765—Recognition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30861—Retrieval from the Internet, e.g. browsers
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Andreas et al. | Learning with latent language | |
Faiyaz Khan et al. | Improved bengali image captioning via deep convolutional neural network based encoder-decoder model | |
Prakash et al. | Neural paraphrase generation with stacked residual LSTM networks | |
Gupta et al. | Integration of textual cues for fine-grained image captioning using deep CNN and LSTM | |
Sharma et al. | Image captioning improved visual question answering | |
Zhao et al. | ZYJ123@ DravidianLangTech-EACL2021: Offensive language identification based on XLM-RoBERTa with DPCNN | |
Nagaraj et al. | Kannada to English Machine Translation Using Deep Neural Network. | |
Nie et al. | Attention-based encoder-decoder model for answer selection in question answering | |
Singh et al. | An encoder-decoder based framework for hindi image caption generation | |
Xian et al. | Self-guiding multimodal LSTM—when we do not have a perfect training dataset for image captioning | |
Choi et al. | Analyzing zero-shot cross-lingual transfer in supervised NLP tasks | |
Rathi | Deep learning apporach for image captioning in Hindi language | |
Palash et al. | Bangla image caption generation through cnn-transformer based encoder-decoder network | |
Chaudhary et al. | Signnet ii: A transformer-based two-way sign language translation model | |
An et al. | Resource mention extraction for MOOC discussion forums | |
Deroy et al. | Question generation: Past, present & future | |
Singh et al. | Generation and evaluation of hindi image captions of visual genome | |
CN115129807A (en) | Fine-grained classification method and system for social media topic comments based on self-attention | |
Zhang et al. | Modeling the relationship between user comments and edits in document revision | |
Uttarwar et al. | Artificial intelligence based system for preliminary rounds of recruitment process | |
Parshakova et al. | Latent question interpretation through variational adaptation | |
Reddy et al. | Multilingual image captioning: multimodal framework for bridging visual and linguistic realms in Tamil and Telugu through transformers | |
Wang et al. | RSRNeT: a novel multi-modal network framework for named entity recognition and relation extraction | |
Mundu et al. | ETransCap: efficient transformer for image captioning | |
Alnami et al. | Story Generation from Images Using Deep Learning |