Shu et al., 2024 - Google Patents

Visual Text Meets Low-level Vision: A Comprehensive Survey on Visual Text Processing

Shu et al., 2024

Document ID: 14931141098695963851
Author: Shu Y; Zeng W; Li Z; Zhao F; Zhou Y
Publication year: 2024
Publication venue: arXiv preprint arXiv:2402.03082

External Links

Cited by

Snippet

Visual text, a pivotal element in both document and scene images, speaks volumes and attracts significant attention in the computer vision domain. Beyond visual text detection and recognition, the field of visual text processing has experienced a surge in research, driven …

Continue reading at arxiv.org (PDF) (other versions)

230000000007 visual effect 0 title abstract description 64

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/20—Image acquisition
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6201—Matching; Proximity measures
- G06K9/6202—Comparing pixel values or logical combinations thereof, or feature values having positional relevance, e.g. template matching
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30244—Information retrieval; Database structures therefor; File system structures therefor in image databases
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation
- G06T11/60—Editing figures and text; Combining figures or text
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20112—Image segmentation details
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K2209/00—Indexing scheme relating to methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2200/00—Indexing scheme for image data processing or generation, in general
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T17/00—Three dimensional [3D] modelling, e.g. data description of 3D objects
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T13/00—Animation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration, e.g. from bit-mapped to bit-mapped creating a similar image

Similar Documents

Publication	Publication Date	Title
Wu et al.	2019	Editing text in the wild
Chai et al.	2016	Autohair: Fully automatic hair modeling from a single image
Li et al.	2020	Layoutgan: Synthesizing graphic layouts with vector-wireframe adversarial networks
CN105493078B (en)	2019-07-23	Colored sketches picture search
Mirzaei et al.	2022	Laterf: Label and text driven object radiance fields
Saxena et al.	2021	Comparison and analysis of image-to-image generative adversarial networks: a survey
Lopez et al.	2010	Modeling complex unfoliaged trees from a sparse set of images
Liu et al.	2022	Don’t forget me: accurate background recovery for text removal via modeling local-global context
Zhang et al.	2024	Brush your text: Synthesize any scene text on images via diffusion model
Shu et al.	2024	Visual Text Meets Low-level Vision: A Comprehensive Survey on Visual Text Processing
Zhang et al.	2011	EXCOL: An EXtract-and-COmplete layering approach to cartoon animation reusing
Yang et al.	2023	Ai-generated images as data source: The dawn of synthetic era
Hu et al.	2023	Face reenactment via generative landmark guidance
Wu et al.	2023	DeepPortraitDrawing: Generating human body images from freehand sketches
Gal et al.	2024	Breathing Life Into Sketches Using Text-to-Video Priors
Wang et al.	2024	Language-Driven Interactive Shadow Detection
Li et al.	2024	A review of advances in image inpainting research
CN113537187A (en)	2021-10-22	Text recognition method and device, electronic equipment and readable storage medium
Li et al.	2022	SPN2D-GAN: semantic prior based night-to-day image-to-image translation
Dai et al.	2024	One-shot diffusion mimicker for handwritten text generation
Du et al.	2020	Mhgan: Multi-hierarchies generative adversarial network for high-quality face sketch synthesis
Kalantari et al.	2014	Improving patch-based synthesis by learning patch masks
Wang et al.	2024	Review of GAN-Based Research on Chinese Character Font Generation
Mu	2022	Pose Estimation‐Assisted Dance Tracking System Based on Convolutional Neural Network
Zhao et al.	2022	From 2D images to 3D model: weakly supervised multi-view face reconstruction with deep fusion