Liao et al., 2020 - Google Patents

SynthText3D: synthesizing scene text images from 3D virtual worlds

Liao et al., 2020

Document ID: 16912918480929954774
Author: Liao M; Song B; Long S; He M; Yao C; Bai X
Publication year: 2020
Publication venue: Science China Information Sciences

External Links

Cited by

Snippet

With the development of deep neural networks, the demand for a significant amount of annotated training data becomes the performance bottlenecks in many fields of research and applications. Image synthesis can generate annotated images automatically and freely …

Continue reading at arxiv.org (PDF) (other versions)

230000002194 synthesizing 0 title abstract description 18

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6201—Matching; Proximity measures
- G06K9/6202—Comparing pixel values or logical combinations thereof, or feature values having positional relevance, e.g. template matching
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/20—Image acquisition
- G06K9/34—Segmentation of touching or overlapping patterns in the image field
- G06K9/342—Cutting or merging image elements, e.g. region growing, watershed, clustering-based techniques
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T15/00—3D [Three Dimensional] image rendering
- G06T15/10—Geometric effects
- G06T15/20—Perspective computation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20112—Image segmentation details
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/50—Computer-aided design
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation
- G06T11/60—Editing figures and text; Combining figures or text
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T17/00—Three dimensional [3D] modelling, e.g. data description of 3D objects
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K2209/00—Indexing scheme relating to methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints

Similar Documents

Publication	Publication Date	Title
Liao et al.	2020	SynthText3D: synthesizing scene text images from 3D virtual worlds
Remez et al.	2018	Learning to segment via cut-and-paste
CN103810744B (en)	2018-09-21	It is backfilled a little in cloud
CN103729885B (en)	2016-08-24	Various visual angles projection registers united Freehandhand-drawing scene three-dimensional modeling method with three-dimensional
Zhao et al.	2021	Image stitching via deep homography estimation
Isola et al.	2013	Scene collaging: Analysis and synthesis of natural images with semantic layers
CN104952083B (en)	2018-01-23	A kind of saliency detection method based on the modeling of conspicuousness target background
CN105989604A (en)	2016-10-05	Target object three-dimensional color point cloud generation method based on KINECT
US20150138193A1 (en)	2015-05-21	Method and device for panorama-based inter-viewpoint walkthrough, and machine readable medium
Wang et al.	2022	Instance shadow detection with a single-stage detector
Dwibedi et al.	2016	Deep cuboid detection: Beyond 2d bounding boxes
Wang et al.	2019	Deep learning‐based vehicle detection with synthetic image data
Wang et al.	2024	Perf: Panoramic neural radiance field from a single panorama
Karakottas et al.	2019	360 surface regression with a hyper-sphere loss
Zeng et al.	2020	Deep recognition of vanishing-point-constrained building planes in urban street views
Qiu et al.	2024	Feature Splatting: Language-Driven Physics-Based Scene Synthesis and Editing
Kaneva et al.	2010	Infinite images: Creating and exploring a large photorealistic virtual space
Nguyen et al.	2024	Semantically-aware Neural Radiance Fields for Visual Scene Understanding: A Comprehensive Review
CN113673567B (en)	2023-07-21	Panorama emotion recognition method and system based on multi-angle sub-region self-adaption
Feng et al.	2021	[Retracted] Research and Application of Multifeature Gesture Recognition in Human‐Computer Interaction Based on Virtual Reality Technology
Chiciudean et al.	2024	Data augmentation for environment perception with unmanned aerial vehicles
Yang et al.	2018	Learning 3D scene semantics and structure from a single depth image
Wang et al.	2019	A multi-task learning convolutional neural network for object pose estimation⋆
Nakabayashi et al.	2021	Mixed Reality Landscape Visualization Method with Automatic Discrimination Process for Dynamic Occlusion Handling Using Instance Segmentation
Qiu et al.	2025	Language-driven physics-based scene synthesis and editing via feature splatting