WO2018112930A1 - Method and device for identifying commodities - Google Patents
Method and device for identifying commodities Download PDFInfo
- Publication number
- WO2018112930A1 WO2018112930A1 PCT/CN2016/111831 CN2016111831W WO2018112930A1 WO 2018112930 A1 WO2018112930 A1 WO 2018112930A1 CN 2016111831 W CN2016111831 W CN 2016111831W WO 2018112930 A1 WO2018112930 A1 WO 2018112930A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- commodity
- product
- information
- image
- image information
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 60
- 238000012015 optical character recognition Methods 0.000 claims description 12
- 238000004590 computer program Methods 0.000 claims description 6
- 238000000605 extraction Methods 0.000 claims description 6
- 238000005516 engineering process Methods 0.000 abstract description 4
- 238000012545 processing Methods 0.000 description 8
- 238000010586 diagram Methods 0.000 description 5
- 230000006870 function Effects 0.000 description 5
- 238000004806 packaging method and process Methods 0.000 description 5
- 238000004891 communication Methods 0.000 description 4
- 230000015572 biosynthetic process Effects 0.000 description 2
- 238000013135 deep learning Methods 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 238000012706 support-vector machine Methods 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 201000004569 Blindness Diseases 0.000 description 1
- 239000003086 colorant Substances 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000005180 public health Effects 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06K—GRAPHICAL DATA READING; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K17/00—Methods or arrangements for effecting co-operative working between equipments covered by two or more of main groups G06K1/00 - G06K15/00, e.g. automatic card files incorporating conveying and reading operations
- G06K17/0022—Methods or arrangements for effecting co-operative working between equipments covered by two or more of main groups G06K1/00 - G06K15/00, e.g. automatic card files incorporating conveying and reading operations arrangements or provisions for transferring data to distant stations, e.g. from a sensing device
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
- G06F18/2411—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on the proximity to a decision surface, e.g. support vector machines
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
Definitions
- the present invention relates to the field of commodity identification technology, and in particular, to a commodity identification method and apparatus.
- Blindness is one of the serious social and public health problems in the world. More than 70% of human information is obtained through vision, and vision problems largely limit access to information for blind people.
- blind people Due to vision problems, there are many inconveniences in the lives of blind people. Especially when shopping in shopping malls, in the face of a wide variety of goods in the mall, blind people cannot independently obtain information on the types, models, prices, etc., so for blind people, It is very difficult to select the products that you need from a large space and a large number of products, and complete the shopping independently. In a case, the blind person obtains the product information through the shopping guide when shopping, but this not only brings great inconvenience to the blind shopping, but also greatly increases the workload of the shopping guide, and consumes a lot of manpower and time. cost. Therefore, how to assist blind people to obtain product information is an urgent problem to be solved.
- Embodiments of the present invention provide a product identification method and apparatus for assisting a blind person to obtain product information.
- a method for identifying a product for assisting a blind person to obtain product information comprising:
- the text information of the commodity is converted into voice information, and the voice information is output.
- a merchandise identification device for assisting a blind person to obtain merchandise information, and the apparatus includes:
- An image acquisition module configured to perform image collection on the commodity from at least one viewing angle, and acquire image information of the commodity
- An image recognition module configured to acquire text information of the commodity according to image information of the commodity
- a voice output module configured to convert text information of the commodity into voice information, and output the voice information.
- a merchandise identification apparatus comprising: a processor and a memory, the memory for storing computer execution code, the computer execution code for controlling the processor to perform the first aspect Product identification method.
- a computer storage medium for storing computer software instructions for use in the article identification device of the third aspect, comprising program code designed to perform the article identification method of the first aspect.
- a computer program product can be directly loaded into an internal memory of a computer and contains software code, and the computer program can be implemented by the computer and can implement the product identification method described in the first aspect.
- the product identification method provided by the embodiment of the present invention firstly collects images from at least one perspective, acquires image information of the commodity, and then acquires text information of the commodity according to the image information of the commodity, and finally converts the text information of the commodity into voice information. And outputting the voice information, because the product identification device provided by the embodiment of the present invention can acquire the image information of the product and obtain the text information of the product according to the image information of the product, and finally convert the text information of the product into the voice information output, so
- the blind person can obtain the commodity information by means of voice.
- FIG. 1 is a flow chart of steps of a product identification method according to an embodiment of the present invention.
- FIG. 2 is a second flowchart of steps of a product identification method according to an embodiment of the present invention.
- FIG. 3 is a third flowchart of steps of a product identification method according to an embodiment of the present invention.
- FIG. 4 is a fourth flowchart of steps of a product identification method according to an embodiment of the present invention.
- FIG. 5 is a fifth flowchart of steps of a product identification method according to an embodiment of the present invention.
- FIG. 6 is a schematic structural diagram of a product identification device according to an embodiment of the present invention.
- Figure 7 is a second schematic structural diagram of a product identification device according to an embodiment of the present invention.
- FIG. 8 is a third schematic structural diagram of a product identification device according to an embodiment of the present invention.
- the term “and/or” is merely an association relationship describing an association object, indicating that there may be three relationships, for example, A and/or B, which may indicate that A exists separately, and A and B, there are three cases of B alone.
- the character "/" in this article generally indicates that the contextual object is an "or” relationship. Unless otherwise stated, "multiple" in this document refers to two or more.
- the words “exemplary” or “such as” are used to mean an example, illustration, or illustration. Any embodiment or design described as “exemplary” or “for example” in the embodiments of the invention should not be construed as preferred or advantageous over other embodiments or designs. Rather, the use of the words “exemplary” or “such as” is intended to present the concepts in a particular manner.
- the basic principle of the technical solution provided by the embodiment of the present invention is: the problem that the blind person cannot obtain the product information due to the vision problem, and the blind person cannot purchase the product independently.
- the technical solution provided by the embodiment of the present invention obtains the image of the product. The information then obtains the text information of the product according to the image information of the product, and finally converts the text information of the product into the voice information output, thereby enabling the blind person to obtain the information of the product by receiving the voice information, thereby solving the problem that the blind person cannot obtain the product information.
- an embodiment of the present invention provides a product identification method for assisting a blind person to obtain product information.
- the executive body of the product identification method provided by the embodiment of the present invention may A terminal device for identifying a product line by using the product identification method provided by the embodiment of the present invention.
- the terminal device may be a head-mounted blind guide device, a guide blind glasses, a mobile phone, a portable computer, a pocket computer, a handheld computer, a digital photo frame, a palmtop computer, a navigator, etc., or the terminal device may be installed and implemented by the present invention.
- the product identification method provided by the example is a software client for identifying a product line or a personal computer for a software system or software application (English name: personal computer, PC for short), a server, etc., and the specific hardware implementation environment may be a general computer form, or
- the specially designed integrated circuit (English name: Application Specific Integrated Circuit, ASIC for short) can also be (English full name: Field Programmable Gate Array, referred to as: FPGA), or some programmable expansion platform such as embedded ( English name: Tensilica) configurable processor platform, etc.
- a product identification method provided by an embodiment of the present invention includes:
- the image acquisition in the above step S11 can be implemented by one or more of an image sensor such as a monocular camera, a binocular camera, or the like.
- the image collection of the product from at least one perspective may be an image collection of the product from one perspective, or an image collection of the product from multiple perspectives.
- the image collection of the commodity from multiple perspectives can be specifically implemented by the following method:
- the user places the product within the collection range of the image capture device, and then rotates and/or moves the product, and the image capture device images the product from multiple perspectives during the user's rotation and/or movement of the product. collection.
- the image acquisition device includes a plurality of images and is respectively disposed at different positions.
- a plurality of image collection devices disposed at different positions respectively perform image collection on the products from one perspective, thereby realizing the goods from multiple perspectives. Perform image acquisition.
- the image information of the product is obtained by image capturing the product from a plurality of viewing angles, and thus the image information of the product includes an image of a plurality of viewing angle products.
- the image information of the products can be more comprehensively obtained, thereby facilitating the subsequent recognition of the image information of the products.
- the process of acquiring the text information of the product according to the image information of the commodity in the above embodiment may be completed inside the image recognition device or may be assisted by the remote service device.
- step S12 can be specifically implemented by the following steps: a. Identifying the image information of the product. b. Obtain text information of the product according to the recognition result.
- the above steps can be performed by an image processing device equipped with image recognition software.
- step S12 can be specifically implemented by: c. transmitting the product image information to the remote service device, so that the remote service device can be used for the product.
- the image information is identified, and the text information of the product is obtained according to the recognition result.
- d. Receive text information of the goods sent by the remote service device.
- the remote service device can be a cloud server or the like.
- the image information of the product can be identified, and the image image information can be processed, analyzed, and understood using any image recognition algorithm.
- the image recognition algorithm used in the process of identifying the image information of the commodity is not limited.
- the text information of the commodity can be converted into voice information by using a voice synthesis technology, and then the synthesized voice information is output through an audio output device such as a speaker, a power amplifier, a speaker, or a headphone.
- an audio output device such as a speaker, a power amplifier, a speaker, or a headphone.
- the text information of the commodity is converted into the voice information, and the voice information is outputted, which may be: directly converting the text information of the commodity into the voice information output through the voice synthesis technology, or performing the text information of the commodity.
- Keyword extraction for example, comparing the text information of the product with the keywords of the product stored in the database, and then outputting the keyword comparison result by voice.
- the product identification method provided by the embodiment of the present invention firstly collects images from at least one perspective, acquires image information of the commodity, and then acquires text information of the commodity according to the image information of the commodity, and finally converts the text information of the commodity into voice information. And outputting the voice information, because the product identification device provided by the embodiment of the present invention can acquire the image information of the product and obtain the text information of the product according to the image information of the product, and finally convert the text information of the product into the voice information output, so
- the blind person can obtain the product information by means of voice, thereby enabling the blind person to independently purchase the product.
- the embodiment of the present invention provides the following specific implementation manners to implement the product identification method shown in FIG. 1:
- the above product identification method specifically includes the following steps:
- S21 Perform image collection on the commodity from at least one perspective to obtain image information of the commodity.
- the commodity barcode in the above embodiment may be a one-dimensional barcode and/or a two-dimensional barcode.
- the one-dimensional barcode of the commodity is composed of a set of regularly arranged bars, spaces and corresponding codes.
- the two-dimensional bar code of the product is a black and white graphic that is distributed in a two-dimensional direction by a certain geometric pattern.
- the bar code symbols that form the bar code of the merchandise may include code and bar code identification of the retail merchandise, the storage and packaging package merchandise, the logistics unit, the location of the party, and the like.
- the bar code of the merchandise is printed on the merchandise package or attached to the merchandise as a bar code label. Therefore, by performing bar code recognition on the image information of the product, the product can be obtained. Bar code.
- the identification of the one-dimensional barcode of the commodity may be: firstly performing one-dimensional barcode detection and positioning, determining a region of the one-dimensional barcode in the image, and then performing one-dimensional barcode recognition on the image of the region, and reading one-dimensional
- the information such as the product number in the barcode is finally obtained by querying the information in the one-dimensional barcode.
- the identification of the two-dimensional barcode of the product may be: first performing two-dimensional barcode detection and positioning, and then performing two-dimensional barcode recognition on the image of the area, and reading the information in the two-dimensional barcode.
- the difference from the one-dimensional barcode is that the two-dimensional barcode can include more information, so the text information of the commodity can be directly obtained through the content of the two-dimensional barcode or the obtained information can be further processed to obtain the text of the commodity.
- the product information of the product manufacturer, name, price, color, etc. can be immediately recognized, and the product information is displayed in the form of text, so the product can be obtained according to the bar code of the product. Text information.
- the barcode of the product also follows the principle of uniqueness, that is, a commodity item can only have one code, or one code can only identify one commodity item. Barcodes of different products will be used for different specifications, different packaging, different varieties, different prices, and different colors. Therefore, the text information of the commodity obtained by the barcode of the commodity is relatively accurate, and the error of the commodity information provided to the user can be avoided.
- the above product identification method specifically includes the following steps:
- S31 Perform image collection on the commodity from at least one perspective to obtain image information of the commodity.
- optical character recognition refers to an optical character recognition device (such as a scanner, a digital camera, a printer, etc.) inspecting paper documents and characters on an image, determining the shape by detecting a dark and bright mode, and then using a character recognition method. The process of translating shapes into computer text.
- an optical character recognition device such as a scanner, a digital camera, a printer, etc.
- the image information of the product in the embodiment of the present invention includes at least two images, and each of the images may be optically recognized in the process of performing optical character recognition on the image information of the product. Then, the recognized characters are sorted to obtain the text information of the product, and at least two images included in the image information of the product may be first spliced and fused, and then the optical character recognition is performed to obtain the text information of the product.
- the outer packaging of the product is accompanied by a packaging label
- the packaging label will indicate the name and address of the manufacturer or seller of the product, the product name, the trademark, the composition, the quality characteristics, the number of products in the package, and the method of use. The amount, number, storage note, quality inspection number, production date and expiration date, etc., so the text information of the product can be obtained according to the characters in the product image information.
- the above product identification method specifically includes the following steps:
- S42 Feature extraction of image information of the product, and acquiring features of the product.
- the feature of the commodity in the above embodiment may be a manually designed image.
- Features acquired by the feature extractor may also be automatically learned by a machine learning method, such as by deep learning methods.
- classifying the products according to the characteristics of the products and obtaining the classification of the products may be to classify the features of the products into a classifier to obtain the classification of the products.
- the classifier may be a traditional support vector machine classifier (English name: Support Vector Machine, SVM for short) or a traditional iterative (English name: adaboost) classifier, or a classifier based on a deep learning network.
- the text information of the merchandise of the merchandise can be obtained according to the classification of the merchandise.
- the above product identification method specifically includes the following steps:
- S51 Perform image collection on the commodity from at least two perspectives to obtain image information of the commodity.
- step S52 When the barcode identification of the image information of the product is successful in step S52, steps S53 and S57 are performed. When the barcode identification of the image information of the product fails in step S52, step S54 is performed.
- step S54 When the optical character recognition of the image information of the product is successful in step S54, steps S55, S57 are performed, and the optical character of the image information of the product is performed in step S52. When the recognition fails, steps S56 and S57 are performed.
- S56 Feature extraction of image information of the product, classifying the product according to the characteristics of the product, acquiring the classification of the product, and acquiring the text information of the product according to the classification of the product.
- the step of acquiring the text information of the product in the product identification method provided in the above embodiment may also be applied to the remote server, for example, when the remote service device receives the image information of the product, the product may also be The image information is barcoded, the barcode of the commodity is obtained, and the text information of the commodity is obtained according to the barcode of the commodity, and then the text information of the commodity is sent; for example, when the remote service device receives the image information of the commodity, it may also The image information of the product is optically recognized, the characters in the image information of the product are obtained; the text information of the product is obtained according to the characters in the product image information, and then the text information of the product is re-issued; that is, the remote server may also adopt any of the above The text information acquisition method of the commodity in the implementation manner obtains the text information of the commodity.
- FIG. 6 is a schematic diagram showing a possible structure of the product identification device for assisting the blind person to acquire the product information in the above embodiment.
- the item identification device 600 includes:
- the image acquisition module 61 is configured to perform image collection on the commodity from at least one viewing angle to acquire image information of the commodity.
- the image recognition module 62 is configured to acquire text information of the product according to the image information of the product.
- the voice output module 63 is configured to convert text information of the commodity into voice information, and output the voice information.
- the product identification device includes: an image acquisition module, an image recognition module, and a voice output module, wherein the image acquisition module is configured to perform image collection on the commodity from at least one perspective, and acquire image information of the commodity, and the image recognition module The image information of the product is identified, and the text information of the product is obtained according to the recognition result.
- the voice output module is configured to convert the text information of the product into voice information, and output the voice information, because the product provided by the embodiment of the present invention
- the identification device can acquire the image information of the product and obtain the product text information according to the image information of the product, and finally convert the text information of the product into the voice information output. Therefore, the blind person can obtain the product information by means of the voice according to the embodiment of the present invention.
- the image recognition module 62 includes: a sending unit 621 and a receiving unit 622;
- the sending unit 621 is configured to send the product image information to the remote service device, so that the remote service device identifies the image information of the product, and acquires the text information of the product according to the recognition result;
- the receiving unit 622 is configured to receive text information of an item sent by the remote service device.
- the image recognition module 62 is specifically configured to perform barcode identification on the product image information, obtain a barcode of the commodity, and obtain text information of the commodity according to the barcode of the commodity.
- the image recognition module 62 is specifically configured to perform optical character recognition on the image information of the product, acquire characters in the image information of the product, and obtain text information of the product according to the characters in the image information of the product.
- the image recognition module 62 is specifically configured to perform feature extraction on the image information of the product, acquire the feature of the product, classify the product according to the feature of the product, obtain a classification of the product, and obtain text information of the product according to the classification of the product.
- the image acquisition module 61 is configured to implement step S11 shown in FIG. 1, step S21 shown in FIG. 2, step S31 shown in FIG. 3, step S41 shown in FIG. Step S51 shown in FIG. 5;
- image recognition module 62 is used to implement step S12 shown in FIG. 1, steps S22 and S23 shown in FIG. 2, steps S32 and S33 shown in FIG. 3, and step S42 shown in FIG. S43, S44 and steps S52, S53, S54, S55, S56 shown in FIG. 5;
- the voice output module 63 is used to implement step S13 shown in FIG. 1, step S24 shown in FIG. 2, and step S34 shown in FIG. Step S45 shown in FIG. 4 and step S57 shown in FIG.
- the image capturing module 61 may be one or more of an image sensor such as a monocular camera or a binocular camera.
- the image recognition module 62 may be a processor; the voice output module 63 may be an audio output device such as a speaker, a power amplifier, a speaker, a headphone, or the like.
- the programs corresponding to the operations performed by the product identification device described above may be stored in the memory of the article identification device in software, so that the processor invokes the operations corresponding to the above respective modules.
- FIG. 8 shows a possible structural diagram of the article identification device involved in the above embodiment.
- the product identification device 800 includes a processor 81, a memory 82, a system bus 83, a communication interface 84, an image acquisition device 85, and a voice output device 86.
- the processor 81 may be a processor or a collective name of a plurality of processing elements.
- the processor 81 can be a central processing unit (CPU).
- the processor 81 can also be another general purpose processor, a digital signal processing (DSP), an application specific integrated circuit (ASIC), a field-programmable gate array (FPGA), or Other programmable logic devices, discrete gates or transistor logic devices, discrete hardware components, and the like, can implement or perform the various illustrative logical blocks, modules, and circuits described in connection with the present disclosure.
- the general purpose processor may be a microprocessor or the processor or any conventional processor or the like.
- the processor 81 may also be a dedicated processor, which may include at least one of a baseband processing chip, a radio frequency processing chip, and the like.
- the processor can also be a combination of computing functions, such as one or more microprocessor combinations, DSP and A combination of microprocessors and so on.
- the dedicated processor may also include a chip having other specialized processing functions of the device.
- the memory 82 is used to store computer execution code, and the processor 81 is connected to the memory 82 through the system bus 83.
- the processor 81 is configured to execute the computer execution code stored in the memory 82 to execute any of the embodiments provided by the embodiments of the present invention.
- a product identification method for example, the processor 81 is configured to support the mobile terminal to perform step S12 shown in FIG. 1 , steps S22 and S23 shown in FIG. 2 , steps S32 and S33 shown in FIG. 3 , and FIG. Steps S42, S43, S44 and steps S52, S53, S54, S55, S56 shown in Fig. 5, and/or other processes for the techniques described herein, the specific article identification method can be referred to above and in the drawings The related description is not repeated here.
- the system bus 83 can include a data bus, a power bus, a control bus, and a signal status bus. For the sake of clarity in the present embodiment, various buses are illustrated as the system bus 83 in FIG.
- Communication interface 84 may specifically be a transceiver on the device.
- the transceiver can be a wireless transceiver.
- the wireless transceiver can be an antenna or the like of the device.
- the processor 81 communicates with other devices via the communication interface 84, for example, if the device is a module or component of the terminal device, the device is for data interaction with other modules in the terminal device.
- the steps of the method described in connection with the present disclosure may be implemented in a hardware manner, or may be implemented by a processor executing software instructions.
- the embodiment of the present invention further provides a storage medium for storing computer software instructions used by the mobile terminal shown in FIG. 8 , which is designed to execute the product identification method shown in FIG. 1 , 2 , 3 , 4 , and 5 . code.
- the software instructions may be composed of corresponding software modules, and the software modules may be stored in a random access memory (English: random access memory, abbreviation: RAM), flash memory, read only memory (English: read only memory, abbreviation: ROM) , erasable programmable read-only memory (English: erasable programmable ROM, abbreviation: EPROM), electrically erasable programmable read-only memory (English: electrical EPROM, abbreviation: EEPROM), registers, hard disk, mobile hard disk, CD-ROM (CD-ROM) or any other form of storage medium known in the art.
- An exemplary storage medium is coupled to the processor to enable the processor to read information from the storage medium and Write information to the storage medium.
- the storage medium can also be an integral part of the processor.
- the processor and the storage medium can be located in an ASIC.
- the ASIC can be located in a core network interface device.
- the processor and the storage medium may also exist as discrete components in the core network interface device.
- the embodiment of the invention further provides a computer program product, which can be directly loaded into the internal memory of the computer and contains software code, and the computer program can be loaded and executed by the computer to implement the figures 1, 2, 3, and 4.
- the functions described herein can be implemented in hardware, software, firmware, or any combination thereof.
- the functions may be stored in a computer readable medium or transmitted as one or more instructions or code on a computer readable medium.
- Computer readable media includes both computer storage media and communication media including any medium that facilitates transfer of a computer program from one location to another.
- a storage medium may be any available media that can be accessed by a general purpose or special purpose computer.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Biology (AREA)
- Evolutionary Computation (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Bioinformatics & Computational Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
A method and a device for identifying commodities, which relate to the field of commodity identification technology. The present invention is used for assisting the blind in acquiring commodity information. The method comprises: collecting images of a commodity from at least one viewing angle to acquire image information of the commodity (S11); acquiring, according to the image information of the commodity, text information of the commodity (S12); and converting the text information of the commodity into voice information, and outputting the voice information (S13). The method is used for assisting the blind in identifying commodities.
Description
本发明涉及商品识别技术领域,尤其涉及一种商品识别方法和装置。The present invention relates to the field of commodity identification technology, and in particular, to a commodity identification method and apparatus.
盲是世界上严重的社会和公共卫生问题之一。人类70%以上的信息都是通过视觉来获取的,视力问题很大程度上限制了盲人进行信息获取。Blindness is one of the serious social and public health problems in the world. More than 70% of human information is obtained through vision, and vision problems largely limit access to information for blind people.
由于视力问题,盲人的生活中存在诸多不便,尤其是在商场进行购物时,面对商场中品类繁多的商品,盲人无法独立获取种类、型号、价格等商品的信息,因此对于盲人而言,在较大的空间范围和繁多的商品中挑选出自己需要的商品,独立完成购物,是非常困难的。一情况下盲人在购物时是通过导购员讲解来获取商品信息的,但这不仅给盲人购物带来了很大的不便,也大大增大了导购员的工作量,耗费了大量的人力和时间成本。因此,如何辅助盲人获取商品信息是亟待解决的一个问题。Due to vision problems, there are many inconveniences in the lives of blind people. Especially when shopping in shopping malls, in the face of a wide variety of goods in the mall, blind people cannot independently obtain information on the types, models, prices, etc., so for blind people, It is very difficult to select the products that you need from a large space and a large number of products, and complete the shopping independently. In a case, the blind person obtains the product information through the shopping guide when shopping, but this not only brings great inconvenience to the blind shopping, but also greatly increases the workload of the shopping guide, and consumes a lot of manpower and time. cost. Therefore, how to assist blind people to obtain product information is an urgent problem to be solved.
发明内容Summary of the invention
本发明的实施例提供一种商品识别方法和装置,用于辅助盲人获取商品信息。Embodiments of the present invention provide a product identification method and apparatus for assisting a blind person to obtain product information.
为达到上述目的,本发明的实施例采用如下技术方案:In order to achieve the above object, embodiments of the present invention adopt the following technical solutions:
第一方面,提供一种商品识别方法,用于辅助盲人获取商品信息,所述方法包括:In a first aspect, a method for identifying a product for assisting a blind person to obtain product information is provided, the method comprising:
从至少一个视角对商品进行图像采集,获取所述商品的图像信息;
Performing image collection on the commodity from at least one perspective to obtain image information of the commodity;
根据所述商品的图像信息获取所述商品的文本信息;Obtaining text information of the commodity according to image information of the commodity;
将所述商品的文本信息转换为语音信息,并对所述语音信息进行输出。The text information of the commodity is converted into voice information, and the voice information is output.
第二方面,提供一种商品识别装置,用于辅助盲人获取商品信息,所述装置包括:In a second aspect, a merchandise identification device is provided for assisting a blind person to obtain merchandise information, and the apparatus includes:
图像采集模块,用于从至少一个视角对商品进行图像采集,获取所述商品的图像信息;An image acquisition module, configured to perform image collection on the commodity from at least one viewing angle, and acquire image information of the commodity;
图像识别模块,用于根据所述商品的图像信息获取所述商品的文本信息;An image recognition module, configured to acquire text information of the commodity according to image information of the commodity;
语音输出模块,用于将所述商品的文本信息转换为语音信息,并对所述语音信息进行输出。a voice output module, configured to convert text information of the commodity into voice information, and output the voice information.
第三方面,提供一种商品识别装置,所述装置包括:处理器和存储器,所述存储器用于存储计算机执行代码,所述计算机执行代码用于控制所述处理器执行第一方面所述的商品识别方法。In a third aspect, a merchandise identification apparatus is provided, the apparatus comprising: a processor and a memory, the memory for storing computer execution code, the computer execution code for controlling the processor to perform the first aspect Product identification method.
第四方面,提供一种计算机存储介质,用于储存为第三方面所述的商品识别装置所用的计算机软件指令,其包含执行第一方面所述的商品识别方法所设计的程序代码。According to a fourth aspect, there is provided a computer storage medium for storing computer software instructions for use in the article identification device of the third aspect, comprising program code designed to perform the article identification method of the first aspect.
第五方面,提供一种计算机程序产品,可直接加载到计算机的内部存储器中,并含有软件代码,所述计算机程序经由计算机载入并执行后能够实现第一方面所述的商品识别方法。In a fifth aspect, a computer program product is provided that can be directly loaded into an internal memory of a computer and contains software code, and the computer program can be implemented by the computer and can implement the product identification method described in the first aspect.
本发明的实施例提供的商品识别方法首先从至少一个视角对商品进行图像采集,获取商品的图像信息,然后根据商品的图像信息获取商品的文本信息,最后将商品的文本信息转换为语音信息,并对语音信息进行输出,由于本发明实施例提供的商品识别装置可以获取商品的图像信息并根据商品的图像信息获取商品的文本信息,最后将商品的文本信息转换为语音信息输出,所以通过本发明实施例盲人可以通过语音的方式获取商品信息。
The product identification method provided by the embodiment of the present invention firstly collects images from at least one perspective, acquires image information of the commodity, and then acquires text information of the commodity according to the image information of the commodity, and finally converts the text information of the commodity into voice information. And outputting the voice information, because the product identification device provided by the embodiment of the present invention can acquire the image information of the product and obtain the text information of the product according to the image information of the product, and finally convert the text information of the product into the voice information output, so In the embodiment of the invention, the blind person can obtain the commodity information by means of voice.
为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the embodiments or the description of the prior art will be briefly described below. Obviously, the drawings in the following description are only It is a certain embodiment of the present invention, and other drawings can be obtained from those skilled in the art without any creative work.
图1为本发明的实施例提供的商品识别方法的步骤流程图之一;1 is a flow chart of steps of a product identification method according to an embodiment of the present invention;
图2为本发明的实施例提供的商品识别方法的步骤流程图之二;2 is a second flowchart of steps of a product identification method according to an embodiment of the present invention;
图3为本发明的实施例提供的商品识别方法的步骤流程图之三;3 is a third flowchart of steps of a product identification method according to an embodiment of the present invention;
图4为本发明的实施例提供的商品识别方法的步骤流程图之四;4 is a fourth flowchart of steps of a product identification method according to an embodiment of the present invention;
图5为本发明的实施例提供的商品识别方法的步骤流程图之五;FIG. 5 is a fifth flowchart of steps of a product identification method according to an embodiment of the present invention; FIG.
图6为本发明的实施例提供的商品识别装置的示意性结构图之一;FIG. 6 is a schematic structural diagram of a product identification device according to an embodiment of the present invention; FIG.
图7为本发明的实施例提供的商品识别装置的示意性结构图之二;Figure 7 is a second schematic structural diagram of a product identification device according to an embodiment of the present invention;
图8为本发明的实施例提供的商品识别装置的示意性结构图之三。FIG. 8 is a third schematic structural diagram of a product identification device according to an embodiment of the present invention.
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。需要说明的是,下文所提供
的任意多个技术方案中的部分或全部技术特征在不冲突的情况下,可以结合使用,形成新的技术方案。基于本发明中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。The technical solutions in the embodiments of the present invention are clearly and completely described in the following with reference to the accompanying drawings in the embodiments of the present invention. It is obvious that the described embodiments are only a part of the embodiments of the present invention, but not all embodiments. It should be noted that the following provides
Some or all of the technical features of any of the various technical solutions may be combined to form a new technical solution without conflict. All other embodiments obtained by those skilled in the art based on the embodiments of the present invention without creative efforts are within the scope of the present invention.
在本发明实施例中术语“和/或”,仅仅是一种描述关联对象的关联关系,表示可以存在三种关系,例如,A和/或B,可以表示:单独存在A,同时存在A和B,单独存在B这三种情况。另外,本文中字符“/”,一般表示前后关联对象是一种“或”的关系。如果不加说明,本文中的“多个”是指两个或两个以上。In the embodiment of the present invention, the term “and/or” is merely an association relationship describing an association object, indicating that there may be three relationships, for example, A and/or B, which may indicate that A exists separately, and A and B, there are three cases of B alone. In addition, the character "/" in this article generally indicates that the contextual object is an "or" relationship. Unless otherwise stated, "multiple" in this document refers to two or more.
在本发明实施例中,“示例性的”或者“例如”等词用于表示作例子、例证或说明。本发明实施例中被描述为“示例性的”或者“例如”的任何实施例或设计方案不应被解释为比其它实施例或设计方案更优选或更具优势。确切而言,使用“示例性的”或者“例如”等词旨在以具体方式呈现相关概念。In the embodiments of the present invention, the words "exemplary" or "such as" are used to mean an example, illustration, or illustration. Any embodiment or design described as "exemplary" or "for example" in the embodiments of the invention should not be construed as preferred or advantageous over other embodiments or designs. Rather, the use of the words "exemplary" or "such as" is intended to present the concepts in a particular manner.
需要说明的是,本发明实施例中,除非另有说明,“多个”的含义是指两个或两个以上。It should be noted that, in the embodiments of the present invention, the meaning of "a plurality" means two or more unless otherwise stated.
还需要说明的是,本发明实施例中,“的(英文:of)”,“相应的(英文:corresponding,relevant)”和“对应的(英文:corresponding)”有时可以混用,应当指出的是,在不强调其区别时,其所要表达的含义是一致的。It should be noted that, in the embodiment of the present invention, "(English: of)", "corresponding (relevant)" and "corresponding" may sometimes be mixed, and it should be noted that When the difference is not emphasized, the meaning to be expressed is the same.
本发明实施例所提供的技术方案的基本原理为:针对盲人因视力问题而无法获取商品信息,进而导致盲人不能独立的进行购物的问题,本发明实施例所提供的技术方案通过获取商品的图像信息,然后根据商品的图像信息获取商品的文本信息,最后将商品的文本信息转换为语音信息输出,进而使盲人通过接收语音信息获取商品的信息,从而解决上述盲人无法获取商品信息的问题。The basic principle of the technical solution provided by the embodiment of the present invention is: the problem that the blind person cannot obtain the product information due to the vision problem, and the blind person cannot purchase the product independently. The technical solution provided by the embodiment of the present invention obtains the image of the product. The information then obtains the text information of the product according to the image information of the product, and finally converts the text information of the product into the voice information output, thereby enabling the blind person to obtain the information of the product by receiving the voice information, thereby solving the problem that the blind person cannot obtain the product information.
基于上述内容,本发明实施例提供一种用于辅助盲人获取商品信息的商品识别方法。Based on the above content, an embodiment of the present invention provides a product identification method for assisting a blind person to obtain product information.
示例性的,本发明实施例提供的商品识别方法的执行主体可以
为采用本发明实施例提供的商品识别方法对商品行识别的终端设备。终端设备可以为头戴式导盲装置、导盲眼镜、手机、便携式计算机、袖珍式计算机、手持式计算机、数码相框、掌上电脑、导航仪等,或者终端设备可以为安装有可以采用本发明实施例提供的商品识别方法对商品行识别的软件客户端或软件系统或软件应用的个人计算机(英文全称:personal computer,简称:PC)、服务器等,具体的硬件实现环境可以通用计算机形式,或者是专门设计的集成电路(英文全称:Application Specific Integrated Circuit,简称:ASIC)的方式,也可以是(英文全称:Field Programmable Gate Array,简称:FPGA),或者是一些可编程的扩展平台例如嵌入式(英文名称:Tensilica)的可配置处理器平台等。Exemplarily, the executive body of the product identification method provided by the embodiment of the present invention may
A terminal device for identifying a product line by using the product identification method provided by the embodiment of the present invention. The terminal device may be a head-mounted blind guide device, a guide blind glasses, a mobile phone, a portable computer, a pocket computer, a handheld computer, a digital photo frame, a palmtop computer, a navigator, etc., or the terminal device may be installed and implemented by the present invention. The product identification method provided by the example is a software client for identifying a product line or a personal computer for a software system or software application (English name: personal computer, PC for short), a server, etc., and the specific hardware implementation environment may be a general computer form, or The specially designed integrated circuit (English name: Application Specific Integrated Circuit, ASIC for short) can also be (English full name: Field Programmable Gate Array, referred to as: FPGA), or some programmable expansion platform such as embedded ( English name: Tensilica) configurable processor platform, etc.
参照图1所示,本发明实施例提供的商品识别方法包括:Referring to FIG. 1 , a product identification method provided by an embodiment of the present invention includes:
S11、从至少一个视角对商品进行图像采集,获取商品的图像信息。S11. Perform image collection on the commodity from at least one perspective to obtain image information of the commodity.
示例性的,可以通过单目摄像头、双目摄像头等图像传感器中的一种或多种来实现上述步骤S11中的图像采集。Exemplarily, the image acquisition in the above step S11 can be implemented by one or more of an image sensor such as a monocular camera, a binocular camera, or the like.
可选的,上述S11中从至少一个视角对商品进行图像采集具体可以为从一个视角对商品进行图像采集,也可以为从多个视角对商品进行图像采集。示例性的,本发明实施例中具体可以通过如下方法实现从多个视角对商品进行图像采集:Optionally, in the foregoing S11, the image collection of the product from at least one perspective may be an image collection of the product from one perspective, or an image collection of the product from multiple perspectives. Exemplarily, in the embodiment of the present invention, the image collection of the commodity from multiple perspectives can be specifically implemented by the following method:
一、用户将商品放置于图像采集装置的采集范围之内,然后对商品进行转动和/或移动,图像采集装置在用户对商品进行转动和/或移动的过程中从多个视角对商品进行图像采集。1. The user places the product within the collection range of the image capture device, and then rotates and/or moves the product, and the image capture device images the product from multiple perspectives during the user's rotation and/or movement of the product. collection.
二、图像采集装置包括多个且分别设置于不同的位置,当进行图像采集时,多个设置于不同位置的图像采集装置分别从一个视角对商品进行图像采集,从而实现从多个视角对商品进行图像采集。Second, the image acquisition device includes a plurality of images and is respectively disposed at different positions. When performing image acquisition, a plurality of image collection devices disposed at different positions respectively perform image collection on the products from one perspective, thereby realizing the goods from multiple perspectives. Perform image acquisition.
还需要说明的是,当从多个视角对商品进行图像采集时,由于
商品的图像信息是通过从多个视角对商品进行图像采集获取的,因此商品的图像信息包括:多个视角商品的图像。通过从多个视角对商品进行图像采集可以更加全面的获取商品的图像信息,进而有利后继对商品的图像信息进行识别。It should also be noted that when images are collected from multiple perspectives,
The image information of the product is obtained by image capturing the product from a plurality of viewing angles, and thus the image information of the product includes an image of a plurality of viewing angle products. By collecting images from a plurality of perspectives, the image information of the products can be more comprehensively obtained, thereby facilitating the subsequent recognition of the image information of the products.
S12、根据商品的图像信息获取商品的文本信息。S12. Acquire text information of the product according to image information of the product.
上述实施例中的根据商品的图像信息获取商品的文本信息的过程可以在图像识别装置内部完成,也可以通过远程服务设备协助完成。The process of acquiring the text information of the product according to the image information of the commodity in the above embodiment may be completed inside the image recognition device or may be assisted by the remote service device.
当根据商品的图像信息获取商品的文本信息的过程在图像识别装置内部完成时,步骤S12具体可以通过如下步骤实现:a、对商品的图像信息进行识别。b、根据识别结果获取商品的文本信息。示例性的,可以通过安装有图像识别软件的图像处理设备来执行上述步骤。When the process of acquiring the text information of the product according to the image information of the product is completed inside the image recognition device, step S12 can be specifically implemented by the following steps: a. Identifying the image information of the product. b. Obtain text information of the product according to the recognition result. Illustratively, the above steps can be performed by an image processing device equipped with image recognition software.
当根据商品的图像信息获取商品的文本信息的过程通过远程服务设备协助完成时,步骤S12具体可以通过如下步骤实现:c、将商品图像信息发送至远端服务设备,以便远端服务设备对商品的图像信息进行识别,并根据识别结果获取商品的文本信息。d、接收远端服务设备发送的商品的文本信息。示例性的,远端服务设备可以为云端服务器等。When the process of acquiring the text information of the product according to the image information of the product is completed by the remote service device, step S12 can be specifically implemented by: c. transmitting the product image information to the remote service device, so that the remote service device can be used for the product. The image information is identified, and the text information of the product is obtained according to the recognition result. d. Receive text information of the goods sent by the remote service device. Exemplarily, the remote service device can be a cloud server or the like.
此外,对商品的图像信息进行识别可以采用任一种图像识别算法对商品图像信息进行处理、分析和理解。本发明实施例中对商品的图像信息识别过程中采用的图像识别算法不做限定。In addition, the image information of the product can be identified, and the image image information can be processed, analyzed, and understood using any image recognition algorithm. In the embodiment of the present invention, the image recognition algorithm used in the process of identifying the image information of the commodity is not limited.
S13、将商品的文本信息转换为语音信息,并对语音信息进行输出。S13. Convert the text information of the commodity into voice information, and output the voice information.
具体的,可以通过语音合成技术将商品的文本信息转换为语音信息,然后通过扬声器、功放机、音箱、耳机等音频输出设备将合成的语音信息输出。
Specifically, the text information of the commodity can be converted into voice information by using a voice synthesis technology, and then the synthesized voice information is output through an audio output device such as a speaker, a power amplifier, a speaker, or a headphone.
需要说明的是,将商品的文本信息转换为语音信息,并对语音信息进行输出具体可以为:直接通过语音合成技术将商品的文本信息转换为语音信息输出,也可以为将商品的文本信息进行关键字提取,比如将商品的文本信息与数据库中存储的商品的关键字进行比对,然后语音输出关键字比对结果。It should be noted that the text information of the commodity is converted into the voice information, and the voice information is outputted, which may be: directly converting the text information of the commodity into the voice information output through the voice synthesis technology, or performing the text information of the commodity. Keyword extraction, for example, comparing the text information of the product with the keywords of the product stored in the database, and then outputting the keyword comparison result by voice.
本发明的实施例提供的商品识别方法首先从至少一个视角对商品进行图像采集,获取商品的图像信息,然后根据商品的图像信息获取商品的文本信息,最后将商品的文本信息转换为语音信息,并对语音信息进行输出,由于本发明实施例提供的商品识别装置可以获取商品的图像信息并根据商品的图像信息获取商品的文本信息,最后将商品的文本信息转换为语音信息输出,所以通过本发明实施例盲人可以通过语音的方式获取商品信息,进而可以使盲人可以独立的进行购物。The product identification method provided by the embodiment of the present invention firstly collects images from at least one perspective, acquires image information of the commodity, and then acquires text information of the commodity according to the image information of the commodity, and finally converts the text information of the commodity into voice information. And outputting the voice information, because the product identification device provided by the embodiment of the present invention can acquire the image information of the product and obtain the text information of the product according to the image information of the product, and finally convert the text information of the product into the voice information output, so In the embodiment of the invention, the blind person can obtain the product information by means of voice, thereby enabling the blind person to independently purchase the product.
进一步的,本发明实施例提供如下几种具体实现方式来实现图1所示的商品识别方法:Further, the embodiment of the present invention provides the following specific implementation manners to implement the product identification method shown in FIG. 1:
一、One,
参照图2所示,上述商品识别方法具体包括如下步骤:Referring to FIG. 2, the above product identification method specifically includes the following steps:
S21、从至少一个视角对商品进行图像采集,获取商品的图像信息。S21: Perform image collection on the commodity from at least one perspective to obtain image information of the commodity.
S22、对商品的图像信息进行条码识别,获取商品的条码。S22. Perform bar code recognition on the image information of the product, and obtain a barcode of the product.
具体的,上述实施例中商品条码可以为一维条码和/或二维条码。其中,商品的一维条码由一组规则排列的条、空及其对应代码组成。商品的二维条码是用特定的几何图形按一定规律在二维方向上分布的黑白相间的图形。其形成商品条码的条码符号可以包括零售商品、储运包装商品、物流单元、参与方位置等等的代码与条码标识。通常商品的条码印在商品包装上,或将其制成条码标签附在商品上。因此通过对商品的图像信息进行条码识别可以获取商品的
条码。Specifically, the commodity barcode in the above embodiment may be a one-dimensional barcode and/or a two-dimensional barcode. The one-dimensional barcode of the commodity is composed of a set of regularly arranged bars, spaces and corresponding codes. The two-dimensional bar code of the product is a black and white graphic that is distributed in a two-dimensional direction by a certain geometric pattern. The bar code symbols that form the bar code of the merchandise may include code and bar code identification of the retail merchandise, the storage and packaging package merchandise, the logistics unit, the location of the party, and the like. Usually the bar code of the merchandise is printed on the merchandise package or attached to the merchandise as a bar code label. Therefore, by performing bar code recognition on the image information of the product, the product can be obtained.
Bar code.
对商品的一维条码别进行识别具体可以为:先进行一维条码检测和定位,确定一维条码的在图像中的区域,然后在对该区域的图像进行一维条码识别,读取一维条码中的商品编号等的信息,最后根据一维条码中的信息查询获取商品文本信息。同样对商品的二维条码进行识别具体可以为:先进行二维条码检测和定位然后在对该区域的图像进行二维条码识别,读取二维条码中的信息。与一维条码不同之处在于,二维条码中可以包括更多信息,因此可以通过二维条码的内容直接获取商品的文本信息或者对获得信息进行进一步处理获得商品的文本。The identification of the one-dimensional barcode of the commodity may be: firstly performing one-dimensional barcode detection and positioning, determining a region of the one-dimensional barcode in the image, and then performing one-dimensional barcode recognition on the image of the region, and reading one-dimensional The information such as the product number in the barcode is finally obtained by querying the information in the one-dimensional barcode. Similarly, the identification of the two-dimensional barcode of the product may be: first performing two-dimensional barcode detection and positioning, and then performing two-dimensional barcode recognition on the image of the area, and reading the information in the two-dimensional barcode. The difference from the one-dimensional barcode is that the two-dimensional barcode can include more information, so the text information of the commodity can be directly obtained through the content of the two-dimensional barcode or the obtained information can be further processed to obtain the text of the commodity.
S23、根据商品的条码获取商品的文本信息。S23. Obtain text information of the product according to the barcode of the product.
对商品的条码进行查询和数据处理,可立即识别出商品制造厂商、名称、价格、颜色等商品信息,并且这些商品信息都是通过文本的形式显示出来的,所以可以根据商品的条码获取商品的文本信息。By querying and processing the bar code of the product, the product information of the product manufacturer, name, price, color, etc. can be immediately recognized, and the product information is displayed in the form of text, so the product can be obtained according to the bar code of the product. Text information.
此外,商品的条码还遵循唯一性原则,即一个商品项目只能有一个代码,或者说一个代码只能标识一种商品项目。不同规格、不同包装、不同品种、不同价格、不同颜色的商品均会使用不同的商品的条码。因此通过商品的条码获取的商品的文本信息相对准确,可以避免向用户提供的商品信息错误。In addition, the barcode of the product also follows the principle of uniqueness, that is, a commodity item can only have one code, or one code can only identify one commodity item. Barcodes of different products will be used for different specifications, different packaging, different varieties, different prices, and different colors. Therefore, the text information of the commodity obtained by the barcode of the commodity is relatively accurate, and the error of the commodity information provided to the user can be avoided.
S24、将商品的文本信息转换为语音信息,并对语音信息进行输出。S24. Convert the text information of the commodity into voice information, and output the voice information.
二、two,
参照图3所示,上述商品识别方法具体包括如下步骤:Referring to FIG. 3, the above product identification method specifically includes the following steps:
S31、从至少一个视角对商品进行图像采集,获取商品的图像信息。S31. Perform image collection on the commodity from at least one perspective to obtain image information of the commodity.
S32、对商品的图像信息进行光学字符识别(英文名称:Optical
Character Recognition,英文简称:OCR),获取商品的图像信息中的字符。S32. Perform optical character recognition on image information of the product (English name: Optical
Character Recognition, English abbreviation: OCR), obtain the characters in the image information of the product.
具体的,光学字符识别是指光学字符识别设备(例如扫描仪、数码相机、打印机等)检查纸质文档、图像上的字符,通过检测暗、亮的模式确定其形状,然后用字符识别方法将形状翻译成计算机文字的过程。Specifically, optical character recognition refers to an optical character recognition device (such as a scanner, a digital camera, a printer, etc.) inspecting paper documents and characters on an image, determining the shape by detecting a dark and bright mode, and then using a character recognition method. The process of translating shapes into computer text.
可选的,如上所述,本发明实施例中的商品的图像信息包括至少两张图像,在对商品的图像信息进行光学字符识别的过程中可以先分别对每一张图像进行光学字符识别,然后再将识别的字符整理获取商品的文本信息,也可以先对商品的图像信息中包括的至少两张图像进行拼接、融合处理,然后再进行光学字符识别获取商品的文本信息。Optionally, as described above, the image information of the product in the embodiment of the present invention includes at least two images, and each of the images may be optically recognized in the process of performing optical character recognition on the image information of the product. Then, the recognized characters are sorted to obtain the text information of the product, and at least two images included in the image information of the product may be first spliced and fused, and then the optical character recognition is performed to obtain the text information of the product.
S33、根据商品图像信息中的字符获取商品的文本信息。S33. Acquire text information of the product according to characters in the product image information.
通常,商品的外包装都会附有包装标签,而包装标签会通过文字方式表明商品的制造者或销售者的名称和地址、产品名称、商标、成分、品质特点、包装内产品数量、使用方法及用量、编号、贮藏应注意的事项、质检号、生产日期和有效期等内容,所以可以根据商品图像信息中的字符获取商品的文本信息。Usually, the outer packaging of the product is accompanied by a packaging label, and the packaging label will indicate the name and address of the manufacturer or seller of the product, the product name, the trademark, the composition, the quality characteristics, the number of products in the package, and the method of use. The amount, number, storage note, quality inspection number, production date and expiration date, etc., so the text information of the product can be obtained according to the characters in the product image information.
S34、将商品的文本信息转换为语音信息,并对语音信息进行输出。S34. Convert the text information of the commodity into voice information, and output the voice information.
三、three,
参照图4所示,上述商品识别方法具体包括如下步骤:Referring to FIG. 4, the above product identification method specifically includes the following steps:
S41、从至少一个视角对商品进行图像采集,获取商品的图像信息。S41. Perform image collection on the commodity from at least one perspective to obtain image information of the commodity.
S42、对商品的图像信息进行特征提取,获取商品的特征。S42: Feature extraction of image information of the product, and acquiring features of the product.
可选的,上述实施例中的商品的特征可以是通过人工设计的图
像特征提取器获取的特征,也可以是通过机器学习方法,比如通过深度学习方法自动学习获取的图像特征。Optionally, the feature of the commodity in the above embodiment may be a manually designed image.
Features acquired by the feature extractor may also be automatically learned by a machine learning method, such as by deep learning methods.
S43、根据商品的特征对商品进行分类,获取商品的分类。S43. Sort the products according to the characteristics of the products, and obtain the classification of the products.
示例性的,根据商品的特征对商品进行分类并获取商品的分类可以为将商品的特征输入分类器进行分类,获取商品的分类。其中,分类器可以是传统的支持向量机分类器(英文名称:Support Vector Machine,简称:SVM)或者传统的迭代(英文名称:adaboost)分类器,也可以是基于深度学习网络的分类器。Illustratively, classifying the products according to the characteristics of the products and obtaining the classification of the products may be to classify the features of the products into a classifier to obtain the classification of the products. The classifier may be a traditional support vector machine classifier (English name: Support Vector Machine, SVM for short) or a traditional iterative (English name: adaboost) classifier, or a classifier based on a deep learning network.
S44、根据商品的分类获取商品的文本信息。S44. Obtain text information of the product according to the classification of the product.
在上述步骤S43中获取商品的分类后,可以根据商品的分类获取该类别的商品的文本信息。After the classification of the merchandise is obtained in the above step S43, the text information of the merchandise of the merchandise can be obtained according to the classification of the merchandise.
S45、将商品的文本信息转换为语音信息,并对语音信息进行输出。S45. Convert the text information of the commodity into voice information, and output the voice information.
四、four,
参照图5所示,上述商品识别方法具体包括如下步骤:Referring to FIG. 5, the above product identification method specifically includes the following steps:
S51、从至少两个视角对商品进行图像采集,获取商品的图像信息。S51. Perform image collection on the commodity from at least two perspectives to obtain image information of the commodity.
S52、对商品的图像信息进行条码识别。S52. Perform barcode identification on image information of the product.
当步骤S52中对商品的图像信息进行条码识别成功时,执行步骤S53、S57,当步骤S52中对商品的图像信息进行条码识别失败时,执行步骤S54。When the barcode identification of the image information of the product is successful in step S52, steps S53 and S57 are performed. When the barcode identification of the image information of the product fails in step S52, step S54 is performed.
S53、获取商品的条码,根据商品的条码获取商品的文本信息。S53. Obtain a barcode of the product, and obtain text information of the commodity according to the barcode of the commodity.
S54、对商品的图像信息进行光学字符识别。S54. Perform optical character recognition on image information of the product.
当步骤S54中对商品的图像信息进行光学字符识别成功时,执行步骤S55、S57,当步骤S52中对商品的图像信息进行光学字符
识别失败时,执行步骤S56、S57。When the optical character recognition of the image information of the product is successful in step S54, steps S55, S57 are performed, and the optical character of the image information of the product is performed in step S52.
When the recognition fails, steps S56 and S57 are performed.
S55、获取商品的图像信息中的字符,根据商品图像信息中的字符获取商品的文本信息。S55. Acquire characters in the image information of the product, and obtain text information of the product according to the characters in the product image information.
S56、对商品的图像信息进行特征提取,根据商品的特征对商品进行分类,获取商品的分类,根据商品的分类获取商品的文本信息。S56: Feature extraction of image information of the product, classifying the product according to the characteristics of the product, acquiring the classification of the product, and acquiring the text information of the product according to the classification of the product.
S57、将商品的文本信息转换为语音信息,并对语音信息进行输出。S57. Convert the text information of the commodity into voice information, and output the voice information.
需要说明的是,上述实施例中提供的商品识别方法中获取商品的文本信息的步骤还可以应用于远端服务器中,例如:当远端服务设备接收到商品的图像信息时,也可以对商品的图像信息进行条码识别,获取商品的条码,并根据商品的条码获取商品的文本信息,然后再发出商品的文本信息;再例如:当远端服务设备接收到商品的图像信息时,也可以对商品的图像信息进行光学字符识别,获取商品的图像信息中的字符;根据商品图像信息中的字符获取商品的文本信息,然后用再发出商品的文本信息;即远端服务器也可以采用上述任一种实现方式中的商品的文本信息获取方法来获取商品的文本信息。It should be noted that the step of acquiring the text information of the product in the product identification method provided in the above embodiment may also be applied to the remote server, for example, when the remote service device receives the image information of the product, the product may also be The image information is barcoded, the barcode of the commodity is obtained, and the text information of the commodity is obtained according to the barcode of the commodity, and then the text information of the commodity is sent; for example, when the remote service device receives the image information of the commodity, it may also The image information of the product is optically recognized, the characters in the image information of the product are obtained; the text information of the product is obtained according to the characters in the product image information, and then the text information of the product is re-issued; that is, the remote server may also adopt any of the above The text information acquisition method of the commodity in the implementation manner obtains the text information of the commodity.
下面说明本发明实施例提供的与上文所提供的方法实施例相对应的装置实施例。需要说明的是,下述装置实施例中相关内容的解释,均可以参考上述方法实施例。The device embodiments corresponding to the method embodiments provided above are provided in the following description of the embodiments of the present invention. It should be noted that the explanation of the related content in the following device embodiments can refer to the foregoing method embodiments.
在采用对应各个功能划分各个功能模块的情况下,图6示出了上述实施例中所涉及的用于辅助盲人获取商品信息的商品识别装置的一种可能的结构示意图。商品识别装置600包括:In the case where the respective functional modules are divided by the corresponding functions, FIG. 6 is a schematic diagram showing a possible structure of the product identification device for assisting the blind person to acquire the product information in the above embodiment. The item identification device 600 includes:
图像采集模块61,用于从至少一个视角对商品进行图像采集,获取商品的图像信息。The image acquisition module 61 is configured to perform image collection on the commodity from at least one viewing angle to acquire image information of the commodity.
图像识别模块62,用于根据商品的图像信息获取商品的文本信息。
The image recognition module 62 is configured to acquire text information of the product according to the image information of the product.
语音输出模块63,用于将商品的文本信息转换为语音信息,并对语音信息进行输出。The voice output module 63 is configured to convert text information of the commodity into voice information, and output the voice information.
本发明的实施例提供的商品识别装置包括:图像采集模块、图像识别模块以及语音输出模块,其中,图像采集模块用于从至少一个视角对商品进行图像采集,获取商品的图像信息,图像识别模块用于对商品的图像信息进行识别,并根据识别结果获取商品的文本信息,语音输出模块用于将商品的文本信息转换为语音信息,并对语音信息进行输出,由于本发明实施例提供的商品识别装置可以获取商品的图像信息并根据商品的图像信息获取商品文本信息,最后将商品的文本信息转换为语音信息输出,所以通过本发明实施例盲人可以通过语音的方式获取商品信息。The product identification device provided by the embodiment of the present invention includes: an image acquisition module, an image recognition module, and a voice output module, wherein the image acquisition module is configured to perform image collection on the commodity from at least one perspective, and acquire image information of the commodity, and the image recognition module The image information of the product is identified, and the text information of the product is obtained according to the recognition result. The voice output module is configured to convert the text information of the product into voice information, and output the voice information, because the product provided by the embodiment of the present invention The identification device can acquire the image information of the product and obtain the product text information according to the image information of the product, and finally convert the text information of the product into the voice information output. Therefore, the blind person can obtain the product information by means of the voice according to the embodiment of the present invention.
可选的,参照图7所示,图像识别模块62包括:发送单元621和接收单元622;Optionally, referring to FIG. 7, the image recognition module 62 includes: a sending unit 621 and a receiving unit 622;
发送单元621用于将商品图像信息发送至远端服务设备,以便远端服务设备对商品的图像信息进行识别,并根据识别结果获取商品的文本信息;The sending unit 621 is configured to send the product image information to the remote service device, so that the remote service device identifies the image information of the product, and acquires the text information of the product according to the recognition result;
接收单元622用于接收远端服务设备发送的商品的文本信息。The receiving unit 622 is configured to receive text information of an item sent by the remote service device.
可选的,图像识别模块62具体用于对商品图像信息进行条码识别,获取商品的条码,根据商品的条码获取商品的文本信息。Optionally, the image recognition module 62 is specifically configured to perform barcode identification on the product image information, obtain a barcode of the commodity, and obtain text information of the commodity according to the barcode of the commodity.
可选的,图像识别模块62具体用于对商品的图像信息进行光学字符识别,获取商品的图像信息中的字符,根据商品的图像信息中的字符获取商品的文本信息。Optionally, the image recognition module 62 is specifically configured to perform optical character recognition on the image information of the product, acquire characters in the image information of the product, and obtain text information of the product according to the characters in the image information of the product.
可选的,图像识别模块62具体用于对商品的图像信息进行特征提取,获取商品的特征;根据商品的特征对商品进行分类,获取商品的分类;根据商品的分类获取商品的文本信息。Optionally, the image recognition module 62 is specifically configured to perform feature extraction on the image information of the product, acquire the feature of the product, classify the product according to the feature of the product, obtain a classification of the product, and obtain text information of the product according to the classification of the product.
即,图像采集模块61用于实现图1所示的步骤S11、图2所示的步骤S21、图3所示的步骤S31、图4所示的步骤S41以及图
5所示的步骤S51;图像识别模块62用于实现图1所示的步骤S12、图2所示的步骤S22、S23、图3所示的步骤S32、S33、图4所示的步骤S42、S43、S44以及图5所示的步骤S52、S53、S54、S55、S56;语音输出模块63用于实现图1所示的步骤S13、图2所示的步骤S24、图3所示的步骤S34、图4所示的步骤S45以及图5所示的步骤S57。That is, the image acquisition module 61 is configured to implement step S11 shown in FIG. 1, step S21 shown in FIG. 2, step S31 shown in FIG. 3, step S41 shown in FIG.
Step S51 shown in FIG. 5; image recognition module 62 is used to implement step S12 shown in FIG. 1, steps S22 and S23 shown in FIG. 2, steps S32 and S33 shown in FIG. 3, and step S42 shown in FIG. S43, S44 and steps S52, S53, S54, S55, S56 shown in FIG. 5; the voice output module 63 is used to implement step S13 shown in FIG. 1, step S24 shown in FIG. 2, and step S34 shown in FIG. Step S45 shown in FIG. 4 and step S57 shown in FIG.
还需说明的是,上述方法实施例涉及的各步骤的所有相关内容均可以援引到对应功能模块的功能描述,在此不再赘述。It should be noted that all the related content of the steps involved in the foregoing method embodiments may be referred to the functional description of the corresponding functional modules, and details are not described herein again.
在硬件实现上,上述的图像采集模块61可以是单目摄像头、双目摄像头等图像传感器中的一种或多种。图像识别模块62可以是处理器;语音输出模块63可以是扬声器、功放机、音箱、耳机等音频输出设备。上述商品识别装置所执行的动作所对应的程序均可以以软件形式存储于商品识别装置的存储器中,以便于处理器调用执行以上各个模块对应的操作。In hardware implementation, the image capturing module 61 may be one or more of an image sensor such as a monocular camera or a binocular camera. The image recognition module 62 may be a processor; the voice output module 63 may be an audio output device such as a speaker, a power amplifier, a speaker, a headphone, or the like. The programs corresponding to the operations performed by the product identification device described above may be stored in the memory of the article identification device in software, so that the processor invokes the operations corresponding to the above respective modules.
在采用集成的单元的情况下,图8示出了上述实施例中所涉及的商品识别装置的一种可能的结构示意图。商品识别装置800包括:处理器81、存储器82、系统总线83、通信接口84、图像采集装置85、语音输出装置86。In the case of employing an integrated unit, FIG. 8 shows a possible structural diagram of the article identification device involved in the above embodiment. The product identification device 800 includes a processor 81, a memory 82, a system bus 83, a communication interface 84, an image acquisition device 85, and a voice output device 86.
上述处理器81可以是一个处理器,也可以是多个处理元件的统称。例如,处理器81可以为中央处理器(central processing unit,CPU)。处理器81也可以为其他通用处理器、数字信号处理器(digital signal processing,DSP)、专用集成电路(application specific integrated circuit,ASIC)、现场可编程门阵列(field-programmable gate array,FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等,其可以实现或执行结合本发明公开内容所描述的各种示例性的逻辑方框,模块和电路。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。处理器81还可以为专用处理器,该专用处理器可以包括基带处理芯片、射频处理芯片等中的至少一个。处理器也可以是实现计算功能的组合,例如包含一个或多个微处理器组合,DSP和
微处理器的组合等等。进一步地,该专用处理器还可以包括具有该装置其他专用处理功能的芯片。The processor 81 may be a processor or a collective name of a plurality of processing elements. For example, the processor 81 can be a central processing unit (CPU). The processor 81 can also be another general purpose processor, a digital signal processing (DSP), an application specific integrated circuit (ASIC), a field-programmable gate array (FPGA), or Other programmable logic devices, discrete gates or transistor logic devices, discrete hardware components, and the like, can implement or perform the various illustrative logical blocks, modules, and circuits described in connection with the present disclosure. The general purpose processor may be a microprocessor or the processor or any conventional processor or the like. The processor 81 may also be a dedicated processor, which may include at least one of a baseband processing chip, a radio frequency processing chip, and the like. The processor can also be a combination of computing functions, such as one or more microprocessor combinations, DSP and
A combination of microprocessors and so on. Further, the dedicated processor may also include a chip having other specialized processing functions of the device.
存储器82用于存储计算机执行代码,处理器81与存储器82通过系统总线83连接,当移动终端运行时,处理器81用于执行存储器82存储的计算机执行代码,以执行本发明实施例提供的任意一种商品识别方法,如,处理器81用于支持移动终端执行图1所示的步骤S12、图2所示的步骤S22、S23、图3所示的步骤S32、S33、图4所示的步骤S42、S43、S44以及图5所示的步骤S52、S53、S54、S55、S56,和/或用于本文所描述的技术的其它过程,具体的商品识别方法可参考上文及附图中的相关描述,此处不再赘述。The memory 82 is used to store computer execution code, and the processor 81 is connected to the memory 82 through the system bus 83. When the mobile terminal is running, the processor 81 is configured to execute the computer execution code stored in the memory 82 to execute any of the embodiments provided by the embodiments of the present invention. A product identification method, for example, the processor 81 is configured to support the mobile terminal to perform step S12 shown in FIG. 1 , steps S22 and S23 shown in FIG. 2 , steps S32 and S33 shown in FIG. 3 , and FIG. Steps S42, S43, S44 and steps S52, S53, S54, S55, S56 shown in Fig. 5, and/or other processes for the techniques described herein, the specific article identification method can be referred to above and in the drawings The related description is not repeated here.
系统总线83可以包括数据总线、电源总线、控制总线和信号状态总线等。本实施例中为了清楚说明,在图8中将各种总线都示意为系统总线83。The system bus 83 can include a data bus, a power bus, a control bus, and a signal status bus. For the sake of clarity in the present embodiment, various buses are illustrated as the system bus 83 in FIG.
通信接口84具体可以是该装置上的收发器。该收发器可以为无线收发器。例如,无线收发器可以是该装置的天线等。处理器81通过通信接口84与其他设备,例如,若该装置为该终端设备中的一个模块或组件时,该装置用于与该终端设备中的其他模块之间进行数据交互。 Communication interface 84 may specifically be a transceiver on the device. The transceiver can be a wireless transceiver. For example, the wireless transceiver can be an antenna or the like of the device. The processor 81 communicates with other devices via the communication interface 84, for example, if the device is a module or component of the terminal device, the device is for data interaction with other modules in the terminal device.
结合本发明公开内容所描述的方法的步骤可以硬件的方式来实现,也可以是由处理器执行软件指令的方式来实现。本发明实施例还提供一种存储介质,用于储存为图8所示的移动终端所用的计算机软件指令,其包含执行图1、2、3、4、5所示的商品识别方法所设计的程序代码。其中,软件指令可以由相应的软件模块组成,软件模块可以被存放于随机存取存储器(英文:random access memory,缩写:RAM)、闪存、只读存储器(英文:read only memory,缩写:ROM)、可擦除可编程只读存储器(英文:erasable programmable ROM,缩写:EPROM)、电可擦可编程只读存储器(英文:electrically EPROM,缩写:EEPROM)、寄存器、硬盘、移动硬盘、只读光盘(CD-ROM)或者本领域熟知的任何其它形式的存储介质中。一种示例性的存储介质耦合至处理器,从而使处理器能够从该存储介质读取信息,且可
向该存储介质写入信息。当然,存储介质也可以是处理器的组成部分。处理器和存储介质可以位于ASIC中。另外,该ASIC可以位于核心网接口设备中。当然,处理器和存储介质也可以作为分立组件存在于核心网接口设备中。The steps of the method described in connection with the present disclosure may be implemented in a hardware manner, or may be implemented by a processor executing software instructions. The embodiment of the present invention further provides a storage medium for storing computer software instructions used by the mobile terminal shown in FIG. 8 , which is designed to execute the product identification method shown in FIG. 1 , 2 , 3 , 4 , and 5 . code. The software instructions may be composed of corresponding software modules, and the software modules may be stored in a random access memory (English: random access memory, abbreviation: RAM), flash memory, read only memory (English: read only memory, abbreviation: ROM) , erasable programmable read-only memory (English: erasable programmable ROM, abbreviation: EPROM), electrically erasable programmable read-only memory (English: electrical EPROM, abbreviation: EEPROM), registers, hard disk, mobile hard disk, CD-ROM (CD-ROM) or any other form of storage medium known in the art. An exemplary storage medium is coupled to the processor to enable the processor to read information from the storage medium and
Write information to the storage medium. Of course, the storage medium can also be an integral part of the processor. The processor and the storage medium can be located in an ASIC. Additionally, the ASIC can be located in a core network interface device. Of course, the processor and the storage medium may also exist as discrete components in the core network interface device.
本发明实施例还提供一种计算机程序产品,该计算机程序可直接加载到计算机的内部存储器中,并含有软件代码,计算机程序经由计算机载入并执行后能够实现图1、2、3、4、5所示的商品识别方法。The embodiment of the invention further provides a computer program product, which can be directly loaded into the internal memory of the computer and contains software code, and the computer program can be loaded and executed by the computer to implement the figures 1, 2, 3, and 4. The product identification method shown in 5.
本领域技术人员应该可以意识到,在上述一个或多个示例中,本发明所描述的功能可以用硬件、软件、固件或它们的任意组合来实现。当使用软件实现时,可以将这些功能存储在计算机可读介质中或者作为计算机可读介质上的一个或多个指令或代码进行传输。计算机可读介质包括计算机存储介质和通信介质,其中通信介质包括便于从一个地方向另一个地方传送计算机程序的任何介质。存储介质可以是通用或专用计算机能够存取的任何可用介质。Those skilled in the art will appreciate that in one or more examples described above, the functions described herein can be implemented in hardware, software, firmware, or any combination thereof. When implemented in software, the functions may be stored in a computer readable medium or transmitted as one or more instructions or code on a computer readable medium. Computer readable media includes both computer storage media and communication media including any medium that facilitates transfer of a computer program from one location to another. A storage medium may be any available media that can be accessed by a general purpose or special purpose computer.
以上所述,仅为本发明的具体实施方式,但本发明的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本发明揭露的技术范围内,可轻易想到的变化或替换,都应涵盖在本发明的保护范围之内。因此,本发明的保护范围应以权利要求的保护范围为准。
The above is only a specific embodiment of the present invention, but the scope of the present invention is not limited thereto, and any person skilled in the art can easily think of changes or substitutions within the technical scope of the present invention. All should be covered by the scope of the present invention. Therefore, the scope of protection of the present invention should be determined by the scope of the claims.
Claims (13)
- 一种商品识别方法,用于辅助盲人获取商品信息,其特征在于,所述方法包括:A commodity identification method for assisting a blind person to obtain commodity information, wherein the method comprises:从至少一个视角对商品进行图像采集,获取所述商品的图像信息;Performing image collection on the commodity from at least one perspective to obtain image information of the commodity;根据所述商品的图像信息获取所述商品的文本信息;Obtaining text information of the commodity according to image information of the commodity;将所述商品的文本信息转换为语音信息,并对所述语音信息进行输出。The text information of the commodity is converted into voice information, and the voice information is output.
- 根据权利要求1所述的方法,其特征在于,所述根据所述商品的图像信息获取所述商品的文本信息,包括:The method according to claim 1, wherein the obtaining the text information of the product according to the image information of the product comprises:将所述商品的图像信息发送至远端服务设备,以便所述远端服务设备根据所述商品的图像信息获取所述商品的文本信息;Sending image information of the commodity to a remote service device, so that the remote service device acquires text information of the commodity according to image information of the commodity;接收所述远端服务设备发送的所述商品的文本信息。Receiving text information of the commodity sent by the remote service device.
- 根据权利要求1所述的方法,其特征在于,所述根据所述商品的图像信息获取所述商品的文本信息包括:The method according to claim 1, wherein the obtaining the text information of the commodity according to the image information of the commodity comprises:对所述商品的图像信息进行条码识别,获取所述商品的条码;Performing barcode identification on the image information of the commodity to obtain a barcode of the commodity;根据所述商品的条码获取所述商品的文本信息。Obtaining text information of the commodity according to the barcode of the commodity.
- 根据权利要求1所述的方法,其特征在于,所述根据所述商品的图像信息获取所述商品的文本信息包括:The method according to claim 1, wherein the obtaining the text information of the commodity according to the image information of the commodity comprises:对所述商品的图像信息进行光学字符识别,获取所述商品的图像信息中的字符;Performing optical character recognition on the image information of the commodity, and acquiring characters in the image information of the commodity;根据所述商品图像信息中的字符获取商品的文本信息。The text information of the item is acquired based on the characters in the product image information.
- 根据权利要求1所述的方法,其特征在于,所述根据所述商品的图像信息获取所述商品的文本信息包括: The method according to claim 1, wherein the obtaining the text information of the commodity according to the image information of the commodity comprises:对所述商品的图像信息进行特征提取,获取商品的特征;Feature extraction of image information of the commodity to acquire features of the commodity;根据所述商品的特征对所述商品进行分类,获取商品的分类;Sorting the goods according to the characteristics of the goods, and obtaining the classification of the goods;根据所述商品的分类获取所述商品的文本信息。The text information of the item is obtained according to the classification of the item.
- 一种商品识别装置,用于辅助盲人获取商品信息,其特征在于,所述装置包括:A commodity identification device for assisting a blind person to obtain commodity information, characterized in that the device comprises:图像采集模块,用于从至少一个视角对商品进行图像采集,获取所述商品的图像信息;An image acquisition module, configured to perform image collection on the commodity from at least one viewing angle, and acquire image information of the commodity;图像识别模块,用于根据所述商品的图像信息获取所述商品的文本信息;An image recognition module, configured to acquire text information of the commodity according to image information of the commodity;语音输出模块,用于将所述商品的文本信息转换为语音信息,并对所述语音信息进行输出。a voice output module, configured to convert text information of the commodity into voice information, and output the voice information.
- 根据权利要求6所述的装置,其特征在于,所述图像识别模块包括:发送单元和接收单元;The apparatus according to claim 6, wherein the image recognition module comprises: a transmitting unit and a receiving unit;所述发送单元用于将所述商品图像信息发送至远端服务设备,以便所述远端服务设备对所述商品的图像信息进行识别,并根据识别结果获取商品的文本信息;The sending unit is configured to send the commodity image information to a remote service device, so that the remote service device identifies image information of the commodity, and acquires text information of the commodity according to the recognition result;所述接收单元用于接收所述远端服务设备发送的商品的文本信息。The receiving unit is configured to receive text information of an item sent by the remote service device.
- 根据权利要求6所述的装置,其特征在于,The device of claim 6 wherein:所述图像识别模块具体用于对所述商品图像信息进行条码识别,获取所述商品的条码,根据所述商品的条码获取所述商品的文本信息。The image recognition module is specifically configured to perform barcode identification on the product image information, acquire a barcode of the commodity, and acquire text information of the commodity according to the barcode of the commodity.
- 根据权利要求6所述的装置,其特征在于,The device of claim 6 wherein:所述图像识别模块具体用于对所述商品的图像信息进行光学字符识别,获取所述商品的图像信息中的字符,根据所述商品的图像 信息中的字符获取商品的文本信息。The image recognition module is specifically configured to perform optical character recognition on the image information of the product, acquire characters in the image information of the product, and according to the image of the product The characters in the message get the text information of the item.
- 根据权利要求6所述的装置,其特征在于,The device of claim 6 wherein:所述图像识别模块具体用于对所述商品的图像信息进行特征提取,获取商品的特征;根据所述商品的特征对所述商品进行分类,获取商品的分类;根据所述商品的分类获取所述商品的文本信息。The image recognition module is specifically configured to perform feature extraction on the image information of the product, acquire characteristics of the product, classify the product according to the feature of the product, obtain a classification of the product, and acquire the classification according to the classification of the product. The textual information of the product.
- 一种商品识别装置,其特征在于,所述装置包括:处理器和存储器,所述存储器用于存储计算机执行代码,所述计算机执行代码用于控制所述处理器执行权利要求1-5任一项所述的商品识别方法。A commodity identification device, comprising: a processor and a memory, the memory for storing computer execution code, the computer execution code for controlling the processor to perform any of claims 1-5 The item identification method described in the item.
- 一种计算机存储介质,其特征在于,用于储存为权利要求11所述的商品识别装置所用的计算机软件指令,其包含执行权利要求1-5任一项所述的商品识别方法所设计的程序代码。A computer storage medium for storing the computer software instructions for use in the article identification device according to claim 11, comprising a program designed to execute the article identification method according to any one of claims 1 to 5. Code.
- 一种计算机程序产品,其特征在于,可直接加载到计算机的内部存储器中,并含有软件代码,所述计算机程序经由计算机载入并执行后能够实现权利要求1-5任一项所述的商品识别方法。 A computer program product, which can be directly loaded into an internal memory of a computer and containing software code, which can be loaded and executed by a computer to implement the product of any one of claims 1-5 recognition methods.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2016/111831 WO2018112930A1 (en) | 2016-12-23 | 2016-12-23 | Method and device for identifying commodities |
CN201680006925.4A CN107454964A (en) | 2016-12-23 | 2016-12-23 | A kind of commodity recognition method and device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2016/111831 WO2018112930A1 (en) | 2016-12-23 | 2016-12-23 | Method and device for identifying commodities |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2018112930A1 true WO2018112930A1 (en) | 2018-06-28 |
Family
ID=60484713
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2016/111831 WO2018112930A1 (en) | 2016-12-23 | 2016-12-23 | Method and device for identifying commodities |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN107454964A (en) |
WO (1) | WO2018112930A1 (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110008782A (en) * | 2019-06-06 | 2019-07-12 | 江苏东大集成电路系统工程技术有限公司 | The acquisition methods and device of bar code information |
CN117077083A (en) * | 2023-10-10 | 2023-11-17 | 上海英内物联网科技股份有限公司 | Automatic identification and statistics method for packaged articles |
Families Citing this family (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110009071B (en) * | 2018-01-04 | 2022-07-01 | 青岛海尔洗衣机有限公司 | Control method of intelligent wardrobe and intelligent wardrobe |
CN109033985B (en) | 2018-06-29 | 2020-10-09 | 百度在线网络技术(北京)有限公司 | Commodity identification processing method, device, equipment, system and storage medium |
CN109353398B (en) * | 2018-09-20 | 2020-11-10 | 北京旷视科技有限公司 | Commodity identification method, device and system, storage medium and shopping cart |
CN109359706A (en) * | 2018-09-25 | 2019-02-19 | 上海合阔信息技术有限公司 | Merchandise news intelligent identifying system and method |
CN111242712B (en) * | 2018-11-29 | 2023-04-28 | 阿里巴巴集团控股有限公司 | Commodity display method and device |
CN110222550B (en) * | 2019-07-11 | 2024-10-29 | 上海肇观电子科技有限公司 | Information broadcasting method, circuit, broadcasting equipment, storage medium and intelligent glasses |
CN111080399A (en) * | 2019-11-22 | 2020-04-28 | 汉口北进出口服务有限公司 | Commodity information processing method and device |
CN110942035A (en) * | 2019-11-28 | 2020-03-31 | 浙江由由科技有限公司 | Method, system, device and storage medium for acquiring commodity information |
CN111738031B (en) * | 2020-08-06 | 2021-03-02 | 江苏东大集成电路系统工程技术有限公司 | One-dimensional bar code identification method |
CN112698848B (en) * | 2020-12-31 | 2024-07-26 | Oppo广东移动通信有限公司 | Downloading method, device, terminal and storage medium of machine learning model |
CN113393289A (en) * | 2021-05-27 | 2021-09-14 | 阿里巴巴新加坡控股有限公司 | Method and device for processing commodity object information and modifying title |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101474020A (en) * | 2009-01-08 | 2009-07-08 | 上海交通大学 | Shopping guide method and apparatus for blind in supermarket based on bar code identification |
CN101524220A (en) * | 2009-03-25 | 2009-09-09 | 胡志超 | Method for blind persons self-help shopping and device thereof |
CN102495995A (en) * | 2011-12-09 | 2012-06-13 | 上海力远计算机科技有限公司 | Optical character recognition system |
CN102798388A (en) * | 2012-06-20 | 2012-11-28 | 上海交通大学 | Blind person supermarket shopping auxiliary device based on DSP (Digital Signal Processor) |
CN103632588A (en) * | 2013-11-28 | 2014-03-12 | 苏州罗马冯环保科技有限公司 | Commodity reader special for blind person |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8620083B2 (en) * | 2004-12-03 | 2013-12-31 | Google Inc. | Method and system for character recognition |
CN102063616A (en) * | 2010-12-30 | 2011-05-18 | 上海电机学院 | Automatic identification system and method for commodities based on image feature matching |
CN102354366A (en) * | 2011-09-23 | 2012-02-15 | 上海合合信息科技发展有限公司 | Network based image recognition method and system |
CN104463659A (en) * | 2014-12-23 | 2015-03-25 | 江苏天使电子科技有限公司 | Blind person shopping system |
-
2016
- 2016-12-23 CN CN201680006925.4A patent/CN107454964A/en active Pending
- 2016-12-23 WO PCT/CN2016/111831 patent/WO2018112930A1/en active Application Filing
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101474020A (en) * | 2009-01-08 | 2009-07-08 | 上海交通大学 | Shopping guide method and apparatus for blind in supermarket based on bar code identification |
CN101524220A (en) * | 2009-03-25 | 2009-09-09 | 胡志超 | Method for blind persons self-help shopping and device thereof |
CN102495995A (en) * | 2011-12-09 | 2012-06-13 | 上海力远计算机科技有限公司 | Optical character recognition system |
CN102798388A (en) * | 2012-06-20 | 2012-11-28 | 上海交通大学 | Blind person supermarket shopping auxiliary device based on DSP (Digital Signal Processor) |
CN103632588A (en) * | 2013-11-28 | 2014-03-12 | 苏州罗马冯环保科技有限公司 | Commodity reader special for blind person |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110008782A (en) * | 2019-06-06 | 2019-07-12 | 江苏东大集成电路系统工程技术有限公司 | The acquisition methods and device of bar code information |
CN117077083A (en) * | 2023-10-10 | 2023-11-17 | 上海英内物联网科技股份有限公司 | Automatic identification and statistics method for packaged articles |
CN117077083B (en) * | 2023-10-10 | 2024-01-05 | 上海英内物联网科技股份有限公司 | Automatic identification and statistics method for packaged articles |
Also Published As
Publication number | Publication date |
---|---|
CN107454964A (en) | 2017-12-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2018112930A1 (en) | Method and device for identifying commodities | |
US10956964B2 (en) | Methods and arrangements including data migration among computing platforms, e.g. through use of audio encoding | |
US20210217129A1 (en) | Detection of encoded signals and icons | |
US10956775B2 (en) | Identification of items depicted in images | |
CN109416731B (en) | Document optical character recognition | |
WO2020077877A1 (en) | Platform commodity stationing method and apparatus, and computer device and storage medium | |
WO2021018241A1 (en) | Information processing | |
US9311531B2 (en) | Systems and methods for classifying objects in digital images captured using mobile devices | |
US11257198B1 (en) | Detection of encoded signals and icons | |
US10380237B2 (en) | Smart optical input/output (I/O) extension for context-dependent workflows | |
US20150026074A1 (en) | Consumer-centric product warranty management system | |
US20160328762A1 (en) | Application independent dex/ucs interface | |
US10803272B1 (en) | Detection of encoded signals and icons | |
CN116071308A (en) | Evaluating image values | |
US11430242B2 (en) | Systems and methods for obtaining product information in real-time | |
WO2019096222A1 (en) | Unmanned vending method and device based on identity identification and product identification | |
CN111784372A (en) | Store commodity recommendation method and device | |
CN107729791A (en) | Information processing method and electronic equipment | |
CN111753608A (en) | Information processing method and device, electronic device and storage medium | |
US10963687B1 (en) | Automatic correlation of items and adaptation of item attributes using object recognition | |
US20210357883A1 (en) | Payment method capable of automatically recognizing payment amount | |
US20140330619A1 (en) | Methods and systems for automatic, pre-sale price matching | |
Guimarães et al. | A review of recent advances and challenges in grocery label detection and recognition | |
WO2019215966A1 (en) | Registration system, registration method, and program | |
WO2019096201A1 (en) | Self-service method and device based on active tags and passive tags |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 16924223 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 15/10/2019) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 16924223 Country of ref document: EP Kind code of ref document: A1 |