Nothing Special   »   [go: up one dir, main page]

CN108921198A - commodity image classification method, server and system based on deep learning - Google Patents

commodity image classification method, server and system based on deep learning Download PDF

Info

Publication number
CN108921198A
CN108921198A CN201810588841.7A CN201810588841A CN108921198A CN 108921198 A CN108921198 A CN 108921198A CN 201810588841 A CN201810588841 A CN 201810588841A CN 108921198 A CN108921198 A CN 108921198A
Authority
CN
China
Prior art keywords
commodity image
training
classification
commodity
image
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810588841.7A
Other languages
Chinese (zh)
Inventor
宋连嵩
王晶晶
孙昂
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shandong Normal University
Original Assignee
Shandong Normal University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shandong Normal University filed Critical Shandong Normal University
Priority to CN201810588841.7A priority Critical patent/CN108921198A/en
Publication of CN108921198A publication Critical patent/CN108921198A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2411Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on the proximity to a decision surface, e.g. support vector machines
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/26Segmentation of patterns in the image field; Cutting or merging of image elements to establish the pattern region, e.g. clustering-based techniques; Detection of occlusion
    • G06V10/267Segmentation of patterns in the image field; Cutting or merging of image elements to establish the pattern region, e.g. clustering-based techniques; Detection of occlusion by performing operations on regions, e.g. growing, shrinking or watersheds

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • General Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Multimedia (AREA)
  • Image Analysis (AREA)

Abstract

The invention discloses commodity image classification method, server and systems based on deep learning.Wherein this method, including obtain commodity image and be labeled classification, form commodity image classification based training data set and commodity image class test data set;Background segment is carried out to the commodity image that training data is concentrated using SVM algorithm;Training dataset after background segment is input in the convolutional neural networks of the image classification of pre-training completion, and carries out parameter adjustment using convolutional neural networks of the method for transfer learning to the image classification that pre-training is completed, constructs commodity image disaggregated model;Commodity image class test data set is input in commodity image disaggregated model and carries out prediction classification, until the prediction classification of commodity image disaggregated model output reaches default precision, commodity image disaggregated model is completed in final training;Commodity image to be sorted is input in the commodity image disaggregated model of training completion, output prediction classification results.

Description

Commodity image classification method, server and system based on deep learning
Technical field
The invention belongs to image identification technical field more particularly to a kind of commodity image classification sides based on deep learning Method, server and system.
Background technique
After the internet of things era arrives, e-commerce rapidly develops, and more and more people get used to using shopping at network.Net Network shopping brings great convenience, and time whole world can be bought by staying indoors, nowadays the shopping at network platform of mainstream, such as Taobao, Jingdone district, Amazon etc., most of is the product search system based on keyword.There is very big drawback in this mode:One side Face, the product search system based on keyword require businessman to classify first to commodity image, then add corresponding text Mark, still, these marks are difficult to reflect product features comprehensively, simultaneously because commodity image quantity sharply increases, it is artificial to mark Note takes time and effort, and greatly reduces the working efficiency of businessman.On the other hand, user inputs crucial according to the demand for commodity of oneself Subjectivity is had when word scans for, there is different user demands and but has input identical keyword in this meeting maximum probability, or It is that identical user demand has input different keywords, both of which will lead to the commodity image of retrieved web appearance As a result inconsistent with user's expectation, it also will be greatly reduced the efficiency that user buys expectation commodity.How shopping website is improved Commodity recall precision the problem of having become people's extensive concern, being scanned on website using image data can be with Retrieval is reduced for the dependence of text, is a solution for being worth research.
With the development of electronics technology, these defects can greatly be made up by electronic information technology, be based on commodity image The image procossing mode of content is come into being, and in this way, can intuitively show the information of commodity, without artificial It is labeled, simple and efficient, application is gradually extensive.But meanwhile traditional image classification method based on feature extracts spy Sign is complicated, and single features can not obtain preferable classification accuracy.
In conclusion how to solve the problems, such as the Accurate classification of shiploads of merchandise image, still lack effective solution scheme.
Summary of the invention
In order to solve the deficiencies in the prior art, the first object of the present invention is to provide a kind of commodity figure based on deep learning As classification method, can rapidly and accurately classify to shiploads of merchandise image.
A kind of commodity image classification method based on deep learning of the invention, including:
It obtains commodity image and is labeled classification, form commodity image classification based training data set and commodity image classification is surveyed Try data set;
Background segment is carried out to the commodity image in commodity image classification based training data set using SVM algorithm;
Commodity image classification based training data set after background segment is input to the convolution of the image classification of pre-training completion In neural network, and parameter is carried out using convolutional neural networks of the method for transfer learning to the image classification that pre-training is completed Adjustment, constructs commodity image disaggregated model;
Commodity image class test data set is input in commodity image disaggregated model and carries out prediction classification, until commodity The prediction classification of image classification model output reaches default precision, and commodity image disaggregated model is completed in final training;
Commodity image to be sorted is input in the commodity image disaggregated model of training completion, output prediction classification knot Fruit.
Further, background segment is carried out to the commodity image in commodity image classification based training data set using SVM algorithm Process, including:
Commodity image to be split is done into prospect label and background label;
Several pixels and extract individual features as training sample from prospect label and background label respectively;
The parameter and kernel function for determining SVM classifier, using the individual features training SVM classifier of extraction;
Classified using the SVM classifier that training is completed to the pixel of commodity image to be split, realizes commodity image Background segment.
Further, the Selection of kernel function Polynomial kernel function of SVM classifier.
Further, the convolutional neural networks for the image classification completed in the method using transfer learning to pre-training carry out During parameter adjusts, carried out according to convolutional neural networks of the convolution kernel capacity principle to the image classification that pre-training is completed It improves.
The second object of the present invention is a kind of commodity image classified service device based on deep learning.
A kind of commodity image classified service device based on deep learning of the invention, including:
Data set acquisition module is configured as:It obtains commodity image and is labeled classification, form commodity image classification Training dataset and commodity image class test data set;
Background segment module, is configured as:Using SVM algorithm to the commodity figure in commodity image classification based training data set As carrying out background segment;
Commodity image disaggregated model constructs module, is configured as:By the commodity image classification based training number after background segment It is input to according to collection in the convolutional neural networks of the image classification of pre-training completion, and using the method for transfer learning to pre-training The convolutional neural networks of the image classification of completion carry out parameter adjustment, construct commodity image disaggregated model;
Commodity image disaggregated model test module, is configured as:Commodity image class test data set is input to quotient Prediction classification is carried out in product image classification model, until the prediction classification of commodity image disaggregated model output reaches default precision, Commodity image disaggregated model is completed in final training;
It predicts categorization module, is configured as:Commodity image to be sorted is input to the commodity image point of training completion In class model, output prediction classification results.
Further, the background segment module, including:
Label setup module, is configured as:Commodity image to be split is done into prospect label and background label;
Characteristic extracting module is configured as:Several pixels are used as instruction from prospect label and background label respectively Practice sample, and extracts individual features;
SVM classifier training module, is configured as:The parameter and kernel function for determining SVM classifier, using the phase of extraction Answer feature training SVM classifier;
SVM classifier categorization module, is configured as:The SVM classifier completed using training is to commodity image to be split Pixel classify, realize the background segment of commodity image.
Further, in the SVM classifier training module, the Selection of kernel function Polynomial kernel function of SVM classifier.
Further, in commodity image disaggregated model building module, according to convolution kernel capacity principle to pre-training The convolutional neural networks of the image classification of completion improve.
The third object of the present invention is to provide a kind of commodity image categorizing system based on deep learning.
A kind of commodity image categorizing system based on deep learning of the invention, including described above based on deep learning Commodity image classified service device.
Compared with prior art, the beneficial effects of the invention are as follows:
(1) present invention employs SVM classifiers has carried out background segment to commodity image, classifies compared to traditional images, After being split to commodity image background, the accuracy rate of classification can be improved.
(2) present invention has also carried out pre-training to network structure, and the parameter after pre-training moves to conduct in new network Commodity image class test data set is input in commodity image disaggregated model and carries out prediction classification by initiation parameter, until The prediction classification of commodity image disaggregated model output reaches default precision, and commodity image disaggregated model is completed in final training, improves The accuracy rate of commodity image classification.
Detailed description of the invention
The accompanying drawings constituting a part of this application is used to provide further understanding of the present application, and the application's shows Meaning property embodiment and its explanation are not constituted an undue limitation on the present application for explaining the application.
Fig. 1 is a kind of commodity image classification method flow chart based on deep learning of the invention.
Fig. 2 is space reflection schematic diagram.
Fig. 3 (a) is the commodity image before segmentation.
Fig. 3 (b) is the image after SVM background segment.
Fig. 4 is the convolutional neural networks structural schematic diagram for the image classification that the pre-training that the present invention uses is completed.
Fig. 5 is convolutional layer schematic diagram.
Fig. 6 (a) is Sigmoid function.
Fig. 6 (b) is ReLU activation primitive.
Fig. 7 is AlexNet structural schematic diagram.
Fig. 8 is a kind of commodity image classified service device structural schematic diagram based on deep learning of the invention.
Specific embodiment
It is noted that following detailed description is all illustrative, it is intended to provide further instruction to the application.Unless another It indicates, all technical and scientific terms used herein has logical with the application person of an ordinary skill in the technical field The identical meanings understood.
It should be noted that term used herein above is merely to describe specific embodiment, and be not intended to restricted root According to the illustrative embodiments of the application.As used herein, unless the context clearly indicates otherwise, otherwise singular shape Formula be also intended to include plural form, additionally, it should be understood that, when in the present specification use term "comprising" and/or When " comprising ", existing characteristics, step, operation, device, component and/or their combination are indicated.
Fig. 1 is a kind of commodity image classification method flow chart based on deep learning of the invention.
As shown in Figure 1, a kind of commodity image classification method based on deep learning of the invention, including:
Step 1:It obtains commodity image and is labeled classification, form commodity image classification based training data set and commodity image Class test data set.
Specifically, the data set of the present embodiment uses more extensive Taobao day cat, Jingdone district, Amazon etc. from current Major shopping website includes 10 class commodity altogether, and part of image has background, and image size is 256*256, picture format For .jpg.
Step 2:Background segment is carried out to the commodity image in commodity image classification based training data set using SVM algorithm.
Using SVM algorithm, experiment porch is based on matlab for the segmentation of the commodity image background of the present embodiment Libsvm under 2016a.
SVM algorithm has excellent generalization ability and recognition capability, in terms of image background segmentation, has good effect Fruit.SVM is that input variable is mapped to a high-dimensional feature space by the Nonlinear Mapping selected in advance by certain, at this Optimal separating hyper plane is constructed in space.The process that data set classification work is carried out using SVM is with previously selected one first The input space is mapped to high-dimensional feature space as shown in Fig. 2, by being mapped to high-dimensional feature space by a little Nonlinear Mappings, The nonlinear data for being not easy to be partitioned into itself in plane, which has been divided, to come.
This Nonlinear Mapping is realized by kernel function, and common kernel function includes Polynomial kernel function, Gauss Core (RBF core) function and linear kernel function.
Although the value of kernel function is that it is the conversion carried out feature from low-dimensional to higher-dimension, kernel function is in advance low Calculated in dimension, and substantial classifying quality shown on higher-dimension, also avoid directly in higher dimensional space into The complicated calculating of row.
Wherein, the mistake of background segment is carried out to the commodity image in commodity image classification based training data set using SVM algorithm Journey, including:
Step 2.1:Commodity image to be split is done into prospect label and background label;
Step 2.2:Several pixels and extract corresponding as training sample from prospect label and background label respectively Feature;
Step 2.3:The parameter and kernel function for determining SVM classifier, using the individual features training SVM classifier of extraction;
Step 2.4:Classified using the SVM classifier that training is completed to the pixel of commodity image to be split, is realized The background segment of commodity image.
Wherein, the Selection of kernel function Polynomial kernel function of SVM classifier.
When input picture is the commodity image that band has powerful connections in data set, in SVM algorithm setting, the back of image to be split Scape is labeled as 0, and prospect is labeled as 1, and foreground and background respectively has chosen 20 points as training sample, makes when pixel is chosen It is taken a little with the help of ginput function.Selection of kernel function Polynomial kernel function, degree are set as -1.Experimental result such as Fig. 3 (a) and shown in Fig. 3 (b), Fig. 3 (a) is the commodity image before segmentation, and Fig. 3 (b) is the image after SVM background segment.From experiment As a result as can be seen that SVM algorithm is well divided the background of image in, and profile information saves well.
Step 3:Commodity image classification based training data set after background segment is input to the image classification of pre-training completion Convolutional neural networks in, and the convolutional neural networks for image classification pre-training completed using the method for transfer learning into The adjustment of row parameter, constructs commodity image disaggregated model.
Present invention employs the transfer learnings based on model, while occurring negative transfer in order to prevent, used in this method The data set and commodity image of pre-training have enough relevances, therefore can carry out relevant transfer learning.
Specifically, the convolutional neural networks for the image classification that the pre-training that the present invention uses is completed, as shown in figure 4, by rolling up Lamination, pond layer, full articulamentum composition.
Convolutional layer:In the application of image procossing, the effect of convolutional layer is to extract characteristics of image.Convolutional layer contains many volumes Product core, each convolution kernel can obtain a kind of feature to image convolution.
As shown in figure 5, black box is convolution kernel, it is assumed herein that step-length is set as 1, convolution operation is expressed as moving right every time A pixel is moved, then sequentially moves down a pixel again.Number is marked in cell to indicate the power of each convolution kernel Weight, such as red have 4 weights.Picture pixels and the corresponding power of convolution kernel in convolution process, that is, convolution kernel moving process Multiplied by weight is carried out again, is finally added and is obtained an output.Fortune in this way, passes through the continuous convolution of convolutional layer, image institute The feature for including is extracted.
Pond layer:Pond layer i.e. down-sampling layer, main function is that a dimensionality reduction operation is carried out to characteristic pattern, to have Effect reduces the parameter that subsequent neural net layer needs, while greatly reducing calculation amount.Chi Huayou is averaged pondization and maximum pond Deng key property has:Invariance, including invariable rotary, Scale invariant etc., focus can be placed on characteristics of image by this In itself and on position present in non-image features;Export fixed length, i.e., by pond layer after, the size of characteristic pattern can subtract The small half for former characteristic pattern;Network parameter can be reduced while retaining main feature, reduced calculation amount and prevented simultaneously Over-fitting greatly improves the generalization ability of network.
Full articulamentum:The effect of full articulamentum be the two dimensional character of convolutional layer final output is converted to finally it is one-dimensional Vector.The core operation connected entirely is exactly linear change of the matrix-vector product by a feature space to another feature space It changes, each tie point is connected with all tie points of preceding layer, and effect is the feature that comprehensive prior process is extracted.
The input of neuron in full articulamentum is indicated using x, a indicates the output of neuron, and b is biasing, wherein under each Mark respectively indicates i-th of relevant parameter (i=1,2,3).
wijIndicate between i-th of neuron and j-th of neuron connection weight (i=1,2,3;J=1,2,3).
The matrix representation forms of articulamentum are entirely:
Activation primitive:In neural network, signal received by each neuron is the line of previous neuron Property function, therefore, in order to avoid linear coincidence, it is that each neuron introduces non-linear factor that activation primitive, which can be used,.This It is that complete expressiveness is not had due to linear model, so introducing activation primitive.Convolutional neural networks commonly activate Function has ReLU function and Sigmoid function.Wherein the expression formula of Sigmoid function is:
Wherein, x indicates the input of neuron in full articulamentum.
As shown in Fig. 6 (a), Sigmoid function has a big signal gain in central area, and two sides have smaller signal Gain, it has good effect to the feature space figure of signal, but there is also certain limitations simultaneously:One, when far from origin Functional gradient levels off to 0, will lead to training process and gradient disappearance occurs;Two, function output reduces not centered on origin Weight updates efficiency;Three, a large amount of exponent arithmetic is introduced, calculation amount is increased, reduces arithmetic speed.
The expression formula of ReLU function is:
F (x)=max (0, x)
Wherein, x indicates the input of neuron in full articulamentum.
ReLU has been proved to it effectively can inhibit gradient to disappear compared to traditional activation primitive, while also have more Fast convergence rate, therefore the application of ReLU activation primitive is comparatively more extensive.Shown in its image such as Fig. 6 (b).
Dropout:Dropout is proposed for the over-fitting occurred for network training process kind, and Dropout can It is not understood as in the training process of neural network, random abandons some units to simplify the process of network.By The link information of neural network random drop a part after Dropout operation.When the numerical value of Dropout is set as 0.5 Have the effect of it is best, the reason is that most network structures can be generated in Dropout when numerical value is set as 0.5.Dropout is random Hidden node is ignored in selection, is different from due to ignoring at random every time, this results in the network trained every time all different, Therefore each training process can be regarded as to the model for establishing one " new " completely.
Caffe frame:With the rise of deep learning upsurge, deep learning frame is continued to bring out, and is deep learning network Realization provide a great convenience, for designer built basis function realize, save the plenty of time.Wherein Caffe Deep learning frame be one efficiently, clearly deep learning frame, it has the features such as modularization, Test coverage, and simultaneously Two kinds of interfaces of Python and Matlab are provided, and can realize the switching between CPU and GPU by function call, in order to just True committed memory.Caffe follow neural network it is assumed that i.e. all calculating all indicates in the form of layer, each layer Function is exactly to obtain some data, the result after then output calculates.Wherein each layer does two calculating:Propagated forward That output is calculated from input, backpropagation be from above to gradient calculate the gradient relative to input, the two calculate real It after now, allows for plurality of layers connection and forms a network, in this way, whole network is exactly input data (image), then come Calculate the output (label) needed.When training, loss function and gradient can be calculated according to existing label, so Afterwards with gradient come backpropagation, the parameter of corrective networks, here it is the basic procedures of Caffe.
SIFT and HOG feature:SIFT feature, that is, Scale invariant features transform is to detect key point in the picture, is one Kind local feature description, has scale invariability;The local grain that histograms of oriented gradients (HOG) is used to describe image is special Sign.The advantage of HOG feature extraction is that it covers texture information abundant, while introducing this physical quantity of amplitude, therefore It can make rotation and translation that there is invariance.But it is more sensitive in the feature extraction to noise spot, and be easy by outer The influence of portion's environment.
In order to facilitate the migration of parameter, network structure of the invention is modified from AlexNet, as shown in Figure 7.
Wherein, AlexNet is Hinton group neural network model used in ISVRC2012.Network structure is not It is that the label layer of the last layer is modified by 1000 classes for 10 classes with place.The input of network is the colored RGB of 256*256 The unified size of image can be cut to 227*227 and facilitate convolution operation by image, data Layer.Convolution kernel size point in network Not Wei 11*11,5*5 and 3*3, pond mode is maximum pond, and Dropout is set as 0.5.It can be generated after convolution operation big The size of small different characteristic pattern, each layer characteristic pattern is 55*55,27*27,27*27,13*13,13*13,13*13 and 6* respectively 6, the number of characteristic pattern is respectively 96,96,256,256,384,384,256 and 256.The parameter of full articulamentum is set as 4096, The classification prediction of the last layer uses softmax classifier.
Specifically, the convolutional neural networks for the image classification that pre-training is completed are joined using the method for transfer learning During number adjustment, the convolutional neural networks for the image classification that pre-training is completed are changed according to convolution kernel capacity principle Into.
The effect of convolution kernel is to extract characteristics of image to find out picture structure, and convolution kernel capacity is one and is used to measure convolution Core finds out the scale of the ability of picture structure.If convolution kernel capacity is small, it is meant that can only have local feature mapping in image To next layer, but if convolution kernel capacity is big, it is meant that can have more Feature Mappings to next layer network, work as convolution When the value of core capacity is greater than 1/6, convolution kernel has good detectivity.
The calculation formula of convolution kernel capacity is:
If the size of convolution kernel is n × n, operated by the maximum pondization of a 3*3, then this convolution kernel is true Convolution kernel is sized for 3n × 3n, if by the maximum pond of m 3*3, then the actual size of convolution kernel is just 3mn× 3mn。
Receptive field is an important concept in convolutional neural networks, it indicates a specific convolution in the input space The range areas of neural network feature.The size of a general receptive field can use feature sizes and center in region It is described.In the operation of the convolution of neural network:The size of current receptive field is r, and the distance between adjacent feature is j, Convolution kernel is filled with p, step-length s having a size of p.
So export the size n of featureoutWith input feature vector size ninRelationship be represented by:
Export the distance j between characteristic patternoutDistance j between input feature vector figureinRelationship be:
jout=jin*s
The receptive field r of outputoutWith the impression r of inputinWild relationship is:
rout=rin+(k-1)*jin
According to the receptive field size and convolution kernel capacity of each convolutional layer of network structure that above-mentioned calculated relationship acquires the design, The convolution kernel capacity of improved network is generally higher than AlexNet, that is to say, that the convolution kernel of network has stronger spy after improvement Extractability is levied, this is also the major reason that network obtains preferable classifying quality after improving.
Step 4:Commodity image class test data set is input in commodity image disaggregated model and carries out prediction classification, directly Reach default precision to the prediction classification that commodity image disaggregated model exports, commodity image disaggregated model is completed in final training.
Step 5:Commodity image to be sorted is input in the commodity image disaggregated model of training completion, output prediction point Class result.
Experimental situation:
The related hardware for testing computer used is configured to:Intel(R)Xeon(R)CPU E5-2620@2.10GHz× 15, physical memory is:16.00GB.GPU be two pieces of NAVIDIA TITAN X, 1.53GHz dominant frequency, 12GB GDDR5X video memory, 12000000000 number of transistors, 3584 CUDA cores.Experimental implementation system is Ubuntu 14.04, and distribution experiment operation platform is base In the deep learning frame caffe of C++, configuration interface is python interface.
Fig. 8 is a kind of commodity image classified service device structural schematic diagram based on deep learning of the invention.
As shown in figure 8, a kind of commodity image classified service device based on deep learning of the invention, including:
(1) data set acquisition module is configured as:It obtains commodity image and is labeled classification, form commodity image Classification based training data set and commodity image class test data set;
(2) background segment module is configured as:Using SVM algorithm to the quotient in commodity image classification based training data set Product image carries out background segment;
Specifically, the background segment module, including:
Label setup module, is configured as:Commodity image to be split is done into prospect label and background label;
Characteristic extracting module is configured as:Several pixels are used as instruction from prospect label and background label respectively Practice sample, and extracts individual features;
SVM classifier training module, is configured as:The parameter and kernel function for determining SVM classifier, using the phase of extraction Answer feature training SVM classifier;
SVM classifier categorization module, is configured as:The SVM classifier completed using training is to commodity image to be split Pixel classify, realize the background segment of commodity image.
Specifically, in the SVM classifier training module, the Selection of kernel function Polynomial kernel function of SVM classifier.
(3) commodity image disaggregated model constructs module, is configured as:By the commodity image classification based training after background segment Data set is input in the convolutional neural networks of the image classification of pre-training completion, and using the method for transfer learning to pre- instruction The convolutional neural networks for practicing the image classification completed carry out parameter adjustment, construct commodity image disaggregated model;
Specifically, complete to pre-training according to convolution kernel capacity principle in commodity image disaggregated model building module At the convolutional neural networks of image classification improve.
(4) commodity image disaggregated model test module, is configured as:Commodity image class test data set is input to Prediction classification is carried out in commodity image disaggregated model, until the prediction classification of commodity image disaggregated model output reaches default essence Commodity image disaggregated model is completed in degree, final training;
(5) it predicts categorization module, is configured as:Commodity image to be sorted is input to the commodity image of training completion In disaggregated model, output prediction classification results.
The commodity image categorizing system based on deep learning that the present invention also provides a kind of.
A kind of commodity image categorizing system based on deep learning of the invention, including as shown in Figure 8 based on depth The commodity image classified service device of habit.
Present invention employs SVM classifiers to have carried out background segment to commodity image, classifies compared to traditional images, right After commodity image background is split, the accuracy rate of classification can be improved.
The present invention has also carried out pre-training to network structure, and the parameter after pre-training moves in new network as initial Change parameter, commodity image class test data set is input in commodity image disaggregated model and carries out prediction classification, until commodity The prediction classification of image classification model output reaches default precision, and commodity image disaggregated model is completed in final training, improves quotient The accuracy rate of product image classification.
It should be understood by those skilled in the art that, the embodiment of the present invention can provide as method, system or computer program Product.Therefore, the shape of hardware embodiment, software implementation or embodiment combining software and hardware aspects can be used in the present invention Formula.Moreover, the present invention, which can be used, can use storage in the computer that one or more wherein includes computer usable program code The form for the computer program product implemented on medium (including but not limited to magnetic disk storage and optical memory etc.).
The present invention be referring to according to the method for the embodiment of the present invention, the process of equipment (system) and computer program product Figure and/or block diagram describe.It should be understood that can be realized by computer program instructions each in flowchart and/or the block diagram The combination of process and/or box in process and/or box and flowchart and/or the block diagram.It can provide these computers Processor of the program instruction to general purpose computer, special purpose computer, Embedded Processor or other programmable data processing devices To generate a machine, so that being generated by the instruction that computer or the processor of other programmable data processing devices execute For realizing the function of being specified in one or more flows of the flowchart and/or one or more blocks of the block diagram Device.
These computer program instructions, which may also be stored in, is able to guide computer or other programmable data processing devices with spy Determine in the computer-readable memory that mode works, so that instruction stored in the computer readable memory generation includes The manufacture of command device, the command device are realized in one box of one or more flows of the flowchart and/or block diagram Or the function of being specified in multiple boxes.
These computer program instructions also can be loaded onto a computer or other programmable data processing device, so that counting Series of operation steps are executed on calculation machine or other programmable devices to generate computer implemented processing, thus in computer Or the instruction executed on other programmable devices is provided for realizing in one or more flows of the flowchart and/or box The step of function of being specified in figure one box or multiple boxes.
Those of ordinary skill in the art will appreciate that realizing all or part of the process in above-described embodiment method, being can be with Relevant hardware is instructed to complete by computer program, the program can be stored in a computer-readable storage and be situated between In matter, the program is when being executed, it may include such as the process of the embodiment of above-mentioned each method.Wherein, the storage medium can be Magnetic disk, CD, read-only memory (Read-Only Memory, ROM) or random access memory (Random AccessMemory, RAM) etc..
Above-mentioned, although the foregoing specific embodiments of the present invention is described with reference to the accompanying drawings, not protects model to the present invention The limitation enclosed, those skilled in the art should understand that, based on the technical solutions of the present invention, those skilled in the art The various modifications or changes that can be made are not needed to make the creative labor still within protection scope of the present invention.

Claims (9)

1. a kind of commodity image classification method based on deep learning, which is characterized in that including:
It obtains commodity image and is labeled classification, form commodity image classification based training data set and commodity image class test number According to collection;
Background segment is carried out to the commodity image in commodity image classification based training data set using SVM algorithm;
Commodity image classification based training data set after background segment is input to the convolutional Neural of the image classification of pre-training completion In network, and parameter adjustment is carried out using convolutional neural networks of the method for transfer learning to the image classification that pre-training is completed, Construct commodity image disaggregated model;
Commodity image class test data set is input in commodity image disaggregated model and carries out prediction classification, until commodity image The prediction classification of disaggregated model output reaches default precision, and commodity image disaggregated model is completed in final training;
Commodity image to be sorted is input in the commodity image disaggregated model of training completion, output prediction classification results.
2. a kind of commodity image classification method based on deep learning as described in claim 1, which is characterized in that use SVM Algorithm carries out the process of background segment to the commodity image in commodity image classification based training data set, including:
Commodity image to be split is done into prospect label and background label;
Several pixels and extract individual features as training sample from prospect label and background label respectively;
The parameter and kernel function for determining SVM classifier, using the individual features training SVM classifier of extraction;
Classified using the SVM classifier that training is completed to the pixel of commodity image to be split, realizes the back of commodity image Scape segmentation.
3. a kind of commodity image classification method based on deep learning as claimed in claim 2, which is characterized in that svm classifier The Selection of kernel function Polynomial kernel function of device.
4. a kind of commodity image classification method based on deep learning as described in claim 1, which is characterized in that moved using During the method for moving study carries out parameter adjustment to the convolutional neural networks for the image classification that pre-training is completed, according to convolution Core capacity principle improves the convolutional neural networks for the image classification that pre-training is completed.
5. a kind of commodity image classified service device based on deep learning, which is characterized in that including:
Data set acquisition module is configured as:It obtains commodity image and is labeled classification, form commodity image classification based training Data set and commodity image class test data set;
Background segment module, is configured as:Using SVM algorithm to the commodity image in commodity image classification based training data set into Row background segment;
Commodity image disaggregated model constructs module, is configured as:By the commodity image classification based training data set after background segment It is input in the convolutional neural networks of the image classification of pre-training completion, and pre-training is completed using the method for transfer learning The convolutional neural networks of image classification carry out parameter adjustment, construct commodity image disaggregated model;
Commodity image disaggregated model test module, is configured as:Commodity image class test data set is input to commodity figure As carrying out prediction classification in disaggregated model, until the prediction classification of commodity image disaggregated model output reaches default precision, finally Commodity image disaggregated model is completed in training;
It predicts categorization module, is configured as:Commodity image to be sorted is input to the commodity image classification mould of training completion In type, output prediction classification results.
6. a kind of commodity image classified service device based on deep learning as claimed in claim 5, which is characterized in that the back Scape divides module, including:
Label setup module, is configured as:Commodity image to be split is done into prospect label and background label;
Characteristic extracting module is configured as:Several pixels are used as training sample from prospect label and background label respectively This, and extract individual features;
SVM classifier training module, is configured as:The parameter and kernel function for determining SVM classifier, using the corresponding spy of extraction Levy training SVM classifier;
SVM classifier categorization module, is configured as:Using the trained SVM classifier completed to the picture of commodity image to be split Vegetarian refreshments is classified, and realizes the background segment of commodity image.
7. a kind of commodity image classified service device based on deep learning as claimed in claim 6, which is characterized in that described In SVM classifier training module, the Selection of kernel function Polynomial kernel function of SVM classifier.
8. a kind of commodity image classified service device based on deep learning as claimed in claim 5, which is characterized in that described Commodity image disaggregated model constructs in module, the convolutional Neural according to the image classification that convolution kernel capacity principle completes pre-training Network improves.
9. a kind of commodity image categorizing system based on deep learning, which is characterized in that including any one of such as claim 5-8 The commodity image classified service device based on deep learning.
CN201810588841.7A 2018-06-08 2018-06-08 commodity image classification method, server and system based on deep learning Pending CN108921198A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810588841.7A CN108921198A (en) 2018-06-08 2018-06-08 commodity image classification method, server and system based on deep learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810588841.7A CN108921198A (en) 2018-06-08 2018-06-08 commodity image classification method, server and system based on deep learning

Publications (1)

Publication Number Publication Date
CN108921198A true CN108921198A (en) 2018-11-30

Family

ID=64420032

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810588841.7A Pending CN108921198A (en) 2018-06-08 2018-06-08 commodity image classification method, server and system based on deep learning

Country Status (1)

Country Link
CN (1) CN108921198A (en)

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110059658A (en) * 2019-04-26 2019-07-26 北京理工大学 A kind of satellite-remote-sensing image multidate change detecting method based on Three dimensional convolution neural network
CN110348293A (en) * 2019-05-29 2019-10-18 广州中科凯泽科技有限公司 A kind of commodity recognition method and system
CN110458096A (en) * 2019-08-09 2019-11-15 广州众聚智能科技有限公司 A kind of extensive commodity recognition method based on deep learning
CN111082992A (en) * 2019-12-23 2020-04-28 超讯通信股份有限公司 SDN network data packet identification method based on deep learning
CN111191723A (en) * 2019-12-30 2020-05-22 创新奇智(北京)科技有限公司 Few-sample commodity classification system and method based on cascade classifier
CN111325115A (en) * 2020-02-05 2020-06-23 山东师范大学 Countermeasures cross-modal pedestrian re-identification method and system with triple constraint loss
CN111750965A (en) * 2019-03-27 2020-10-09 杭州海康威视数字技术股份有限公司 Commodity self-service charging method, device and system
CN112164076A (en) * 2020-09-24 2021-01-01 济南大学 Hardened cement water cement ratio prediction method and system based on cement microstructure image
CN112257646A (en) * 2020-11-02 2021-01-22 创新奇智(南京)科技有限公司 Commodity detection method and device, electronic equipment and storage medium
CN112801578A (en) * 2021-01-18 2021-05-14 哈尔滨工业大学 Commodity warehousing and ex-warehouse management system and method applied to individual vendor
CN113191378A (en) * 2020-01-14 2021-07-30 阿里巴巴集团控股有限公司 Prediction method, device and system for image search
CN113220970A (en) * 2021-03-09 2021-08-06 十堰时风达工贸有限公司 E-commerce big data platform based on block chain
CN113220918A (en) * 2021-03-09 2021-08-06 十堰时风达工贸有限公司 E-commerce big data classification method and system based on block technology
CN113361584A (en) * 2021-06-01 2021-09-07 推想医疗科技股份有限公司 Model training method and device, and pulmonary arterial hypertension measurement method and device
CN116579722A (en) * 2023-07-14 2023-08-11 四川集鲜数智供应链科技有限公司 Commodity distribution warehouse-in and warehouse-out management method based on deep learning

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103186901A (en) * 2013-03-29 2013-07-03 中国人民解放军第三军医大学 Full-automatic image segmentation method
CN106530283A (en) * 2016-10-20 2017-03-22 北京工业大学 SVM (support vector machine)-based medical image blood vessel recognition method
CN107239802A (en) * 2017-06-28 2017-10-10 广东工业大学 A kind of image classification method and device

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103186901A (en) * 2013-03-29 2013-07-03 中国人民解放军第三军医大学 Full-automatic image segmentation method
CN106530283A (en) * 2016-10-20 2017-03-22 北京工业大学 SVM (support vector machine)-based medical image blood vessel recognition method
CN107239802A (en) * 2017-06-28 2017-10-10 广东工业大学 A kind of image classification method and device

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
孙昂 等: "《基于改进的卷积神经网络多类商品精细分类》", 《山东师范大学学报(自然科学版)》 *

Cited By (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111750965A (en) * 2019-03-27 2020-10-09 杭州海康威视数字技术股份有限公司 Commodity self-service charging method, device and system
CN110059658B (en) * 2019-04-26 2020-11-24 北京理工大学 Remote sensing satellite image multi-temporal change detection method based on three-dimensional convolutional neural network
CN110059658A (en) * 2019-04-26 2019-07-26 北京理工大学 A kind of satellite-remote-sensing image multidate change detecting method based on Three dimensional convolution neural network
CN110348293A (en) * 2019-05-29 2019-10-18 广州中科凯泽科技有限公司 A kind of commodity recognition method and system
CN110348293B (en) * 2019-05-29 2022-11-18 广州中科凯泽科技有限公司 Commodity identification method and system
CN110458096A (en) * 2019-08-09 2019-11-15 广州众聚智能科技有限公司 A kind of extensive commodity recognition method based on deep learning
CN111082992A (en) * 2019-12-23 2020-04-28 超讯通信股份有限公司 SDN network data packet identification method based on deep learning
CN111191723A (en) * 2019-12-30 2020-05-22 创新奇智(北京)科技有限公司 Few-sample commodity classification system and method based on cascade classifier
CN111191723B (en) * 2019-12-30 2023-06-20 创新奇智(北京)科技有限公司 Cascade classifier-based few-sample commodity classification system and classification method
CN113191378A (en) * 2020-01-14 2021-07-30 阿里巴巴集团控股有限公司 Prediction method, device and system for image search
CN111325115B (en) * 2020-02-05 2022-06-21 山东师范大学 Cross-modal countervailing pedestrian re-identification method and system with triple constraint loss
CN111325115A (en) * 2020-02-05 2020-06-23 山东师范大学 Countermeasures cross-modal pedestrian re-identification method and system with triple constraint loss
CN112164076B (en) * 2020-09-24 2022-10-25 济南大学 Hardened cement water cement ratio prediction method and system based on cement microstructure image
CN112164076A (en) * 2020-09-24 2021-01-01 济南大学 Hardened cement water cement ratio prediction method and system based on cement microstructure image
CN112257646A (en) * 2020-11-02 2021-01-22 创新奇智(南京)科技有限公司 Commodity detection method and device, electronic equipment and storage medium
CN112257646B (en) * 2020-11-02 2023-09-12 创新奇智(南京)科技有限公司 Commodity detection method and device, electronic equipment and storage medium
CN112801578A (en) * 2021-01-18 2021-05-14 哈尔滨工业大学 Commodity warehousing and ex-warehouse management system and method applied to individual vendor
CN113220970A (en) * 2021-03-09 2021-08-06 十堰时风达工贸有限公司 E-commerce big data platform based on block chain
CN113220918A (en) * 2021-03-09 2021-08-06 十堰时风达工贸有限公司 E-commerce big data classification method and system based on block technology
CN113361584A (en) * 2021-06-01 2021-09-07 推想医疗科技股份有限公司 Model training method and device, and pulmonary arterial hypertension measurement method and device
CN116579722A (en) * 2023-07-14 2023-08-11 四川集鲜数智供应链科技有限公司 Commodity distribution warehouse-in and warehouse-out management method based on deep learning
CN116579722B (en) * 2023-07-14 2023-09-19 四川集鲜数智供应链科技有限公司 Commodity distribution warehouse-in and warehouse-out management method based on deep learning

Similar Documents

Publication Publication Date Title
CN108921198A (en) commodity image classification method, server and system based on deep learning
CN110866140B (en) Image feature extraction model training method, image searching method and computer equipment
CN112733749B (en) Real-time pedestrian detection method integrating attention mechanism
CN112396002B (en) SE-YOLOv 3-based lightweight remote sensing target detection method
Tang et al. Deepchart: Combining deep convolutional networks and deep belief networks in chart classification
Liu et al. ARC-Net: An efficient network for building extraction from high-resolution aerial images
Quoc Bao et al. Plant species identification from leaf patterns using histogram of oriented gradients feature space and convolution neural networks
CN108875076B (en) Rapid trademark image retrieval method based on Attention mechanism and convolutional neural network
Sun et al. Fast object detection based on binary deep convolution neural networks
CN109063719B (en) Image classification method combining structure similarity and class information
CN109117781A (en) Method for building up, device and the more attribute recognition approaches of more attribute Recognition Models
CN110633708A (en) Deep network significance detection method based on global model and local optimization
Ye et al. A new re-ranking method based on convolutional neural network and two image-to-class distances for remote sensing image retrieval
CN113807399A (en) Neural network training method, neural network detection method and neural network detection device
Xiong et al. MSN: Modality separation networks for RGB-D scene recognition
Cui et al. Lightweight spectral–spatial attention network for hyperspectral image classification
Hashemi Hosseinabad et al. Multiple answers to a question: a new approach for visual question answering
Li et al. Page segmentation using convolutional neural network and graphical model
Wei et al. Food image classification and image retrieval based on visual features and machine learning
Xin et al. Hybrid dilated multilayer faster RCNN for object detection
Chen et al. Research on object detection algorithm based on multilayer information fusion
Yang et al. An effective and lightweight hybrid network for object detection in remote sensing images
Ansari et al. A survey of recent trends in two-stage object detection methods
Tian et al. Object detection in optical remote sensing images by integrating object-to-object relationships
CN118247645A (en) Novel DDCE-YOLOv s model underwater image target detection method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20181130

RJ01 Rejection of invention patent application after publication