Nothing Special   »   [go: up one dir, main page]

CN110245334A - Method and apparatus for output information - Google Patents

Method and apparatus for output information Download PDF

Info

Publication number
CN110245334A
CN110245334A CN201910552619.6A CN201910552619A CN110245334A CN 110245334 A CN110245334 A CN 110245334A CN 201910552619 A CN201910552619 A CN 201910552619A CN 110245334 A CN110245334 A CN 110245334A
Authority
CN
China
Prior art keywords
word
probability
article
sentence
sample
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910552619.6A
Other languages
Chinese (zh)
Other versions
CN110245334B (en
Inventor
蒋帅
陈思姣
梁海金
罗雨
卞东海
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN201910552619.6A priority Critical patent/CN110245334B/en
Publication of CN110245334A publication Critical patent/CN110245334A/en
Application granted granted Critical
Publication of CN110245334B publication Critical patent/CN110245334B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/58Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/5866Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using information manually generated, e.g. tags, keywords, comments, manually generated location and time information
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/137Hierarchical processing, e.g. outlines
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/189Automatic justification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • G06F40/216Parsing using statistical methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Data Mining & Analysis (AREA)
  • Library & Information Science (AREA)
  • Databases & Information Systems (AREA)
  • Probability & Statistics with Applications (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Machine Translation (AREA)

Abstract

Embodiment of the disclosure discloses the method and apparatus for output information.One specific embodiment of this method includes: to obtain audio-frequency information to be converted;Audio-frequency information is converted into text information;Text information is subjected to word cutting, obtains word sequence;For the word in word sequence, the word that probabilistic model obtains is connected by word trained in advance and connects the connection probability that probability tables inquire the word and connect probability and the word and all kinds of punctuates between next word of the word, and the linking objective based on the connection determine the probability word inquired;Word each in word sequence is connected article of the generation with punctuate with corresponding linking objective to export.Audio can be changed into automatically the article with punctuate by the embodiment.

Description

Method and apparatus for output information
Technical field
Embodiment of the disclosure is related to field of computer technology, and in particular to the method and apparatus for output information.
Background technique
Field is automatically generated in article, the article that multimedia transcription automatically generates is also fewer, is according to structuring mostly Text data generate article, this makes data source single, and the article of generation is not abundant enough, extensively;And human-edited Multimedia article is again very time-consuming and cumbersome, causes the expense of unnecessary manpower, financial resources.Conventional method is mainly artificial Editor, by manually converting text for related audio, then searches picture concerned according to audio theme etc., finally on network Artificial text and picture by after conversion renders.
It is based on artificial method main problem: (1) for the conversion of audio: it is time-consuming and laborious by the way of artificial, Accuracy rate is also not necessarily high;(2) selection of figure: relevant picture is selected according to theme, the mode of manual search will expend big Measure manpower;(3) the tissue rendering of article, ultimately generates a readability strong article for related text and picture tissue.
Summary of the invention
Embodiment of the disclosure proposes the method and apparatus for output information.
In a first aspect, embodiment of the disclosure provides a kind of method for output information, comprising: obtain to be converted Audio-frequency information;Audio-frequency information is converted into text information;Text information is subjected to word cutting, obtains word sequence;For in word sequence Word, by word trained in advance connect the word that probabilistic model obtains connect probability tables inquire the word and with next word of the word Between connect the connection probability of probability and the word and all kinds of punctuates, and the connection based on the connection determine the probability word inquired Target;Word each in word sequence is connected article of the generation with punctuate with corresponding linking objective to export.
In some embodiments, word connection probability tables are obtained through the following steps: obtaining training sample set, training Sample includes the sentence containing punctuate;Using the sentence of the training sample in training sample set as the input of LSTM model, instruction Get word connection probabilistic model;It is general between each word and word according to being obtained in the pilot process of word connection probabilistic model training Probability between rate and each word and each punctuate generates word and connects probability tables.
In some embodiments, training sample set is obtained, comprising: sample article is obtained, sample article is big by one The granularity of sentence carries out cutting and obtains sample sentence set, wherein big sentence refers to the sentence to end up with fullstop, question mark or exclamation mark;It is right Sample sentence in sample sentence set generates term vector as training sample after the sentence is carried out word cutting.
In some embodiments, this method further include: article is divided at least one paragraph.
In some embodiments, this method further include: determine the theme and entity of article;Obtain the theme and reality with article The matched image of body;Graph text information is generated according to image and article.
In some embodiments, this method further include: graph text information is subjected to typesetting optimization.
Second aspect, embodiment of the disclosure provide a kind of device for output information, comprising: acquiring unit, quilt It is configured to obtain audio-frequency information to be converted;Converting unit is configured to audio-frequency information being converted into text information;Word cutting list Member is configured to text information carrying out word cutting, obtains word sequence;Judging unit is configured to lead to the word in word sequence The word is inquired after the word connection probability tables that word connection probabilistic model trained in advance obtains and is connected between next word of the word Connect the connection probability of probability and the word and all kinds of punctuates, and the linking objective based on the connection determine the probability word inquired; Connection unit is configured to word each in word sequence connecting article progress of the generation with punctuate with corresponding linking objective defeated Out.
In some embodiments, which further includes training unit, is configured to: obtaining training sample set, training sample It originally include the sentence containing punctuate;Using the sentence of the training sample in training sample set as the input of LSTM model, training Obtain word connection probabilistic model;It is connected according to word in the pilot process of probabilistic model training and obtains the probability between each word and word Probability between each word and each punctuate generates word and connects probability tables.
In some embodiments, training unit is further configured to: sample article is obtained, sample article is big by one The granularity of sentence carries out cutting and obtains sample sentence set, wherein big sentence refers to the sentence to end up with fullstop, question mark or exclamation mark;It is right Sample sentence in sample sentence set generates term vector as training sample after the sentence is carried out word cutting.
In some embodiments, which further includes segmenting unit, is configured to: article is divided at least one paragraph.
In some embodiments, which further includes figure unit, is configured to: determining the theme and entity of article;It obtains Take the image with the theme of article and Entities Matching;Graph text information is generated according to image and article.
In some embodiments, which further includes typesetting unit, is configured to: graph text information is carried out typesetting optimization.
The third aspect, embodiment of the disclosure provide a kind of electronic equipment, comprising: one or more processors;Storage Device is stored thereon with one or more programs, when one or more programs are executed by one or more processors, so that one Or multiple processors are realized such as method any in first aspect.
Fourth aspect, embodiment of the disclosure provide a kind of computer-readable medium, are stored thereon with computer program, Wherein, it realizes when program is executed by processor such as method any in first aspect.
The method and apparatus for output information that embodiment of the disclosure provides, the text that can be parsed according to audio Content carries out sentence link and is segmented, and then carries out figure according to this paper subject content, finally excellent to text, picture progress typesetting Metaplasia is at article.System is generated compared to traditional article, the data of the system are richer, various, and source is also more extensive.Compared to biography The hand-written article of small volume of system, has higher timeliness and coverage, while also saving human cost and time cost.
Detailed description of the invention
By reading a detailed description of non-restrictive embodiments in the light of the attached drawings below, the disclosure is other Feature, objects and advantages will become more apparent upon:
Fig. 1 is that one embodiment of the disclosure can be applied to exemplary system architecture figure therein;
Fig. 2 is the flow chart according to one embodiment of the method for output information of the disclosure;
Fig. 3 is the schematic diagram according to an application scenarios of the method for output information of the disclosure;
Fig. 4 is the flow chart according to another embodiment of the method for output information of the disclosure;
Fig. 5 a, 5b are the schematic network structures according to the LSTM model of the method for output information of the disclosure.
Fig. 6 is the structural schematic diagram according to one embodiment of the device for output information of the disclosure;
Fig. 7 is adapted for the structural schematic diagram for the computer system for realizing the electronic equipment of embodiment of the disclosure.
Specific embodiment
The disclosure is described in further detail with reference to the accompanying drawings and examples.It is understood that this place is retouched The specific embodiment stated is used only for explaining related invention, rather than the restriction to the invention.It also should be noted that in order to Convenient for description, part relevant to related invention is illustrated only in attached drawing.
It should be noted that in the absence of conflict, the feature in embodiment and embodiment in the disclosure can phase Mutually combination.The disclosure is described in detail below with reference to the accompanying drawings and in conjunction with the embodiments.
Fig. 1 is shown can be using the method for output information of the disclosure or the implementation of the device for output information The exemplary system architecture 100 of example.
As shown in Figure 1, system architecture 100 may include terminal device 101,102,103, network 104 and server 105. Network 104 between terminal device 101,102,103 and server 105 to provide the medium of communication link.Network 104 can be with Including various connection types, such as wired, wireless communication link or fiber optic cables etc..
User can be used terminal device 101,102,103 and be interacted by network 104 with server 105, to receive or send out Send message etc..Various telecommunication customer end applications can be installed, such as audio conversion text is answered on terminal device 101,102,103 With, web browser applications, shopping class application, searching class application, instant messaging tools, mailbox client, social platform software Deng.
Terminal device 101,102,103 can be hardware, be also possible to software.When terminal device 101,102,103 is hard When part, it can be with microphone, display screen and the various electronic equipments for supporting audio conversion text, including but not limited to intelligently Mobile phone, tablet computer, E-book reader, MP3 player (Moving Picture Experts Group Audio Layer III, dynamic image expert's compression standard audio level 3), MP4 (Moving Picture Experts Group Audio Layer IV, dynamic image expert's compression standard audio level 4) player, pocket computer on knee and desktop computer etc. Deng.When terminal device 101,102,103 is software, may be mounted in above-mentioned cited electronic equipment.It may be implemented At multiple softwares or software module (such as providing Distributed Services), single software or software module also may be implemented into. It is not specifically limited herein.
Server 105 can be to provide the server of various services, such as to showing on terminal device 101,102,103 Text provides the backstage editing server supported.Backstage editing server the data such as the audio received such as can analyze Processing, and processing result (such as the article generated according to audio) is fed back into terminal device.
It should be noted that server can be hardware, it is also possible to software.When server is hardware, may be implemented At the distributed server cluster that multiple servers form, individual server also may be implemented into.It, can when server is software It, can also be with to be implemented as multiple softwares or software module (such as providing multiple softwares of Distributed Services or software module) It is implemented as single software or software module.It is not specifically limited herein.
It should be noted that the method provided by embodiment of the disclosure for output information can be by terminal device 101, it 102,103 executes, can also be executed by server 105.Correspondingly, it can be set for the device of output information in terminal In equipment 101,102,103, also it can be set in server 105.It is not specifically limited herein.
It should be understood that the number of terminal device, network and server in Fig. 1 is only schematical.According to realization need It wants, can have any number of terminal device, network and server.
With continued reference to Fig. 2, the process of one embodiment of the method for output information according to the disclosure is shown 200.This is used for the method for output information, comprising the following steps:
Step 201, audio-frequency information to be converted is obtained.
In the present embodiment, can lead to for the executing subject of the method for output information (such as server shown in FIG. 1) It crosses wired connection mode or radio connection and carries out the terminal reception audio-frequency information that voice is write using it from user.Audio Information can be the audio file of various formats.It includes a large amount of sentences.It may include this section audio in the title of audio file Title.
Step 202, audio-frequency information is converted into text information.
In the present embodiment, existing automatic speech recognition (ASR, Automatic Speech can be passed through Recognition) audio-frequency information is converted into whole section of text by technology.It is one section based on the text that ASR is parsed not make pauses in reading unpunctuated ancient writings Text stamp punctuate so also needing that it is cut and is linked according to the meaning of one's words.
Step 203, text information is subjected to word cutting, obtains word sequence.
In the present embodiment, whole section of text is subjected to word cutting operation based on the lexical structure of Chinese or English, obtained whole The word sequence of section audio.Word cutting method may include the common word cutting modes such as maximum reverse matching method.It can first identify the language of audio Kind, for example, Chinese, English or other languages.Then word cutting operation is carried out according to the lexical structure of the languages.
Step 204, for the word in word sequence, the word that probabilistic model obtains is connected by word trained in advance and connects probability Table inquires the word and connects the connection probability of probability and the word and all kinds of punctuates between next word of the word, and based on looking into The linking objective for the connection determine the probability word ask.
In the present embodiment, according to word connect probabilistic model generate word connect probability tables, for each word calculate itself and The probability of next word and all kinds of punctuates takes the maximum word of probability value or punctuate to be linked.Word connects probability tables and is used for table Levy the probability of word and word or all kinds of punctuates.We regard each word as independent, that is to say, that each word back is likely to In addition punctuate (,.?!;Deng).For current word, the probability of the word and various punctuates and next word is calculated separately, is finally taken The highest word of probability is linked.If probability is up to next word, illustrate here it is not necessary to which punctuate connection, directly carries out Word addition is all right.If highest probability is punctuate, in the additional punctuation mark in the word back.All words are as above walked Suddenly, the sentence connected with punctuate is finally obtained.For example, word sequence " I " " love " " China " " because ", successively inquire " I " with Connection probability between " love ", and the connection probability of the punctuates such as " I " and fullstop, comma.Between available " me " and " love " Maximum probability is connected, punctuate is not therefore used between " I " " love ".And the connection probability of " China " and fullstop is much larger than " China " With " because ", fullstop is added below also greater than the connection probability of " China " and other punctuates, therefore in " China ".Word connects probability Model is an emphasis of this subsystem, needs to train correlation model, obtains the probability occurred between word and word to generate word Probability tables are connected, linking objective of the highest word of probability as the word is then taken.The generating process that word connects probability tables will be in step It is introduced in rapid 401-403.
Step 205, word each in word sequence and corresponding linking objective are connected generate the article with punctuate carry out it is defeated Out.
In the present embodiment, according to step 204 as a result, word each in word sequence is connected with corresponding linking objective The article with punctuate is generated to be exported.
In some optional implementations of the present embodiment, this method further include: article is divided at least one paragraph. Semantic analysis can be carried out to article, be then segmented article according to semanteme.The word content of identical semanteme is classified as one section.
In some optional implementations of the present embodiment, this method further include: determine the theme and entity of article;It obtains Take the image with the theme of article and Entities Matching;Graph text information is generated according to image and article.It is obtained according to audio conversion module The text data arrived, (entity is more fine-grained to the entity for excavating in text here, including personage such as star, things Such as bank) and theme (classifications such as finance and economics, amusement, sport), then according to entity go sterogram library searching related entities figure, Thematic map library searching related subject figure is removed according to theme.These pictures are text picture concerned, can be directly with making an issue of Figure.
In some optional implementations of the present embodiment, this method further include: graph text information is subjected to typesetting optimization. Automatically picture is inserted into the more reasonable position of article, and adjusts dimension of picture, so that the area ratio of word content and picture Example reaches predetermined value.
With continued reference to the signal that Fig. 3, Fig. 3 are according to the application scenarios of the method for output information of the present embodiment Figure.In the application scenarios of Fig. 3, server receives the audio file " Dali scene " of terminal transmission.User is in audio text The local conditions and customs of Dali is described with voice in part.By ASR technology, audio file is parsed into whole section of text.So Afterwards by after whole section of text word cutting, connection probability between query word and word, between word and each punctuate, by the word of maximum probability or Punctuate is as linking objective.Each word generates the article with punctuate after being attached.Also picture inspection can be carried out according to content of text Rope finds suitable figure.Then article is segmented according to text semantic.Finally the picture searched is inserted into article again Carry out polishing processing.
The method provided by the above embodiment of the disclosure can carry out sentence chain according to the content of text that audio parses & segmentation is connect, figure is then carried out according to this paper subject content, typesetting optimization finally is carried out to text, picture and generates article.It compares System is generated in traditional article, the data of the system are richer, various, and source is also more extensive;Compared to traditional hand-written text of small volume Chapter has higher timeliness and coverage, while also saving human cost and time cost.
With further reference to Fig. 4, it illustrates the processes 400 of another embodiment of the method for output information.The use In the process 400 of the method for output information, comprising the following steps:
Step 401, training sample set is obtained.
In the present embodiment, can lead to for the executing subject of the method for output information (such as server shown in FIG. 1) It crosses wired connection mode or radio connection obtains training sample set, wherein training sample includes the sentence containing punctuate Son.The executing subject of process 400 can be identical as the executing subject of process 200, can also be different executing subjects.It can be taken by third party It is engaged in generating word connection probability tables after device executes process 400, then issues the executing subject use of process 200.
Use normal newsletter archive or article as training data.
Firstly, carrying out article cuts sentence, article is subjected to cutting according to the granularity of one big sentence, big sentence refers to fullstop, asks Number, exclamation mark ending sentence.Each big sentence is as a data;
Then, sentence word cutting is carried out, word cutting is carried out to sentence according to the lexical structure of English or Chinese;
Finally, carrying out word encode (coding), say that each word is embedding (insertion) and obtains each sentence Embedding indicates to get training sample has been arrived.Here word includes punctuation mark.
Step 402, using the sentence of the training sample in training sample set as the input of LSTM model, training obtains word Connect probabilistic model.
In the present embodiment, LSTM (Long Short-Term Memory) is shot and long term memory network, is a kind of time Recognition with Recurrent Neural Network is suitable for being spaced and postpone relatively long critical event in processing and predicted time sequence.Original RNN's Only one state (Fig. 5 a) of hidden layer, it is very sensitive for short-term input.So, if we are further added by a state (Fig. 5 b) allows it to save long-term state.
LSTM is equally such structure, but duplicate module possesses a different structure.Different from single nerve Network layer, be here there are four, interacted in a kind of very special mode.
In t moment, there are three the inputs of LSTM: the input value of current time network, the output valve of last moment LSTM, And the location mode of last moment;There are two the outputs of LSTM: the cell-like of current time LSTM output valve and current time State.
The key of LSTM is exactly how to control long term state.Herein, the thinking of LSTM is opened using three controls It closes.First switch, responsible control continue to save long term state;Second switch is responsible for control immediate status and is input to length Phase state;Third switch, is responsible for controlling whether using long term state as the output of current LSTM.
Step 403, connected according to word obtained in the pilot process of probabilistic model training probability between each word and word and Probability between each word and each punctuate generates word and connects probability tables.
In the present embodiment, using the sentence after embedding as the input of LSTM model, training pattern.Pull model Pilot process, obtain the connection probability between each word and word.It obtains the connection probability between each word is for statistical analysis Word connects probability tables.Probability tables are connected by looking into word, the connection probability before word and word can be obtained.
With further reference to Fig. 6, as the realization to method shown in above-mentioned each figure, present disclose provides one kind for exporting letter One embodiment of the device of breath, the Installation practice is corresponding with embodiment of the method shown in Fig. 2, which can specifically answer For in various electronic equipments.
As shown in fig. 6, the device 600 for output information of the present embodiment includes: acquiring unit 601, converting unit 602, word cutting unit 603, judging unit 604 and connection unit 605.Wherein, acquiring unit 601 are configured to obtain to be converted Audio-frequency information;Converting unit 602 is configured to audio-frequency information being converted into text information;Word cutting unit 603, is configured to Text information is subjected to word cutting, obtains word sequence;Judging unit 604 is configured to for the word in word sequence, by instructing in advance The word connection probability tables that experienced word connection probabilistic model obtains inquire the word and connected between next word of the word probability and The connection probability of the word and all kinds of punctuates, and the linking objective based on the connection determine the probability word inquired;Connection unit 605, it is configured to connect word each in word sequence and corresponding linking objective and generates the article with punctuate and export.
In the present embodiment, for the acquiring unit 601 of the device of output information 600, converting unit 602, word cutting unit 603, the specific processing of judging unit 604 and connection unit 605 can be with reference to step 201, the step in Fig. 2 corresponding embodiment 202, step 203, step 204 and step 205.
In some optional implementations of the present embodiment, device 600 further includes training unit (attached to be not shown in the figure), It is configured to: obtaining training sample set, wherein training sample includes the sentence containing punctuate;It will be in training sample set Input of the sentence of training sample as LSTM model, training obtain word connection probabilistic model;Probabilistic model instruction is connected according to word The probability between the probability and each word and each punctuate between each word and word is obtained in experienced pilot process generates word connection generally Rate table.
In some optional implementations of the present embodiment, training unit is further configured to: sample article is obtained, Sample article is subjected to cutting by the granularity of one big sentence and obtains sample sentence set, wherein big sentence refers to fullstop, question mark or sense The sentence of exclamation ending;For the sample sentence in sample sentence set, term vector is generated after which is carried out word cutting as training sample This.
In some optional implementations of the present embodiment, device 600 further includes segmenting unit (attached to be not shown in the figure), It is configured to: article is divided at least one paragraph.
In some optional implementations of the present embodiment, device 600 further includes figure unit (attached to be not shown in the figure), It is configured to: determining the theme and entity of article;It obtains and the theme of article and the image of Entities Matching;According to image and article Generate graph text information.
In some optional implementations of the present embodiment, device 600 further includes typesetting unit (attached to be not shown in the figure), It is configured to: graph text information is subjected to typesetting optimization.
Below with reference to Fig. 7, it illustrates the electronic equipment that is suitable for being used to realize embodiment of the disclosure, (example is as shown in figure 1 Server) 700 structural schematic diagram.Server shown in Fig. 7 is only an example, should not be to the function of embodiment of the disclosure Any restrictions can be brought with use scope.
As shown in fig. 7, electronic equipment 700 may include processing unit (such as central processing unit, graphics processor etc.) 701, random access can be loaded into according to the program being stored in read-only memory (ROM) 702 or from storage device 708 Program in memory (RAM) 703 and execute various movements appropriate and processing.In RAM 703, it is also stored with electronic equipment Various programs and data needed for 700 operations.Processing unit 701, ROM 702 and RAM 703 pass through the phase each other of bus 704 Even.Input/output (I/O) interface 705 is also connected to bus 704.
In general, following device can connect to I/O interface 705: including such as touch screen, touch tablet, keyboard, mouse, taking the photograph As the input unit 706 of head, microphone, accelerometer, gyroscope etc.;Including such as liquid crystal display (LCD), loudspeaker, vibration The output device 707 of dynamic device etc.;Storage device 708 including such as tape, hard disk etc.;And communication device 709.Communication device 709, which can permit electronic equipment 700, is wirelessly or non-wirelessly communicated with other equipment to exchange data.Although Fig. 7 shows tool There is the electronic equipment 700 of various devices, it should be understood that being not required for implementing or having all devices shown.It can be with Alternatively implement or have more or fewer devices.Each box shown in Fig. 7 can represent a device, can also root According to needing to represent multiple devices.
Particularly, in accordance with an embodiment of the present disclosure, it may be implemented as computer above with reference to the process of flow chart description Software program.For example, embodiment of the disclosure includes a kind of computer program product comprising be carried on computer-readable medium On computer program, which includes the program code for method shown in execution flow chart.In such reality It applies in example, which can be downloaded and installed from network by communication device 709, or from storage device 708 It is mounted, or is mounted from ROM 702.When the computer program is executed by processing unit 701, the implementation of the disclosure is executed The above-mentioned function of being limited in the method for example.It should be noted that computer-readable medium described in embodiment of the disclosure can be with It is computer-readable signal media or computer readable storage medium either the two any combination.It is computer-readable Storage medium for example may be-but not limited to-the system of electricity, magnetic, optical, electromagnetic, infrared ray or semiconductor, device or Device, or any above combination.The more specific example of computer readable storage medium can include but is not limited to: have The electrical connection of one or more conducting wires, portable computer diskette, hard disk, random access storage device (RAM), read-only memory (ROM), erasable programmable read only memory (EPROM or flash memory), optical fiber, portable compact disc read-only memory (CD- ROM), light storage device, magnetic memory device or above-mentioned any appropriate combination.In embodiment of the disclosure, computer Readable storage medium storing program for executing can be any tangible medium for including or store program, which can be commanded execution system, device Either device use or in connection.And in embodiment of the disclosure, computer-readable signal media may include In a base band or as the data-signal that carrier wave a part is propagated, wherein carrying computer-readable program code.It is this The data-signal of propagation can take various forms, including but not limited to electromagnetic signal, optical signal or above-mentioned any appropriate Combination.Computer-readable signal media can also be any computer-readable medium other than computer readable storage medium, should Computer-readable signal media can send, propagate or transmit for by instruction execution system, device or device use or Person's program in connection.The program code for including on computer-readable medium can transmit with any suitable medium, Including but not limited to: electric wire, optical cable, RF (radio frequency) etc. or above-mentioned any appropriate combination.
Above-mentioned computer-readable medium can be included in above-mentioned electronic equipment;It is also possible to individualism, and not It is fitted into the electronic equipment.Above-mentioned computer-readable medium carries one or more program, when said one or more When a program is executed by the electronic equipment, so that the electronic equipment: obtaining audio-frequency information to be converted;Audio-frequency information is converted into Text information;Text information is subjected to word cutting, obtains word sequence;For the word in word sequence, connected by word trained in advance Word that probabilistic model obtains connection probability tables inquire the word and connected between next word of the word probability and the word with it is all kinds of The connection probability of punctuate, and the linking objective based on the connection determine the probability word inquired;By word each in word sequence and phase The linking objective answered connects article of the generation with punctuate and is exported.
The behaviour for executing embodiment of the disclosure can be write with one or more programming languages or combinations thereof The computer program code of work, described program design language include object oriented program language-such as Java, Smalltalk, C++ further include conventional procedural programming language-such as " C " language or similar program design language Speech.Program code can be executed fully on the user computer, partly be executed on the user computer, as an independence Software package execute, part on the user computer part execute on the remote computer or completely in remote computer or It is executed on server.In situations involving remote computers, remote computer can pass through the network of any kind --- packet It includes local area network (LAN) or wide area network (WAN)-is connected to subscriber computer, or, it may be connected to outer computer (such as benefit It is connected with ISP by internet).
Flow chart and block diagram in attached drawing are illustrated according to the system of the various embodiments of the disclosure, method and computer journey The architecture, function and operation in the cards of sequence product.In this regard, each box in flowchart or block diagram can generation A part of one module, program segment or code of table, a part of the module, program segment or code include one or more use The executable instruction of the logic function as defined in realizing.It should also be noted that in some implementations as replacements, being marked in box The function of note can also occur in a different order than that indicated in the drawings.For example, two boxes succeedingly indicated are actually It can be basically executed in parallel, they can also be executed in the opposite order sometimes, and this depends on the function involved.Also it to infuse Meaning, the combination of each box in block diagram and or flow chart and the box in block diagram and or flow chart can be with holding The dedicated hardware based system of functions or operations as defined in row is realized, or can use specialized hardware and computer instruction Combination realize.
Being described in unit involved in embodiment of the disclosure can be realized by way of software, can also be passed through The mode of hardware is realized.Described unit also can be set in the processor, for example, can be described as: a kind of processor Including acquiring unit, converting unit, word cutting unit, judging unit and connection unit.Wherein, the title of these units is in certain feelings The restriction to the unit itself is not constituted under condition, for example, acquiring unit is also described as " obtaining audio letter to be converted The unit of breath ".
Above description is only the preferred embodiment of the disclosure and the explanation to institute's application technology principle.Those skilled in the art Member is it should be appreciated that invention scope involved in the disclosure, however it is not limited to technology made of the specific combination of above-mentioned technical characteristic Scheme, while should also cover in the case where not departing from the inventive concept, it is carried out by above-mentioned technical characteristic or its equivalent feature Any combination and the other technical solutions formed.Such as features described above has similar function with (but being not limited to) disclosed in the disclosure Can technical characteristic replaced mutually and the technical solution that is formed.

Claims (14)

1. a kind of method for output information, comprising:
Obtain audio-frequency information to be converted;
The audio-frequency information is converted into text information;
The text information is subjected to word cutting, obtains word sequence;
For the word in the word sequence, the word connection probability tables inquiry that probabilistic model obtains is connected by word trained in advance and is somebody's turn to do Word and the connection probability that probability and the word and all kinds of punctuates are connected between next word of the word, and based on the company inquired Connect the linking objective of the determine the probability word;
Word each in the word sequence is connected article of the generation with punctuate with corresponding linking objective to export.
2. according to the method described in claim 1, wherein, institute's predicate connection probability tables are obtained through the following steps:
Obtain training sample set, wherein training sample includes the sentence containing punctuate;
Using the sentence of the training sample in the training sample set as the input of LSTM model, training obtains word connection probability Model;
Connected according to institute's predicate obtained in the pilot process of probabilistic model training probability between each word and word and each word with Probability between each punctuate generates word and connects probability tables.
3. according to the method described in claim 2, wherein, the acquisition training sample set, comprising:
Sample article is obtained, the sample article is subjected to cutting by the granularity of one big sentence and obtains sample sentence set, wherein is big Sentence refers to the sentence to end up with fullstop, question mark or exclamation mark;
For the sample sentence in the sample sentence set, term vector is generated after which is carried out word cutting as training sample.
4. according to the method described in claim 1, wherein, the method also includes:
The article is divided at least one paragraph.
5. method described in one of -4 according to claim 1, wherein the method also includes:
Determine the theme and entity of the article;
It obtains and the theme of the article and the image of Entities Matching;
Graph text information is generated according to described image and the article.
6. according to the method described in claim 5, wherein, the method also includes:
The graph text information is subjected to typesetting optimization.
7. a kind of device for output information, comprising:
Acquiring unit is configured to obtain audio-frequency information to be converted;
Converting unit is configured to the audio-frequency information being converted into text information;
Word cutting unit is configured to the text information carrying out word cutting, obtains word sequence;
Judging unit, is configured to for the word in the word sequence, connects what probabilistic model obtained by word trained in advance Word connection probability tables inquire the word and connect the connection probability of probability and the word and all kinds of punctuates between next word of the word, And the linking objective based on the connection determine the probability word inquired;
Connection unit is configured to word each in the word sequence connecting text of the generation with punctuate with corresponding linking objective Zhang Jinhang output.
8. device according to claim 7, wherein described device further includes training unit, is configured to:
Obtain training sample set, wherein training sample includes the sentence containing punctuate;
Using the sentence of the training sample in the training sample set as the input of LSTM model, training obtains word connection probability Model;
Connected according to institute's predicate obtained in the pilot process of probabilistic model training probability between each word and word and each word with Probability between each punctuate generates word and connects probability tables.
9. device according to claim 8, wherein the training unit is further configured to:
Sample article is obtained, the sample article is subjected to cutting by the granularity of one big sentence and obtains sample sentence set, wherein is big Sentence refers to the sentence to end up with fullstop, question mark or exclamation mark;
For the sample sentence in the sample sentence set, term vector is generated after which is carried out word cutting as training sample.
10. device according to claim 7, wherein described device further includes segmenting unit, is configured to:
The article is divided at least one paragraph.
11. the device according to one of claim 7-10, wherein described device further includes figure unit, is configured to:
Determine the theme and entity of the article;
It obtains and the theme of the article and the image of Entities Matching;
Graph text information is generated according to described image and the article.
12. device according to claim 11, wherein described device further includes typesetting unit, is configured to:
The graph text information is subjected to typesetting optimization.
13. a kind of electronic equipment, comprising:
One or more processors;
Storage device is stored thereon with one or more programs,
When one or more of programs are executed by one or more of processors, so that one or more of processors are real Now such as method as claimed in any one of claims 1 to 6.
14. a kind of computer-readable medium, is stored thereon with computer program, wherein real when described program is executed by processor Now such as method as claimed in any one of claims 1 to 6.
CN201910552619.6A 2019-06-25 2019-06-25 Method and device for outputting information Active CN110245334B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910552619.6A CN110245334B (en) 2019-06-25 2019-06-25 Method and device for outputting information

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910552619.6A CN110245334B (en) 2019-06-25 2019-06-25 Method and device for outputting information

Publications (2)

Publication Number Publication Date
CN110245334A true CN110245334A (en) 2019-09-17
CN110245334B CN110245334B (en) 2023-06-16

Family

ID=67889231

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910552619.6A Active CN110245334B (en) 2019-06-25 2019-06-25 Method and device for outputting information

Country Status (1)

Country Link
CN (1) CN110245334B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111078831A (en) * 2019-11-06 2020-04-28 广州荔支网络技术有限公司 Optimization method for converting audio content into text in text reading
CN113297824A (en) * 2021-05-11 2021-08-24 北京字跳网络技术有限公司 Text display method and device, electronic equipment and storage medium
WO2022143058A1 (en) * 2021-01-04 2022-07-07 北京有竹居网络技术有限公司 Voice recognition method and apparatus, storage medium, and electronic device

Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070203887A1 (en) * 2004-06-30 2007-08-30 Misha Dynin Methods and systems for endorsing search results
US20130132401A1 (en) * 2011-11-17 2013-05-23 Yahoo! Inc. Related news articles
CN106951413A (en) * 2017-03-24 2017-07-14 北京百度网讯科技有限公司 Segmenting method and device based on artificial intelligence
CN107221330A (en) * 2017-05-26 2017-09-29 北京搜狗科技发展有限公司 Punctuate adding method and device, the device added for punctuate
CN107679039A (en) * 2017-10-17 2018-02-09 北京百度网讯科技有限公司 The method and apparatus being intended to for determining sentence
CN107767870A (en) * 2017-09-29 2018-03-06 百度在线网络技术(北京)有限公司 Adding method, device and the computer equipment of punctuation mark
CN107944032A (en) * 2017-12-13 2018-04-20 北京百度网讯科技有限公司 Method and apparatus for generating information
CN108920611A (en) * 2018-06-28 2018-11-30 北京百度网讯科技有限公司 article generation method, device, equipment and storage medium
CN108932220A (en) * 2018-06-29 2018-12-04 北京百度网讯科技有限公司 article generation method and device
CN108959556A (en) * 2018-06-29 2018-12-07 北京百度网讯科技有限公司 Entity answering method, device and terminal neural network based
CN109213997A (en) * 2018-08-16 2019-01-15 昆明理工大学 A kind of Chinese word cutting method based on two-way length memory network model in short-term
CN109241330A (en) * 2018-08-20 2019-01-18 北京百度网讯科技有限公司 The method, apparatus, equipment and medium of key phrase in audio for identification
CN109657041A (en) * 2018-12-04 2019-04-19 南京理工大学 The problem of based on deep learning automatic generation method

Patent Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070203887A1 (en) * 2004-06-30 2007-08-30 Misha Dynin Methods and systems for endorsing search results
US20130132401A1 (en) * 2011-11-17 2013-05-23 Yahoo! Inc. Related news articles
CN106951413A (en) * 2017-03-24 2017-07-14 北京百度网讯科技有限公司 Segmenting method and device based on artificial intelligence
CN107221330A (en) * 2017-05-26 2017-09-29 北京搜狗科技发展有限公司 Punctuate adding method and device, the device added for punctuate
CN107767870A (en) * 2017-09-29 2018-03-06 百度在线网络技术(北京)有限公司 Adding method, device and the computer equipment of punctuation mark
CN107679039A (en) * 2017-10-17 2018-02-09 北京百度网讯科技有限公司 The method and apparatus being intended to for determining sentence
CN107944032A (en) * 2017-12-13 2018-04-20 北京百度网讯科技有限公司 Method and apparatus for generating information
CN108920611A (en) * 2018-06-28 2018-11-30 北京百度网讯科技有限公司 article generation method, device, equipment and storage medium
CN108932220A (en) * 2018-06-29 2018-12-04 北京百度网讯科技有限公司 article generation method and device
CN108959556A (en) * 2018-06-29 2018-12-07 北京百度网讯科技有限公司 Entity answering method, device and terminal neural network based
CN109213997A (en) * 2018-08-16 2019-01-15 昆明理工大学 A kind of Chinese word cutting method based on two-way length memory network model in short-term
CN109241330A (en) * 2018-08-20 2019-01-18 北京百度网讯科技有限公司 The method, apparatus, equipment and medium of key phrase in audio for identification
CN109657041A (en) * 2018-12-04 2019-04-19 南京理工大学 The problem of based on deep learning automatic generation method

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
侯加英: ""汉越双语新闻话题发现研究"", 《中国优秀硕士学位论文全文数据库信息科技辑》, pages 138 - 1937 *
江跃华等: ""融合词汇特征的生成式摘要模型"", 《河北科技大学学报》, pages 152 - 158 *

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111078831A (en) * 2019-11-06 2020-04-28 广州荔支网络技术有限公司 Optimization method for converting audio content into text in text reading
CN111078831B (en) * 2019-11-06 2023-05-30 广州荔支网络技术有限公司 Optimizing method for converting text from text reading audio content
WO2022143058A1 (en) * 2021-01-04 2022-07-07 北京有竹居网络技术有限公司 Voice recognition method and apparatus, storage medium, and electronic device
CN113297824A (en) * 2021-05-11 2021-08-24 北京字跳网络技术有限公司 Text display method and device, electronic equipment and storage medium

Also Published As

Publication number Publication date
CN110245334B (en) 2023-06-16

Similar Documents

Publication Publication Date Title
CN108022586B (en) Method and apparatus for controlling the page
CN110298906B (en) Method and device for generating information
US10956480B2 (en) System and method for generating dialogue graphs
CN110321544B (en) Method and device for generating information
CN106227792B (en) Method and apparatus for pushed information
US11990124B2 (en) Language model prediction of API call invocations and verbal responses
CN109643540A (en) System and method for artificial intelligent voice evolution
CN110245334A (en) Method and apparatus for output information
CN109829164A (en) Method and apparatus for generating text
WO2021063089A1 (en) Rule matching method, rule matching apparatus, storage medium and electronic device
WO2023005968A1 (en) Text category recognition method and apparatus, and electronic device and storage medium
CN111753551A (en) Information generation method and device based on word vector generation model
CN111582360A (en) Method, apparatus, device and medium for labeling data
US20210004603A1 (en) Method and apparatus for determining (raw) video materials for news
CN112364653A (en) Text analysis method, apparatus, server and medium for speech synthesis
WO2024099171A1 (en) Video generation method and apparatus
CN112182255A (en) Method and apparatus for storing media files and for retrieving media files
CN110457325B (en) Method and apparatus for outputting information
CN111555960A (en) Method for generating information
CN111008213A (en) Method and apparatus for generating language conversion model
CN109710939A (en) Method and apparatus for determining theme
CN112633004A (en) Text punctuation deletion method and device, electronic equipment and storage medium
CN109241296A (en) Method and apparatus for generating information
CN109947526A (en) Method and apparatus for output information
CN112652329B (en) Text realignment method and device, electronic equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant