CN113297387B

CN113297387B - News detection method for image-text mismatching based on NKD-GNN

Info

Publication number: CN113297387B
Application number: CN202110424490.8A
Authority: CN
Inventors: 云静; 高硕�; 赵禹萌; 许志伟; 刘利民
Original assignee: Inner Mongolia University of Technology
Current assignee: Inner Mongolia University of Technology
Priority date: 2021-04-20
Filing date: 2021-04-20
Publication date: 2022-04-29
Anticipated expiration: 2041-04-20
Also published as: CN113297387A

Abstract

A news detection method based on NKD-GNN picture and text mismatching is used for generating news matching description with placeholders for news matching; constructing the named entity into a news knowledge graph according to the connection rule; selecting named entities related to the news matching graph based on a graph neural network driven by a news knowledge graph, and inserting the named entities into the news matching graph description, thereby generating the news matching graph description with the named entities; and calculating the matching performance of the news text and the news matching description with the named entity, and judging whether the images and texts of the news are matched. On the basis of comprehensively analyzing all the associations between the named entities in the news knowledge graph, the importance degree of the named entities in the news knowledge graph is calculated, and the core named entities in the related news are analyzed, so that the image-text matching judgment effect on the news is better.

Description

News detection method for image-text mismatching based on NKD-GNN

Technical Field

The invention belongs to the technical field of artificial intelligence, relates to false information detection, and particularly relates to a text mismatching news detection method based on NKD-GNN.

Background

With the rapid development of internet technology, browsing network news has become a main channel for people to know the current affairs. Some bad media are used for earning the eyes of readers, obtaining extremely high news click volume, and matching pictures which are attractive to news but not related to news contents. If the news with unmatched images and texts is not processed in time, the public is easily misunderstood to the fact, the ecology of the network news is damaged, and the public credibility of the media is lost. Generally, the news text comprises a place class named entity of an event, a person class named entity and an organization class named entity related to the event, and a news map visually and vividly shows a key named entity in the news event. Whether the news text is consistent with the named entity in the news matching picture or not greatly influences the result of news image-text matching detection. Because the news text contains a large number of named entities, the image feature extraction algorithm cannot directly extract the named entities from the news matching image, and a huge semantic difference exists between the news text and the news matching image. Therefore, the existing image-text matching detection method cannot be directly used for judging the matching between the news text and the news matching chart, and the news matching chart description with named entities needs to be generated.

Disclosure of Invention

In order to overcome the defects of the prior art, the invention aims to provide a text mismatch news detection method based on NKD-GNN.

In order to achieve the purpose, the invention adopts the technical scheme that:

an NKD-GNN-based image-text mismatching news detection method comprises the following steps:

step 1, generating a news matching description with placeholders for the news matching;

step 2, constructing the named entities into a news knowledge graph according to the connection rule;

step 3, selecting a named entity related to the news matching based on a graph neural network driven by a news knowledge graph, and inserting the named entity into the description of the news matching, so as to generate the description of the news matching with the named entity;

and 4, calculating the matching performance of the news text and the news matching description with the named entity, and judging whether the images and texts of one news are matched.

Compared with the prior art, the method and the device have the advantages that on the basis of comprehensively analyzing all the associations among the named entities in the news knowledge graph, the importance degree of the named entities in the news knowledge graph is calculated, and the core named entities in related news are analyzed, so that the image-text matching judgment effect on the news is better.

Drawings

FIG. 1 is a schematic flow diagram of the present invention.

Fig. 2 is a diagram of two news images, which each include three related articles, and a detection process and a conclusion thereof according to an embodiment of the present invention. Wherein (a) is news with matched pictures and texts, and (b) is news with unmatched pictures and texts.

Detailed Description

The embodiments of the present invention will be described in detail below with reference to the drawings and examples.

As shown in fig. 1, the invention relates to a text mismatch news detection method based on NKD-GNN, comprising the following steps:

step 1, matching the news into a map, and generating a description of the news with placeholders.

A large number of named entities exist in a news text, but the existing image description generation method cannot directly generate the image description with the named entities, so that a semantic difference exists between the news text and a news matching picture. There is a certain difficulty in directly calculating the matching between the two. Therefore, the method generates a news matching description with placeholders by matching the news, unifies the news text and the news matching into the same modality, and selects the named entity related to the news matching to be inserted into the news matching description in the subsequent step.

The method comprises the following specific steps of generating the description of the news matching map with the placeholder:

step 1.1: generating a news matching description by using an open source pre-trained image generation description model, wherein the model follows the Encoder-Decoder design idea, uses CNN to extract image characteristics at the encoding stage, and uses RNN to generate the news matching description at the decoding stage;

step 1.2: for the generated description of the newsflash, a WordNet tool is adopted to replace the vocabulary in the newsflash description, which is in the same semantic tree with 'Person', by a < Person > placeholder, replace the vocabulary in the same semantic tree with 'Place' by a < Place > placeholder, and replace 'a group of sight' in the newsflash description by an < Organization > placeholder; replacing the Building vocabulary in the description of the news matching graph with a < Building > placeholder; thereby generating a newsflash description with four categories of placeholders < Person >, < Place >, < Organization >, < Building >.

And 2, constructing a news knowledge graph.

When a named entity is selected to be inserted into a description of a news match graph with placeholders, the statistical association between the named entities in a news article needs to be analyzed to accurately reflect the association between the named entities in a news scene. The invention constructs the news knowledge map, constructs the named entities into the news knowledge map according to the connection rule, and lays a foundation for the association between subsequent analysis entities.

The method specifically comprises the following steps of constructing a news knowledge graph:

step 2.1: using a SpaCy's named entity recognizer to extract named entities of related articles of news, and reserving four kinds of named entities of Person, Organization, Location and Building;

step 2.2: the reserved named entities form an entity set V ═ { V ═ V₁,v₂,...,v_m}; named entities appearing in the same sentence are connected by edges, and all the edges form an edge set E ═ { E }₁,e₂,...,e_mThe weighted value of the edge e is calculated as follows:

wherein E is E, H_eWeight value of edge e, i.e. co-occurrence of two named entities, v_hAnd v_tAre two named entities connected by an edge e,

is v_hAnd v_tThe number of times of co-occurrence,

and

are each v_hAnd v_tThe number of individual occurrences, the graph G ═ V, E, which is composed of all named entities and all edges, i.e., the news map.

And 3, generating a news matching description with the named entities.

The news knowledge-graph constructed in step 2 contains all the named entities in the news article, some of which are not related to the newsletter. In order to fully analyze the association between entities in the news knowledge-graph and to eliminate noise interference in the news knowledge-graph, named entities relevant to the news mapping are selected. The invention provides a News knowledge graph driven neural network (NKD-GNN), which selects named entities related to a News matching graph to be inserted into a News matching graph description on the basis of completely analyzing the association between the named entities in the News knowledge graph, thereby generating the News matching graph description with the named entities.

The method comprises the following specific steps of generating the news matching description with the named entities:

step 3.1: and aggregating all edge and all node information in the news knowledge graph by using the graph neural network so as to obtain each node vector v.

In particular, node v in the news knowledge-graph_iProcess of input to a graph neural network, and graph neural network updating entity vector v_iThe process of (1) is as follows:

wherein

Is the input and weight matrix corresponding to the ith node in the news knowledge map at the time t

Is a set of node vectors at time t-1,

is a contiguous matrix of the news knowledge-graph,

a blocking matrix corresponding to the ith entity of the adjacency matrix of the news knowledge-graph,

is a reset gate that is turned on and off,

is an update gate, σ () is a sigmoid function,

is a point-by-point operator and,

entity v at time t-1_iVector, W_zIs time t

Weight matrix of W_rIs time t

Weight matrix of W_oWeight matrix, U, for activating function inputs at time t_zIs the time t-1

Weight matrix of, U_rIs the time t-1

Weight matrix of, U_oThe weight matrix for the activation function input at time t-1,

is an entity v_iThe candidate vector of (a) is selected,

is the s-th reset gate; n is a serial number.

Formula (2) reflects the node v in the news knowledge graph_iProcess of aggregating information of its neighboring nodes, node v_iBy aggregating neighbor node information

Formula (3) and formula (4) respectively determine the information retained in the neighbor node information and the discarded information. Equation (5) uses the t-1 time node v_i(Vector)

And time t node v_iIs inputted

Get node v_iCandidate vector of

Equation (6) uses the t-1 time instance entity vector

And node candidate vectors

Calculating a node v at time t_iAnd (4) vector representation. Until all the node information is learned, the final vector representation of the node is obtained

Step 3.2: the most distant nodes in the news knowledge graph are core entities in the related texts and reflect key information of the news related to the matching of the news. The invention sets the named entity with the most edges in the news knowledge graph as the important node v_bWhen the named entities with the most edges in the news knowledge graph are multiple, the important node v with the highest frequency is taken as the important node v_b(ii) a Global vector of news knowledge map

And important node vector

Combining to obtain the expression vector N of the news knowledge map_r。

Specifically, since each node has different priority due to different influences on the global vector of the news knowledge graph, each node vector is weighted by an attention mechanism and then summed to obtain the global knowledge vector N of the news knowledge graph_g(ii) a Secondly important node v_bIs defined as N_b(ii) a Finally, the global knowledge vector N of the news knowledge graph is used_gAnd important entity vector N_bPerforming linear splicing to obtain a news knowledge map representation vector N_rThe process is as follows:

α_i＝q^Tσ(W₁v_b+W₂v_i) (7)

wherein alpha is_iFor the node coefficients in the news knowledge-graph,parameter(s)

The purpose is to transpose the matrix so that two matrices are multiplied to obtain a specific value, the matrix

Sum matrix

Is a weight matrix, matrix of node vectors in a news knowledge graph

Will N_gAnd N_bPacked mapping of combined vectors to

In vector space of (a), v_bIs an important entity vector, n is a sequence number,

step 3.3: node v_iVector v of_iWith the expression vector N of the news knowledge graph_rPerforming product operation to obtain the fraction of each node

And outputting the fraction by using a Soft max function to obtain the probability of the node, which is shown as the following formula:

wherein

Is the score of the node or nodes,

is a node v_iThe score of (a) is calculated,

inserting the named entity into the news matching description, namely the probability of the entity being inserted into a placeholder in the news matching description;

and then training an NKD-GNN model by using a cross entropy loss function and using a time-based back propagation algorithm to perform core entity prediction, wherein the cross entropy loss function is shown as the following formula:

wherein y is_iCoding the core entity one-hot marked in the news knowledge graph,

the probability of inserting a placeholder for the ith entity into the newswizzle description.

Step 3.4: the core entities predicted by NKD-GNN are inserted into the newsreader description with placeholders.

Specifically, the highest probability of each type of named entity is taken and inserted into the corresponding placeholder according to the entity type to obtain the description of the news matching graph with the named entities; when the placeholders of the newsletter bitmap with placeholders do not have corresponding inserted named entities, they are replaced with the type in the placeholder, for example, the vocabulary "PERSON" is used to replace slot < PERSON >.

And 4, calculating the matching of the news text and the news matching description with the named entities.

The method and the device can finally judge whether the image and text of the news are matched on the basis of making up the semantic difference between the news text and the news matching picture by calculating the matching property of the news text and the description of the news matching picture with the named entity. There are still sentence and structure differences between the description of the newsletter with named entities and the newsletter text. When calculating the matching between the two, not only the sentence pattern structure similarity of the two needs to be analyzed, but also the similarity of the two keywords needs to be calculated. The invention provides a method for calculating the matching of a Hybrid Co-Attention Network (HCAN) and a text of a news, which adopts a Hybrid Co-Attention Network (HCAN) method to calculate the matching of the Hybrid Co-Attention Network and the HCAN, analyzes the sentence structure similarity of the Hybrid Co-Attention Network and calculates the similarity of keywords of the Hybrid Co-Attention Network and the HCAN when calculating, firstly divides the text of the news into a plurality of single sentences, and if one single sentence is matched with the description of a news matching picture with a named entity, the news is considered as the news matched with pictures and texts.

The specific calculation method of the matching performance of the invention is as follows:

step 4.1: generating two sentence Word vectors to be compared by using a Word2vec tool, wherein each sentence consists of a plurality of Word vectors, and the two sentence vectors to be compared are respectively U_qAnd U_c；

Step 4.2: multiplying each word vector of two sentences to obtain a similarity matrix

S∈Rⁿ ^×mM is a sequence number between 0 and n;

step 4.3: for the matrix S ∈ R^n×mNormalizing each word vector, namely performing mean-posing and max-posing on the score of each word, and outputting U_qAnd U_cWherein the normalization method is as follows:

max S is the set of all maximal pooling;

max pooling operations for each word; mean(s) is the set of average pooling;

an average pooling operation for each word;

step 4.4: calculate U_qAnd U_cTF-IDF weight, namely wgt (q), of each word in two sentences is fully considered in normalization, so that U is obtained_qAnd U_cCorrelation matching output O_RMAnd classify U using Soft max_qAnd U_cWhether there is a match. If the two sentences are matched, judging that the news images and texts are matched; and if the description of the news match with the named entity is not matched with all the single sentences of the news text, the news graph and text are considered not to be matched. Wherein O is calculated_RMAnd the formula for classification using the Softmax function is as follows:

o＝soft max(O_RM) (16)

in summary, the input of the detection method of the invention is the news matching chart, the news text and the articles related to the news matching chart, and the output is the matching property of the news text and the news matching chart. The overall implementation of the method of the invention is illustrated by a specific example.

The embodiment is established on a cloud computing platform, the platform consists of 15 servers, and comprises Vmware Esxi 5, a 20T disk array and a 1000M network switch, a Hadoop cluster is deployed, and two news images are provided, as shown in FIG. 2, each of the two news images includes three related articles. Wherein (a) is news with matched pictures and texts, and (b) is news with unmatched pictures and texts. (a) The news states that the sport events in European crown, and the match shows that Timo Werner plays football, and the related articles have 8 named entities, namely people, places and organizations. Constructing the entities into a news knowledge graph, wherein the core named entity in the news knowledge graph of (a) is Timo Werner and is scored by NKD-GNN. The most highly scored in the Person class entity is Timo Werner and the most highly scored in the Place class entity is Cologne. Therefore, the two entities are inserted into the description of the news matching graph with the placeholder, and the explanatory description of the matching graph < Timo Werner is playing the socker in Cologne > is obtained. (b) News teaches the change of consumer behavior under economic downlink conditions, and related articles have 10 named entities, namely people, places and buildings. The named entities of the two cases of news are constructed into a news knowledge graph, and then each entity is scored. The highest scoring in the Organization class entity is Federal agent, and the highest scoring in the Place class entity is Washington. An explanatory description of the assignment chart (b) < Federal agent binding in the Washington > was obtained.

(a) And the entities described in the news text are consistent with the entities in the description of the news matching chart with the named entities, so that the detection result is matching, and the detection is correct. (b) Described in the news text is the Norton Western University professor Pittr Dworkzak which teaches changes in consumer behavior. The mapping is that the federal police is maintaining social order. And (b) generating a news matching picture with named entities according to the related article of the news matching picture, wherein the news matching picture is described as a local agent marking in the Washington after the fire, is completely irrelevant to the news text, is judged to be unmatched by the news picture and text through calculation, and is detected correctly.

By the example, the news knowledge-driven image-text mismatching news detection method is shown, and the semantic difference between the news text and the news matching map is reduced by generating the news matching map description with the named entities, so that the matching performance of the news text and the news matching map is accurately calculated.

Claims

1. An NKD-GNN-based image-text mismatching news detection method is characterized by comprising the following steps:

step 3, selecting a named entity related to the news matching based on a graph neural network driven by a news knowledge graph, and inserting the named entity into the description of the news matching, thereby generating the description of the news matching with the named entity, wherein the method comprises the following steps:

step 3.1: aggregating all edge and all node information in the news knowledge graph by using a graph neural network so as to obtain each node vector v;

step 3.2: setting the named entity with the most edges in the news knowledge graph as an important node v_bWhen the named entities with the most edges in the news knowledge graph are multiple, the important node v with the highest frequency is taken as the important node v_b(ii) a Global vector of news knowledge map

And important node vector

Combining to obtain the expression vector N of the news knowledge map_r；

Wherein, firstly, an attention mechanism is adopted to weight each node vector, and then the node vectors are summed to obtain a global knowledge vector N of the news knowledge graph_g(ii) a Secondly important node v_bIs defined as N_b(ii) a Finally, the global knowledge vector N of the news knowledge graph is used_gAnd important entity vector N_bPerforming linear splicing to obtain a news knowledge map representation vector N_rThe process is as follows:

α_i＝q^Tσ(W₁v_b+W₂v_i)

wherein alpha is_iFor node coefficient and parameter in news knowledge graph

The purpose being to transpose the momentThe matrix is obtained by multiplying two matrixes to obtain a specific numerical value

Sum matrix

Is a weight matrix, matrix of node vectors in a news knowledge graph

Will N_gAnd N_bPacked mapping of combined vectors to

In vector space of (a), v_bIs an important entity vector, and n is a sequence number;

Outputting the fraction by using a Soft max function to obtain the probability of the node; then, training an NKD-GNN model by using a cross entropy loss function and using a time-based back propagation algorithm to perform core entity prediction;

step 3.4: inserting the core entity predicted by the NKD-GNN into the news mapping description with the placeholder;

and 4, calculating the matching of the news text and the description of the news matching with the named entity by adopting a Hybrid Co-Attention Network (HCAN) method, analyzing sentence structure similarity of the news text and the description of the news matching with the named entity and calculating the similarity of keywords of the news text and the description of the news matching with the named entity during calculation, firstly dividing the text of the news into a plurality of single sentences, and if one single sentence is matched with the description of the news matching with the named entity, considering the news as the news matching with the pictures and the texts.

2. The NKD-GNN-based teletext mismatch detection method according to claim 1, wherein the method for generating the placeholder-carrying newsfeld description in step 1 is as follows:

3. The NKD-GNN-based teletext mismatch detection method according to claim 2, wherein the method for constructing a news knowledge graph in step 2 is as follows:

wherein E is E, H_eA weight value of the edge e, i.eCo-occurrence of two entities, v_hAnd v_tAre two named entities connected by an edge e,

is v_hAnd v_tThe number of times of co-occurrence,

and

4. The NKD-GNN-based teletext detection method according to claim 1, wherein the step 3.1 is a node v in a news knowledge graph_iProcess of input to a graph neural network, and graph neural network updating entity vector v_iThe process of (1) is as follows:

wherein

Is a set of node vectors at time t-1,

is a contiguous matrix of the news knowledge-graph,

is a reset gate that is turned on and off,

is an update gate, σ () is a sigmoid function,

is a point-by-point operator and,

entity v at time t-1_iVector, W_zIs time t

Weight matrix of W_rAt time t r_i ^tWeight matrix of W_oWeights for activating function inputs at time tHeavy matrix, U_zIs the time t-1

Weight matrix of, U_rIs a time r of t-1_i ^tWeight matrix of, U_oThe weight matrix for the activation function input at time t-1,

is an entity v_iThe candidate vector of (a) is selected,

is the s-th reset gate; n is a serial number.

5. The NKD-GNN-based teletext detection method according to claim 1, wherein in step 3.3, the calculation process is as follows

Wherein

Is the score of the node or nodes,

is a node v_iThe score of (a) is calculated,

the cross entropy loss function is shown as follows:

6. The NKD-GNN-based teletext detection method according to claim 1, wherein in step 3.4, the most probable of each type of named entity is taken and inserted into the corresponding placeholder according to the entity type to obtain a news mapping description with the named entities; when the placeholder of the newswizzle with placeholder does not have a corresponding inserted named entity, the placeholder is replaced with the type in the placeholder.

7. The NKD-GNN-based teletext mismatch detection method according to claim 1, wherein the specific calculation method of the matching is as follows:

S∈R^n×mM is a sequence number between 0 and n;

max S is the set of all maximal pooling;

max pooling operations for each word; mean(s) is the set of average pooling;

an average pooling operation for each word;

step 4.4: calculate U_qAnd U_cTF-IDF weight, namely wgt (q), of each word in two sentences is fully considered in normalization, so that U is obtained_qAnd U_cCorrelation matching output O_RMAnd classify U using Softmax_qAnd U_cWhether the two are matched; if the two sentences are matched, judging that the news images and texts are matched; if the description of the news matching picture with the named entity is not matched with all the single sentences of the news text, the news picture and text are considered not to be matched; wherein O is calculated_RMAnd the formula for classification using the Softmax function is as follows:

o＝softmax(O_RM)。