Nothing Special   »   [go: up one dir, main page]

WO2014098561A1 - A semantic query system and method thereof - Google Patents

A semantic query system and method thereof Download PDF

Info

Publication number
WO2014098561A1
WO2014098561A1 PCT/MY2013/000254 MY2013000254W WO2014098561A1 WO 2014098561 A1 WO2014098561 A1 WO 2014098561A1 MY 2013000254 W MY2013000254 W MY 2013000254W WO 2014098561 A1 WO2014098561 A1 WO 2014098561A1
Authority
WO
WIPO (PCT)
Prior art keywords
query
question
candidate question
semantic
candidate
Prior art date
Application number
PCT/MY2013/000254
Other languages
French (fr)
Inventor
Arun Anand Sadanandan
Tan Sieow YEEK
Lim Ying SEAN
Dickson Lukose
Original Assignee
Mimos Berhad
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Mimos Berhad filed Critical Mimos Berhad
Publication of WO2014098561A1 publication Critical patent/WO2014098561A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/3332Query translation
    • G06F16/3338Query expansion
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2457Query processing with adaptation to user needs
    • G06F16/24575Query processing with adaptation to user needs using context

Definitions

  • the present invention relates to an information retrieval system and more particularly to a semantic query system.
  • Semantic query system is used to retrieve relevant information based on a query by understanding the contextual meaning of the query. Moreover, the semantic query system ranks relevancy of the retrieved information based on semantics. Thus, the semantic query system improves search accuracy compared to a typical search engine which retrieves information that matches the keywords in the query.
  • the semantic query system is evaluated based on precision and recall, wherein precision is defined as the number of relevant results retrieved by a search divided by the total number of results retrieved by that search, while recall is defined as the number of relevant results retrieved by a search divided by the total number of possible relevant results. High precision means that a semantic query system retrieves more relevant results than irrelevant results, while high recall means that a semantic query system retrieves most of the relevant results.
  • the semantic query system needs to balance out between precision and recall.
  • Most of the existing semantic query systems conduct a search by mapping a query into internal query structures such as SPARQL and thereon, retrieving the relevant results.
  • Such semantic query systems increase precision at the expense of recall.
  • such semantic query systems pose a mapping problem whereby the correct mapping of the query into the internal query may be difficult to find due to the complexity in natural language query.
  • the semantic query systems use template based approach.
  • such systems require a query to rigidly suit the structure of the templates in its database and thus, the systems may not be able to process any types of query unless it suits the templates.
  • the systems need to include more templates in its database. Therefore, there is a need to provide a semantic query system that addresses the above-mentioned problems.
  • the present invention provides a semantic query system (100).
  • the semantic query system (100) comprises a query manager (110) configured to control activities of the system (100); a context identifier (120) configured to identify at least one context related to a query, a question generator (130) configured to generate at least one candidate question based on the at least one context; a query mapper (140) configured to perform social network analysis on the at least one candidate question generated by the question generator (130) with respect to the query; a ranking component (150) configured to produce a list of ranked candidate question generated by the question generator (130); and a query processor (160) configured for retrieving relevant information based on the list of ranked candidate question generated by said question generator (130).
  • the context identifier (120), the question generator (130), the query mapper (140), the ranking component (150) and the query processor (160) are connected to said query manager (110).
  • the present invention also provides a method for processing a query. The method is characterised by the steps of receiving a query; identifying at least one context related to the query; generating at least one candidate question based on the at least one context; mapping the query to the at least one candidate question to produce a list of ranked candidate question; and retrieving and ranking relevant information based on the list of ranked candidate question.
  • mapping the query to the at least one candidate question includes the steps of performing a syntactic analysis between the query and the at least one candidate question; performing a social network analysis on the at least one candidate question with respect to the query; determining a semantic score, similarity score, and confidence score for each candidate question; normalizing the semantic score and similarity score for each candidate question; and producing a list of ranked candidate question by ranking the at least one candidate question based on the normalized semantic and similarity scores.
  • the social network analysis is preferably performed by constructing a social network based on co-occurrence of concepts or terms in the at least one candidate question; determining relevance value between the co-occurred terms for each term; and computing centrality value for each term.
  • FIG. 1 shows a block diagram of a semantic query system (100) according to an embodiment of the present invention.
  • FIG. 2 shows a flowchart of a method for processing a query according to an embodiment of the present invention.
  • FIG. 3 shows a flowchart of a method mapping a query to questions generated by a question generator (130) according to an embodiment of the present invention.
  • the system (100) retrieves closest and most relevant information or results based on a natural language query provided by a user.
  • the system (100) is connected to at least one user device (10) via a network.
  • the user device (10) may include but not limited to laptop, mobile phone, personal computer, handheld communication device, and handheld computing device.
  • the system (100) comprises of a query manager (110), a context identifier (120), a question generator (130), a query mapper (140), a ranking component (150) and a query processor (160).
  • the query manager (110) is configured to control the flow of activities for the overall system (100).
  • the query manager (110) is connected to the context identifier (120), the question generator (130), the query mapper (140), the ranking component (150) and the query processor (160).
  • the context identifier (120) is configured to identify one or more contexts related to the query by using a classification algorithm such as k-nearest neighbour classification algorithm, Naive-Bayes classification algorithm or any other machine learning algorithms suitable for classifying a query.
  • a classification algorithm such as k-nearest neighbour classification algorithm, Naive-Bayes classification algorithm or any other machine learning algorithms suitable for classifying a query.
  • the question generator (130) is configured to automatically generate one or more candidate questions based on the context identified by the context identifier (120).
  • the question generator (130) generates the candidate questions by identifying concepts and relationships of each identified context and incorporating the concepts with one or more question stems.
  • the question generator (130) is connected to a knowledge base server (20) to identify and analyse the concepts and relationships based on the identified context during the process of generating the questions.
  • the query mapper (140) is configured to perform social network analysis on the candidate questions generated by the question generator (130) with respect to the query.
  • the query mapper (140) performs the social network analysis by constructing a social network based on co-occurrence of concepts or terms in the candidate questions, and performing co-occurrence analysis, centrality analysis, and relevance analysis.
  • the query mapper (140) identifies multiple sub-graphs from the social network analysis, wherein each sub-graph shows how the query terms appear together and the closeness of query terms shows the similarity between the query terms.
  • the similar query terms can be identified based on the sub-graphs.
  • the relevance value and importance value of each subgraph is determined, wherein the relevance value is computed using the relevance analysis and the importance value is computed using the centrality analysis.
  • the relevance analysis and centrality analysis are based on graph theory and network analysis to determine the relative importance of a node within each sub-graph.
  • the query mapper (140) is also configured to perform syntactic analysis between the query and the candidate questions.
  • the ranking component (150) is configured to rank the sub-graphs which lead to ranking the candidate question as the sub-graphs composed of concepts in the candidate questions. This is to identify the candidate questions that are semantically closest to the query. The ranking is based on normalized semantic and similarity scores.
  • the query processor (160) is configured to retrieve and rank relevant results based on the candidate questions from the ranking component (150). The relevant information is displayed as the query results.
  • the query processor (160) is also connected to a knowledge base server (30) to retrieve relevant information based on the candidate questions.
  • the query manager (110) of the semantic query system (100) receives a query from a user device (10).
  • the query manager (110) sends the query to the context identifier (120).
  • the context identifier (120) identifies all contexts related to the query by using a classification algorithm.
  • the question generator (130) automatically generates candidate questions based on the context identified by the context identifier (120) as in step 202.
  • step 203 the query is mapped to the candidate questions.
  • a list of ranked candidate questions is produced.
  • the list of ranked candidate questions is then transmitted to the query processor (160).
  • the query processor (160) searches and retrieves relevant results based on the list of ranked candidate questions as in step 204.
  • the results retrieved are answers to the candidate questions in the list.
  • step 205 the results retrieved is transmitted and displayed as the query results to the user device (10).
  • the query results are displayed according to the ranking of the candidate questions.
  • a syntactic analysis is performed between the query and the candidate questions generated by the question generator (130).
  • the syntactic analysis includes extracting the terms in the query (step 311), finding terms having similar meaning to the terms extracted from the query (step 312), performing morphological analysis on all the terms (step 313), and term matching or matching the terms in the query with the terms in the candidate questions (step 314).
  • the query mapper (140) performs a social network analysis on the candidate questions generated by the question generator (130) with respect to the query.
  • the social network analysis is initiated by constructing a social network based on co-occurrence of concepts or terms which indicates terms that appear together in the candidate questions (step 321). For instance, assuming that terml and term2 appear in a first candidate question while term3 and term4 appear in a second candidate question, the query mapper (140) constructs a social network of terml -term2-term3-term4 based on co-occurrence of the terms in the candidate questions.
  • the query mapper (140) determines the relevance value between the co-occurred terms (step 322) and thereon, computes the centrality value for each term (step 323).
  • each candidate question is scored based on the syntactic analysis and social network analysis to determine a semantic score, similarity score and confidence score for each candidate question (steps 331 to 333).
  • the semantic score is based on the relevance and centrality values for each candidate question.
  • the similarity score is based on the number of matched terms based on the syntactic analysis performed.
  • the confidence score is a relative measure of certainty of matched terms based on the syntactic analysis performed.
  • the semantic and similarity scores for each candidate question are normalized as in step 340.
  • step 350 the candidate questions are ranked according to the normalized scores and/or confidence scores by the ranking component (150).
  • the ranking component 150.
  • a user device (10) sends a query of "cancer and its effect on today's youth" to the semantic query system (100). Thereon, the query manager (110) sends the query to the context identifier (120).
  • the context identifier (120) identifies all contexts related to the query by using a classification algorithm.
  • the contexts that have been identified are cancer, disease, and malignant neoplasm.
  • the identified contexts are then sent to the question generator (130) for generating candidate questions based on the identified context.
  • the candidate questions generated are listed below:
  • the query is mapped to the candidate questions based on syntactic analysis and social network analysis.
  • the semantic and similarity scores for each candidate question are normalized to produce a list of ranked candidate questions.
  • the list of ranked candidate questions is then transmitted to the query processor (160) to search and retrieve relevant results based on the ranked candidate questions.
  • the query processor (160) retrieves the answer or result for Q6 as "smoking, hereditary traits, working with chemicals, etc. are common factors contributing to cancer"
  • the relevant results are transmitted and displayed as the query results to the user device (10). While embodiments of the invention have been illustrated and described, it is not intended that these embodiments illustrate and describe all possible forms of the invention. Rather, the words used in the specifications are words of description rather than limitation and various changes may be made without departing from the scope of the invention.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present invention relates to a semantic query system (100) and a method for processing a query. The semantic query system (100) comprises of a query manager (110), a context identifier (120), a question generator (130), a query mapper (140), a ranking component (150) and a query processor (160). When the semantic query system (100) receives a query, the context identifier (120) identifies all contexts related to the query. Thereon, the question generator (130) generates candidate questions based on the contexts. The query mapper (140) maps the query to the candidate questions by performing a syntactic analysis and a social network analysis on the candidate questions with respect to the query. The ranking component (150) produces a list of ranked candidate questions. Based on the list, the query processor (160) retrieves and ranks relevant information which is displayed as the query results.

Description

A SEMANTIC QUERY SYSTEM AND METHOD THEREOF
FIELD OF INVENTION
The present invention relates to an information retrieval system and more particularly to a semantic query system.
BACKGROUND OF THE INVENTION
Semantic query system is used to retrieve relevant information based on a query by understanding the contextual meaning of the query. Moreover, the semantic query system ranks relevancy of the retrieved information based on semantics. Thus, the semantic query system improves search accuracy compared to a typical search engine which retrieves information that matches the keywords in the query. In order to provide an effective semantic query system, the semantic query system is evaluated based on precision and recall, wherein precision is defined as the number of relevant results retrieved by a search divided by the total number of results retrieved by that search, while recall is defined as the number of relevant results retrieved by a search divided by the total number of possible relevant results. High precision means that a semantic query system retrieves more relevant results than irrelevant results, while high recall means that a semantic query system retrieves most of the relevant results. Thus, the semantic query system needs to balance out between precision and recall. Most of the existing semantic query systems conduct a search by mapping a query into internal query structures such as SPARQL and thereon, retrieving the relevant results. Such semantic query systems increase precision at the expense of recall. Moreover, such semantic query systems pose a mapping problem whereby the correct mapping of the query into the internal query may be difficult to find due to the complexity in natural language query. In order to resolve this, the semantic query systems use template based approach. However, such systems require a query to rigidly suit the structure of the templates in its database and thus, the systems may not be able to process any types of query unless it suits the templates. In order to expand the types of the query to be processed, the systems need to include more templates in its database. Therefore, there is a need to provide a semantic query system that addresses the above-mentioned problems. SUMMARY OF INVENTION
The present invention provides a semantic query system (100). The semantic query system (100) comprises a query manager (110) configured to control activities of the system (100); a context identifier (120) configured to identify at least one context related to a query, a question generator (130) configured to generate at least one candidate question based on the at least one context; a query mapper (140) configured to perform social network analysis on the at least one candidate question generated by the question generator (130) with respect to the query; a ranking component (150) configured to produce a list of ranked candidate question generated by the question generator (130); and a query processor (160) configured for retrieving relevant information based on the list of ranked candidate question generated by said question generator (130). The context identifier (120), the question generator (130), the query mapper (140), the ranking component (150) and the query processor (160) are connected to said query manager (110). The present invention also provides a method for processing a query. The method is characterised by the steps of receiving a query; identifying at least one context related to the query; generating at least one candidate question based on the at least one context; mapping the query to the at least one candidate question to produce a list of ranked candidate question; and retrieving and ranking relevant information based on the list of ranked candidate question.
Preferably, mapping the query to the at least one candidate question includes the steps of performing a syntactic analysis between the query and the at least one candidate question; performing a social network analysis on the at least one candidate question with respect to the query; determining a semantic score, similarity score, and confidence score for each candidate question; normalizing the semantic score and similarity score for each candidate question; and producing a list of ranked candidate question by ranking the at least one candidate question based on the normalized semantic and similarity scores. The social network analysis is preferably performed by constructing a social network based on co-occurrence of concepts or terms in the at least one candidate question; determining relevance value between the co-occurred terms for each term; and computing centrality value for each term.
BRIEF DESCRIPTION OF THE DRAWINGS
The accompanying drawings, which are incorporated in and constitute a part of the specification, illustrate embodiments of the invention and, together with the description, serve to explain the principles of the invention.
FIG. 1 shows a block diagram of a semantic query system (100) according to an embodiment of the present invention.
FIG. 2 shows a flowchart of a method for processing a query according to an embodiment of the present invention. FIG. 3 shows a flowchart of a method mapping a query to questions generated by a question generator (130) according to an embodiment of the present invention.
DESCRIPTION OF THE PREFFERED EMBODIMENT
A preferred embodiment of the present invention will be described herein below with reference to the accompanying drawings. In the following description, well known functions or constructions are not described in detail since they would obscure the description with unnecessary detail.
Referring to FIG. 1, there is shown a block diagram of a semantic query system (100) according to an embodiment of the present invention. The system (100) retrieves closest and most relevant information or results based on a natural language query provided by a user. The system (100) is connected to at least one user device (10) via a network. The user device (10) may include but not limited to laptop, mobile phone, personal computer, handheld communication device, and handheld computing device. The system (100) comprises of a query manager (110), a context identifier (120), a question generator (130), a query mapper (140), a ranking component (150) and a query processor (160).
The query manager (110) is configured to control the flow of activities for the overall system (100). The query manager (110) is connected to the context identifier (120), the question generator (130), the query mapper (140), the ranking component (150) and the query processor (160).
The context identifier (120) is configured to identify one or more contexts related to the query by using a classification algorithm such as k-nearest neighbour classification algorithm, Naive-Bayes classification algorithm or any other machine learning algorithms suitable for classifying a query.
The question generator (130) is configured to automatically generate one or more candidate questions based on the context identified by the context identifier (120). The question generator (130) generates the candidate questions by identifying concepts and relationships of each identified context and incorporating the concepts with one or more question stems. The question generator (130) is connected to a knowledge base server (20) to identify and analyse the concepts and relationships based on the identified context during the process of generating the questions.
The query mapper (140) is configured to perform social network analysis on the candidate questions generated by the question generator (130) with respect to the query. The query mapper (140) performs the social network analysis by constructing a social network based on co-occurrence of concepts or terms in the candidate questions, and performing co-occurrence analysis, centrality analysis, and relevance analysis. As a result, the query mapper (140) identifies multiple sub-graphs from the social network analysis, wherein each sub-graph shows how the query terms appear together and the closeness of query terms shows the similarity between the query terms. Thus, the similar query terms can be identified based on the sub-graphs. Moreover, the relevance value and importance value of each subgraph is determined, wherein the relevance value is computed using the relevance analysis and the importance value is computed using the centrality analysis. The relevance analysis and centrality analysis are based on graph theory and network analysis to determine the relative importance of a node within each sub-graph. The query mapper (140) is also configured to perform syntactic analysis between the query and the candidate questions.
The ranking component (150) is configured to rank the sub-graphs which lead to ranking the candidate question as the sub-graphs composed of concepts in the candidate questions. This is to identify the candidate questions that are semantically closest to the query. The ranking is based on normalized semantic and similarity scores. The query processor (160) is configured to retrieve and rank relevant results based on the candidate questions from the ranking component (150). The relevant information is displayed as the query results. The query processor (160) is also connected to a knowledge base server (30) to retrieve relevant information based on the candidate questions.
Referring to FIG. 2, there is shown a method for processing a query by using the semantic query system (100) of FIG. 1. Initially, as in step 201, the query manager (110) of the semantic query system (100) receives a query from a user device (10). The query manager (110) sends the query to the context identifier (120). Based on the query, the context identifier (120) identifies all contexts related to the query by using a classification algorithm.
Thereon, the identified contexts are sent to the question generator (130). The question generator (130) automatically generates candidate questions based on the context identified by the context identifier (120) as in step 202.
In step 203, the query is mapped to the candidate questions. As a result, a list of ranked candidate questions is produced. The list of ranked candidate questions is then transmitted to the query processor (160). The query processor (160) searches and retrieves relevant results based on the list of ranked candidate questions as in step 204. The results retrieved are answers to the candidate questions in the list. In step 205, the results retrieved is transmitted and displayed as the query results to the user device (10). The query results are displayed according to the ranking of the candidate questions.
Referring now to FIG. 3, there is shown a flowchart of a method mapping a query to candidate questions generated by the question generator (130). Initially, as in step 310, a syntactic analysis is performed between the query and the candidate questions generated by the question generator (130). The syntactic analysis includes extracting the terms in the query (step 311), finding terms having similar meaning to the terms extracted from the query (step 312), performing morphological analysis on all the terms (step 313), and term matching or matching the terms in the query with the terms in the candidate questions (step 314).
In step 320, the query mapper (140) performs a social network analysis on the candidate questions generated by the question generator (130) with respect to the query. The social network analysis is initiated by constructing a social network based on co-occurrence of concepts or terms which indicates terms that appear together in the candidate questions (step 321). For instance, assuming that terml and term2 appear in a first candidate question while term3 and term4 appear in a second candidate question, the query mapper (140) constructs a social network of terml -term2-term3-term4 based on co-occurrence of the terms in the candidate questions. Based on the social network, the query mapper (140) determines the relevance value between the co-occurred terms (step 322) and thereon, computes the centrality value for each term (step 323). In step 330, each candidate question is scored based on the syntactic analysis and social network analysis to determine a semantic score, similarity score and confidence score for each candidate question (steps 331 to 333). The semantic score is based on the relevance and centrality values for each candidate question. The similarity score is based on the number of matched terms based on the syntactic analysis performed. The confidence score is a relative measure of certainty of matched terms based on the syntactic analysis performed.
Thereon, the semantic and similarity scores for each candidate question are normalized as in step 340.
In step 350, the candidate questions are ranked according to the normalized scores and/or confidence scores by the ranking component (150). Thus, this produces a list of the ranked candidate questions. For a further understanding of this invention, a specific example is provided herein below for illustration purpose only and is not intended to be limiting unless otherwise specified. A user device (10) sends a query of "cancer and its effect on today's youth" to the semantic query system (100). Thereon, the query manager (110) sends the query to the context identifier (120).
Based on the query, the context identifier (120) identifies all contexts related to the query by using a classification algorithm. The contexts that have been identified are cancer, disease, and malignant neoplasm.
The identified contexts are then sent to the question generator (130) for generating candidate questions based on the identified context. The candidate questions generated are listed below:
Q1: What are the symptoms of cancer?
Q2: Is cancer treatable?
Q3: Are youngsters taking to smoking more than before?
Q4: Studies on young people falling victims to cancer
Q5: Are youngsters more prone to diseases?
Q6: What are the factors that cause cancer to young people?
Q7: What are the dangerous hereditary diseases in today's world?
Thereon, the query is mapped to the candidate questions based on syntactic analysis and social network analysis. Based on the syntactic analysis, the similarity score for each candidate question is obtained as follows: Q6 = 0.8, Q4 = 0.8, Q2 = 0.73, Q1 = 0.75, Q3 = 0.4, Q5 = 0.35, and Q7 = 0.2. On the other hand, the semantic score for each candidate question based on the social network analysis is obtained as follows: Q6 = 78.54, Q7 = 70.23, Q4 = 65.23, Q1 = 55.26 Q2 = 52.31, Q3 = 45.11, and Q5 = 40.37. The semantic and similarity scores for each candidate question are normalized to produce a list of ranked candidate questions.
The list of ranked candidate questions is then transmitted to the query processor (160) to search and retrieve relevant results based on the ranked candidate questions. For instance, the query processor (160) retrieves the answer or result for Q6 as "smoking, hereditary traits, working with chemicals, etc. are common factors contributing to cancer" The relevant results are transmitted and displayed as the query results to the user device (10). While embodiments of the invention have been illustrated and described, it is not intended that these embodiments illustrate and describe all possible forms of the invention. Rather, the words used in the specifications are words of description rather than limitation and various changes may be made without departing from the scope of the invention.

Claims

1. A semantic query system (100) comprising:
a query manager (110) configured to control activities of the system
(100),
a context identifier (120) configured to identify at least one context related to a query, and
a query processor (160) configured for retrieving relevant information; wherein said semantic query system (100) is characterised in that:
said semantic query system (100) further includes:
a question generator (130) configured to generate at least one candidate question based on the at least one context, a query mapper (140) configured to perform social network analysis on the at least one candidate question generated by said question generator (130) with respect to the query, and a ranking component (150) configured to produce a list of ranked candidate question generated by said question generator (130); and wherein said query processor (160) is configured for retrieving relevant information based on the list of ranked candidate question generated by said question generator (130); and wherein said context identifier (120), said question generator (130), said query mapper (140), said ranking component (150) and said query processor (160) are connected to said query manager (110)
2. A method for processing a query is characterised by the steps of:
a) receiving a query;
b) identifying at least one context related to the query;
c) generating at least one candidate question based on the at least one context;
d) mapping the query to the at least one candidate question to produce a list of ranked candidate question; and e) retrieving and ranking relevant information based on the list of ranked candidate question.
3. The method as claimed in claim 2, wherein mapping the query to the at least one candidate question includes the steps of:
a) performing a syntactic analysis between the query and the at least one candidate question;
b) performing a social network analysis on the at least one candidate question with respect to the query;
c) determining a semantic score, similarity score, and confidence score for each candidate question;
d) normalizing the semantic score and similarity score for each candidate question; and
e) producing a list of ranked candidate question by ranking the at least one candidate question based on the normalized semantic and similarity scores.
4. The method as claimed in claim 3, wherein performing the social network analysis on the at least one candidate question with respect to the query includes the steps of:
a) constructing a social network based on co-occurrence of concepts or terms in the at least one candidate question;
b) determining relevance value between the co-occurred terms for each term; and
c) computing centrality value for each term.
PCT/MY2013/000254 2012-12-18 2013-12-12 A semantic query system and method thereof WO2014098561A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
MYPI2012701213 2012-12-18
MYPI2012701213A MY185315A (en) 2012-12-18 2012-12-18 A semantic query system and method thereof

Publications (1)

Publication Number Publication Date
WO2014098561A1 true WO2014098561A1 (en) 2014-06-26

Family

ID=50184970

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/MY2013/000254 WO2014098561A1 (en) 2012-12-18 2013-12-12 A semantic query system and method thereof

Country Status (2)

Country Link
MY (1) MY185315A (en)
WO (1) WO2014098561A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10621507B2 (en) 2016-03-12 2020-04-14 Wipro Limited System and method for generating an optimized result set using vector based relative importance measure
WO2021229328A1 (en) * 2020-05-12 2021-11-18 Bayestree Intelligence Pvt Ltd Method of automatically assigning a classification
CN115203598A (en) * 2022-07-20 2022-10-18 贝壳找房(北京)科技有限公司 Information sorting method, electronic device and storage medium in real estate field

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090171929A1 (en) * 2007-12-26 2009-07-02 Microsoft Corporation Toward optimized query suggeston: user interfaces and algorithms

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090171929A1 (en) * 2007-12-26 2009-07-02 Microsoft Corporation Toward optimized query suggeston: user interfaces and algorithms

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
APOSTOL (PAUL) NATSEV ET AL: "Semantic concept-based query expansion and re-ranking for multimedia retrieval", PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON MULTIMEDIA , MULTIMEDIA '07, 23 September 2007 (2007-09-23), New York, New York, USA, pages 991, XP055035435, ISBN: 978-1-59-593702-5, DOI: 10.1145/1291233.1291448 *
HUANHUAN CAO ET AL: "Context-aware query suggestion by mining click-through and session data", PROCEEDING OF THE 14TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 08, 24 August 2008 (2008-08-24), New York, New York, USA, pages 875, XP055112020, ISBN: 978-1-60-558193-4, DOI: 10.1145/1401890.1401995 *
YUNBO CAO ET AL: "Recommending questions using the mdl-based tree cut model", PROCEEDING OF THE 17TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB , WWW '08, 21 April 2008 (2008-04-21), New York, New York, USA, pages 81, XP055112023, ISBN: 978-1-60-558085-2, DOI: 10.1145/1367497.1367509 *

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10621507B2 (en) 2016-03-12 2020-04-14 Wipro Limited System and method for generating an optimized result set using vector based relative importance measure
WO2021229328A1 (en) * 2020-05-12 2021-11-18 Bayestree Intelligence Pvt Ltd Method of automatically assigning a classification
CN115203598A (en) * 2022-07-20 2022-10-18 贝壳找房(北京)科技有限公司 Information sorting method, electronic device and storage medium in real estate field
CN115203598B (en) * 2022-07-20 2023-09-19 贝壳找房(北京)科技有限公司 Information ordering method in real estate field, electronic equipment and storage medium

Also Published As

Publication number Publication date
MY185315A (en) 2021-05-03

Similar Documents

Publication Publication Date Title
CN107515877B (en) Sensitive subject word set generation method and device
US10565533B2 (en) Systems and methods for similarity and context measures for trademark and service mark analysis and repository searches
US7548910B1 (en) System and method for retrieving scenario-specific documents
US10545956B2 (en) Natural language search with semantic mapping and classification
US8326861B1 (en) Personalized term importance evaluation in queries
US9280535B2 (en) Natural language querying with cascaded conditional random fields
US8903794B2 (en) Generating and presenting lateral concepts
US10095784B2 (en) Synonym generation
US8332426B2 (en) Indentifying referring expressions for concepts
CN109522465A (en) The semantic searching method and device of knowledge based map
JP2017508214A (en) Provide search recommendations
KR102411778B1 (en) Server, method and computer program for infering comparative advantage of multiple knowledge
CN110704743A (en) Semantic search method and device based on knowledge graph
JP2013506189A (en) Retrieving information based on general query attributes
WO2012142553A2 (en) Identifying query formulation suggestions for low-match queries
KR101220557B1 (en) Method and system for searching mobile application using human activity knowledge database
JP2013077333A (en) System, method and software for information retrieval accompanying concept-based retrieval and ranking
JP2013531282A (en) Guided search based on query model
WO2014015176A1 (en) Query expansion
US20030212663A1 (en) Neural network feedback for enhancing text search
CN110637316A (en) System and method for intelligent prospective object recognition using online resources and neural network processing to classify tissue based on published material
KR20210130976A (en) Device, method and computer program for deriving response based on knowledge graph
US20120130972A1 (en) Concept disambiguation via search engine search results
Sasikumar et al. A survey of natural language question answering system
WO2014098561A1 (en) A semantic query system and method thereof

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 13831981

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 13831981

Country of ref document: EP

Kind code of ref document: A1