CN100378724C - 基于移动配置概念的句子结构分析方法及使用其的自然语言搜索方法 - Google Patents
基于移动配置概念的句子结构分析方法及使用其的自然语言搜索方法 Download PDFInfo
- Publication number
- CN100378724C CN100378724C CNB2004800110557A CN200480011055A CN100378724C CN 100378724 C CN100378724 C CN 100378724C CN B2004800110557 A CNB2004800110557 A CN B2004800110557A CN 200480011055 A CN200480011055 A CN 200480011055A CN 100378724 C CN100378724 C CN 100378724C
- Authority
- CN
- China
- Prior art keywords
- sentence
- morpheme
- database
- analysis
- word
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 238000004458 analytical method Methods 0.000 title claims abstract description 139
- 238000000034 method Methods 0.000 title claims abstract description 71
- 230000014509 gene expression Effects 0.000 claims abstract description 11
- 238000010276 construction Methods 0.000 claims description 31
- 230000006870 function Effects 0.000 claims description 25
- 239000000203 mixture Substances 0.000 claims description 13
- 239000003607 modifier Substances 0.000 claims description 13
- 238000003860 storage Methods 0.000 claims description 10
- 230000008859 change Effects 0.000 claims description 9
- 230000008878 coupling Effects 0.000 claims description 8
- 238000010168 coupling process Methods 0.000 claims description 8
- 238000005859 coupling reaction Methods 0.000 claims description 8
- 230000015572 biosynthetic process Effects 0.000 claims description 6
- 239000000284 extract Substances 0.000 claims description 4
- 150000001875 compounds Chemical class 0.000 claims description 3
- 238000000605 extraction Methods 0.000 claims description 3
- 238000004364 calculation method Methods 0.000 claims description 2
- 238000012545 processing Methods 0.000 claims description 2
- 241000282414 Homo sapiens Species 0.000 abstract description 6
- 239000003550 marker Substances 0.000 abstract 1
- 238000007781 pre-processing Methods 0.000 abstract 1
- 238000013459 approach Methods 0.000 description 7
- NHUHCSRWZMLRLA-UHFFFAOYSA-N Sulfisoxazole Chemical compound CC1=NOC(NS(=O)(=O)C=2C=CC(N)=CC=2)=C1C NHUHCSRWZMLRLA-UHFFFAOYSA-N 0.000 description 6
- 238000010586 diagram Methods 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 230000005055 memory storage Effects 0.000 description 4
- 230000008569 process Effects 0.000 description 4
- 235000013305 food Nutrition 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 238000002203 pretreatment Methods 0.000 description 3
- 239000000470 constituent Substances 0.000 description 2
- 235000013399 edible fruits Nutrition 0.000 description 2
- 230000008676 import Effects 0.000 description 2
- 239000011159 matrix material Substances 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- 229920000057 Mannan Polymers 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 230000001133 acceleration Effects 0.000 description 1
- 238000013473 artificial intelligence Methods 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 238000000354 decomposition reaction Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 239000000446 fuel Substances 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 230000014759 maintenance of location Effects 0.000 description 1
- 235000012054 meals Nutrition 0.000 description 1
- 230000036651 mood Effects 0.000 description 1
- 230000000877 morphologic effect Effects 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 230000000306 recurrent effect Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/268—Morphological analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/205—Parsing
- G06F40/211—Syntactic parsing, e.g. based on context-free grammar [CFG] or unification grammars
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/253—Grammatical analysis; Style critique
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/40—Processing or translation of natural language
- G06F40/53—Processing of non-Latin text
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Machine Translation (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims (6)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020030025995 | 2003-04-24 | ||
KR10-2003-0025995A KR100515641B1 (ko) | 2003-04-24 | 2003-04-24 | 모빌적 형상 개념을 기초로 한 구문 분석방법 및 이를이용한 자연어 검색 방법 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1777888A CN1777888A (zh) | 2006-05-24 |
CN100378724C true CN100378724C (zh) | 2008-04-02 |
Family
ID=36766677
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB2004800110557A Expired - Fee Related CN100378724C (zh) | 2003-04-24 | 2004-04-22 | 基于移动配置概念的句子结构分析方法及使用其的自然语言搜索方法 |
Country Status (9)
Country | Link |
---|---|
US (1) | US20070010990A1 (zh) |
EP (1) | EP1616270A4 (zh) |
JP (2) | JP2006524372A (zh) |
KR (1) | KR100515641B1 (zh) |
CN (1) | CN100378724C (zh) |
AU (1) | AU2004232276B2 (zh) |
CA (1) | CA2523140A1 (zh) |
HK (1) | HK1092242A1 (zh) |
WO (1) | WO2004095310A1 (zh) |
Families Citing this family (54)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8706747B2 (en) | 2000-07-06 | 2014-04-22 | Google Inc. | Systems and methods for searching using queries written in a different character-set and/or language from the target pages |
GB0316806D0 (en) * | 2003-07-17 | 2003-08-20 | Ivis Group Ltd | Improved search engine |
KR100590553B1 (ko) * | 2004-05-21 | 2006-06-19 | 삼성전자주식회사 | 대화체 운율구조 생성방법 및 장치와 이를 적용한음성합성시스템 |
US8442828B2 (en) * | 2005-12-02 | 2013-05-14 | Microsoft Corporation | Conditional model for natural language understanding |
KR100717998B1 (ko) * | 2005-12-26 | 2007-05-15 | 고려대학교 산학협력단 | 문서의 표절 검사 방법 |
US7668791B2 (en) * | 2006-07-31 | 2010-02-23 | Microsoft Corporation | Distinguishing facts from opinions using a multi-stage approach |
US8548795B2 (en) * | 2006-10-10 | 2013-10-01 | Abbyy Software Ltd. | Method for translating documents from one language into another using a database of translations, a terminology dictionary, a translation dictionary, and a machine translation system |
US8145473B2 (en) | 2006-10-10 | 2012-03-27 | Abbyy Software Ltd. | Deep model statistics method for machine translation |
US9235573B2 (en) | 2006-10-10 | 2016-01-12 | Abbyy Infopoisk Llc | Universal difference measure |
US9047275B2 (en) | 2006-10-10 | 2015-06-02 | Abbyy Infopoisk Llc | Methods and systems for alignment of parallel text corpora |
US8195447B2 (en) | 2006-10-10 | 2012-06-05 | Abbyy Software Ltd. | Translating sentences between languages using language-independent semantic structures and ratings of syntactic constructions |
US9645993B2 (en) | 2006-10-10 | 2017-05-09 | Abbyy Infopoisk Llc | Method and system for semantic searching |
US8214199B2 (en) * | 2006-10-10 | 2012-07-03 | Abbyy Software, Ltd. | Systems for translating sentences between languages using language-independent semantic structures and ratings of syntactic constructions |
US9633005B2 (en) | 2006-10-10 | 2017-04-25 | Abbyy Infopoisk Llc | Exhaustive automatic processing of textual information |
US20080086298A1 (en) * | 2006-10-10 | 2008-04-10 | Anisimovich Konstantin | Method and system for translating sentences between langauges |
US9984071B2 (en) | 2006-10-10 | 2018-05-29 | Abbyy Production Llc | Language ambiguity detection of text |
CN101013421B (zh) * | 2007-02-02 | 2012-06-27 | 清华大学 | 基于规则的汉语基本块自动分析方法 |
US8959011B2 (en) | 2007-03-22 | 2015-02-17 | Abbyy Infopoisk Llc | Indicating and correcting errors in machine translation systems |
US8812296B2 (en) | 2007-06-27 | 2014-08-19 | Abbyy Infopoisk Llc | Method and system for natural language dictionary generation |
US9015194B2 (en) * | 2007-07-02 | 2015-04-21 | Verint Systems Inc. | Root cause analysis using interactive data categorization |
US8374914B2 (en) * | 2008-08-06 | 2013-02-12 | Obschestvo S Ogranichennoi Otvetstvennostiu “Kuznetch” | Advertising using image comparison |
US9262409B2 (en) | 2008-08-06 | 2016-02-16 | Abbyy Infopoisk Llc | Translation of a selected text fragment of a screen |
KR101117427B1 (ko) * | 2009-02-26 | 2012-03-13 | 고려대학교 산학협력단 | 형태소 합성 장치 및 방법 |
KR101309839B1 (ko) * | 2009-12-02 | 2013-09-23 | 한국전자통신연구원 | 통계정보를 이용한 규칙 기반 구문분석 장치 및 방법 |
US20130158986A1 (en) * | 2010-07-15 | 2013-06-20 | The University Of Queensland | Communications analysis system and process |
JP2012027722A (ja) * | 2010-07-23 | 2012-02-09 | Sony Corp | 情報処理装置、情報処理方法及び情報処理プログラム |
KR101850886B1 (ko) * | 2010-12-23 | 2018-04-23 | 네이버 주식회사 | 감소 질의를 추천하는 검색 시스템 및 방법 |
CN102054047B (zh) * | 2011-01-07 | 2013-03-27 | 焦点科技股份有限公司 | 一种服务可配置业务规则的提取方法 |
US10123053B2 (en) | 2011-05-23 | 2018-11-06 | Texas Instruments Incorporated | Acceleration of bypass binary symbol processing in video coding |
KR20130014106A (ko) * | 2011-07-29 | 2013-02-07 | 한국전자통신연구원 | 다중 번역 엔진을 사용한 번역 장치 및 방법 |
US9495352B1 (en) | 2011-09-24 | 2016-11-15 | Athena Ann Smyros | Natural language determiner to identify functions of a device equal to a user manual |
CN103164426B (zh) * | 2011-12-13 | 2015-10-28 | 北大方正集团有限公司 | 一种命名实体识别的方法及装置 |
CN103310343A (zh) * | 2012-03-15 | 2013-09-18 | 阿里巴巴集团控股有限公司 | 商品信息发布方法和装置 |
US8989485B2 (en) | 2012-04-27 | 2015-03-24 | Abbyy Development Llc | Detecting a junction in a text line of CJK characters |
US8971630B2 (en) | 2012-04-27 | 2015-03-03 | Abbyy Development Llc | Fast CJK character recognition |
JP5526209B2 (ja) * | 2012-10-09 | 2014-06-18 | 株式会社Ubic | フォレンジックシステムおよびフォレンジック方法並びにフォレンジックプログラム |
KR20140119841A (ko) * | 2013-03-27 | 2014-10-13 | 한국전자통신연구원 | 애니메이션을 이용한 번역 검증 방법 및 그 장치 |
US9495357B1 (en) * | 2013-05-02 | 2016-11-15 | Athena Ann Smyros | Text extraction |
RU2592395C2 (ru) | 2013-12-19 | 2016-07-20 | Общество с ограниченной ответственностью "Аби ИнфоПоиск" | Разрешение семантической неоднозначности при помощи статистического анализа |
RU2586577C2 (ru) | 2014-01-15 | 2016-06-10 | Общество с ограниченной ответственностью "Аби ИнфоПоиск" | Фильтрация дуг в синтаксическом графе |
CN103927298B (zh) * | 2014-04-25 | 2016-09-21 | 秦一男 | 一种基于计算机的自然语言句法结构解析方法和装置 |
RU2596600C2 (ru) | 2014-09-02 | 2016-09-10 | Общество с ограниченной ответственностью "Аби Девелопмент" | Способы и системы обработки изображений математических выражений |
US9626358B2 (en) | 2014-11-26 | 2017-04-18 | Abbyy Infopoisk Llc | Creating ontologies by analyzing natural language texts |
CN108885617B (zh) * | 2016-03-23 | 2022-05-31 | 株式会社野村综合研究所 | 语句解析系统以及程序 |
US11449744B2 (en) | 2016-06-23 | 2022-09-20 | Microsoft Technology Licensing, Llc | End-to-end memory networks for contextual language understanding |
JP6784084B2 (ja) * | 2016-07-27 | 2020-11-11 | 富士通株式会社 | 符号化プログラム、符号化装置、符号化方法、及び検索方法 |
US10366163B2 (en) * | 2016-09-07 | 2019-07-30 | Microsoft Technology Licensing, Llc | Knowledge-guided structural attention processing |
CN109086285B (zh) * | 2017-06-14 | 2021-10-15 | 佛山辞荟源信息科技有限公司 | 基于语素的汉语智能处理方法和系统及装置 |
KR102209786B1 (ko) * | 2018-06-29 | 2021-01-29 | 김태정 | 자연어 처리 기반의 청크 구성 방법 및 장치 |
CN109388801B (zh) * | 2018-09-30 | 2023-07-14 | 创新先进技术有限公司 | 相似词集合的确定方法、装置和电子设备 |
US11416556B2 (en) * | 2019-12-19 | 2022-08-16 | Accenture Global Solutions Limited | Natural language dialogue system perturbation testing |
CN113139183B (zh) * | 2020-01-17 | 2023-12-29 | 深信服科技股份有限公司 | 一种检测方法、装置、设备及存储介质 |
CN111897914B (zh) * | 2020-07-20 | 2023-09-19 | 杭州叙简科技股份有限公司 | 用于综合管廊领域的实体信息抽取及知识图谱构建方法 |
CN113407739B (zh) * | 2021-07-14 | 2023-01-06 | 海信视像科技股份有限公司 | 信息标题中概念的确定方法、装置和存储介质 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0635958A (ja) * | 1992-07-14 | 1994-02-10 | Hitachi Ltd | 語句検索方法 |
CN1302412A (zh) * | 1997-07-22 | 2001-07-04 | 微软公司 | 应用搜索结果的自然语言处理以改进整体精度的信息检索系统的设备和方法 |
US6278967B1 (en) * | 1992-08-31 | 2001-08-21 | Logovista Corporation | Automated system for generating natural language translations that are domain-specific, grammar rule-based, and/or based on part-of-speech analysis |
JP2003030184A (ja) * | 2001-07-18 | 2003-01-31 | Sony Corp | 自然言語処理装置および自然言語処理方法、並びにプログラムおよび記録媒体 |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4931936A (en) * | 1987-10-26 | 1990-06-05 | Sharp Kabushiki Kaisha | Language translation system with means to distinguish between phrases and sentence and number discrminating means |
JPH02281372A (ja) * | 1989-04-24 | 1990-11-19 | Sharp Corp | 機械翻訳装置における挿入副詞句処理方法 |
KR100331029B1 (ko) * | 1998-11-24 | 2002-09-04 | 한국전자통신연구원 | 한국어개념분류체계구축방법과,수정방법및구축장치 |
KR20000039749A (ko) * | 1998-12-15 | 2000-07-05 | 정선종 | 기계 번역을 위한 변환 장치 및 이를 이용한 변환 방법 |
-
2003
- 2003-04-24 KR KR10-2003-0025995A patent/KR100515641B1/ko not_active IP Right Cessation
-
2004
- 2004-04-22 JP JP2006500677A patent/JP2006524372A/ja active Pending
- 2004-04-22 US US10/553,856 patent/US20070010990A1/en not_active Abandoned
- 2004-04-22 CN CNB2004800110557A patent/CN100378724C/zh not_active Expired - Fee Related
- 2004-04-22 AU AU2004232276A patent/AU2004232276B2/en not_active Ceased
- 2004-04-22 CA CA002523140A patent/CA2523140A1/en not_active Abandoned
- 2004-04-22 WO PCT/KR2004/000927 patent/WO2004095310A1/en active IP Right Grant
- 2004-04-22 EP EP04728982A patent/EP1616270A4/en not_active Withdrawn
-
2006
- 2006-11-21 HK HK06112777A patent/HK1092242A1/xx not_active IP Right Cessation
-
2007
- 2007-07-02 JP JP2007174038A patent/JP2007317211A/ja active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0635958A (ja) * | 1992-07-14 | 1994-02-10 | Hitachi Ltd | 語句検索方法 |
US6278967B1 (en) * | 1992-08-31 | 2001-08-21 | Logovista Corporation | Automated system for generating natural language translations that are domain-specific, grammar rule-based, and/or based on part-of-speech analysis |
CN1302412A (zh) * | 1997-07-22 | 2001-07-04 | 微软公司 | 应用搜索结果的自然语言处理以改进整体精度的信息检索系统的设备和方法 |
JP2003030184A (ja) * | 2001-07-18 | 2003-01-31 | Sony Corp | 自然言語処理装置および自然言語処理方法、並びにプログラムおよび記録媒体 |
Also Published As
Publication number | Publication date |
---|---|
AU2004232276B2 (en) | 2007-08-02 |
KR100515641B1 (ko) | 2005-09-22 |
HK1092242A1 (en) | 2007-02-02 |
US20070010990A1 (en) | 2007-01-11 |
KR20030044949A (ko) | 2003-06-09 |
EP1616270A4 (en) | 2010-05-05 |
CA2523140A1 (en) | 2004-11-04 |
AU2004232276A1 (en) | 2004-11-04 |
JP2007317211A (ja) | 2007-12-06 |
CN1777888A (zh) | 2006-05-24 |
WO2004095310A1 (en) | 2004-11-04 |
EP1616270A1 (en) | 2006-01-18 |
JP2006524372A (ja) | 2006-10-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN100378724C (zh) | 基于移动配置概念的句子结构分析方法及使用其的自然语言搜索方法 | |
Deng et al. | Turl: Table understanding through representation learning | |
US8751218B2 (en) | Indexing content at semantic level | |
CN109783806B (zh) | 一种利用语义解析结构的文本匹配方法 | |
US20240086436A1 (en) | Systems and methods for training language models to reason over tables | |
WO2021041722A1 (en) | System and method to extract customized information in natural language text | |
WO2024015323A1 (en) | Methods and systems for improved document processing and information retrieval | |
Sbattella et al. | A novel semantic information retrieval system based on a three-level domain model | |
Xu | Bootstrapping relation extraction from semantic seeds | |
Zhang et al. | Sciencebenchmark: A complex real-world benchmark for evaluating natural language to sql systems | |
Lee | Natural Language Processing: A Textbook with Python Implementation | |
Zinoviev | Data Science Essentials in Python: Collect-Organize-Explore-Predict-Value | |
Sati et al. | Arabic text question answering from an answer retrieval point of view: A survey | |
Nikolić et al. | Modelling the System of Receiving Quick Answers for e-Government Services: Study for the Crime Domain in the Republic of Serbia | |
OsmanHamoud | A Question Answering System Design about the Holy Quran | |
Asadi | Comparison of social tags and controlled vocabulary terms assigned to images: A feasibility study of computer-assisted image indexing | |
Oo et al. | Myanmar News Retrieval Using Kernelized Neural Ranking Model | |
Nandhini | Legal document summarization using hybrid model | |
Karki et al. | Question answering via web extracted tables | |
Hjelm | Cross-language ontology learning: Incorporating and exploiting cross-language data in the ontology learning process | |
Hauschild | Examining the Effect of Word Embeddings and Preprocessing Methods on Fake News Detection | |
Othman | Learning to Retrieve Relevant Passages and Questions in Open Domain and Community Question Answering | |
Taye et al. | An Ontology Learning Framework for unstructured Arabic Text | |
Nawaz | Sentiment Analysis of News Headlines for Comparing Performance of Governments of Pakistan | |
Diomedi Pinto | Question answering over Wikidata using entity linking and neural semantic parsing |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
REG | Reference to a national code |
Ref country code: HK Ref legal event code: DE Ref document number: 1092242 Country of ref document: HK |
|
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
REG | Reference to a national code |
Ref country code: HK Ref legal event code: GR Ref document number: 1092242 Country of ref document: HK |
|
C56 | Change in the name or address of the patentee | ||
CP03 | Change of name, title or address |
Address after: Gyeonggi Do, South Korea Patentee after: Yu Shunchao Address before: Gyeonggi Do, South Korea Patentee before: Yu Shengchao |
|
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20080402 Termination date: 20160422 |
|
CF01 | Termination of patent right due to non-payment of annual fee |