Nothing Special   »   [go: up one dir, main page]

CN107748803B - Method for designing spatial situation characteristic event database - Google Patents

Method for designing spatial situation characteristic event database Download PDF

Info

Publication number
CN107748803B
CN107748803B CN201711157764.1A CN201711157764A CN107748803B CN 107748803 B CN107748803 B CN 107748803B CN 201711157764 A CN201711157764 A CN 201711157764A CN 107748803 B CN107748803 B CN 107748803B
Authority
CN
China
Prior art keywords
data
database
keywords
event
space
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201711157764.1A
Other languages
Chinese (zh)
Other versions
CN107748803A (en
Inventor
黄辉
李洪波
王涛
魏向旺
李宇飞
王欣
张琦
李元元
李航
席福彪
胡超
王立强
岳志勇
张帆
阎晶红
吕淮北
毛羽
秦芬
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Academy of Launch Vehicle Technology CALT
Original Assignee
China Academy of Launch Vehicle Technology CALT
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Academy of Launch Vehicle Technology CALT filed Critical China Academy of Launch Vehicle Technology CALT
Priority to CN201711157764.1A priority Critical patent/CN107748803B/en
Publication of CN107748803A publication Critical patent/CN107748803A/en
Application granted granted Critical
Publication of CN107748803B publication Critical patent/CN107748803B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2282Tablespace storage structures; Management thereof
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/211Schema design and management

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

A design method of a spatial situation characteristic event database comprises the following steps of (1) dividing all current spatial situation characteristic event data into normalized data and descriptive data; (2) establishing a corresponding empty table in a database according to the keywords of the normalized data, and importing the content of the normalized data into the empty table to form a normalized database table; the following processing is performed on the descriptive data: (3) extracting titles, countries, time, keywords, contents and sources from the data; (4) establishing a database form; (5) establishing a label library; (6) according to the frequency of the keywords in the database form in the step (4), calculating the weight of the keywords according to the frequency of the keywords, selecting the keywords with the weight exceeding a preset value as tags, and classifying and filling the tags into the corresponding headers in the step (5); (7) and comparing the key words corresponding to each spatial situation characteristic event in the database form with the labels in the label library, and filling the matched labels into the label part of the event form header.

Description

Method for designing spatial situation characteristic event database
Technical Field
The invention belongs to the technical field of space situation perception, space situation evaluation and the like. And aiming at the characteristics of multisource isomerism and distribution of the spatial situation data, a spatial data model is constructed on the basis of analyzing spatial situation constituent elements, and finally the construction of the spatial situation integrated data model is completed.
Background
The spatial situation event information mainly relates to news reports of space activities carried out by countries in the world, and the event database is used for classifying, storing, maintaining and managing the data information so as to be used for subsequent spatial situation evaluation and analysis.
"construction and application of national group event database" (public safety, 03 2017) "a sentence is based on a comet news database, 48 news reports are sampled, and ROST CM6.0 is adopted to perform word segmentation and high-frequency word analysis on the sampled news reports, so that 30 high-frequency words related to national group events are extracted. Then 5708 news reports about the national group event are screened from more than 600 ten thousand related news based on high-frequency words, and the news reports are encoded and checked for credibility, so that a national group event database is constructed (1998-. The database decomposes each event information into 17 fields for coding, but the method can not adopt a uniform coding format to code the event information with different sources and different data structures. The feature-based spatial situation integrated data model (survey and drawing project, volume 24, No. 8 in 2015) adopts a feature-based modeling method to carry out conceptual modeling, combines an object-oriented method to carry out logic model design, and constructs a spatial situation data physical model based on XML, thereby realizing the construction of the spatial situation integrated data model. The method has the advantages that the attention point is the comprehensive expression of the space situation information components, and the storage management of the event information is not involved.
The information of the space event mainly comes from nine fields of information sources: the method comprises the following steps of space law and policy, space facility and equipment, organization and personnel, application strategy of space equipment, balance strategy of military and civil business, cross strategy of international interaction, natural space resource and environment, social concept and scientific and technological environment and space situation perception and comprehensive evaluation. The description difference of various spatial events is large, different information contains different elements, the focus of attention is different, and a uniform event description format does not exist. Therefore, the current database design technology is not sufficient to directly support the design implementation of the spatial situation characteristic event database.
Disclosure of Invention
The technical problem to be solved by the invention is as follows: the defects of the prior art are overcome, and the method for preventing errors in the whole process of aerospace servo valve development is provided.
The technical solution of the invention is as follows: a method for designing a spatial situation characteristic event database comprises the following steps:
(1) dividing all current spatial situation characteristic event data into normalized data and descriptive data; the normalized data is a spatial situation related data table issued by a third party, and the descriptive data is a spatial situation characteristic event reported by a news media;
(2) establishing a corresponding empty table in a database according to the keywords of the normalized data, and importing the content of the normalized data into the empty table to form a normalized database table;
the following processing is performed on the descriptive data:
(3) preprocessing the descriptive data, and extracting titles, countries, time, keywords, contents and sources from the data;
(4) establishing a database form, taking the title, country, time, keywords, content, source, tags and accessories as the form header, and finishing the initial filling of the form by using the information extracted in the step (3); wherein the content of the label part is empty, and the content part fills all the character information in the news media report; the attachment contains complete news media report information;
(5) establishing a tag library, wherein a header in the tag library comprises space laws and policies, space facilities and equipment, organizations and personnel, application strategies of the space equipment, balance strategies of military and civil commerce, longitudinal and transverse strategies of international communication, natural space resources and environment, social concepts and scientific and technological environments, and spatial situation perception and comprehensive evaluation;
(6) according to the frequency of the keywords in the database form in the step (4), calculating the weight of the keywords according to the frequency of the keywords, selecting the keywords with the weight exceeding a preset value as tags, and classifying and filling the tags into the corresponding headers in the step (5);
(7) and comparing the key words corresponding to each spatial situation characteristic event in the database form with the labels in the label library, and filling the matched labels into the label part of the event form header.
Further, a visual interaction page is designed for the database, the visual page comprises two visual pages of normalized data and descriptive data, and each visual page comprises a data import function.
Furthermore, each visual interactive page also comprises two parts besides the data import function, wherein one part displays the titles as a list, and the other part displays the complete database list information associated with the titles.
Further, a data import function is used for importing single space situation characteristic event data or importing space situation characteristic event data in batch regularly or in real time, the steps (1) - (7) are executed again on all current space situation characteristic event data, and updating of the database is completed.
Further, the steps (1) to (6) are executed by inputting data of a specific time period, and the keywords are sorted according to the weight in the step (6), so that the hotspot space situation vocabulary in the specific time period is determined.
Compared with the prior art, the invention has the beneficial effects that:
the invention aims to construct a better database mode, design the mutual relation between the storage structure of data and data objects according to the characteristics of space event intelligence information, and establish a space event database and an application system thereof, so that the space event database can effectively store multi-source and heterogeneous event data and meet various user requirements (comprising the functions of information inquiry, classification, statistical analysis and the like). Compared with the prior art, the invention has the beneficial effects that:
(1) heterogeneous event data can be stored in a uniform manner. The spatial situation characteristic events can be described in various ways according to different sources and fields, and a simple log-type storage way is not beneficial to later-stage searching and analysis. For example: the description of the spatial legislation may be that "a certain country promulgated a certain part of the spatial law in a certain month and a certain country in a certain year and a certain time of a certain day and a certain payload is carried in a certain country, and the description of the spatial characteristic event has a common field/keyword (such as time) and different parts (such as specific activity content). The field design workload of fine granularity is huge and it is difficult to ensure comprehensive coverage. The invention analyzes and extracts common fields/key words (including titles, nations, time and the like) of the event description, and completely reserves the content description part of each piece of event information, so that different types of event information can be stored in a uniform mode.
(2) A label library is constructed, the labeling of event information is realized, and the advanced and deep search is conveniently carried out. The above-mentioned unified storage of event information can only support simple search according to limited keywords. In order to realize the retrieval according to the content, the invention designs a label library for labeling the event information. The labels in the label library are classified names of the space events, empty keywords are reserved in the event items, the labels in the label library are selected for each piece of event information by a user and are endowed with self-defined labels, and the event information can be accurately searched by inputting the keywords, the labels or the combination of the keywords and the labels during searching. In addition, the label library is maintainable and expandable, and new labels are modified or added by users according to actual needs.
Drawings
FIG. 1 is a flow chart of the present invention;
FIG. 2 is a database structure of spatial situation characteristic events according to the present invention;
FIG. 3 is a schematic diagram of a spatial situation characteristic event database using tag search according to the present invention.
Detailed Description
A method for designing a spatial situation characteristic event database is shown in FIG. 1, and comprises the following steps:
(1) dividing all current spatial situation characteristic event data into normalized data and descriptive data; the normalized data is a spatial situation related data table issued by a third party, and the descriptive data is a spatial situation characteristic event reported by a news media;
for example: a Satellite Database (UCS Satellite Database) published by the united states hypochondriac scientist consortium is typical normalized data, which is in an Excel format, and lists on-orbit Satellite information, keywords such as Satellite name, registered country, all countries, orbit type, near-position height, far-position height, inclination angle, period, emission quality and the like, wherein each row corresponds to all relevant parameters of a specific on-orbit Satellite. On the other hand, according to the daily satellite news website report, on day 2/4/2014, the russian navigation satellite system consisting of 24 'glonass' satellites fails, resulting in service interruption for more than ten hours. … … "is a news report of spatial situation characteristic event, which is a descriptive text (also can contain graph, table, multimedia, etc.), without keyword decomposition.
(2) Establishing a corresponding empty table in a database according to the keywords of the normalized data, and importing the content of the normalized data into the empty table to form a normalized database table;
the following processing is performed on the descriptive data:
(3) preprocessing the descriptive data, and extracting titles, countries, time, keywords, contents and sources from the data;
for example: or "according to the daily satellite news website report, on 4 months and 2 days in 2014, the russian navigation satellite system consisting of 24 'glonass' satellites fails, resulting in service interruption for more than ten hours. … …, taking descriptive data as an example, the extraction title is that the Russian 'Grounels' satellite navigation system fails, the country is Russian, the time is 4 months and 2 days in 2014, the keywords are Grounels, navigation systems and failures, the content is complete text description of the news, and the source is a daily satellite news website.
(4) Establishing a database form, taking the title, country, time, keywords, content, source, tags and accessories as the form header, and finishing the initial filling of the form by using the information extracted in the step (3); wherein the content of the label part is empty, and the content part fills all the character information in the news media report; the attachment contains complete news media story information. As shown in the lower half of fig. 2;
(5) and establishing a tag library, wherein the header in the tag library comprises space laws and policies, space facilities and equipment, organizations and personnel, application strategies of the space equipment, trade-off strategies for civil and military use, longitudinal and transverse strategies for international interaction, perception and comprehensive evaluation of natural space resources and environment, social concepts and scientific and technological environments and space situations. As shown in the upper half of fig. 2;
(6) and (5) calculating the weight of the keyword according to the frequency of the keyword in the database form in the step (4), selecting the keyword with the weight exceeding a preset value as a label, and classifying and filling the label into the corresponding form header in the step (5). As indicated by the arrow on the right half of fig. 2;
for example: after the spatial situation characteristic event data is imported into the database, the keywords 'navigation system' appear in the whole database form together by n1Second (word frequency), other keywords respectively appear n2、n3、……nkNext, k is a keywordThe total number, the weight of the keyword 'navigation system' is calculated as x ═ n1/(n1+n2+n3+……nk) The weight presets y, if x>And y, classifying the keyword label under the head of the aerospace facility and equipment in the label library.
(7) And comparing the key words corresponding to each spatial situation characteristic event in the database form with the labels in the label library, and filling the matched labels into the label part of the event form header.
For example: or "according to the daily satellite news website report, on 4 months and 2 days in 2014, the russian navigation satellite system consisting of 24 'glonass' satellites fails, resulting in service interruption for more than ten hours. … …, the event contains the keyword "navigation system", and if the keyword is classified in the tag library by the step (6) processing, it shows that the keyword corresponding to the item of spatial situation characteristic event matches with the tag in the tag library, the "navigation system" is filled in the tag part of the item of descriptive data in the database form.
(8) And designing a visual interaction page for the database, wherein the visual page comprises two visual pages of normalized data and descriptive data, and each visual page comprises a data import function. The visual page directly reads the relevant database form from the database and displays the relevant database form in a row and column mode. The data import comprises single data import and batch import, the single data import is a pop-up dialog box, the title, country, time, keywords, content and source of the spatial situation characteristic event are manually filled in, and the attachment is uploaded, and the operations are all realized by using the universal control. Finally, the tag of the event data is filled in through step (7). The batch import can integrally import the spatial situation characteristic event data subjected to the normalized preprocessing at one time, and is realized by using a universal control.
(9) And (5) when new spatial situation characteristic event data are added in the database, automatically triggering the calculation operation in the step (6), updating the word frequency and the weight of the keywords, reordering the keywords, extracting the labels and updating the label database.
(10) And (3) executing the steps (1) to (6) by inputting data of a specific time period, and sequencing the keywords according to the weights in the step (6) so as to determine the hot spot space situation vocabulary in the specific time period.
For example: inputting the spatial situation characteristic event data from 1/2015 to 31/2015, and executing all the operations in the steps (1) - (6) to obtain the hot spot spatial situation vocabulary ordering in the time period as follows: the method comprises the steps of reusable, deep space exploration, electric propulsion, internet constellation and … …, and the first few words are annual hot spot space situation vocabularies.
(11) Search functions are designed for the database, including general searches and deep searches. The general search is to set a pull-down list, the user selects the 'country' and 'time' items in the header of the database form, the user inputs a single keyword to search the database and list an event list. The deep search can be carried out according to a common key word or a plurality of key word combinations, and the relation of logic 'AND' or 'OR' is selected by a user when the plurality of key words are subjected to the combined search; the combined search can also be carried out on single or a plurality of keywords and single or a plurality of labels, the number of the labels is user-defined, the relation between the keywords is selected as 'AND' or 'OR' by a user, the relation between the label items is selected as 'AND' or 'OR' by the user, and the relation between the keywords and the labels is fixed as the logical 'AND' relation. As shown in fig. 3. Implemented with a generic control.
For example: a separate search may be made for the keyword "country" term selection or for the input value "usa"; the keywords 'country' item and 'time' item can be respectively selected or input into the values 'American' and '2014.01.12', the logical relation is selected as 'AND', and the combined search is carried out; the search conditions may also be set to: the keyword "country" item is "United states", the label items are respectively set as "deep space", "asteroid" and "moon", the label items are in a logical "OR" relationship with each other, and database search is performed.
The invention has not been described in detail in part of the common general knowledge of those skilled in the art.

Claims (5)

1. A method for designing a spatial situation characteristic event database is characterized by comprising the following steps:
(1) dividing all current spatial situation characteristic event data into normalized data and descriptive data; the normalized data is a spatial situation related data table issued by a third party, and the descriptive data is a spatial situation characteristic event reported by a news media;
(2) establishing a corresponding empty table in a database according to the keywords of the normalized data, and importing the content of the normalized data into the empty table to form a normalized database table;
the following processing is performed on the descriptive data:
(3) preprocessing the descriptive data, and extracting titles, countries, time, keywords, contents and sources from the data;
(4) establishing a database form, taking the title, country, time, keywords, content, source, tags and accessories as the form header, and finishing the initial filling of the form by using the information extracted in the step (3); wherein the content of the label part is empty, and the content part fills all the character information in the news media report; the attachment contains complete news media report information;
(5) establishing a tag library, wherein a header in the tag library comprises space laws and policies, space facilities and equipment, organizations and personnel, application strategies of the space equipment, balance strategies of military and civil commerce, longitudinal and transverse strategies of international communication, natural space resources and environment, social concepts and scientific and technological environments, and spatial situation perception and comprehensive evaluation;
(6) according to the frequency of the keywords in the database form in the step (4), calculating the weight of the keywords according to the frequency of the keywords, selecting the keywords with the weight exceeding a preset value as tags, and classifying and filling the tags into the corresponding headers in the step (5);
(7) comparing the key words corresponding to each spatial situation characteristic event in the database form with the labels in the label library, and filling the matched labels into the label part of the event form header;
(8) designing search functions for the database, wherein the search functions comprise general search and deep search; setting a pull-down list in general searching, selecting the 'country' and 'time' items in the table header of a database table by a user, inputting a single keyword by the user to search the database and listing an event list; the deep search can be carried out according to a common key word or a plurality of key word combinations, and the relation of logic 'AND' or 'OR' is selected by a user when the plurality of key words are subjected to the combined search; or single or multiple keywords and single or multiple labels are searched in a combined mode, the number of the labels is user-defined, the relation between the keywords is selected as 'AND' or 'OR' by a user, the relation between the label items is selected as 'AND' or 'OR' by the user, and the relation between the keywords and the labels is fixed as the logical 'AND' relation.
2. The method of claim 1, wherein: and designing a visual interaction page for the database, wherein the visual page comprises two visual pages of normalized data and descriptive data, and each visual page comprises a data import function.
3. The method of claim 2, wherein: each visual interactive page also comprises two parts besides a data importing function, wherein one part displays the titles as a list, and the other part displays complete database list information associated with the titles.
4. The method of claim 2, wherein: and (3) introducing single space situation characteristic event data or introducing space situation characteristic event data in batch regularly or in real time by using a data introduction function, and re-executing the steps (1) - (7) on all current space situation characteristic event data to complete the updating of the database.
5. The method of claim 2, wherein: and (3) executing the steps (1) to (6) by inputting data of a specific time period, and sequencing the keywords according to the weights in the step (6) so as to determine the hot spot space situation vocabulary in the specific time period.
CN201711157764.1A 2017-11-20 2017-11-20 Method for designing spatial situation characteristic event database Active CN107748803B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711157764.1A CN107748803B (en) 2017-11-20 2017-11-20 Method for designing spatial situation characteristic event database

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711157764.1A CN107748803B (en) 2017-11-20 2017-11-20 Method for designing spatial situation characteristic event database

Publications (2)

Publication Number Publication Date
CN107748803A CN107748803A (en) 2018-03-02
CN107748803B true CN107748803B (en) 2021-02-09

Family

ID=61251584

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711157764.1A Active CN107748803B (en) 2017-11-20 2017-11-20 Method for designing spatial situation characteristic event database

Country Status (1)

Country Link
CN (1) CN107748803B (en)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109509512B (en) * 2018-07-10 2021-04-30 北京大学 Clinical business automatic library building method based on Excel import
CN111078905A (en) * 2018-10-22 2020-04-28 北京国双科技有限公司 Data processing method, device, medium and equipment
CN109858869A (en) * 2018-12-21 2019-06-07 厦门市美亚柏科信息股份有限公司 Method and apparatus for handling event information
CN109933669B (en) * 2019-03-19 2023-04-21 南京大学 Matching method of battlefield situation data labels
CN110516048A (en) * 2019-09-02 2019-11-29 苏州朗动网络科技有限公司 The extracting method, equipment and storage medium of list data in pdf document
CN110543914B (en) * 2019-09-04 2022-06-24 软通智慧信息技术有限公司 Event data processing method and device, computing equipment and medium
CN110826194A (en) * 2019-10-18 2020-02-21 内蒙动力机械研究所 Modeling method for reliability data of solid rocket engine

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7958127B2 (en) * 2007-02-15 2011-06-07 Uqast, Llc Tag-mediated review system for electronic content
CN103106199B (en) * 2011-11-09 2016-03-02 中国移动通信集团四川有限公司 Text searching method and device
CN102831234B (en) * 2012-08-31 2015-04-22 北京邮电大学 Personalized news recommendation device and method based on news content and theme feature
CN103186662B (en) * 2012-12-28 2016-08-03 北京中油网资讯技术有限公司 A kind of dynamically public sentiment keyword abstraction system and method
US11281716B2 (en) * 2014-07-29 2022-03-22 DISH Technologies L.L.C. Apparatus, systems and methods for media content searching
CN106294742B (en) * 2016-08-10 2019-05-14 中国科学技术大学 A kind of space launching site security reliability database construction method and analysis and assessment system
CN106599174A (en) * 2016-12-12 2017-04-26 国云科技股份有限公司 Real-time news recommendation system and method thereof

Also Published As

Publication number Publication date
CN107748803A (en) 2018-03-02

Similar Documents

Publication Publication Date Title
CN107748803B (en) Method for designing spatial situation characteristic event database
US9652467B2 (en) Inline tree data structure for high-speed searching and filtering of large datasets
US8086592B2 (en) Apparatus and method for associating unstructured text with structured data
US9697250B1 (en) Systems and methods for high-speed searching and filtering of large datasets
CN101535945A (en) Full text query and search systems and method of use
WO2005010727A2 (en) Extracting data from semi-structured text documents
CN103548019A (en) Method and system for providing statistical from a data warehouse
WO2024065952A1 (en) Remote sensing satellite information recommendation method, system and device
CN102467544B (en) Information smart searching method and system based on space fuzzy coding
CN105183803A (en) Personalized search method and search apparatus thereof in social network platform
CN108052668A (en) The endowed method and system of intelligence based on commodity code
US9524341B2 (en) Retrieval system and method of searching of information in the internet
CN101853163B (en) Industry application software system construction method based on assembly business modeling
CN100421107C (en) Data structure and management system for a superset of relational databases
US20210240334A1 (en) Interactive patent visualization systems and methods
CN101133416A (en) Database management apparatus and method of managing database
WO2014113327A2 (en) Intellectual property asset information retrieval system
CN101088082A (en) Full text query and search systems and methods of use
Wang et al. AceMap: Knowledge Discovery through Academic Graph
KR100855238B1 (en) Patent Searching Method and Patent Search System Using Query Automatically Containing hierarchically Subordinate Patent Classifications
Sinif et al. Approaching an optimizing open linked government data portal
CN100496091C (en) System for making global search in wired TV one-way set-top box
Monaco Methods for in-sourcing authority control with MarcEdit, SQL, and regular expressions
US20080158161A1 (en) Data entry processing
Litvinov et al. Paradigm of controls concept for global information systems

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant