CN107748803B - Method for designing spatial situation characteristic event database - Google Patents
Method for designing spatial situation characteristic event database Download PDFInfo
- Publication number
- CN107748803B CN107748803B CN201711157764.1A CN201711157764A CN107748803B CN 107748803 B CN107748803 B CN 107748803B CN 201711157764 A CN201711157764 A CN 201711157764A CN 107748803 B CN107748803 B CN 107748803B
- Authority
- CN
- China
- Prior art keywords
- data
- database
- keywords
- event
- space
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/22—Indexing; Data structures therefor; Storage structures
- G06F16/2282—Tablespace storage structures; Management thereof
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/21—Design, administration or maintenance of databases
- G06F16/211—Schema design and management
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Databases & Information Systems (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Software Systems (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
A design method of a spatial situation characteristic event database comprises the following steps of (1) dividing all current spatial situation characteristic event data into normalized data and descriptive data; (2) establishing a corresponding empty table in a database according to the keywords of the normalized data, and importing the content of the normalized data into the empty table to form a normalized database table; the following processing is performed on the descriptive data: (3) extracting titles, countries, time, keywords, contents and sources from the data; (4) establishing a database form; (5) establishing a label library; (6) according to the frequency of the keywords in the database form in the step (4), calculating the weight of the keywords according to the frequency of the keywords, selecting the keywords with the weight exceeding a preset value as tags, and classifying and filling the tags into the corresponding headers in the step (5); (7) and comparing the key words corresponding to each spatial situation characteristic event in the database form with the labels in the label library, and filling the matched labels into the label part of the event form header.
Description
Technical Field
The invention belongs to the technical field of space situation perception, space situation evaluation and the like. And aiming at the characteristics of multisource isomerism and distribution of the spatial situation data, a spatial data model is constructed on the basis of analyzing spatial situation constituent elements, and finally the construction of the spatial situation integrated data model is completed.
Background
The spatial situation event information mainly relates to news reports of space activities carried out by countries in the world, and the event database is used for classifying, storing, maintaining and managing the data information so as to be used for subsequent spatial situation evaluation and analysis.
"construction and application of national group event database" (public safety, 03 2017) "a sentence is based on a comet news database, 48 news reports are sampled, and ROST CM6.0 is adopted to perform word segmentation and high-frequency word analysis on the sampled news reports, so that 30 high-frequency words related to national group events are extracted. Then 5708 news reports about the national group event are screened from more than 600 ten thousand related news based on high-frequency words, and the news reports are encoded and checked for credibility, so that a national group event database is constructed (1998-. The database decomposes each event information into 17 fields for coding, but the method can not adopt a uniform coding format to code the event information with different sources and different data structures. The feature-based spatial situation integrated data model (survey and drawing project, volume 24, No. 8 in 2015) adopts a feature-based modeling method to carry out conceptual modeling, combines an object-oriented method to carry out logic model design, and constructs a spatial situation data physical model based on XML, thereby realizing the construction of the spatial situation integrated data model. The method has the advantages that the attention point is the comprehensive expression of the space situation information components, and the storage management of the event information is not involved.
The information of the space event mainly comes from nine fields of information sources: the method comprises the following steps of space law and policy, space facility and equipment, organization and personnel, application strategy of space equipment, balance strategy of military and civil business, cross strategy of international interaction, natural space resource and environment, social concept and scientific and technological environment and space situation perception and comprehensive evaluation. The description difference of various spatial events is large, different information contains different elements, the focus of attention is different, and a uniform event description format does not exist. Therefore, the current database design technology is not sufficient to directly support the design implementation of the spatial situation characteristic event database.
Disclosure of Invention
The technical problem to be solved by the invention is as follows: the defects of the prior art are overcome, and the method for preventing errors in the whole process of aerospace servo valve development is provided.
The technical solution of the invention is as follows: a method for designing a spatial situation characteristic event database comprises the following steps:
(1) dividing all current spatial situation characteristic event data into normalized data and descriptive data; the normalized data is a spatial situation related data table issued by a third party, and the descriptive data is a spatial situation characteristic event reported by a news media;
(2) establishing a corresponding empty table in a database according to the keywords of the normalized data, and importing the content of the normalized data into the empty table to form a normalized database table;
the following processing is performed on the descriptive data:
(3) preprocessing the descriptive data, and extracting titles, countries, time, keywords, contents and sources from the data;
(4) establishing a database form, taking the title, country, time, keywords, content, source, tags and accessories as the form header, and finishing the initial filling of the form by using the information extracted in the step (3); wherein the content of the label part is empty, and the content part fills all the character information in the news media report; the attachment contains complete news media report information;
(5) establishing a tag library, wherein a header in the tag library comprises space laws and policies, space facilities and equipment, organizations and personnel, application strategies of the space equipment, balance strategies of military and civil commerce, longitudinal and transverse strategies of international communication, natural space resources and environment, social concepts and scientific and technological environments, and spatial situation perception and comprehensive evaluation;
(6) according to the frequency of the keywords in the database form in the step (4), calculating the weight of the keywords according to the frequency of the keywords, selecting the keywords with the weight exceeding a preset value as tags, and classifying and filling the tags into the corresponding headers in the step (5);
(7) and comparing the key words corresponding to each spatial situation characteristic event in the database form with the labels in the label library, and filling the matched labels into the label part of the event form header.
Further, a visual interaction page is designed for the database, the visual page comprises two visual pages of normalized data and descriptive data, and each visual page comprises a data import function.
Furthermore, each visual interactive page also comprises two parts besides the data import function, wherein one part displays the titles as a list, and the other part displays the complete database list information associated with the titles.
Further, a data import function is used for importing single space situation characteristic event data or importing space situation characteristic event data in batch regularly or in real time, the steps (1) - (7) are executed again on all current space situation characteristic event data, and updating of the database is completed.
Further, the steps (1) to (6) are executed by inputting data of a specific time period, and the keywords are sorted according to the weight in the step (6), so that the hotspot space situation vocabulary in the specific time period is determined.
Compared with the prior art, the invention has the beneficial effects that:
the invention aims to construct a better database mode, design the mutual relation between the storage structure of data and data objects according to the characteristics of space event intelligence information, and establish a space event database and an application system thereof, so that the space event database can effectively store multi-source and heterogeneous event data and meet various user requirements (comprising the functions of information inquiry, classification, statistical analysis and the like). Compared with the prior art, the invention has the beneficial effects that:
(1) heterogeneous event data can be stored in a uniform manner. The spatial situation characteristic events can be described in various ways according to different sources and fields, and a simple log-type storage way is not beneficial to later-stage searching and analysis. For example: the description of the spatial legislation may be that "a certain country promulgated a certain part of the spatial law in a certain month and a certain country in a certain year and a certain time of a certain day and a certain payload is carried in a certain country, and the description of the spatial characteristic event has a common field/keyword (such as time) and different parts (such as specific activity content). The field design workload of fine granularity is huge and it is difficult to ensure comprehensive coverage. The invention analyzes and extracts common fields/key words (including titles, nations, time and the like) of the event description, and completely reserves the content description part of each piece of event information, so that different types of event information can be stored in a uniform mode.
(2) A label library is constructed, the labeling of event information is realized, and the advanced and deep search is conveniently carried out. The above-mentioned unified storage of event information can only support simple search according to limited keywords. In order to realize the retrieval according to the content, the invention designs a label library for labeling the event information. The labels in the label library are classified names of the space events, empty keywords are reserved in the event items, the labels in the label library are selected for each piece of event information by a user and are endowed with self-defined labels, and the event information can be accurately searched by inputting the keywords, the labels or the combination of the keywords and the labels during searching. In addition, the label library is maintainable and expandable, and new labels are modified or added by users according to actual needs.
Drawings
FIG. 1 is a flow chart of the present invention;
FIG. 2 is a database structure of spatial situation characteristic events according to the present invention;
FIG. 3 is a schematic diagram of a spatial situation characteristic event database using tag search according to the present invention.
Detailed Description
A method for designing a spatial situation characteristic event database is shown in FIG. 1, and comprises the following steps:
(1) dividing all current spatial situation characteristic event data into normalized data and descriptive data; the normalized data is a spatial situation related data table issued by a third party, and the descriptive data is a spatial situation characteristic event reported by a news media;
for example: a Satellite Database (UCS Satellite Database) published by the united states hypochondriac scientist consortium is typical normalized data, which is in an Excel format, and lists on-orbit Satellite information, keywords such as Satellite name, registered country, all countries, orbit type, near-position height, far-position height, inclination angle, period, emission quality and the like, wherein each row corresponds to all relevant parameters of a specific on-orbit Satellite. On the other hand, according to the daily satellite news website report, on day 2/4/2014, the russian navigation satellite system consisting of 24 'glonass' satellites fails, resulting in service interruption for more than ten hours. … … "is a news report of spatial situation characteristic event, which is a descriptive text (also can contain graph, table, multimedia, etc.), without keyword decomposition.
(2) Establishing a corresponding empty table in a database according to the keywords of the normalized data, and importing the content of the normalized data into the empty table to form a normalized database table;
the following processing is performed on the descriptive data:
(3) preprocessing the descriptive data, and extracting titles, countries, time, keywords, contents and sources from the data;
for example: or "according to the daily satellite news website report, on 4 months and 2 days in 2014, the russian navigation satellite system consisting of 24 'glonass' satellites fails, resulting in service interruption for more than ten hours. … …, taking descriptive data as an example, the extraction title is that the Russian 'Grounels' satellite navigation system fails, the country is Russian, the time is 4 months and 2 days in 2014, the keywords are Grounels, navigation systems and failures, the content is complete text description of the news, and the source is a daily satellite news website.
(4) Establishing a database form, taking the title, country, time, keywords, content, source, tags and accessories as the form header, and finishing the initial filling of the form by using the information extracted in the step (3); wherein the content of the label part is empty, and the content part fills all the character information in the news media report; the attachment contains complete news media story information. As shown in the lower half of fig. 2;
(5) and establishing a tag library, wherein the header in the tag library comprises space laws and policies, space facilities and equipment, organizations and personnel, application strategies of the space equipment, trade-off strategies for civil and military use, longitudinal and transverse strategies for international interaction, perception and comprehensive evaluation of natural space resources and environment, social concepts and scientific and technological environments and space situations. As shown in the upper half of fig. 2;
(6) and (5) calculating the weight of the keyword according to the frequency of the keyword in the database form in the step (4), selecting the keyword with the weight exceeding a preset value as a label, and classifying and filling the label into the corresponding form header in the step (5). As indicated by the arrow on the right half of fig. 2;
for example: after the spatial situation characteristic event data is imported into the database, the keywords 'navigation system' appear in the whole database form together by n1Second (word frequency), other keywords respectively appear n2、n3、……nkNext, k is a keywordThe total number, the weight of the keyword 'navigation system' is calculated as x ═ n1/(n1+n2+n3+……nk) The weight presets y, if x>And y, classifying the keyword label under the head of the aerospace facility and equipment in the label library.
(7) And comparing the key words corresponding to each spatial situation characteristic event in the database form with the labels in the label library, and filling the matched labels into the label part of the event form header.
For example: or "according to the daily satellite news website report, on 4 months and 2 days in 2014, the russian navigation satellite system consisting of 24 'glonass' satellites fails, resulting in service interruption for more than ten hours. … …, the event contains the keyword "navigation system", and if the keyword is classified in the tag library by the step (6) processing, it shows that the keyword corresponding to the item of spatial situation characteristic event matches with the tag in the tag library, the "navigation system" is filled in the tag part of the item of descriptive data in the database form.
(8) And designing a visual interaction page for the database, wherein the visual page comprises two visual pages of normalized data and descriptive data, and each visual page comprises a data import function. The visual page directly reads the relevant database form from the database and displays the relevant database form in a row and column mode. The data import comprises single data import and batch import, the single data import is a pop-up dialog box, the title, country, time, keywords, content and source of the spatial situation characteristic event are manually filled in, and the attachment is uploaded, and the operations are all realized by using the universal control. Finally, the tag of the event data is filled in through step (7). The batch import can integrally import the spatial situation characteristic event data subjected to the normalized preprocessing at one time, and is realized by using a universal control.
(9) And (5) when new spatial situation characteristic event data are added in the database, automatically triggering the calculation operation in the step (6), updating the word frequency and the weight of the keywords, reordering the keywords, extracting the labels and updating the label database.
(10) And (3) executing the steps (1) to (6) by inputting data of a specific time period, and sequencing the keywords according to the weights in the step (6) so as to determine the hot spot space situation vocabulary in the specific time period.
For example: inputting the spatial situation characteristic event data from 1/2015 to 31/2015, and executing all the operations in the steps (1) - (6) to obtain the hot spot spatial situation vocabulary ordering in the time period as follows: the method comprises the steps of reusable, deep space exploration, electric propulsion, internet constellation and … …, and the first few words are annual hot spot space situation vocabularies.
(11) Search functions are designed for the database, including general searches and deep searches. The general search is to set a pull-down list, the user selects the 'country' and 'time' items in the header of the database form, the user inputs a single keyword to search the database and list an event list. The deep search can be carried out according to a common key word or a plurality of key word combinations, and the relation of logic 'AND' or 'OR' is selected by a user when the plurality of key words are subjected to the combined search; the combined search can also be carried out on single or a plurality of keywords and single or a plurality of labels, the number of the labels is user-defined, the relation between the keywords is selected as 'AND' or 'OR' by a user, the relation between the label items is selected as 'AND' or 'OR' by the user, and the relation between the keywords and the labels is fixed as the logical 'AND' relation. As shown in fig. 3. Implemented with a generic control.
For example: a separate search may be made for the keyword "country" term selection or for the input value "usa"; the keywords 'country' item and 'time' item can be respectively selected or input into the values 'American' and '2014.01.12', the logical relation is selected as 'AND', and the combined search is carried out; the search conditions may also be set to: the keyword "country" item is "United states", the label items are respectively set as "deep space", "asteroid" and "moon", the label items are in a logical "OR" relationship with each other, and database search is performed.
The invention has not been described in detail in part of the common general knowledge of those skilled in the art.
Claims (5)
1. A method for designing a spatial situation characteristic event database is characterized by comprising the following steps:
(1) dividing all current spatial situation characteristic event data into normalized data and descriptive data; the normalized data is a spatial situation related data table issued by a third party, and the descriptive data is a spatial situation characteristic event reported by a news media;
(2) establishing a corresponding empty table in a database according to the keywords of the normalized data, and importing the content of the normalized data into the empty table to form a normalized database table;
the following processing is performed on the descriptive data:
(3) preprocessing the descriptive data, and extracting titles, countries, time, keywords, contents and sources from the data;
(4) establishing a database form, taking the title, country, time, keywords, content, source, tags and accessories as the form header, and finishing the initial filling of the form by using the information extracted in the step (3); wherein the content of the label part is empty, and the content part fills all the character information in the news media report; the attachment contains complete news media report information;
(5) establishing a tag library, wherein a header in the tag library comprises space laws and policies, space facilities and equipment, organizations and personnel, application strategies of the space equipment, balance strategies of military and civil commerce, longitudinal and transverse strategies of international communication, natural space resources and environment, social concepts and scientific and technological environments, and spatial situation perception and comprehensive evaluation;
(6) according to the frequency of the keywords in the database form in the step (4), calculating the weight of the keywords according to the frequency of the keywords, selecting the keywords with the weight exceeding a preset value as tags, and classifying and filling the tags into the corresponding headers in the step (5);
(7) comparing the key words corresponding to each spatial situation characteristic event in the database form with the labels in the label library, and filling the matched labels into the label part of the event form header;
(8) designing search functions for the database, wherein the search functions comprise general search and deep search; setting a pull-down list in general searching, selecting the 'country' and 'time' items in the table header of a database table by a user, inputting a single keyword by the user to search the database and listing an event list; the deep search can be carried out according to a common key word or a plurality of key word combinations, and the relation of logic 'AND' or 'OR' is selected by a user when the plurality of key words are subjected to the combined search; or single or multiple keywords and single or multiple labels are searched in a combined mode, the number of the labels is user-defined, the relation between the keywords is selected as 'AND' or 'OR' by a user, the relation between the label items is selected as 'AND' or 'OR' by the user, and the relation between the keywords and the labels is fixed as the logical 'AND' relation.
2. The method of claim 1, wherein: and designing a visual interaction page for the database, wherein the visual page comprises two visual pages of normalized data and descriptive data, and each visual page comprises a data import function.
3. The method of claim 2, wherein: each visual interactive page also comprises two parts besides a data importing function, wherein one part displays the titles as a list, and the other part displays complete database list information associated with the titles.
4. The method of claim 2, wherein: and (3) introducing single space situation characteristic event data or introducing space situation characteristic event data in batch regularly or in real time by using a data introduction function, and re-executing the steps (1) - (7) on all current space situation characteristic event data to complete the updating of the database.
5. The method of claim 2, wherein: and (3) executing the steps (1) to (6) by inputting data of a specific time period, and sequencing the keywords according to the weights in the step (6) so as to determine the hot spot space situation vocabulary in the specific time period.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711157764.1A CN107748803B (en) | 2017-11-20 | 2017-11-20 | Method for designing spatial situation characteristic event database |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711157764.1A CN107748803B (en) | 2017-11-20 | 2017-11-20 | Method for designing spatial situation characteristic event database |
Publications (2)
Publication Number | Publication Date |
---|---|
CN107748803A CN107748803A (en) | 2018-03-02 |
CN107748803B true CN107748803B (en) | 2021-02-09 |
Family
ID=61251584
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201711157764.1A Active CN107748803B (en) | 2017-11-20 | 2017-11-20 | Method for designing spatial situation characteristic event database |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107748803B (en) |
Families Citing this family (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109509512B (en) * | 2018-07-10 | 2021-04-30 | 北京大学 | Clinical business automatic library building method based on Excel import |
CN111078905A (en) * | 2018-10-22 | 2020-04-28 | 北京国双科技有限公司 | Data processing method, device, medium and equipment |
CN109858869A (en) * | 2018-12-21 | 2019-06-07 | 厦门市美亚柏科信息股份有限公司 | Method and apparatus for handling event information |
CN109933669B (en) * | 2019-03-19 | 2023-04-21 | 南京大学 | Matching method of battlefield situation data labels |
CN110516048A (en) * | 2019-09-02 | 2019-11-29 | 苏州朗动网络科技有限公司 | The extracting method, equipment and storage medium of list data in pdf document |
CN110543914B (en) * | 2019-09-04 | 2022-06-24 | 软通智慧信息技术有限公司 | Event data processing method and device, computing equipment and medium |
CN110826194A (en) * | 2019-10-18 | 2020-02-21 | 内蒙动力机械研究所 | Modeling method for reliability data of solid rocket engine |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7958127B2 (en) * | 2007-02-15 | 2011-06-07 | Uqast, Llc | Tag-mediated review system for electronic content |
CN103106199B (en) * | 2011-11-09 | 2016-03-02 | 中国移动通信集团四川有限公司 | Text searching method and device |
CN102831234B (en) * | 2012-08-31 | 2015-04-22 | 北京邮电大学 | Personalized news recommendation device and method based on news content and theme feature |
CN103186662B (en) * | 2012-12-28 | 2016-08-03 | 北京中油网资讯技术有限公司 | A kind of dynamically public sentiment keyword abstraction system and method |
US11281716B2 (en) * | 2014-07-29 | 2022-03-22 | DISH Technologies L.L.C. | Apparatus, systems and methods for media content searching |
CN106294742B (en) * | 2016-08-10 | 2019-05-14 | 中国科学技术大学 | A kind of space launching site security reliability database construction method and analysis and assessment system |
CN106599174A (en) * | 2016-12-12 | 2017-04-26 | 国云科技股份有限公司 | Real-time news recommendation system and method thereof |
-
2017
- 2017-11-20 CN CN201711157764.1A patent/CN107748803B/en active Active
Also Published As
Publication number | Publication date |
---|---|
CN107748803A (en) | 2018-03-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107748803B (en) | Method for designing spatial situation characteristic event database | |
US9652467B2 (en) | Inline tree data structure for high-speed searching and filtering of large datasets | |
US8086592B2 (en) | Apparatus and method for associating unstructured text with structured data | |
US9697250B1 (en) | Systems and methods for high-speed searching and filtering of large datasets | |
CN101535945A (en) | Full text query and search systems and method of use | |
WO2005010727A2 (en) | Extracting data from semi-structured text documents | |
CN103548019A (en) | Method and system for providing statistical from a data warehouse | |
WO2024065952A1 (en) | Remote sensing satellite information recommendation method, system and device | |
CN102467544B (en) | Information smart searching method and system based on space fuzzy coding | |
CN105183803A (en) | Personalized search method and search apparatus thereof in social network platform | |
CN108052668A (en) | The endowed method and system of intelligence based on commodity code | |
US9524341B2 (en) | Retrieval system and method of searching of information in the internet | |
CN101853163B (en) | Industry application software system construction method based on assembly business modeling | |
CN100421107C (en) | Data structure and management system for a superset of relational databases | |
US20210240334A1 (en) | Interactive patent visualization systems and methods | |
CN101133416A (en) | Database management apparatus and method of managing database | |
WO2014113327A2 (en) | Intellectual property asset information retrieval system | |
CN101088082A (en) | Full text query and search systems and methods of use | |
Wang et al. | AceMap: Knowledge Discovery through Academic Graph | |
KR100855238B1 (en) | Patent Searching Method and Patent Search System Using Query Automatically Containing hierarchically Subordinate Patent Classifications | |
Sinif et al. | Approaching an optimizing open linked government data portal | |
CN100496091C (en) | System for making global search in wired TV one-way set-top box | |
Monaco | Methods for in-sourcing authority control with MarcEdit, SQL, and regular expressions | |
US20080158161A1 (en) | Data entry processing | |
Litvinov et al. | Paradigm of controls concept for global information systems |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |