Nothing Special   »   [go: up one dir, main page]

CN110149139A - A kind of compression transmitting method of Big Dipper short message - Google Patents

A kind of compression transmitting method of Big Dipper short message Download PDF

Info

Publication number
CN110149139A
CN110149139A CN201910413080.6A CN201910413080A CN110149139A CN 110149139 A CN110149139 A CN 110149139A CN 201910413080 A CN201910413080 A CN 201910413080A CN 110149139 A CN110149139 A CN 110149139A
Authority
CN
China
Prior art keywords
short message
big dipper
compression
dipper short
algorithm
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910413080.6A
Other languages
Chinese (zh)
Inventor
王垚
杨赫
赵精博
郝硕
任鸿飞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
CETC 54 Research Institute
Original Assignee
CETC 54 Research Institute
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by CETC 54 Research Institute filed Critical CETC 54 Research Institute
Priority to CN201910413080.6A priority Critical patent/CN110149139A/en
Publication of CN110149139A publication Critical patent/CN110149139A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • H03M7/3084Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction using adaptive string matching, e.g. the Lempel-Ziv method
    • H03M7/3088Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction using adaptive string matching, e.g. the Lempel-Ziv method employing the use of a dictionary, e.g. LZ78
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04BTRANSMISSION
    • H04B7/00Radio transmission systems, i.e. using radiation field
    • H04B7/14Relay systems
    • H04B7/15Active relay systems
    • H04B7/185Space-based or airborne stations; Stations for satellite systems
    • H04B7/1851Systems using a satellite or space-based relay
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04BTRANSMISSION
    • H04B7/00Radio transmission systems, i.e. using radiation field
    • H04B7/14Relay systems
    • H04B7/15Active relay systems
    • H04B7/185Space-based or airborne stations; Stations for satellite systems
    • H04B7/1851Systems using a satellite or space-based relay
    • H04B7/18517Transmission equipment in earth stations

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Astronomy & Astrophysics (AREA)
  • Aviation & Aerospace Engineering (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Theoretical Computer Science (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

The invention discloses a kind of compression transmitting methods of Big Dipper short message, it effectively combines the lzw algorithm (Lempel-Ziv-Welch Encoding) in LZ series coding with Chinese Word Automatic Segmentation, according to Chinese use habit, the problem of construction is suitable for the dictionary for word segmentation of Big Dipper short message, and this method overcomes the length limited that Big Dipper short message occurs in transmission process to a certain extent.

Description

A kind of compression transmitting method of Big Dipper short message
Technical field
The invention belongs to compression coding technology field in digital communication more particularly to a kind of compression transmissions of Big Dipper short message Method.
Background technique
Chinese Beidou satellite navigation system is the Global Satellite Navigation System that China voluntarily develops.It is to be positioned after US Global The satellite navigation system of third maturation after system (GPS), Russian Glonass satellite navigation system (GLONASS).System System can round-the-clock in the world, round-the-clock be all types of user provide high-precision, it is highly reliable positioning, navigate, time service service, and With short message communication capacity.Big Dipper short message communication is the featured function of dipper system, can be subscriber computer and subscriber computer, user The communication service of 120 Chinese characters is at most provided every time between machine and ground central station.But as practical application, the limitation of capacity is not It is sufficient for the demand of reality.Therefore, it is necessary to combine the actual conditions of dipper system, a kind of data compression scheme is designed, that is, is existed Within the scope of limited message length, warning message is compressed as far as possible, conveys more letters with identical character quantity Breath, and reconfiguration information is identical with original information, i.e., lossless data compression.
Chinese word segmentation is the first step to Big Dipper short message compression processing, and traditional compression algorithm is to the north based on Chinese The short message compression effectiveness that struggles against is unobvious, therefore we convert Numerical Index code for Big Dipper short message first with Chinese word segmentation, Then compression processing is carried out again.Currently, common Chinese word cutting method includes: segmenting method based on dictionary, based on understanding Segmenting method and segmenting method based on statistics.Segmenting method based on dictionary is most widely used Chinese word cutting method, it It is divided into again: Forward Maximum Method method, reverse maximum matching method and bi-directional matching method.Forward Maximum Method algorithm refers to from left to right By wait segment several continuation characters and dictionary matching in text, if matched, it is syncopated as a word.Reverse maximum matching Algorithm is the reverse thinking of Forward Maximum Method, goes to match by entire character string to be slit first, if matching not at Function then falls the most previous word of matching field, is recycled to entire sentence cutting always and finishes.Bi-directional matching algorithm is the two In conjunction with the cutting result of the two being compared, to select optimal solution.According to SunM.S. and Benjamin K.T.'s (1995) Studies have shown that 90.0% or so sentence, two kinds of matching methods are completely coincident and correctly, and two kinds of sentence of only general 9.0% are cut The result that point method obtains is different, but it is correct for must wherein having one.It can be seen that bi-directional matching algorithm cuts Chinese Divide accuracy higher.Therefore, the Chinese segment method that computation complexity is low, algorithm accuracy is high, has a wide range of application has important Research significance and application value.
It is exactly by dynamically establishing a string table, the shorter code of use that lzw algorithm, which is called " string list compression algorithm ", Come indicate longer character string realize compression, lzw algorithm is a kind of lossless compression algorithm.Lzw algorithm can efficiently use character Frequency of occurrences redundancy is compressed, and dictionary is adaptive generation, it has the characteristics that below several: LZW compress technique The data little for predictability have good treatment effect, commonly use the compression of images with TJF format;For in data flow The byte and word string for continuously repeating appearance have very high compression ratio;LZW compress technique is also used for the data compressions such as text application Field;Common ARC, RKARC, PKZIP Efficient Compression program is all the variant of LZW compress technique;For any width and picture The image of plain bit length all has stable compression process;It is of less demanding to machine hardware, on the machine of Intel 80386 It can carry out compression and decompression.
Summary of the invention
The purpose of the invention is to overcome the problems, such as Big Dipper short message, there are length limiteds in communication process, propose one The compression transmitting method of kind Big Dipper short message.
The technical solution adopted by the present invention are as follows:
A kind of compression transmitting method of Big Dipper short message, includes the following steps:
Step 1, the dedicated vocabulary of statistical correlation writes Beidou everyday words dictionary, and assigns index code for each vocabulary;
It step 2, is reference with the Beidou everyday words dictionary that writes, using Chinese Word Automatic Segmentation by Big Dipper short message cutting At k phrase;K≤M, M are the number of characters of Big Dipper short message;
Step 3, respective Numerical Index code is converted according to Beidou everyday words dictionary by the phrase segmented;
Step 4, using lzw algorithm, Numerical Index code character obtained in step 3 is subjected to second-compressed, obtains compressing short Message;
Complete the compression processing to Big Dipper short message.
Wherein, the Chinese Word Automatic Segmentation in step 2 specifically: bi-directional matching algorithm is used, by positive maximum It is compared with algorithm with reverse maximum matching algorithm result, obtains optimal solution.
Wherein, the vocabulary in dictionary that lzw algorithm is established in real time in step 4 has prefix, the phrase quilt of same prefix Combination distributes new number.
The present invention is acquired compared with prior art to be had the beneficial effect that
A kind of compression transmitting method of Big Dipper short message proposed by the present invention, this method computation complexity is low, to Chinese Based on Big Dipper short message have good compression performance.And according to the characteristic of LZW compression algorithm, this method, which has, to be repeated The characteristics of vocabulary is more, and compression effectiveness is more obvious.
Detailed description of the invention
Fig. 1 is a kind of compression transmission flow chart of Big Dipper short message of the present invention;
Fig. 2 is the dictionary model figure of the Chinese Word Automatic Segmentation based on dictionary in the present invention;
Fig. 3 is LZW compression algorithm work flow diagram in the specific embodiment of the invention.
Specific embodiment
Objects and advantages in order to better illustrate the present invention, 1-3 and embodiment are to skill of the invention with reference to the accompanying drawing Art scheme is described further.In this embodiment, following short message information will be used to describe as example:
Short message []={ s1s2s3s4s5s6s7s4s5s6s7}
Big Dipper short message once can only at most transmit 120 characters in communication process, in order to clearly describe algorithm work Make principle, chooses 11 characters as compression goal.And according to compression algorithm characteristic, there are repeat character (RPT)s in target to be compressed s4s5s6s7
In order to describe the needs of algorithm, further reduce the scale of problem, it is assumed that Forward Maximum Method algorithm and it is reverse most Big matching algorithm result is identical, and algorithm description is carried out using Forward Maximum Method algorithm as example, problem scale diminution and The demonstration program of this example is not influenced, and Big Dipper short message compression transmission process is as shown in Figure 1.
A kind of compression transmitting method of Big Dipper short message, the Big Dipper short message to be compressed are as follows: short message []={ s1s2s3s4s5s6s7s4s5s6s7, include the following steps:
Step 1, the dedicated vocabulary of statistical correlation writes Beidou everyday words dictionary, and assigns index code for each vocabulary;
Step 2, according to the process sequence of Fig. 1, short message to be compressed is subjected to Chinese word segmentation processing first, it is assumed that participle institute The dictionary of reference are as follows:
Dic []={ " s1s2”“s1s2s3”“s4s5”“s6s7”}
By taking Forward Maximum Method algorithm as an example, short message to be slit is scanned from left to right, scanning to s1s2It is the discovery that Word in dictionary, but cannot determine whether to continue to scan on for maximum vocabulary, find s1s2s3It is the word in dictionary, does not can determine that still It whether is maximum vocabulary, until scanning is to the end, determination is syncopated as first word s1s2s3.It continues to scan on, until entire short message It is split as following phrase:
Word []={ s1s2s3 s4s5 s6s7 s4s5 s6s7}
Step 3, the phrase segmented is converted into respective index code;
The phrase that will have been segmented is converted to respective Numerical Index code shape according to transformation rule as shown in Figure 2 Formula, phrase will be converted into following form (inFor index code of the vocabulary in dictionary):
Word []={ ic ia ib ia ib}
Step 4, second-compressed is carried out to index code character using lzw algorithm;
The index code character that step 3 is generated is according to carry out second-compressed processing as shown in Figure 3, and lzw algorithm is to be compressed Phrase is scanned, and dynamically establishes phrase table, in scanning process, by prefix (ia ib) typing dictionary, it is to be scanned to next (ia ib) find that the prefix is already present in dictionary, use new number (n+1) (n is by the number of word in dictionary) to replace, directly It is finished to compression, index code character is compressed into following form:
Code []={ ic ia ib (n+1)}
So far, the process of Big Dipper short message compressed encoding is terminated.
Lzw algorithm will dynamically establish dictionary, and the vocabulary in dictionary has the property that
(the already present prefix of dictionary in ω expression)
Through lzw algorithm second-compressed, following compression short message is obtained:
Msg (A)=A1/A2/.../AN,(N≤k)
It is the concrete analysis compression method to the compression effectiveness of Big Dipper short message, chooses the alarm message of different length, And it is divided into: 10 word groups, 20 word groups, 50 word groups, 100 word groups, 120 word groups, 150 word groups and 200 word groups.Every group of selection 5 alarms Segment carries out coding compression, and every group of compressed average number of words and compression ratio is asked (to compress short message after preceding message number of words/compression Average number of words), and record data.The results are shown in Table 1, it can be seen that new method has good compression to imitate Big Dipper short message Fruit.Test allocation of computer used are as follows: Intel Core 3.30GHz processor, 8GB memory, Matlab R2016a software are flat Platform.
1 Big Dipper short message compression transmission experimental result of table
Short message number of words before compressing Short message average number of words after compression Compression ratio
10 word groups 9.4 1.064
20 word groups 18.6 1.075
50 word groups 46.4 1.078
100 word groups 92.4 1.082
120 word groups 110.8 1.083
150 word groups 138.2 1.085
Although the embodiments of the invention are described in conjunction with the attached drawings, it will be apparent to those skilled in the art that not taking off Under the premise of from the principle of the invention, various modifications and improvements can be made, these also should be regarded as belonging to protection model of the invention It encloses.

Claims (3)

1. a kind of compression transmitting method of Big Dipper short message, it is characterised in that include the following steps:
Step 1, the dedicated vocabulary of statistical correlation writes Beidou everyday words dictionary, and assigns index code for each vocabulary;
Step 2, it is reference with the Beidou everyday words dictionary write, Big Dipper short message is cut into k using Chinese Word Automatic Segmentation A phrase;K≤M, M are the number of characters of Big Dipper short message;
Step 3, respective Numerical Index code is converted according to Beidou everyday words dictionary by the phrase segmented;
Step 4, using lzw algorithm, Numerical Index code character obtained in step 3 is subjected to second-compressed, obtains compression short message;
Complete the compression processing to Big Dipper short message.
2. a kind of compression transmitting method of Big Dipper short message according to claim 1, it is characterised in that: the institute in step 2 The Chinese Word Automatic Segmentation stated specifically: bi-directional matching algorithm is used, by Forward Maximum Method algorithm and reverse maximum matching algorithm As a result it is compared, obtains optimal solution.
3. a kind of compression transmitting method of Big Dipper short message according to claim 1, it is characterised in that: LZW is calculated in step 4 The vocabulary in dictionary that method is established in real time has prefix, and the phrase of same prefix, which is combined, distributes new number.
CN201910413080.6A 2019-05-17 2019-05-17 A kind of compression transmitting method of Big Dipper short message Pending CN110149139A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910413080.6A CN110149139A (en) 2019-05-17 2019-05-17 A kind of compression transmitting method of Big Dipper short message

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910413080.6A CN110149139A (en) 2019-05-17 2019-05-17 A kind of compression transmitting method of Big Dipper short message

Publications (1)

Publication Number Publication Date
CN110149139A true CN110149139A (en) 2019-08-20

Family

ID=67595530

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910413080.6A Pending CN110149139A (en) 2019-05-17 2019-05-17 A kind of compression transmitting method of Big Dipper short message

Country Status (1)

Country Link
CN (1) CN110149139A (en)

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107561564A (en) * 2017-09-08 2018-01-09 广州祺智通信科技股份有限公司 A kind of compression implementation method of big-dipper satellite information transfer

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107561564A (en) * 2017-09-08 2018-01-09 广州祺智通信科技股份有限公司 A kind of compression implementation method of big-dipper satellite information transfer

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
彭皓: "北斗系统用户通信数据扩容技术研究", <<中国优秀硕士学位论文全文数据库>> *

Similar Documents

Publication Publication Date Title
Matsumoto et al. Biological sequence compression algorithms
Nelson et al. The data compression book 2nd edition
US5532694A (en) Data compression apparatus and method using matching string searching and Huffman encoding
CN1167951A (en) Method of and apparatus for compressing and expanding data and data processing apparatus and network system using same
CN101783788A (en) File compression method, file compression device, file decompression method, file decompression device, compressed file searching method and compressed file searching device
CN1228887A (en) Data compression and decompression system with immediate dictionary updating interleaved with string search
CN112953550A (en) Data compression method, electronic device and storage medium
EP0127815B1 (en) Data compression method
CN102385609A (en) Enhancing search-result relevance ranking using uniform resource locators for queries containing non-encoding characters
CN111539199A (en) Text error correction method, device, terminal and storage medium
CN101469989B (en) Compression method for navigation data in mobile phone network navigation
CN101534124A (en) Compression algorithm for short natural language
CN110149139A (en) A kind of compression transmitting method of Big Dipper short message
CN117651076B (en) Adaptive cross-domain multichannel secret source coding compression and decompression method
CN110598057B (en) Data searching method and device for telemetering data
US20080270117A1 (en) Method and system for text compression and decompression
US5812076A (en) Data compressing apparatus, data restoring apparatus and recording medium
Brisaboa et al. New adaptive compressors for natural language text
US10506388B1 (en) Efficient short message compression
JPH09219650A (en) Data encoding device, data decoding device and its method
Klein et al. Using Fibonacci compression codes as alternatives to dense codes
US7193542B2 (en) Digital data compression robust relative to transmission noise
Brisaboa et al. Simple, fast, and efficient natural language adaptive compression
Shanmugasundaram et al. Text preprocessing using enhanced intelligent dictionary based encoding (EIDBE)
WO2000035098A1 (en) Apparatus and methods for real time lossless compression

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20190820