Nothing Special   »   [go: up one dir, main page]

CN110147203A - A kind of file management method, device, electronic equipment and storage medium - Google Patents

A kind of file management method, device, electronic equipment and storage medium Download PDF

Info

Publication number
CN110147203A
CN110147203A CN201910411298.8A CN201910411298A CN110147203A CN 110147203 A CN110147203 A CN 110147203A CN 201910411298 A CN201910411298 A CN 201910411298A CN 110147203 A CN110147203 A CN 110147203A
Authority
CN
China
Prior art keywords
storage
file
storage region
data
small documents
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910411298.8A
Other languages
Chinese (zh)
Other versions
CN110147203B (en
Inventor
尹滔
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Kingsoft Cloud Network Technology Co Ltd
Beijing Kingsoft Cloud Technology Co Ltd
Original Assignee
Beijing Kingsoft Cloud Network Technology Co Ltd
Beijing Kingsoft Cloud Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Kingsoft Cloud Network Technology Co Ltd, Beijing Kingsoft Cloud Technology Co Ltd filed Critical Beijing Kingsoft Cloud Network Technology Co Ltd
Priority to CN201910411298.8A priority Critical patent/CN110147203B/en
Publication of CN110147203A publication Critical patent/CN110147203A/en
Application granted granted Critical
Publication of CN110147203B publication Critical patent/CN110147203B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/13File access structures, e.g. distributed indices
    • G06F16/134Distributed indices
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0628Interfaces specially adapted for storage systems making use of a particular technique
    • G06F3/0638Organizing or formatting or addressing of data
    • G06F3/064Management of blocks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0628Interfaces specially adapted for storage systems making use of a particular technique
    • G06F3/0638Organizing or formatting or addressing of data
    • G06F3/0643Management of files
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0668Interfaces specially adapted for storage systems adopting a particular infrastructure
    • G06F3/067Distributed or networked storage systems, e.g. storage area networks [SAN], network attached storage [NAS]

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The embodiment of the invention provides a kind of file management method, device, electronic equipment and storage mediums, wherein file management method includes: to obtain merging file to be written;The merging file is stored to the first storage region, and obtains the index data for each small documents in the merging file;By the index datastore to the second storage region, the reading and writing data performance of second storage region is higher than the reading and writing data performance of first storage region.The embodiment of the present invention can also improve the reading data performance of cloud storage system while saving memory space.

Description

A kind of file management method, device, electronic equipment and storage medium
Technical field
The present invention relates to technical field of data storage, more particularly to a kind of file management method, device, electronic equipment and Storage medium.
Background technique
Cloud storage technology is a kind of technology for the storage of data cloud, and cloud storage system can receive client and pass through net The data that network is sent, to be stored to data.
In cloud storage system, data usually store in the form of a file, specifically, the storage service in cloud storage system File is carried out piecemeal according to fixed size by device, and each piece is known as a data block, and each data block is stored to existing in advance In the storage space block divided on storage medium, that is, storage space block one data block of corresponding storage, and for size One small documents of insufficient above-mentioned fixed size then need individually to occupy a storage space block in storage, when small documents number When measuring excessive, lead to the problem of waste of storage space occur.
The prior art generallys use following scheme and avoids waste of storage space: large amount of small documents is merged into one big text Part stores the index information of the big file and each small documents to the mechanical hard disk of storage server respectively.Reading small text When part, the index information of small documents is first read, then data are read from big file based on index information.
But since small documents are large number of, the prior art will first read index information when reading small documents every time Read data again, that is, mechanical hard disk will be read twice, therefore storage server single machine QPS (Query Per Second, Query rate per second) up to 1000 or more, and the maximum QPS that mechanical hard disk can provide is typically only 90, leads to data read capability It is unable to satisfy reading data demand, to cause the reading data degraded performance of storage server.
Summary of the invention
The embodiment of the present invention is designed to provide a kind of file management method, device, electronic equipment and storage medium, with The reading data performance of storage server is improved while saving memory space.Specific technical solution is as follows:
In a first aspect, the embodiment of the invention provides a kind of file management method, applied to the management in cloud storage system Server, the management server are used to manage more storage servers in the cloud storage system, the storage server For storing data, which comprises
Merging file to be written is obtained, the file that merges is the file that multiple small documents obtain after merging, described Small documents are the file that size is lower than preset threshold;
The merging file is stored to the first storage region, and obtains the rope for each small documents in the merging file Argument evidence, wherein the storage location information of the small documents is carried in every index data;
By the index datastore to the second storage region, wherein the reading and writing data performance of second storage region Higher than the reading and writing data performance of first storage region.
Optionally, described the step of storing the merging file to the first storage region, comprising:
The merging file is stored into the corresponding storage region of the first storage medium;
It is described by the index datastore to the second storage region the step of, comprising:
By the index datastore into the corresponding storage region of the second storage medium, wherein second storage is situated between The reading and writing data performance of matter is higher than the reading and writing data performance of first storage medium.
Optionally, first storage medium is mechanical hard disk, and second storage medium is solid-state hard disk SSD.
Optionally, described the step of storing the merging file to the first storage region, comprising:
The merging file is stored to the first storage region of the first storage server, first storage server is Wherein one in the more storage servers;
It is described by the index datastore to the second storage region the step of, comprising:
By the index datastore to the second storage region of first storage server.
Optionally, first storage region and second storage region are divided into the identical memory space of size Block, for each storage space block for storing a data block, a data block includes: multiple storage catalogues, Yi Jicun Storage space sets specified directory, and for storing data, record has default value to the storage catalogue under the storage location specified directory, The different default values stores data to different storage catalogues for specified;
Described the step of storing the merging file to the first storage region, comprising:
If the size for merging file is not more than the size of the storage space block, the merging file is stored Under the specified storage catalogue of data block corresponding to one of storage space block to first storage region;
If it is described merge file size be greater than the storage space block size, by the merging file store to Under the specified storage catalogue of the corresponding data block of multiple storage space blocks of first storage region.
Optionally, the method also includes:
From second storage region, the index data of the small documents to read from first storage region is read;
According to the index data, storage location of the data in first storage region of the small documents is positioned;
According to the storage location positioned, read from the merging file that first storage region is stored The small documents.
Optionally, described from second storage region, read the small documents to read from first storage region The step of index data, comprising:
From the corresponding storage region of second storage medium, obtain to from the corresponding storage of first storage medium The index data of the small documents read in region;
Described the step of reading the small documents from the merging file that first storage region is stored, packet It includes:
From the merging file that the corresponding storage region of first storage medium is stored, the small text is read Part.
Second aspect, the embodiment of the invention provides a kind of document management apparatus, applied to the management in cloud storage system Server, the management server are used to manage more storage servers in the cloud storage system, the storage server For storing data, described device includes:
Module is obtained, for obtaining merging file to be written, the merging file is that multiple small documents obtain after merging The file arrived, the small documents are the file that size is lower than preset threshold;
First memory module for storing the merging file to the first storage region, and obtains and is directed to the merging The index data of each small documents in file, wherein the storage position of the small documents is carried in every index data Confidence breath;
Second memory module is used for the index datastore to the second storage region, wherein second memory block The reading and writing data performance in domain is higher than the reading and writing data performance of first storage region.
Optionally, first memory module, is specifically used for:
The merging file is stored into the corresponding storage region of the first storage medium;
Second memory module, is specifically used for:
By the index datastore into the corresponding storage region of the second storage medium, wherein second storage is situated between The reading and writing data performance of matter is higher than the reading and writing data performance of first storage medium.
Optionally, first storage medium is mechanical hard disk, and second storage medium is solid-state hard disk SSD.
Optionally, first memory module, is specifically used for:
The merging file is stored to the first storage region of the first storage server, first storage server is Wherein one in the more storage servers;
Second memory module, is specifically used for:
By the index datastore to the second storage region of first storage server.
Optionally, first storage region and second storage region are divided into the identical memory space of size Block, for each storage space block for storing a data block, a data block includes: multiple storage catalogues, Yi Jicun Storage space sets specified directory, and for storing data, record has default value to the storage catalogue under the storage location specified directory, The different default values stores data to different storage catalogues for specified;
First memory module, comprising:
First sub-module stored, if being not more than the size of the storage space block for the size for merging file, The merging file is then stored into the specified of data block corresponding to one of storage space block to first storage region Under storage catalogue;
Second sub-module stored, if being greater than the size of the storage space block for the size for merging file, The merging file is stored to specified the depositing of the corresponding data block of multiple storage space blocks of first storage region It stores up under catalogue.
Optionally, described device further include:
First read module, for reading small to be read from first storage region from second storage region The index data of file;
Locating module, for positioning the data of the small documents in first storage region according to the index data Storage location;
Second read module, for being stored from first storage region according to the storage location positioned The small documents are read in the merging file.
Optionally, first read module, is specifically used for:
From the corresponding storage region of second storage medium, obtain to from the corresponding storage of first storage medium The index data of the small documents read in region;
Second read module, is specifically used for:
From the merging file that the corresponding storage region of first storage medium is stored, the small text is read Part.
The third aspect, the embodiment of the invention provides a kind of electronic equipment, including processor and machine readable storage medium, The machine readable storage medium is stored with the machine-executable instruction that can be executed by the processor, and the processor executes The machine-executable instruction is to realize the method and step of file management method that above-mentioned first aspect provides.
Fourth aspect, the embodiment of the invention provides a kind of computer readable storage medium, the computer-readable storage Dielectric memory contains computer program, when the computer program is executed by processor, realizes the text that above-mentioned first aspect provides The method and step of part management method.
5th aspect, the embodiment of the invention also provides a kind of computer program products comprising instruction, when it is being calculated When being run on machine, so that computer executes the method and step for the file management method that above-mentioned first aspect provides.
6th aspect, the embodiment of the invention also provides a kind of computer programs, when run on a computer, so that Computer executes the method and step for the file management method that above-mentioned first aspect provides.
A kind of file management method, device, electronic equipment and storage medium provided in an embodiment of the present invention, it is to be written obtaining It after the merging file entered, is stored by the way that file will be merged to the first storage region, and obtained for each small documents in merging file Index data, then by index datastore to the second storage region, since the reading and writing data performance of the second storage region is higher than The reading and writing data performance of first storage region, therefore when reading small documents, the high read-write property of the second storage region can be utilized The index data of each small documents can be more rapidly read, and then according to read index data, from the first storage region institute Each small documents are read in the merging file of storage, to avoid reading data degraded performance existing for existing file management method Problem can also improve the reading data performance of cloud storage system while saving memory space.Certainly, implement of the invention Any product or method must be not necessarily required to reach all the above advantage simultaneously.
Detailed description of the invention
In order to more clearly explain the embodiment of the invention or the technical proposal in the existing technology, to embodiment or will show below There is attached drawing needed in technical description to be briefly described, it should be apparent that, the accompanying drawings in the following description is only this Some embodiments of invention for those of ordinary skill in the art without creative efforts, can be with It obtains other drawings based on these drawings.
Fig. 1 is a kind of flow diagram of file management method provided in an embodiment of the present invention;
Fig. 2 is the schematic diagram of the data management topological structure in the cloud storage system of the embodiment of the present invention;
Fig. 3 is the flow diagram of another file management method provided in an embodiment of the present invention;
Fig. 4 is the flow diagram of another file management method provided in an embodiment of the present invention;
Fig. 5 is the flow diagram of the 4th kind of file management method provided in an embodiment of the present invention;
Fig. 6 is the memory topology structural schematic diagram of the storage server in the embodiment of the present invention;
Fig. 7 is the storage content schematic diagram of data block in the embodiment of the present invention;
Fig. 8 is the flow diagram of the 5th kind of file management method provided in an embodiment of the present invention;
Fig. 9 is a kind of structural schematic diagram of document management apparatus provided in an embodiment of the present invention;
Figure 10 is the structural schematic diagram of the first memory module in the embodiment of the present invention;
Figure 11 is the structural schematic diagram of another document management apparatus provided in an embodiment of the present invention;
Figure 12 is the structural schematic diagram of a kind of electronic equipment provided in an embodiment of the present invention.
Specific embodiment
Following will be combined with the drawings in the embodiments of the present invention, and technical solution in the embodiment of the present invention carries out clear, complete Site preparation description, it is clear that described embodiments are only a part of the embodiments of the present invention, instead of all the embodiments.It is based on Embodiment in the present invention, it is obtained by those of ordinary skill in the art without making creative efforts every other Embodiment shall fall within the protection scope of the present invention.
Existing file management method can also usually use following scheme:
For file to be written, judge whether file size is less than preset threshold value, if this document is not more than threshold value, Then this document can directly be stored into the storage server based on SSD (Solid State Disk, solid state hard disk);If This document is greater than threshold value, then can store this document into the storage server based on mechanical hard disk.But this manager Method makes all small documents be both needed to storage into SSD, and SSD is at high price at present, and it is high to will lead to cloud storage system operation cost.
Alternatively, the caching of default size can be distributed for this document in memory if this document is not more than threshold value, it will This document write-in caching;If this document is greater than threshold value, this document can be stored to the storage service based on mechanical hard disk In device.But this management method will will lead to loss of data when mechanical disorder since small documents are stored in memory.
Embodiment of the method 1
Have in view of that, as shown in Figure 1, the embodiment of the present invention provides firstly a kind of file management method, this method can be with Applied to the management server in cloud storage system, wherein management server can be used for managing more in cloud storage system Storage server, for storing data, this method may comprise steps of storage server:
S101 obtains merging file to be written.
In the embodiment of the present invention, as shown in Fig. 2, for the schematic diagram of the data management topological structure in cloud storage system, In, it may include multiple storage clusters in a cloud storage system, a management server be laid in each storage cluster (i.e. Nameserver) and several (N platform, N be greater than 2) storage server (i.e. dataserver), management server can be to upper Several storage servers are stated uniformly to be managed, storage server for storing data, illustratively, when obtaining number to be written According to rear, management server can control above-mentioned storage server and store to the data.
Merging file therein can refer to a big file for obtaining multiple small documents after merging treatment, can be with Understand, the obtained size for merging file is bigger compared to each small documents, due to that can be deposited in the form of one file Storage, therefore can be avoided a case where small documents deposit discontented storage space block generation, that is, avoid waste of storage space Problem.
Also, the data for merging each small documents in file, which can be, to be saved in order into merging file, therefore more Be conducive to traditional mechanical hard disk read-write (the random reading performance of traditional mechanical hard disk is far below continuous reading performance).
Also, it is stored not yet into storage server due to merging file at this time, it can be by above-mentioned merging file Merging file referred to as to be written.
Small documents therein can refer to that size is lower than the file of preset threshold, for example, size of data is lower than 2KB The file of (Kilobyte, kilobytes).Certainly, the developer of cloud storage system can be flexibly reasonable according to practical business demand Ground is arranged above-mentioned threshold value, the embodiment of the present invention to the specific value of above-mentioned threshold value without limitation.Multiple small documents are merged into one A process for merging file can be obtained by existing file mergences method, and the detailed process embodiment of the present invention is no longer superfluous It states.
S102 will merge file and store to the first storage region, and obtains for the index for merging each small documents in file Data.
Different storage regions can be set in a storage cluster, it is corresponding for different storage regions Reading and writing data performance can be different, and therefore, the embodiment of the present invention can will merge file and store to the first storage region, and deposit Chu Hou, obtains the index data of each small documents in the merging file, wherein a small documents are carried in every index data Storage location information, that is, the storage location information of the corresponding small documents of an index data, storage location information is for remembering Small documents are recorded in the storage location of the first storage region, for example, the identification number of the storage space block stored, and can read Play the role of positioning when the small documents.
The process of the above-mentioned index data for obtaining each small documents can be obtained by existing file index information generation method It arrives, the embodiment of the present invention repeats no more.
The reading and writing data performance of index datastore to the second storage region, the second storage region is higher than first by S103 The reading and writing data performance of storage region.
After obtaining index data, the embodiment of the present invention can be deposited index datastore to the second storage region, second Storage area domain is the region different from the first storage region, also, the reading and writing data performance of the second storage region is higher than the first storage The reading and writing data performance in region.By the way that index datastore into higher second storage region of readwrite performance, is being read in this way When taking large amount of small documents, it can use the high readwrite performance of the second storage region, more rapidly read the index of each small documents Data, and then each small documents are further read from merging file according to read index data, it can be avoided due to tradition The problem of reading data degraded performance caused by mechanical hard disk performance limits.In addition, since index data is relative to file data Required memory space is smaller, therefore, only can save index datastore in the second high storage region of reading and writing data performance Cost.
A kind of file management method provided in an embodiment of the present invention, after obtaining merging file to be written, by that will close And file is stored to the first storage region, and is obtained for the index data for merging each small documents in file, then by index data It stores to the second storage region, since the reading and writing data performance of the second storage region is higher than the reading and writing data of the first storage region Can, therefore when reading small documents, each small documents can be more rapidly read using the high readwrite performance of the second storage region Index data read from the merging file that the first storage region is stored each small and then according to read index data File is saving the same of memory space to avoid the problem that reading data degraded performance existing for existing file management method When can also improve the reading data performance of cloud storage system.
Embodiment of the method 2
As shown in figure 3, this method may include following step the embodiment of the invention also provides a kind of file management method It is rapid:
S201 obtains merging file to be written.
The step is identical as the step S101 of embodiment of the method 1, and details are not described herein for the embodiment of the present invention.
S202 will merge file and store into the corresponding storage region of the first storage medium, and obtain for merging file In each small documents index data.
The step is similar to the step S102 of embodiment of the method 1, is different in, and the embodiment of the present invention can will merge text Part is stored into the corresponding storage region of the first storage medium, that is, the first storage region is specifically as follows the first storage medium pair The storage region answered.
As a kind of optional embodiment of the embodiment of the present invention, above-mentioned first storage medium for example can be mechanical hard Disk, then the above-mentioned file that will merge stores the step into the corresponding storage region of the first storage medium, that is, can be that will merge text Part is stored into the corresponding storage region of mechanical hard disk.
S203, by index datastore into the corresponding storage region of the second storage medium.
The step is similar to the step S103 of embodiment of the method 1, is different in, and the embodiment of the present invention can be by index number According to storing into the corresponding storage region of the second storage medium, that is, the second storage region is specifically as follows the second storage medium pair The storage region answered.
As a kind of optional embodiment of the embodiment of the present invention, above-mentioned second storage medium for example can be SSD, then on State the step by index datastore into the corresponding storage region of the second storage medium, that is, can be by index datastore Into the corresponding storage region of SSD.
It is appreciated that the reading and writing data performance of the second storage medium is higher than the reading and writing data performance of the first storage medium, because , when reading small documents, the embodiment of the present invention can read the index data of each small documents more quickly for this.
A kind of file management method provided in an embodiment of the present invention, since the reading and writing data performance of the second storage medium is higher than The reading and writing data performance of first storage medium, therefore when reading small documents, the high read-write property of the second storage medium can be utilized The index data of each small documents can be more rapidly read, and then according to read index data, from the first storage region institute Each small documents are read in the merging file of storage, to further increase the reading data performance of cloud storage system.
Embodiment of the method 3
As shown in figure 4, this method may include following step the embodiment of the invention also provides a kind of file management method It is rapid:
S301 obtains merging file to be written.
The step is identical as the step S101 of embodiment of the method 1, and details are not described herein for the embodiment of the present invention.
S302 will merge file and store to the first storage region of the first storage server.
The step is similar to the step S102 of embodiment of the method 1, is different in, in the embodiment of the present invention, management service Device can will merge file and store in the wherein storage server into a storage cluster, specifically, can will merge File is stored to the first storage region of the first storage server, and above-mentioned first storage server is any in storage cluster One storage server.
Optionally, can will merge file to store to the corresponding memory block of the first storage medium of the first storage server Domain, above-mentioned first storage region are specifically as follows storage region corresponding to the first storage medium in the first storage server.
Optionally, the first storage medium can be mechanical hard disk.
S303, by the second storage region of index datastore to the first storage server.
The step is similar to the step S103 of embodiment of the method 1, is different in, and the embodiment of the present invention can be by index number It, specifically, can be by index datastore to first according in the wherein storage server stored into a storage cluster Second storage region of storage server, above-mentioned first storage server are any one storage service in storage cluster Device, the reading and writing data performance of the second storage region of the storage server are higher than the reading and writing data performance of the first storage region.
It optionally, can be by the corresponding memory block of the second storage medium of index datastore to the first storage server Domain, above-mentioned second storage region are specifically as follows storage region corresponding to the second storage medium in the first storage server.
Optionally, the second storage medium can be SSD.
In compared with the prior art, small documents are all stored in SSD disk, or are stored in point of SSD disk composition In cloth storage system, the cost of the embodiment of the present invention is lower, the reason is that SSD is more than mechanical hard disk price under identical capacity It is expensive, in a storage server, if disk is all changed to SSD, compared to the file management method of the embodiment of the present invention, Carrying cost at least rises 100 times.
As a kind of optional embodiment of the embodiment of the present invention, on the basis of process shown in Fig. 4, as shown in figure 5, Before step S302, the file management method of the embodiment of the present invention can also include:
S301 ', judges whether the size for merging file is greater than the size of storage space block.
Then above-mentioned steps S302 can specifically include:
S3021 will merge file and store to first if the size for merging file is not more than the size of storage space block Under the specified storage catalogue of data block corresponding to one of storage space block of storage region.
In the embodiment of the present invention, if the size for merging file is less than or equal to the size of storage space block, show only to need Memory space provided by a storage space block is wanted, the memory space of merging file needs can be met.
Illustratively, above-mentioned merging file can be stored in the storage catalogue of data block corresponding to a storage space block Under 0 or under storage catalogue 1.
Merging file is stored to first and is deposited if the size for merging file is greater than the size of storage space block by S3022 Under the specified storage catalogue of the corresponding data block of multiple storage space blocks in storage area domain.
In the embodiment of the present invention, if the size for merging file is greater than the size of storage space block, show that a storage is empty Between memory space provided by block, be unable to satisfy memory space required for the merging file, therefore can be used multiple continuous Storage space block the merging file is stored.
Likewise it is possible to store data to the specified storage mesh of above-mentioned multiple corresponding data blocks of storage space block Under record.
As shown in fig. 6, for the memory topology structural schematic diagram of the storage server in the embodiment of the present invention, a storage clothes It is usually provided with muti-piece disk in business device, forms disk array.The storage region of every piece of disk, can according to preset size, Being divided into the identical storage space block of multiple sizes, (N number of, 2) N is greater than, each storage space block can be used for storing data.
In the embodiment of the present invention, illustratively, file to be written can indicate in the storage path of disk are as follows:/data/ Vols/vol1/phenix_data/00000000000000709106501, wherein data/, vols/, vol1/, phenix_ Data/ indicates store path step by step, and in the string number after phenix_data/, the first bit digital indicates data block Type, 0 indicates three copy types;20 digit word bits after first indicate the identification number of storage space block, the i.e. memory space Block a storage cluster unique identification, when data (for example, with idx format save data, alternatively, with dat format preservation Data) store to data block corresponding to storage space block, management server can recorde the storage that the data are stored The identification number of space block, so that index data is generated, in order to search and read later;Last two indicate the copy in three pairs Unique identifying number in this, value is usually one in 0-2, the case where if there is 3 copies are greater than, is then likely larger than 2.
As shown in fig. 7, for the storage content schematic diagram of data block in the embodiment of the present invention, it can under each data block catalogue To include different catalogues, these catalogues can be with are as follows: the first storage catalogue indicates the title of the catalogue with number 0;Second storage Catalogue indicates the title of the catalogue with number 1;And a storage location specified directory, which is indicated with English CURRENT Title.
Wherein, storage catalogue (i.e. storage catalogue 0 and storage catalogue 1) can be used for storing data, in each storage catalogue Index data, the i.e. data of idx format can be saved, file data, the i.e. data of dat format can also be saved.
Storage location specified directory (i.e. CURRENT catalogue) stores data to which storage catalogue for specified, That is, storing data to storage catalogue 0 for specified, or data are stored to storage catalogue 01.
The specified principle of storage location specified directory are as follows: number 0 or 1 can be saved under storage location specified directory, when When the numerical value saved in storage location specified directory is 0, expressions storage catalogue 0 is current storage catalogue, can will be current Data are stored to storage catalogue 0;When the numerical value saved in storage location specified directory is 1, indicate that storage catalogue 1 is current Storage catalogue, current data can be stored to storage catalogue 1.
In the embodiment of the present invention, illustratively, index data can indicate in the storage path of disk are as follows:/data/ Phenix_idx/ { 0..35 }/00000000000000709106501/0/idx, wherein data/, phenix_idx/ are indicated Store path step by step;The serial number of disk where { 0..35 } indicates file data corresponding to index data;phenix_idx/ In string number later, the first bit digital indicates the type of data block, and 0 indicates three copy types;20 digits after first Word bit indicates the identification number of storage space block corresponding to dat data, i.e., the storage space block is in the unique of storage cluster Mark;Last two indicate unique identifying number of the copy in three copies, and value is usually one in 0-2, if there is The case where greater than 3 copies, then it is likely larger than 2.
Above-mentioned disk array, such as can be the disk array being made of 36 pieces of mechanical hard disks and 1 piece of SSD, wherein 36 Block mechanical hard disk is used to store the file of dat format, and 1 piece of SSD is used to store the file of idx format.Due to the embodiment of the present invention Index data is stored respectively with file data, therefore, storage catalogue corresponding for data block each in mechanical hard disk can be only File data is saved, storage catalogue corresponding for data block each in SSD can only save index data, optionally, above-mentioned rope Argument evidence and file data can save in a binary format.
According to the specified principle of above-mentioned storage location specified directory, compression behaviour is being carried out to the data under currently stored catalogue When making, the data compressed can be stored under another storage catalogue, while modifying storage location after squeeze operation is complete Numerical value in specified directory to be directed toward another storage catalogue, therefore will not influence the data under currently stored catalogue.
A kind of file management method provided in an embodiment of the present invention, after obtaining merging file to be written, by that will close And file stores the first storage region of the wherein storage server into cloud storage system, and obtains for merging file In each small documents index data, then by the second storage region of index datastore to the storage server, due to the storage The reading and writing data performance of second storage region of server is higher than the reading and writing data performance of the first storage region, therefore small reading When file, the rope of each small documents can be more rapidly read using the high readwrite performance of second storage region of storage server Argument evidence, and then according to read index data, each small documents are read from the merging file that the first storage region is stored, It, can also while saving memory space to avoid the problem that reading data degraded performance existing for existing file management method Enough improve the reading data performance of cloud storage system.
Embodiment of the method 4
The embodiment of the invention also provides a kind of file management methods, based on any of the above embodiments, such as Fig. 8 institute Show, this method may comprise steps of:
S 401 reads the index data of the small documents to read from the first storage region from the second storage region.
According to previous embodiment it is found that the first storage region is stored with merging file, the second storage region is stored with small text The index data of part therefore, can be with for some small documents to read from the first storage region in the embodiment of the present invention The index data of the small documents is read, from the second storage region first to obtain the storage location information of the small documents.
It, can be from the corresponding storage region of the second storage medium as a kind of optional embodiment of the embodiment of the present invention In, the index data of the small documents to read from the corresponding storage region of the first storage medium is obtained, the second storage medium can Think SSD.
S402 positions storage location of the data in the first storage region of small documents according to index data.
After reading index data, it can be deposited according to storage location information therein, the data for positioning small documents first The storage location in storage area domain.
Specifically, according to previous embodiment it is found that can recorde the identification number of storage space block in index data, because Which this can be stored according to the file data that identification number positions small documents and deposit after determining the identification number of storage space block It stores up in the corresponding data block of space block.
It, can be from the corresponding storage region institute of the first storage medium as a kind of optional embodiment of the embodiment of the present invention In the merging file of storage, small documents are read, optionally, the first storage medium can be mechanical hard disk.
S403 reads small documents from the merging file that the first storage region is stored according to the storage location positioned.
The merging file that first storage region is stored is the file that multiple small documents obtain after merging, therefore merges text It include the file data of each small documents in part, when to be stored in which storage space block corresponding for the file data for navigating to small documents After in data block, the file data of small documents can be read from the data block, that is, the conjunction stored from the first storage region And small documents are read in file.
A kind of file management method provided in an embodiment of the present invention is stored by that will merge file to the first storage region, By index datastore to the second storage region, since the reading and writing data performance of the second storage region is higher than the first storage region Reading and writing data performance, therefore when reading small documents, it can more rapidly be read using the high readwrite performance of the second storage region The index data of each small documents is taken, and then according to read index data, the merging file stored from the first storage region Middle each small documents of reading are deposited to avoid the problem that reading data degraded performance existing for existing file management method in saving The reading data performance of cloud storage system can also be improved while storing up space.
Corresponding to above method embodiment, embodiment that the embodiment of the present invention also provides corresponding devices.
Installation practice 1
As shown in figure 9, can be applied in cloud storage system the embodiment of the invention provides a kind of document management apparatus Management server, management server are used to manage more storage servers in cloud storage system, and storage server is for storing Data may include multiple storage clusters in one cloud storage system, a management server be laid in each storage cluster And several storage servers, management server can uniformly be managed above-mentioned several storage servers, storage clothes Device be engaged in for storing data, illustratively, after obtaining data to be written, management server can control above-mentioned storage server The data are stored, which includes:
Module 501 is obtained, for obtaining merging file to be written, merging file is that multiple small documents obtain after merging File, small documents be size be lower than preset threshold file.
Merging file in the embodiment of the present invention can refer to one that obtains multiple small documents after merging treatment big File, it will be understood that the obtained size for merging file is bigger compared to each small documents, due to can be with the shape of a file Formula is stored, therefore can be avoided a case where small documents deposit discontented storage space block generation, that is, avoids storage empty Between the problem of wasting.
Also, the data for merging each small documents in file, which can be, to be saved in order into merging file, therefore more Be conducive to traditional mechanical hard disk read-write (the random reading performance of traditional mechanical hard disk is far below continuous reading performance).
Also, it is stored not yet into storage server due to merging file at this time, it can be by above-mentioned merging file Merging file referred to as to be written.
Small documents therein can refer to that size is lower than the file of preset threshold, for example, size of data is lower than 2KB The file of (Kilobyte, kilobytes).Certainly, the developer of cloud storage system can be flexibly reasonable according to practical business demand Ground is arranged above-mentioned threshold value, the embodiment of the present invention to the specific value of above-mentioned threshold value without limitation.Multiple small documents are merged into one A process for merging file can be obtained by existing file mergences method, and the detailed process embodiment of the present invention is no longer superfluous It states.
First memory module 502 is stored to the first storage region for that will merge file, and is obtained in merging file The index data of each small documents, wherein the storage location information an of small documents is carried in every index data.
Different storage regions can be set in a storage cluster, it is corresponding for different storage regions Reading and writing data performance can be different, and therefore, the embodiment of the present invention can will merge file and store to the first storage region, and deposit Chu Hou, obtains the index data of each small documents in the merging file, wherein a small documents are carried in every index data Storage location information, that is, the storage location information of the corresponding small documents of an index data, storage location information is for remembering Small documents are recorded in the storage location of the first storage region, for example, the identification number of the storage space block stored, and can read Play the role of positioning when the small documents.
The process of the above-mentioned index data for obtaining each small documents can be obtained by existing file index information generation method It arrives, the embodiment of the present invention repeats no more.
Second memory module 503 is used for index datastore to the second storage region, wherein the second storage region Reading and writing data performance is higher than the reading and writing data performance of the first storage region.
After obtaining index data, the embodiment of the present invention can be deposited index datastore to the second storage region, second Storage area domain is the region different from the first storage region, also, the reading and writing data performance of the second storage region is higher than the first storage The reading and writing data performance in region.By the way that index datastore into higher second storage region of readwrite performance, is being read in this way When taking large amount of small documents, it can use the high readwrite performance of the second storage region, more rapidly read the index of each small documents Data, and then each small documents are further read from merging file according to read index data, it can be avoided due to tradition The problem of reading data degraded performance caused by mechanical hard disk performance limits.In addition, since index data is relative to file data Required memory space is smaller, therefore, only can save index datastore in the second high storage region of reading and writing data performance Cost.
As a kind of optional embodiment of the embodiment of the present invention, the first memory module specifically can be used for:
File will be merged to store into the corresponding storage region of the first storage medium;
Second memory module, specifically can be used for:
By index datastore into the corresponding storage region of the second storage medium, wherein the data of the second storage medium Readwrite performance is higher than the reading and writing data performance of the first storage medium.
As a kind of optional embodiment of the embodiment of the present invention, the first storage medium is mechanical hard disk, and the second storage is situated between Matter is solid-state hard disk SSD.
As a kind of optional embodiment of the embodiment of the present invention, the first memory module specifically can be used for:
File will be merged to store to the first storage region of the first storage server, the first storage server is more storages Wherein one in server;
Second memory module, specifically can be used for:
By the second storage region of index datastore to the first storage server.
As a kind of optional embodiment of the embodiment of the present invention, the first storage region and the second storage region are divided into The identical storage space block of size, for each storage space block for storing a data block, a data block includes: multiple deposits Catalogue and storage location specified directory are stored up, for storing data, there is default record storage catalogue under storage location specified directory Numerical value, different default values store data to different storage catalogues for specified, as shown in Figure 10, then the first storage Module may include:
First sub-module stored 5021 will if the size for merging file is not more than the size of storage space block Merge under the specified storage catalogue that file stores data block corresponding to one of storage space block to the first storage region.
Second sub-module stored 5022 will close if the size for merging file is greater than the size of storage space block And file is stored to the specified storage catalogue of the corresponding data block of multiple storage space blocks of the first storage region.
A kind of document management apparatus provided in an embodiment of the present invention, after obtaining merging file to be written, by that will close And file is stored to the first storage region, and is obtained for the index data for merging each small documents in file, then by index data It stores to the second storage region, since the reading and writing data performance of the second storage region is higher than the reading and writing data of the first storage region Can, therefore when reading small documents, each small documents can be more rapidly read using the high readwrite performance of the second storage region Index data read from the merging file that the first storage region is stored each small and then according to read index data File is saving the same of memory space to avoid the problem that reading data degraded performance existing for existing file management method When can also improve the reading data performance of cloud storage system.
Installation practice 2
The embodiment of the invention also provides a kind of document management apparatus, on the basis of embodiment shown in Fig. 9, such as Figure 11 institute Show, can also include:
First read module 601, for reading the small documents to read from the first storage region from the second storage region Index data.
Locating module 602, for positioning storage position of the data in the first storage region of small documents according to index data It sets.
Second read module 603, the merging text for being stored from the first storage region according to the storage location positioned Small documents are read in part.
As a kind of optional embodiment of the embodiment of the present invention, the first read module specifically can be used for:
From the corresponding storage region of the second storage medium, obtain to be read from the corresponding storage region of the first storage medium The index data of the small documents taken;
Second read module, specifically can be used for:
From the merging file that the corresponding storage region of the first storage medium is stored, small documents are read.
A kind of document management apparatus provided in an embodiment of the present invention is stored by that will merge file to the first storage region, By index datastore to the second storage region, since the reading and writing data performance of the second storage region is higher than the first storage region Reading and writing data performance, therefore when reading small documents, it can more rapidly be read using the high readwrite performance of the second storage region The index data of each small documents is taken, and then according to read index data, the merging file stored from the first storage region Middle each small documents of reading are deposited to avoid the problem that reading data degraded performance existing for existing file management method in saving The reading data performance of cloud storage system can also be improved while storing up space.
The embodiment of the invention also provides a kind of electronic equipment, are specifically as follows server, as shown in figure 12, the equipment 700 include processor 701 and machine readable storage medium 702, and machine readable storage medium, which is stored with, to be executed by processor Machine-executable instruction, processor execute machine-executable instruction perform the steps of
Merging file to be written is obtained, merging file is the file that multiple small documents obtain after merging, and small documents are Size is lower than the file of preset threshold;
File will be merged to store to the first storage region, and obtained for the index data for merging each small documents in file, Wherein, the storage location information an of small documents is carried in every index data;
By index datastore to the second storage region, wherein the reading and writing data performance of the second storage region is higher than first The reading and writing data performance of storage region.
Machine readable storage medium may include random access memory (Random Access Memory, abbreviation RAM), It also may include nonvolatile memory (non-volatile memory), for example, at least a magnetic disk storage.Optionally, Memory can also be that at least one is located remotely from the storage device of aforementioned processor.
Above-mentioned processor can be general processor, including central processing unit (Central Processing Unit, Abbreviation CPU), network processing unit (Network Processor, abbreviation NP) etc.;It can also be digital signal processor (Digital Signal Processing, abbreviation DSP), specific integrated circuit (Application Specific Integrated Circuit, abbreviation ASIC), field programmable gate array (Field-Programmable Gate Array, Abbreviation FPGA) either other programmable logic device, discrete gate or transistor logic, discrete hardware components.
Electronic equipment provided in an embodiment of the present invention is deposited after obtaining merging file to be written by that will merge file Storage is obtained for the index data for merging each small documents in file to the first storage region, then by index datastore to the Two storage regions, since the reading and writing data performance of the second storage region is higher than the reading and writing data performance of the first storage region, When reading small documents, the index number of each small documents can be more rapidly read using the high readwrite performance of the second storage region According to, and then according to read index data, each small documents are read from the merging file that the first storage region is stored, thus It avoids the problem that reading data degraded performance existing for existing file management method, can also be mentioned while saving memory space The reading data performance of high cloud storage system.
The embodiment of the invention also provides a kind of computer readable storage medium, it is stored in computer readable storage medium Computer program, when computer program is executed by processor, to execute following steps:
Merging file to be written is obtained, merging file is the file that multiple small documents obtain after merging, and small documents are Size is lower than the file of preset threshold;
File will be merged to store to the first storage region, and obtained for the index data for merging each small documents in file, Wherein, the storage location information an of small documents is carried in every index data;
By index datastore to the second storage region, wherein the reading and writing data performance of the second storage region is higher than first The reading and writing data performance of storage region.
Computer readable storage medium provided in an embodiment of the present invention, after obtaining merging file to be written, pass through by Merge file to store to the first storage region, and obtain for the index data for merging each small documents in file, then by index number According to storing to the second storage region, since the reading and writing data performance of the second storage region is higher than the reading and writing data of the first storage region Performance, therefore when reading small documents, each small text can be more rapidly read using the high readwrite performance of the second storage region The index data of part, and then according to read index data, it is read from the merging file that the first storage region is stored each Small documents are saving memory space to avoid the problem that reading data degraded performance existing for existing file management method The reading data performance of cloud storage system can also be improved simultaneously.
The embodiment of the invention also provides a kind of computer program products comprising instruction, when it runs on computers When, so that computer executes following steps:
Merging file to be written is obtained, merging file is the file that multiple small documents obtain after merging, and small documents are Size is lower than the file of preset threshold;
File will be merged to store to the first storage region, and obtained for the index data for merging each small documents in file, Wherein, the storage location information an of small documents is carried in every index data;
By index datastore to the second storage region, wherein the reading and writing data performance of the second storage region is higher than first The reading and writing data performance of storage region.
Computer program product provided in an embodiment of the present invention comprising instruction, after obtaining merging file to be written, It is stored by the way that file will be merged to the first storage region, and obtained for the index data for merging each small documents in file, then will Index datastore is to the second storage region, since the reading and writing data performance of the second storage region is higher than the number of the first storage region According to readwrite performance, therefore when reading small documents, can more rapidly be read using the high readwrite performance of the second storage region The index data of each small documents, and then according to read index data, from the merging file that the first storage region is stored Each small documents are read, to avoid the problem that reading data degraded performance existing for existing file management method, are stored saving The reading data performance of cloud storage system can also be improved while space.
The embodiment of the invention also provides a kind of computer programs, when run on a computer, so that computer is held Row following steps:
Merging file to be written is obtained, merging file is the file that multiple small documents obtain after merging, and small documents are Size is lower than the file of preset threshold;
File will be merged to store to the first storage region, and obtained for the index data for merging each small documents in file, Wherein, the storage location information an of small documents is carried in every index data;
By index datastore to the second storage region, wherein the reading and writing data performance of the second storage region is higher than first The reading and writing data performance of storage region.
Computer program provided in an embodiment of the present invention comprising instruction passes through after obtaining merging file to be written File will be merged to store to the first storage region, and obtained for the index data for merging each small documents in file, then will index Data are stored to the second storage region, since the data that the reading and writing data performance of the second storage region is higher than the first storage region are read Write performance, therefore when reading small documents, it can more rapidly be read using the high readwrite performance of the second storage region each small The index data of file, and then according to read index data, it is read from the merging file that the first storage region is stored Each small documents are saving memory space to avoid the problem that reading data degraded performance existing for existing file management method While can also improve the reading data performance of cloud storage system.
For device/electronic equipment/storage medium embodiment, since it is substantially similar to the method embodiment, so It is described relatively simple, the relevent part can refer to the partial explaination of embodiments of method.
It should be noted that, in this document, relational terms such as first and second and the like are used merely to a reality Body or operation are distinguished with another entity or operation, are deposited without necessarily requiring or implying between these entities or operation In any actual relationship or order or sequence.Moreover, the terms "include", "comprise" or its any other variant are intended to Non-exclusive inclusion, so that the process, method, article or equipment including a series of elements is not only wanted including those Element, but also including other elements that are not explicitly listed, or further include for this process, method, article or equipment Intrinsic element.In the absence of more restrictions, the element limited by sentence "including a ...", it is not excluded that There is also other identical elements in process, method, article or equipment including the element.
Each embodiment in this specification is all made of relevant mode and describes, same and similar portion between each embodiment Dividing may refer to each other, and each embodiment focuses on the differences from other embodiments.Especially for system reality For applying example, since it is substantially similar to the method embodiment, so being described relatively simple, related place is referring to embodiment of the method Part explanation.
The foregoing is merely illustrative of the preferred embodiments of the present invention, is not intended to limit the scope of the present invention.It is all Any modification, equivalent replacement, improvement and so within the spirit and principles in the present invention, are all contained in protection scope of the present invention It is interior.

Claims (16)

1. a kind of file management method, which is characterized in that applied to the management server in cloud storage system, the management service Device is used to manage more storage servers in the cloud storage system, the storage server for storing data, the side Method includes:
Merging file to be written is obtained, the file that merges is the file that multiple small documents obtain after merging, the small text Part is the file that size is lower than preset threshold;
The merging file is stored to the first storage region, and obtains the index number for each small documents in the merging file According to, wherein the storage location information of the small documents is carried in every index data;
By the index datastore to the second storage region, wherein the reading and writing data performance of second storage region is higher than The reading and writing data performance of first storage region.
2. the method according to claim 1, wherein described store the merging file to the first storage region The step of, comprising:
The merging file is stored into the corresponding storage region of the first storage medium;
It is described by the index datastore to the second storage region the step of, comprising:
By the index datastore into the corresponding storage region of the second storage medium, wherein second storage medium Reading and writing data performance is higher than the reading and writing data performance of first storage medium.
3. according to the method described in claim 2, it is characterized in that, first storage medium be mechanical hard disk, described second Storage medium is solid-state hard disk SSD.
4. method according to claim 1-3, which is characterized in that described to store the merging file to first The step of storage region, comprising:
The merging file is stored to the first storage region of the first storage server, first storage server is described Wherein one in more storage servers;
It is described by the index datastore to the second storage region the step of, comprising:
By the index datastore to the second storage region of first storage server.
5. according to the method described in claim 4, it is characterized in that, first storage region and the second storage region quilt It is divided into the identical storage space block of size, each storage space block is for storing a data block, a data block Include: multiple storage catalogues and storage location specified directory, the storage catalogue for storing data, the storage location Record has default value under specified directory, and the different default values stores data to different storage catalogues for specified Under;
Described the step of storing the merging file to the first storage region, comprising:
If the size for merging file is not more than the size of the storage space block, the merging file is stored to institute Under the specified storage catalogue for stating data block corresponding to one of storage space block of the first storage region;
If the size for merging file is greater than the size of the storage space block, the merging file is stored to described Under the specified storage catalogue of the corresponding data block of multiple storage space blocks of first storage region.
6. according to the method described in claim 5, it is characterized in that, the method also includes:
From second storage region, the index data of the small documents to read from first storage region is read;
According to the index data, storage location of the data in first storage region of the small documents is positioned;
According to the storage location positioned, from the merging file that first storage region is stored described in reading Small documents.
7. according to the method described in claim 6, it is characterized in that, described from second storage region, read to from described The step of index data for the small documents that first storage region is read, comprising:
From the corresponding storage region of second storage medium, obtain to from the corresponding storage region of first storage medium The index data of the small documents of middle reading;
Described the step of reading the small documents from the merging file that first storage region is stored, comprising:
From the merging file that the corresponding storage region of first storage medium is stored, the small documents are read.
8. a kind of document management apparatus, which is characterized in that applied to the management server in cloud storage system, the management service Device is used to manage more storage servers in the cloud storage system, the storage server for storing data, the dress It sets and includes:
Module is obtained, for obtaining merging file to be written, the multiple small documents of the merging file obtain after merging File, the small documents are the file that size is lower than preset threshold;
First memory module for storing the merging file to the first storage region, and obtains and is directed to the merging file In each small documents index data, wherein carried in every index data the small documents storage location letter Breath;
Second memory module is used for the index datastore to the second storage region, wherein second storage region Reading and writing data performance is higher than the reading and writing data performance of first storage region.
9. device according to claim 8, which is characterized in that first memory module is specifically used for:
The merging file is stored into the corresponding storage region of the first storage medium;
Second memory module, is specifically used for:
By the index datastore into the corresponding storage region of the second storage medium, wherein second storage medium Reading and writing data performance is higher than the reading and writing data performance of first storage medium.
10. device according to claim 9, which is characterized in that first storage medium be mechanical hard disk, described second Storage medium is solid-state hard disk SSD.
11. according to the described in any item devices of claim 8-10, which is characterized in that first memory module is specifically used for:
The merging file is stored to the first storage region of the first storage server, first storage server is described Wherein one in more storage servers;
Second memory module, is specifically used for:
By the index datastore to the second storage region of first storage server.
12. device according to claim 11, which is characterized in that first storage region and second storage region It is divided into the identical storage space block of size, each storage space block is for storing a data block, the data Block includes: multiple storage catalogues and storage location specified directory, the storage catalogue for storing data, the storage position Setting record under specified directory has default value, and the different default values stores data to different storage mesh for specified Under record;
First memory module, comprising:
First sub-module stored will if being not more than the size of the storage space block for the size for merging file The specified storage for merging file and storing data block corresponding to one of storage space block to first storage region Under catalogue;
Second sub-module stored, if being greater than the size of the storage space block for the size for merging file, by institute Merging file is stated to store to the specified storage mesh of the corresponding data block of multiple storage space blocks of first storage region Under record.
13. device according to claim 12, which is characterized in that described device further include:
First read module, for reading the small documents to read from first storage region from second storage region Index data;
Locating module, for positioning data the depositing in first storage region of the small documents according to the index data Storage space is set;
Second read module, for according to the storage location positioned, stored from first storage region described in Merge in file and reads the small documents.
14. device according to claim 13, which is characterized in that first read module is specifically used for:
From the corresponding storage region of second storage medium, obtain to from the corresponding storage region of first storage medium The index data of the small documents of middle reading;
Second read module, is specifically used for:
From the merging file that the corresponding storage region of first storage medium is stored, the small documents are read.
15. a kind of electronic equipment, which is characterized in that including processor and machine readable storage medium, the machine readable storage Media storage has the machine-executable instruction that can be executed by the processor, and the processor executes the executable finger of the machine Enable the method and step to realize the described in any item file management methods of claim 1-7.
16. a kind of computer readable storage medium, which is characterized in that be stored with computer in the computer readable storage medium Program realizes the side of the described in any item file management methods of claim 1-7 when the computer program is executed by processor Method step.
CN201910411298.8A 2019-05-16 2019-05-16 File management method and device, electronic equipment and storage medium Active CN110147203B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910411298.8A CN110147203B (en) 2019-05-16 2019-05-16 File management method and device, electronic equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910411298.8A CN110147203B (en) 2019-05-16 2019-05-16 File management method and device, electronic equipment and storage medium

Publications (2)

Publication Number Publication Date
CN110147203A true CN110147203A (en) 2019-08-20
CN110147203B CN110147203B (en) 2022-11-04

Family

ID=67595693

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910411298.8A Active CN110147203B (en) 2019-05-16 2019-05-16 File management method and device, electronic equipment and storage medium

Country Status (1)

Country Link
CN (1) CN110147203B (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112416880A (en) * 2021-01-22 2021-02-26 南京群顶科技有限公司 Method and device for optimizing storage performance of mass small files based on real-time merging
CN113420025A (en) * 2021-06-11 2021-09-21 广联达科技股份有限公司 Component data processing method and device and electronic equipment
CN113495681A (en) * 2020-04-07 2021-10-12 杭州萤石软件有限公司 NAND FLASH file data access method, device and storage medium
CN114218161A (en) * 2021-12-29 2022-03-22 北京百度网讯科技有限公司 Index storage method and device, retrieval engine, electronic equipment and storage medium
WO2022083287A1 (en) * 2020-10-20 2022-04-28 百果园技术(新加坡)有限公司 Storage space management method and apparatus, device, and storage medium
CN117632039A (en) * 2024-01-25 2024-03-01 合肥兆芯电子有限公司 Memory management method, memory storage device and memory control circuit unit

Citations (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5454103A (en) * 1993-02-01 1995-09-26 Lsc, Inc. Method and apparatus for file storage allocation for secondary storage using large and small file blocks
CA2596434A1 (en) * 2006-08-25 2008-02-25 Dan Dodge File system having variable logical storage block size
US20130111182A1 (en) * 2011-10-26 2013-05-02 International Business Machines Corporation Storing a small file with a reduced storage and memory footprint
CN103577123A (en) * 2013-11-12 2014-02-12 河海大学 Small file optimization storage method based on HDFS
CN103605726A (en) * 2013-11-15 2014-02-26 中安消技术有限公司 Method and system for accessing small files, control node and storage node
CN103678579A (en) * 2013-12-12 2014-03-26 浪潮电子信息产业股份有限公司 Optimizing method for small-file storage efficiency
CN104462563A (en) * 2014-12-26 2015-03-25 浙江宇视科技有限公司 File storage method and system
CN104536959A (en) * 2014-10-16 2015-04-22 南京邮电大学 Optimized method for accessing lots of small files for Hadoop
CN104572670A (en) * 2013-10-15 2015-04-29 方正国际软件(北京)有限公司 Small file storage, query and deletion method and system
CN104991747A (en) * 2015-07-30 2015-10-21 湖南亿谷科技发展股份有限公司 Method and system for data management
CN105069048A (en) * 2015-07-23 2015-11-18 东方网力科技股份有限公司 Small file storage method, query method and device
CN105095421A (en) * 2015-07-14 2015-11-25 南京国电南自美卓控制系统有限公司 Distributed storage method for real-time database
CN105138571A (en) * 2015-07-24 2015-12-09 四川长虹电器股份有限公司 Distributed file system and method for storing lots of small files
US9286261B1 (en) * 2011-11-14 2016-03-15 Emc Corporation Architecture and method for a burst buffer using flash technology
CN105868286A (en) * 2016-03-23 2016-08-17 中国科学院计算技术研究所 Parallel adding method and system for merging small files on basis of distributed file system
CN105956183A (en) * 2016-05-30 2016-09-21 广东电网有限责任公司电力调度控制中心 Method and system for multi-stage optimization storage of a lot of small files in distributed database
CN106021585A (en) * 2016-06-02 2016-10-12 同济大学 Traffic incident video access method and system based on time-space characteristics
CN106294603A (en) * 2016-07-29 2017-01-04 北京奇虎科技有限公司 File memory method and device
CN106775446A (en) * 2016-11-11 2017-05-31 中国人民解放军国防科学技术大学 Based on the distributed file system small documents access method that solid state hard disc accelerates
CN107247714A (en) * 2016-06-01 2017-10-13 国家电网公司 A kind of small documents access system and method based on distributed storage technology
CN107766374A (en) * 2016-08-19 2018-03-06 上海凯翔信息科技有限公司 The optimization method and system that a kind of mass small documents storage is read
CN108234594A (en) * 2017-11-28 2018-06-29 北京市商汤科技开发有限公司 File memory method and device, electronic equipment, program and medium

Patent Citations (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5454103A (en) * 1993-02-01 1995-09-26 Lsc, Inc. Method and apparatus for file storage allocation for secondary storage using large and small file blocks
CA2596434A1 (en) * 2006-08-25 2008-02-25 Dan Dodge File system having variable logical storage block size
US20130111182A1 (en) * 2011-10-26 2013-05-02 International Business Machines Corporation Storing a small file with a reduced storage and memory footprint
US9286261B1 (en) * 2011-11-14 2016-03-15 Emc Corporation Architecture and method for a burst buffer using flash technology
CN104572670A (en) * 2013-10-15 2015-04-29 方正国际软件(北京)有限公司 Small file storage, query and deletion method and system
CN103577123A (en) * 2013-11-12 2014-02-12 河海大学 Small file optimization storage method based on HDFS
CN103605726A (en) * 2013-11-15 2014-02-26 中安消技术有限公司 Method and system for accessing small files, control node and storage node
CN103678579A (en) * 2013-12-12 2014-03-26 浪潮电子信息产业股份有限公司 Optimizing method for small-file storage efficiency
CN104536959A (en) * 2014-10-16 2015-04-22 南京邮电大学 Optimized method for accessing lots of small files for Hadoop
CN104462563A (en) * 2014-12-26 2015-03-25 浙江宇视科技有限公司 File storage method and system
CN105095421A (en) * 2015-07-14 2015-11-25 南京国电南自美卓控制系统有限公司 Distributed storage method for real-time database
CN105069048A (en) * 2015-07-23 2015-11-18 东方网力科技股份有限公司 Small file storage method, query method and device
CN105138571A (en) * 2015-07-24 2015-12-09 四川长虹电器股份有限公司 Distributed file system and method for storing lots of small files
CN104991747A (en) * 2015-07-30 2015-10-21 湖南亿谷科技发展股份有限公司 Method and system for data management
CN105868286A (en) * 2016-03-23 2016-08-17 中国科学院计算技术研究所 Parallel adding method and system for merging small files on basis of distributed file system
CN105956183A (en) * 2016-05-30 2016-09-21 广东电网有限责任公司电力调度控制中心 Method and system for multi-stage optimization storage of a lot of small files in distributed database
CN107247714A (en) * 2016-06-01 2017-10-13 国家电网公司 A kind of small documents access system and method based on distributed storage technology
CN106021585A (en) * 2016-06-02 2016-10-12 同济大学 Traffic incident video access method and system based on time-space characteristics
CN106294603A (en) * 2016-07-29 2017-01-04 北京奇虎科技有限公司 File memory method and device
CN107766374A (en) * 2016-08-19 2018-03-06 上海凯翔信息科技有限公司 The optimization method and system that a kind of mass small documents storage is read
CN106775446A (en) * 2016-11-11 2017-05-31 中国人民解放军国防科学技术大学 Based on the distributed file system small documents access method that solid state hard disc accelerates
CN108234594A (en) * 2017-11-28 2018-06-29 北京市商汤科技开发有限公司 File memory method and device, electronic equipment, program and medium

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113495681A (en) * 2020-04-07 2021-10-12 杭州萤石软件有限公司 NAND FLASH file data access method, device and storage medium
WO2022083287A1 (en) * 2020-10-20 2022-04-28 百果园技术(新加坡)有限公司 Storage space management method and apparatus, device, and storage medium
CN112416880A (en) * 2021-01-22 2021-02-26 南京群顶科技有限公司 Method and device for optimizing storage performance of mass small files based on real-time merging
CN113420025A (en) * 2021-06-11 2021-09-21 广联达科技股份有限公司 Component data processing method and device and electronic equipment
CN114218161A (en) * 2021-12-29 2022-03-22 北京百度网讯科技有限公司 Index storage method and device, retrieval engine, electronic equipment and storage medium
CN117632039A (en) * 2024-01-25 2024-03-01 合肥兆芯电子有限公司 Memory management method, memory storage device and memory control circuit unit
CN117632039B (en) * 2024-01-25 2024-05-03 合肥兆芯电子有限公司 Memory management method, memory storage device and memory control circuit unit

Also Published As

Publication number Publication date
CN110147203B (en) 2022-11-04

Similar Documents

Publication Publication Date Title
US10649696B2 (en) Data set management
CN110147203A (en) A kind of file management method, device, electronic equipment and storage medium
US10346081B2 (en) Handling data block migration to efficiently utilize higher performance tiers in a multi-tier storage environment
CN103064639B (en) Date storage method and device
US9329957B2 (en) Method for managing storage system using flash memory, and computer
US11675499B2 (en) Synchronous discovery logs in a fabric storage system
CN106446159B (en) A kind of method of storage file, the first virtual machine and name node
CN110134338B (en) Distributed storage system and data redundancy protection method and related equipment thereof
US20180059949A1 (en) Tier based data file management
US11531491B2 (en) Systems and methods for random to sequential storage mapping
US9189494B2 (en) Object file system
CN103929500A (en) Method for data fragmentation of distributed storage system
US8386707B2 (en) Virtual disk management program, storage device management program, multinode storage system, and virtual disk managing method
US20140181455A1 (en) Category based space allocation for multiple storage devices
CN103037004A (en) Implement method and device of cloud storage system operation
US10365845B1 (en) Mapped raid restripe for improved drive utilization
CN109582213A (en) Data reconstruction method and device, data-storage system
CN109918352B (en) Memory system and method of storing data
CN111435286B (en) Data storage method, device and system
CN104021088A (en) Log storage method and device
CN108304142A (en) A kind of data managing method and device
US11507278B2 (en) Proactive copy in a storage environment
CN105426125B (en) A kind of date storage method and device
CN109508140B (en) Storage resource management method and device, electronic equipment and system
CN113853778A (en) Cloning method and device of file system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant