Nothing Special   »   [go: up one dir, main page]

CN106708622A - Cluster resource processing method and system, and resource processing cluster - Google Patents

Cluster resource processing method and system, and resource processing cluster Download PDF

Info

Publication number
CN106708622A
CN106708622A CN201610570094.5A CN201610570094A CN106708622A CN 106708622 A CN106708622 A CN 106708622A CN 201610570094 A CN201610570094 A CN 201610570094A CN 106708622 A CN106708622 A CN 106708622A
Authority
CN
China
Prior art keywords
node
resource
cluster
label
uml
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201610570094.5A
Other languages
Chinese (zh)
Other versions
CN106708622B (en
Inventor
唐祥豪
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tencent Technology Shenzhen Co Ltd
Original Assignee
Tencent Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology Shenzhen Co Ltd filed Critical Tencent Technology Shenzhen Co Ltd
Priority to CN201610570094.5A priority Critical patent/CN106708622B/en
Publication of CN106708622A publication Critical patent/CN106708622A/en
Application granted granted Critical
Publication of CN106708622B publication Critical patent/CN106708622B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5083Techniques for rebalancing the load in a distributed system

Landscapes

  • Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Debugging And Monitoring (AREA)

Abstract

The invention relates to a cluster resource processing method and system, and a resource processing cluster. The cluster resource processing method comprises the following steps that: monitoring node calculation resource information, and obtaining current idle calculation resources according to the node calculation resource information; receiving an operation request which carries resource configuration information, wherein the resource configuration information comprises a node label; according to the node label, obtaining a corresponding node queue, and distributing target calculation resources to the operation in the current idle calculation resources of the node in the node queue according to the resource configuration information; and on the node corresponding to the target calculation resources, starting a (User Mode Linux) resource isolation container, and executing the operation in a user space. A cluster use ratio can be improved, and operation throughput in the cluster can be improved.

Description

Cluster resource processing method and system, resource treatment cluster
Technical field
The present invention relates to field of computer technology, more particularly to a kind of cluster resource processing method and system, Energy Resources Service Reason cluster.
Background technology
Cluster is one group of computer that is separate, being interconnected by express network, and they constitute a group, and with list The pattern of one system is managed.Server in cluster cooperates, and provides a user with network english teaching, comes for user Say that cluster is like a triangular web.Special computer is rented in such as enterprise procurement, using linux type operating systems, is Customer group provides access service.
Traditional cluster resource processing method, when being managed container resource, generally requires to update kernel or application behaviour Make the administrator right of system, updating kernel can influence the service of enterprise's existing deployment, or need to operation maintenance personnel application pipe Reason person's authority, is unfavorable for that addition server node carries out related deployment, causes cluster server to be unable to flexible deployment, and average profit It is not high with rate.
The content of the invention
Based on this, it is necessary to for above-mentioned technical problem, there is provided a kind of cluster resource processing method and system, resource treatment Cluster, improves cluster utilization rate, improves the handling capacity of operation in cluster.
A kind of cluster resource processing method, methods described includes:
Monitor node computing resource information, obtains cluster current idle and calculates money according to the node computing resource information Source;
The job request for carrying resource allocation information is received, the resource allocation information includes node label;
Corresponding node queue is obtained according to the node label, according to the resource allocation information in the node queue In node current idle computing resource in be operation distribution target computational resource;
Start user model Linux UML resource isolation containers in the corresponding node of the target computational resource, in user Space performs operation.
A kind of cluster resource processing system, the system includes:
Computing resource monitoring management module, for monitor node computing resource information, believes according to the node computing resource Breath obtains cluster current idle computing resource;
Computational resource allocation module, the job request of resource allocation information, the resource allocation information are carried for receiving Including node label, corresponding node queue is obtained according to the node label, according to the resource allocation information in the section It is the operation distribution target computational resource in the current idle computing resource of the node in point queue;
Operation execution module, for starting user model Linux UML moneys in the corresponding node of the target computational resource Source spacing container, operation is performed in user's space.
Above-mentioned cluster resource processing method and system, by monitor node computing resource information, calculate according to the node Resource information obtains cluster current idle computing resource, receives the job request for carrying resource allocation information, resource allocation information Including node label, corresponding node queue, the section according to resource allocation information in node queue are obtained according to node label For operation distributes target computational resource in the current idle computing resource of point, user is started in the corresponding node of target computational resource Pattern Linux UML resource isolation containers, operation is performed in user's space, according to the node computing resource letter that monitor in real time is obtained Breath obtain real-time change idle computing resources so that the distribution of resource is flexible and efficient, and by UML resource isolations container with Family space performs operation, and on the premise of good isolation is ensured, it is not necessary to administrator right, energy flexible deployment clustered node will Operation is distributed to the node of available free computing resource, by introducing scattered service node, equivalent in existing machine upper portion A new computing cluster is affixed one's name to, the utilization rate of server resource can have been improved, so as to improve the handling capacity of operation in cluster.
A kind of resource processes cluster, and the cluster includes:
Calculate node, the calculate node node pre-conditioned for idle computing resources meet, the calculate node The external loading of node of the quantity in the cluster takes amount of computational resources and updates;
Resource management server, for monitor node computing resource information, obtains according to the node computing resource information Cluster current idle computing resource, receives the job request for carrying resource allocation information, and the resource allocation information includes node Label, corresponding calculate node queue is obtained according to the node label, and section is calculated described according to the resource allocation information It is the operation distribution target computational resource in the current idle computing resource of the target computing nodes in point queue;
The target computing nodes are used to start user model Linux UML resource isolation containers, are performed in user's space Operation.
Above-mentioned resource processes cluster, calculate node in the external loading occupancy amount of computational resources renewal cluster according to node Quantity, the idle computing resources of the node computing resource acquisition of information real-time change obtained according to monitor in real time so that resource Distribution is flexible and efficient;And operation is performed in user's space by UML resource isolations container, on the premise of good isolation is ensured, Administrator right is not needed, energy flexible deployment clustered node distributes operation to the node of available free computing resource, by introducing Scattered service node composition cluster, a new computing cluster has been affixed one's name to equivalent in existing machine upper portion, can improve service The utilization rate of device resource, so as to improve the handling capacity of operation in cluster.
Brief description of the drawings
Fig. 1 is the applied environment figure of cluster resource processing method operation in one embodiment;
Fig. 2 is the flow chart of cluster resource processing method in one embodiment;
Fig. 3 is one week interior cpu occupancy schematic diagram of cluster in one embodiment;
Fig. 4 is the flow chart of cluster resource processing method in a specific embodiment;
Fig. 5 is the structured flowchart of resource treatment cluster in one embodiment;
Fig. 6 is the structured flowchart of resource treatment cluster in another embodiment;
Fig. 7 is the structured flowchart of cluster resource processing system in one embodiment;
Fig. 8 is the structured flowchart of cluster resource processing system in another embodiment;
Fig. 9 is the structured flowchart of cluster resource processing system in further embodiment;
Figure 10 is the structured flowchart of cluster resource processing system in another embodiment;
Figure 11 is the structured flowchart of cluster resource processing system in further embodiment;
Figure 12 is the structured flowchart of operation execution module in one embodiment;
Figure 13 is the structured flowchart of cluster resource processing system in another embodiment;
Figure 14 is the structured flowchart of cluster resource processing system in another embodiment.
Specific embodiment
Fig. 1 is the applied environment figure of cluster resource processing method operation in one embodiment.As shown in figure 1, this applies ring Border includes that end 110, resource management server 120, calculate node 130 and dedicated node 140, wherein calculate node are submitted in operation to 130 include multiple nodes, including node 131 ... node 13n, dedicated node 140 can be one or more, and wherein operation is carried End 110, resource management server 120, calculate node 130 and dedicated node 140 composition cluster are handed over, can be led to by network Letter, can be Hadoop clusters or Spark clusters.
It can be notebook computer, desktop computer etc. that end 110 is submitted in operation to, but be not limited thereto.Calculate node 130 Quantity takes computing resource and updates according to the external loading of node, and each clustered node possesses monitoring module, for prison in real time Control node computing resource information, obtains cluster current idle computing resource, and resource management server 120 is taken according in job request The resource allocation information of band is that operation distributes destination node, and starts user model Linux UML resource isolations in destination node Container, operation is performed in user's space.Job manager, the progress of monitoring resultses can be disposed on dedicated node 140.
In one embodiment, as shown in Figure 2, there is provided a kind of cluster resource processing method, to be applied to above-mentioned application Come in environment for example, comprising the following steps:
Step S210, monitor node computing resource information obtains cluster current idle meter according to node computing resource information Calculate resource.
Specifically, money of the computing resource information including cpu occupancies, free memory, disk remaining space etc. for calculating Source relevant information, wherein can carry out being converted to occupancy cpu check figures according to cpu occupancies, and then obtains available cpu check figures, such as Current cpu occupancies 60%, the total check figures of cpu are 8, can be 8* (1-0.6), about 3 with cpu check figures.Idle free memory is By total internal memory Mem of nodetotalSubtract the internal memory Mem of external service occupancyexternalObtain, i.e. Memtotal- Memexternal.The circular of the current idle computing resource of each node can customize, such as when disk remaining space is small When predetermined threshold value, node is added into blacklist.Each deployment node calculates money by node manager to the idle of this node Source is timed monitoring and collects, and is pooled to explorer, obtains the total available resource amount of whole cluster, it is ensured that All Jobs Resource request be all to be divided away from idle computing resources.Wherein node manager is a clothes for operating in calculate node Business process, manages, distributes, reporting, reclaiming the computing resource of the calculate node.Explorer is to operate in resource management service One service processes of device, manage, distribute, reclaiming the computing resource of all calculate nodes.
In one embodiment, the current idle computing resource of node is that remaining disk space presets threshold more than disk space The available cpu check figures of the node of value, the summation of free memory.
Specifically, if the remaining disk space of node is less than disk space predetermined threshold value, this node does not perform calculating Task, it is ensured that the data storage of calculating task and external service.
In one embodiment, using platform monitoring software such as ganglia, the load condition of the whole cluster of deployment monitoring, To investigate node failure reason.According to platform identity, the measurement data that increased ganglia in resource management layer is reported patrols Volume, the utilization power of idle computing resources is visualized.If Fig. 3 is cpu occupancy schematic diagrames in cluster one week, 240 is outside The resource occupation ratio of service, 250 is the idling-resource ratio of the actual occupancy of operation, and 260 is the remaining available idle meter of operation Calculate resource ratio.
Step S220, receives the job request for carrying resource allocation information, and resource allocation information includes node label, according to Node label obtains corresponding node queue, and the current idle of the node according to resource allocation information in node queue calculates money For operation distributes target computational resource in source.
Specifically, the resource distribution that resource allocation information is used for needed for characterizing Job execution, such as operation can be with when submitting to Specify the memory size and node label of application container.Node label is used to characterize the type of node, can area by node label Divide the node of different computing capabilitys.The setting of node label can be self-defined as needed, such as sets different level distinguishing labels.
In one embodiment, before step S220, also include:Believed by the physical attribute information and load condition of node Breath, is that node sets label.
Specifically, physical attribute information is the physical attribute of node, and it is build-in attribute, such as big calculate node distribution of internal memory High_mem is identified.Load state information is the load condition of stabilization in the regular period, such as by counting preset time period internal segment The resource occupation of the external progress of point, compares with predetermined threshold value, external resource is taken high more than the external loading of predetermined threshold value Node identified with high_load.One node can be set one or more node labels.
Operation can specify the memory size and node label of application container when submitting to by resource allocation information, pass through Node label obtains corresponding target labels node queue, and suitable node is only selected in target labels node queue to perform Calculating task.If it is understood that the node mark of acquiescence can be used without specified node label in resource allocation information Sign.Operation is usually distributed treatment bulk data, and input data is cut into small data block, gives subtask treatment, is Suitable target computational resource is distributed in each subtask, and each subtask is assigned to each node for meeting computing resource condition In.By node label, the destination node for meeting condition can be screened in calculate node, to meet the calculating of different work needs Resource granularity and reduction Job Failure Rate.
Step S230, user model Linux UML resource isolation containers are started in the corresponding node of target computational resource, User's space performs operation.
Specifically, user model Linux UML (user mode linux) resource isolation container can be created by domestic consumer Startup, more versatility are built, user mode linux use mmap mode extended memory address space, therefore have little influence on outer The physical memory that portion's service is used, substantially allows data to be converted operation via cpu, there is the independent process space, protects Card container internal process cannot access physical machine external service process, it is ensured that the process of external service is not destroyed.In a reality Apply in example, Job execution terminates, the computing resource taken during release Job execution, it is ensured that the recurrence of idling-resource.
In one embodiment, user model Linux is started in the corresponding node of target computational resource in step S230 The step of UML resource isolation containers, includes:In perform script, the environmental variance of job dependence is set, and the file that operation is needed In resource carry to container, created with domestic consumer's identity and start UML resource isolation containers.
The root in container can be mapped as with newly-built private file catalogue, it is ensured that limited file access, it is to avoid not The unauthorized access of mandate, modification file, while result of calculation can be stored, so as to reach the requirement of resource isolation.Can be by modification The mixed-media network modules mixed-media code of user mode linux, realize UML resource isolations container do not need administrator right can be with extranets Network is communicated, and carries out data transmission, and operation is performed in user's space.Do not need applications management person's authority, it is possible to perform operation, it is complete Into communication, improve the flexibility of each calculate node deployment, can be random increase or decrease calculate node, without extra Application authority.
In the present embodiment, by monitor node computing resource information, cluster is obtained according to the node computing resource information Current idle computing resource, receives the job request for carrying resource allocation information, and resource allocation information includes node label, according to Node label obtains corresponding node queue, and the current idle of the node according to resource allocation information in node queue calculates money For operation distributes target computational resource in source, user model Linux UML resources are started in the corresponding node of target computational resource Spacing container, operation is performed in user's space, the node computing resource acquisition of information real-time change obtained according to monitor in real time Idle computing resources so that the distribution of resource is flexible and efficient, and operation is performed in user's space by UML resource isolations container, On the premise of good isolation is ensured, it is not necessary to administrator right, can flexible deployment clustered node, operation is distributed to available free The node of computing resource, by introducing scattered service node, a new calculating has been affixed one's name to equivalent in existing machine upper portion Cluster, can improve the utilization rate of server resource, so as to improve the handling capacity of operation in cluster.
In one embodiment, in step S230, when user's space performs operation, by using user model network Slirp network simulators start virtual network device and are bridged with physical network card with domestic consumer's identity, realize network service.
Specifically, by tcp/ip remote transmission input datas during Job execution, virtual network inside container and The physical network of back end needs foundation connection transceiving data bag to be communicated.User mode linux are generally by opening Dynamic virtual network device, such as tun equipment is bridged with physical network card, realizes the communication with external network, but start tun and set It is standby to need root authority, in the present embodiment, the mixed-media network modules mixed-media code of user mode linux is have modified, using slirp network moulds Intend device scheme, start virtual network device with domestic consumer's identity bridges with physical network card, it is ensured that the transmission of inputoutput data And the efficient convenient communication between UML resource isolation containers.
In one embodiment, user model Linux is started in the corresponding node of target computational resource in step S230 The step of UML resource isolation containers, includes:New files catalogue, if newly-built success, obtains lock, starts first user pattern Linux UML resource isolation containers, are a UML resource isolation container allocation IP address, and a UML resource isolations container starts Success, then delete file directory, release lock.
Specifically, each subtask performs to be required for starting a UML resource isolation container, calculating process may need far Journey accesses input data, it is necessary to is one sole internal ip of each UML resource isolations container allocation, need to ensure and physical node The network segment do not conflict, lock is obtained by way of new files catalogue, only obtain lock could start UML resource isolation containers And distributing IP address, it is ensured that it is serial that individual node inner pressurd vessel starts, so as to ensure that each IP address is unique and different.
In one embodiment, after step S230, also include:If external loading increases the resource for taking more than first The resource that predetermined threshold value or operation subtask take then is cleared up UML resource isolations container and shifts son more than the second predetermined threshold value Other nodes in task to node queue.
Specifically, collecting the computing resource that operation subtask takes by container process tree in individual node, work as subtask The computing resource of occupancy is then cleared up UML resource isolations container and shifts calculating task to node queue more than the second predetermined threshold value In other nodes.External loading increases the resource for taking more than the first predetermined threshold value so that remaining computing resource in node During less than the first predetermined threshold value, then clear up UML resource isolations container and shift other nodes in subtask to node queue, from And ensure that computing resource has cushion space to a certain extent.
In one embodiment, the effective transfer and operation in order to ensure task can finally be smoothly completed, for due to making The resource that industry subtask takes is not included into unsuccessfully number of retries more than mission failure caused by the second predetermined threshold value, so that Operation subtask can be assigned to the relatively low node of external service load and perform calculating.
In one embodiment, whether key service node is obtained, current time is judged in preset time period, if it is, It is 0 then to set the current idle computing resource of the key service node, described if current time is not in preset time period The node computing resource that the current idle computing resource of key service node is obtained for actual monitored.
Specifically, key service node is the node that critical services are provided for outside, such node general work Time period service request amount Relatively centralized, load is higher, in order to not influence service, when computing resource collection is carried out, with the addition of Time period limits, and the scope of preset time period can be self-defined as needed, such as can be arranged on one day words spoken by an actor from offstage by configuration file The available free resource of its time period key service node is 0, then will not distribute calculating task, to night or morning, load Take resource to decline, node reports actual idling-resource amount again.
In one embodiment, if the current idle computing resource of first node is less than resource predetermined threshold value, by the One node is temporarily disengaged from cluster.
If specifically, the current idle computing resource of first node be less than resource predetermined threshold value, first node is temporary When depart from cluster because the load of external service is uncertain, rise in external loading, current idle computing resource is too small When, cluster is temporarily disengaged from by by first node, job task of not reallocating temporarily, it is ensured that first node can be carried for external service For cushion space, while avoiding node uncontrollable because of load too high, it is ensured that the limitation of computing resource.When working as first node Preceding idle computing resources recover during more than resource predetermined threshold value, can recover to add cluster.
In one embodiment, method also includes:For the node that load condition meets stable node condition sets static Static labels, in the node initiating task manager of static labels, the node of the static labels is proprietary node, is protected Hold cluster state.
Specifically, the design parameter of stable node condition can be self-defined as needed, such as load relatively low, computing resource and fill Foot, can be considered the node for meeting stable node condition, static state static labels be set to this node, in the section of static labels Point initiating task manager, such node can be the proprietary node of cluster, keep cluster state, and service is provided always, can Job manager is avoided to be cleaned out because of computing resource deficiency, it is to avoid to lose job scheduling information.
In one embodiment, method also includes:The corresponding node blacklist of operation is obtained, if the in node blacklist The ratio of the corresponding all node numbers of the first label exceedes label threshold value in the corresponding node number of one label and cluster, then will The corresponding node of the first label removes node blacklist in node blacklist.
Specifically, explorer can be operation sets a interim blacklist, only to the operation effectively, when certain node Mission failure number of times it is excessive when, by node add blacklist in, when the node number in blacklist exceed cluster in node Several proportion threshold value p, then blacklist failure.If ratio of the node number of node label X in whole cluster is no more than p, Therefore blacklist will not fail, fluctuated under big mal-condition in the corresponding node loads of node label X, it is understood that there may be node label Each node of X is put on the blacklist because mission failure number of times is excessive, and cluster is added unless there are new X label nodes, no Then blacklist will not be failed, and operation will be waited infinitely, in order to avoid such case occurs, in the present embodiment, if the black name of node The ratio of the corresponding all node numbers of the first label exceedes label threshold in the corresponding node number of the first label and cluster in list Value, then remove node blacklist by the corresponding node of the first label in node blacklist, and when calculating proportionality coefficient, use is not Node in whole cluster, but the corresponding all nodes of the first label in cluster, it is to avoid the situation that operation is infinitely waited, more Rationally, fault freedom is good.
As shown in figure 4, being the flow chart of cluster resource processing method in a specific embodiment, detailed process is as follows:
Step S310, monitor node computing resource information obtains cluster current idle meter according to node computing resource information Calculate resource.
Step S320, receives the job request of the resource allocation information for carrying node label, is existed according to resource allocation information For operation distributes target computational resource in the current idle computing resource of the node in the corresponding node queue of node label.
Step S330, user model Linux UML resource isolation containers are started in the corresponding node of target computational resource, User's space performs operation.
Step S340, when external loading increases the resource for taking more than the money that the first predetermined threshold value or operation subtask take When source is more than the second predetermined threshold value, other sections in cleaning UML resource isolations container and transfer tasks subtask to node queue Point.
Step S350, Job execution terminates, the computing resource taken during release Job execution.
In one embodiment, as shown in Figure 5, there is provided a kind of resource processes cluster, including:
Calculate node 420, the node pre-conditioned for idle computing resources meet of calculate node 420, calculate node 420 The external loading that quantity processes the node in cluster according to resource takes amount of computational resources renewal.
Specifically, pre-conditioned can be self-defined as needed, idle computing resources meet pre-conditioned node can be at any time As calculate node, idle computing resources are unsatisfactory for pre-conditioned node and can at any time be temporarily disengaged from cluster, the number of calculate node The external loading for measuring the node processed according to resource in cluster takes amount of computational resources renewal, it is ensured that effective utilization of resource.
In one embodiment, if resource management server is additionally operable to the current idle computing resource of the first calculate node Less than resource predetermined threshold value, then the first calculate node is temporarily disengaged from cluster.
If specifically, the current idle computing resource of the first calculate node is less than resource predetermined threshold value, first counted Operator node is temporarily disengaged from cluster, because the load of external service is uncertain, rises in external loading, and current idle is calculated When resource is too small, cluster is temporarily disengaged from by by the first calculate node, job task of not reallocating, it is ensured that first node can be outer Portion's service provides cushion space, while avoiding node uncontrollable because of load too high, it is ensured that the limitation of computing resource.
Resource management server 410, for monitor node computing resource information, is obtained according to node computing resource information One cluster current idle computing resource, receives the job request for carrying resource allocation information, and resource allocation information includes node mark Sign, corresponding calculate node queue is obtained according to node label, according to target of the resource allocation information in calculate node queue For operation distributes target computational resource in the current idle computing resource of calculate node 421.
Target computing nodes 421 are used to start user model Linux UML resource isolation containers, and work is performed in user's space Industry.
In one embodiment, resource treatment cluster includes key service node, and key service node is used to judge current Whether the time is in preset time period, if it is, the current idle computing resource for setting key service node is 0, if currently Time is not in preset time period, then the node calculating money that the current idle computing resource of key service node is obtained for actual monitored Source..
Specifically, key service node is the node that critical services are provided for outside, such node general work Time period service request amount Relatively centralized, load is higher, in order to not influence service, when computing resource collection is carried out, with the addition of Time period limits, and the scope of preset time period can be self-defined as needed, such as can be arranged on one day words spoken by an actor from offstage by configuration file The available free resource of its time period key service node is 0, then will not distribute calculating task, to night or morning, load Take resource to decline, node reports actual idling-resource amount again, can again be its corresponding calculating task of distribution operation.
In one embodiment, target computing nodes 421 are additionally operable to when user's space performs operation, by using user Mode network slirp network simulators start virtual network device and are bridged with physical network card with domestic consumer's identity, realize network Communication.
Specifically, target computing nodes 421 pass through tcp/ip remote transmission input datas, container during Job execution Internal virtual network and the physical network of back end need foundation connection transceiving data bag to be communicated.user mode Linux is bridged generally by tun equipment is started with physical network card, realizes the communication with external network, but start tun Equipment needs root authority, in the present embodiment, the mixed-media network modules mixed-media code of user mode linux is have modified, using slirp networks Simulator scheme, starts virtual network device and is bridged with physical network card, it is ensured that the biography of inputoutput data with domestic consumer's identity Efficient convenient communication between defeated and UML resource isolation containers.
In one embodiment, as shown in fig. 6, cluster also includes:
Dedicated node 430, the node label of dedicated node is static labels, cluster state is kept, for initiating task Manager.
Specifically, dedicated node refers to the node that service-specific is provided for cluster, external service is not provided, it is such Node keeps providing the service of stabilization, and cluster is not departed from easily.In the node initiating task manager of static labels, collection is kept Group's state, can always provide service, job manager can be avoided to be cleaned out because of computing resource deficiency, it is to avoid from losing operation and enter Degree information.
In one embodiment, as shown in Figure 7, there is provided a kind of cluster resource processing system, including:
Computing resource monitoring management module 510, for monitor node computing resource information, according to node computing resource information Obtain cluster current idle computing resource.
Computational resource allocation module 520, the job request of resource allocation information, resource allocation information bag are carried for receiving Node label is included, corresponding node queue, the node according to resource allocation information in node queue are obtained according to node label Current idle computing resource in for operation distribute target computational resource.
Operation execution module 530, for starting user model Linux UML resources in the corresponding node of target computational resource Spacing container, operation is performed in user's space.
In one embodiment, as shown in figure 8, system also includes:
First resource limits module 540, if increase the resource for taking more than the first predetermined threshold value for external loading or The resource that operation subtask takes clear up UML resource isolations container more than the second predetermined threshold value, then and transfer tasks subtask extremely Other nodes in node queue.
In one embodiment, as shown in figure 8, system also includes:
Whether Secondary resource limits module 550, for obtaining key service node, judge current time in Preset Time Section, if it is, the current idle computing resource for setting key service node is 0, if current time is not in preset time period, The node computing resource that then the current idle computing resource of key service node is obtained for actual monitored.It is understood that being System may include first resource limitation module 540 and Secondary resource limitation any one module therein of module 550.
Although it is understood that included in Fig. 8 first resource limitation module 540 and Secondary resource limitation module 550, At least one module therein is may include in embodiment.
In one embodiment, as shown in figure 9, system also includes:
Cluster resource management module 560, if presetting threshold less than resource for the current idle computing resource of first node Value, then be temporarily disengaged from cluster by first node.
In one embodiment, as shown in Figure 10, system also includes:
Label setup module 570, is that node sets mark for physical attribute information and load state information according to node Sign.
In one embodiment, as shown in figure 11, system also includes:
Job management module 580, the node for meeting stable node condition for load condition sets static state static marks Sign, in the node initiating task manager of static labels, the node of static labels is proprietary node, keeps cluster state.
In one embodiment, operation execution module 530 is additionally operable to be set in perform script the environment of the job dependence In variable, and file resource carry to the container that operation is needed, created with domestic consumer's identity and start UML resource isolations appearance Device.
In one embodiment, as shown in figure 12, operation execution module 530 includes:
Lock acquisition module 531, for new files catalogue, if newly-built success, obtains lock.
Job initiation module 532, is UML moneys for starting first user pattern Linux UML resource isolation containers Source spacing container distribution IP address.
Lock release module 533, starts successfully for a UML resource isolation containers, then delete file directory, release lock.
In one embodiment, as shown in figure 13, system also includes:
UML network communication modules 590, for when user's space performs operation, by using user model network slirp Network simulator starts virtual network device and is bridged with physical network card with domestic consumer's identity, realizes network service.
In one embodiment, as shown in figure 14, system also includes:
Fault-tolerant module 600, for obtaining the corresponding node blacklist of operation, if the first label correspondence in node blacklist Node number and cluster in the corresponding all node numbers of the first label ratio exceed label threshold value, then by node blacklist In the corresponding node of the first label remove node blacklist.
One of ordinary skill in the art will appreciate that all or part of flow in realizing above-described embodiment method, can be The hardware of correlation is instructed to complete by computer program, described program can be stored in a computer read/write memory medium In, such as in the embodiment of the present invention, the program can be stored in the storage medium of computer system, and by the computer system At least one computing device, to realize including the flow of the embodiment such as above-mentioned each method.Wherein, the storage medium can be Magnetic disc, CD, read-only memory (Read-Only Memory, ROM) or random access memory (Random Access Memory, RAM) etc..
Each technical characteristic of embodiment described above can be combined arbitrarily, to make description succinct, not to above-mentioned reality Apply all possible combination of each technical characteristic in example to be all described, as long as however, the combination of these technical characteristics is not deposited In contradiction, the scope of this specification record is all considered to be.
Embodiment described above only expresses several embodiments of the invention, and its description is more specific and detailed, but simultaneously Can not therefore be construed as limiting the scope of the patent.It should be pointed out that coming for one of ordinary skill in the art Say, without departing from the inventive concept of the premise, various modifications and improvements can be made, these belong to protection of the invention Scope.Therefore, the protection domain of patent of the present invention should be determined by the appended claims.

Claims (27)

1. a kind of cluster resource processing method, methods described includes:
Monitor node computing resource information, cluster current idle computing resource is obtained according to the node computing resource information;
The job request for carrying resource allocation information is received, the resource allocation information includes node label;
Corresponding node queue is obtained according to the node label, according to the resource allocation information in the node queue It is the operation distribution target computational resource in the current idle computing resource of node;
Start user model Linux UML resource isolation containers in the corresponding node of the target computational resource, in user's space Perform operation.
2. method according to claim 1, it is characterised in that described after the step of user's space performs operation, also Including:
If external loading increases resource of the resource more than the first predetermined threshold value or operation subtask occupancy for taking more than second Predetermined threshold value, then other nodes cleared up in the UML resource isolations container and transfer tasks subtask to the node queue.
3. method according to claim 1, it is characterised in that methods described also includes:
Key service node is obtained, whether current time is judged in preset time period, if it is, setting the key service section The current idle computing resource of point is 0, if current time is not in preset time period, the current sky of the key service node The node computing resource that not busy computing resource is obtained for actual monitored.
4. method according to claim 1, it is characterised in that methods described also includes:
If the current idle computing resource of first node is less than resource predetermined threshold value, the first node is temporarily disengaged from collection Group.
5. method according to claim 1, it is characterised in that the reception carries the job request of resource allocation information Before step, also include:
It is that node sets label according to the physical attribute information and load state information of node.
6. method according to claim 1, it is characterised in that the current idle computing resource is that remaining disk space is big The summation of available cpu check figures, free memory in the node of disk space predetermined threshold value.
7. method according to claim 1, it is characterised in that methods described also includes:
For the node that load condition meets stable node condition sets static state static labels, in the node of the static labels Initiating task manager, the node of the static labels is proprietary node, keeps cluster state.
8. method according to claim 1, it is characterised in that described to start in the corresponding node of the target computational resource The step of user model Linux UML resource isolation containers, includes:
In perform script, the environmental variance of the job dependence is set, and in file resource carry to the container that operation is needed, Created with domestic consumer's identity and start UML resource isolation containers.
9. method according to claim 1, it is characterised in that described to start in the corresponding node of the target computational resource The step of user model Linux UML resource isolation containers, includes:
New files catalogue, if newly-built success, obtains lock, starts first user pattern Linux UML resource isolation containers, It is the UML resource isolation container allocation IP address;
The first UML resource isolation containers start successfully, then delete the file directory, release lock.
10. method according to claim 1, it is characterised in that methods described also includes:
When user's space performs operation, opened with domestic consumer's identity by using user model network slirp network simulators Dynamic virtual network device is bridged with physical network card, realizes network service.
11. methods according to claim 1, it is characterised in that methods described also includes:
Obtain the corresponding node blacklist of the operation, if in the node blacklist the corresponding node number of the first label with The ratio of the corresponding all node numbers of the first label exceedes label threshold value in cluster, then by the first mark in the node blacklist Sign corresponding node and remove the node blacklist.
12. methods according to claim 1, it is characterised in that described after the step of user's space performs operation, also Including:
The Job execution terminates, and discharges the computing resource taken during the Job execution.
13. a kind of cluster resource processing systems, it is characterised in that the system includes:
Computing resource monitoring management module, for monitor node computing resource information, obtains according to the node computing resource information To cluster current idle computing resource;
Computational resource allocation module, the job request of resource allocation information is carried for receiving, and the resource allocation information includes Node label, obtains corresponding node queue, according to the resource allocation information in the node team according to the node label It is the operation distribution target computational resource in the current idle computing resource of the node in row;
Operation execution module, for starting user model LinuxUML resource isolations in the corresponding node of the target computational resource Container, operation is performed in user's space.
14. systems according to claim 13, it is characterised in that the system also includes:
First resource limits module, if increase the resource for taking for external loading appointed more than the first predetermined threshold value or operation The resource for taking be engaged in more than the second predetermined threshold value, then clear up the UML resource isolations container and transfer tasks subtask to described Other nodes in node queue.
15. systems according to claim 13, it is characterised in that the system also includes:
Whether Secondary resource limits module, for obtaining key service node, judges current time in preset time period, if It is that then to set the current idle computing resource of the key service node be 0, if current time is not in preset time period, The node computing resource that the current idle computing resource of the key service node is obtained for actual monitored.
16. systems according to claim 13, it is characterised in that the system also includes:
Cluster resource management module, if being less than resource predetermined threshold value for the current idle computing resource of first node, will The first node is temporarily disengaged from cluster.
17. systems according to claim 13, it is characterised in that the system also includes:
Label setup module, is that node sets label for physical attribute information and load state information according to node.
18. systems according to claim 13, it is characterised in that the system also includes:
Job management module, the node for meeting stable node condition for load condition sets static state static labels, in institute The node initiating task manager of static labels is stated, the node of the static labels is proprietary node, keeps cluster state.
19. systems according to claim 13, it is characterised in that the operation execution module is additionally operable to be set in perform script The environmental variance of the job dependence is put, and in file resource carry to the container that operation is needed, is created with domestic consumer's identity Build startup UML resource isolation containers.
20. systems according to claim 13, it is characterised in that the operation execution module includes:
Lock acquisition module, for new files catalogue, if newly-built success, obtains lock;
Job initiation module, is a UML resources for starting first user pattern Linux UML resource isolation containers Spacing container distributes IP address;
Lock release module, starts successfully for the UML resource isolation containers, then delete the file directory, release lock.
21. systems according to claim 13, it is characterised in that the system also includes:
UML network communication modules, for when user's space performs operation, by using user model network slirp network moulds Intend device and virtual network device and physical network card bridge joint are started with domestic consumer's identity, realize network service.
22. systems according to claim 13, it is characterised in that the system also includes:
Fault-tolerant module, for obtaining the corresponding node blacklist of the operation, if the first label pair in the node blacklist The ratio of the corresponding all node numbers of the first label exceedes label threshold value in the node number and cluster answered, then by the node The corresponding node of the first label removes the node blacklist in blacklist.
23. a kind of resource treatment clusters, it is characterised in that the cluster includes:
Calculate node, the calculate node node pre-conditioned for idle computing resources meet, the quantity of the calculate node The external loading of the node in the cluster takes amount of computational resources and updates;
Resource management server, for monitor node computing resource information, cluster is obtained according to the node computing resource information Current idle computing resource, receives the job request for carrying resource allocation information, and the resource allocation information includes node label, Corresponding calculate node queue is obtained according to the node label, according to the resource allocation information in the calculate node queue In target computing nodes current idle computing resource in be operation distribution target computational resource;
The target computing nodes are used to start user model Linux UML resource isolation containers, and operation is performed in user's space.
24. clusters according to claim 23, it is characterised in that the cluster includes key service node, the key Whether service node is used to judge current time in preset time period, if it is, setting the current of the key service node Idle computing resources are 0, if current time is not in preset time period, the current idle of the key service node calculates money The node computing resource that source obtains for actual monitored.
25. clusters according to claim 23, it is characterised in that if the resource management server is additionally operable to the first meter The current idle computing resource of operator node is less than resource predetermined threshold value, then first calculate node is temporarily disengaged from into the collection Group.
26. clusters according to claim 23, it is characterised in that the target computing nodes are additionally operable to be held in user's space During row operation, by use user model network slirp network simulators with domestic consumer identity start virtual network device with Physical network card is bridged, and realizes network service.
27. clusters according to claim 23, it is characterised in that the cluster also includes:
Dedicated node, the node label of the dedicated node is static labels, keeps cluster state, for initiating task management Device.
CN201610570094.5A 2016-07-18 2016-07-18 Cluster resource processing method and system and resource processing cluster Active CN106708622B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610570094.5A CN106708622B (en) 2016-07-18 2016-07-18 Cluster resource processing method and system and resource processing cluster

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610570094.5A CN106708622B (en) 2016-07-18 2016-07-18 Cluster resource processing method and system and resource processing cluster

Publications (2)

Publication Number Publication Date
CN106708622A true CN106708622A (en) 2017-05-24
CN106708622B CN106708622B (en) 2020-06-02

Family

ID=58939713

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610570094.5A Active CN106708622B (en) 2016-07-18 2016-07-18 Cluster resource processing method and system and resource processing cluster

Country Status (1)

Country Link
CN (1) CN106708622B (en)

Cited By (36)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107273196A (en) * 2017-05-31 2017-10-20 中国科学院北京基因组研究所 Bioinformatics high-performance calculation job scheduling and system administration external member
CN108279982A (en) * 2018-02-27 2018-07-13 郑州云海信息技术有限公司 Pbs resources and hadoop method for managing resource, system and equipment
CN108509268A (en) * 2018-02-24 2018-09-07 宁波诺信睿聚投资有限责任公司 Cluster resource distribution method, device, equipment and computer readable storage medium
CN109086134A (en) * 2018-07-19 2018-12-25 郑州云海信息技术有限公司 A kind of operation method and device of deep learning operation
CN109104334A (en) * 2018-08-23 2018-12-28 郑州云海信息技术有限公司 The management method and device of monitoring system interior joint
WO2019041206A1 (en) * 2017-08-31 2019-03-07 Entit Software Llc Managing containers using attribute/value pairs
CN109471727A (en) * 2018-10-29 2019-03-15 北京金山云网络技术有限公司 A kind of task processing method, apparatus and system
CN109783237A (en) * 2019-01-16 2019-05-21 腾讯科技(深圳)有限公司 A kind of resource allocation method and device
CN110120940A (en) * 2019-04-12 2019-08-13 华中科技大学 A kind of file system resource partition method towards Docker container
CN110278257A (en) * 2019-06-13 2019-09-24 中信银行股份有限公司 A kind of method of mobilism configuration distributed type assemblies node label
CN110287000A (en) * 2019-05-29 2019-09-27 北京达佳互联信息技术有限公司 Data processing method, device, electronic equipment and storage medium
CN110389836A (en) * 2019-07-17 2019-10-29 腾讯科技(深圳)有限公司 A kind of more cluster management methods, device, server and storage medium
CN110413382A (en) * 2019-08-06 2019-11-05 山东超越数控电子股份有限公司 A kind of method, equipment and the readable medium of the resource dynamic adjustment of Docker container
CN110659127A (en) * 2018-06-29 2020-01-07 杭州海康威视数字技术股份有限公司 Method, device and system for processing task
CN110704177A (en) * 2019-09-04 2020-01-17 金蝶软件(中国)有限公司 Computing task processing method and device, computer equipment and storage medium
CN110719306A (en) * 2018-07-11 2020-01-21 阿里巴巴集团控股有限公司 Network request limiting method, computer equipment and storage medium
CN110928666A (en) * 2019-12-09 2020-03-27 湖南大学 Method and system for optimizing task parallelism based on memory in Spark environment
CN111078404A (en) * 2019-12-09 2020-04-28 腾讯科技(深圳)有限公司 Computing resource determination method and device, electronic equipment and medium
WO2020119117A1 (en) * 2018-12-14 2020-06-18 平安医疗健康管理股份有限公司 Distributed computing method, apparatus and system, device and readable storage medium
CN111340613A (en) * 2020-02-26 2020-06-26 中国邮政储蓄银行股份有限公司 Job processing method, system and storage medium
CN111767199A (en) * 2020-06-24 2020-10-13 中国工商银行股份有限公司 Resource management method, device, equipment and system based on batch processing operation
CN111813545A (en) * 2020-06-29 2020-10-23 北京字节跳动网络技术有限公司 Resource allocation method, device, medium and equipment
CN112148467A (en) * 2019-06-28 2020-12-29 微软技术许可有限责任公司 Dynamic allocation of computing resources
CN112445602A (en) * 2019-08-27 2021-03-05 阿里巴巴集团控股有限公司 Resource scheduling method, device and system and electronic equipment
CN112463290A (en) * 2020-11-10 2021-03-09 中国建设银行股份有限公司 Method, system, apparatus and storage medium for dynamically adjusting the number of computing containers
CN112527454A (en) * 2020-12-04 2021-03-19 上海连尚网络科技有限公司 Container group scheduling method and device, electronic equipment and computer readable medium
CN112597502A (en) * 2020-12-17 2021-04-02 山东乾云启创信息科技股份有限公司 Large-scale computing service configuration method and system based on trusted cloud
CN112835996A (en) * 2019-11-22 2021-05-25 北京初速度科技有限公司 Map production system and method thereof
CN113296929A (en) * 2020-06-29 2021-08-24 阿里巴巴集团控股有限公司 Resource matching method, device and system based on cloud computing
CN113703952A (en) * 2020-05-20 2021-11-26 山东省计算中心(国家超级计算济南中心) Resource allocation method for queue resource scheduling based on super computer
CN114168354A (en) * 2022-02-11 2022-03-11 北京易源兴华软件有限公司 Data-driven data cluster parallel computing allocation method and device
CN114640677A (en) * 2022-05-17 2022-06-17 云宏信息科技股份有限公司 Method for adding working nodes to cluster working nodes and APIServer by client configuration
CN114826964A (en) * 2022-04-11 2022-07-29 京东科技信息技术有限公司 Resource monitoring method, device and system
US11609837B2 (en) 2021-06-02 2023-03-21 Kyndryl, Inc. Calibration technique using computer analysis for ascertaining performance of containers
CN115951988A (en) * 2023-03-03 2023-04-11 北京并行科技股份有限公司 Job scheduling method, computing device and storage medium
CN117009060A (en) * 2023-09-27 2023-11-07 腾讯科技(深圳)有限公司 Resource scheduling method, device, equipment and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101420458A (en) * 2008-12-09 2009-04-29 清华大学 Multimedia content monitoring system, method and device based on content distributing network
US20150095919A1 (en) * 2010-10-27 2015-04-02 Amazon Technologies, Inc. Methods and system for swapping memory in a virtual machine environment
CN104572306A (en) * 2015-01-28 2015-04-29 中国石油集团川庆钻探工程有限公司地球物理勘探公司 Method for managing resources of computer cluster and resource manager
CN105323282A (en) * 2014-07-28 2016-02-10 神州数码信息系统有限公司 Enterprise application deployment and management system for multiple tenants
CN105487930A (en) * 2015-12-01 2016-04-13 中国电子科技集团公司第二十八研究所 Task optimization scheduling method based on Hadoop
CN105512083A (en) * 2015-11-30 2016-04-20 华为技术有限公司 YARN based resource management method, device and system

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101420458A (en) * 2008-12-09 2009-04-29 清华大学 Multimedia content monitoring system, method and device based on content distributing network
US20150095919A1 (en) * 2010-10-27 2015-04-02 Amazon Technologies, Inc. Methods and system for swapping memory in a virtual machine environment
CN105323282A (en) * 2014-07-28 2016-02-10 神州数码信息系统有限公司 Enterprise application deployment and management system for multiple tenants
CN104572306A (en) * 2015-01-28 2015-04-29 中国石油集团川庆钻探工程有限公司地球物理勘探公司 Method for managing resources of computer cluster and resource manager
CN105512083A (en) * 2015-11-30 2016-04-20 华为技术有限公司 YARN based resource management method, device and system
CN105487930A (en) * 2015-12-01 2016-04-13 中国电子科技集团公司第二十八研究所 Task optimization scheduling method based on Hadoop

Cited By (50)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107273196A (en) * 2017-05-31 2017-10-20 中国科学院北京基因组研究所 Bioinformatics high-performance calculation job scheduling and system administration external member
WO2019041206A1 (en) * 2017-08-31 2019-03-07 Entit Software Llc Managing containers using attribute/value pairs
US11816496B2 (en) 2017-08-31 2023-11-14 Micro Focus Llc Managing containers using attribute/value pairs
CN108509268A (en) * 2018-02-24 2018-09-07 宁波诺信睿聚投资有限责任公司 Cluster resource distribution method, device, equipment and computer readable storage medium
CN108279982A (en) * 2018-02-27 2018-07-13 郑州云海信息技术有限公司 Pbs resources and hadoop method for managing resource, system and equipment
CN110659127A (en) * 2018-06-29 2020-01-07 杭州海康威视数字技术股份有限公司 Method, device and system for processing task
CN110719306A (en) * 2018-07-11 2020-01-21 阿里巴巴集团控股有限公司 Network request limiting method, computer equipment and storage medium
CN110719306B (en) * 2018-07-11 2022-07-05 阿里巴巴集团控股有限公司 Network request limiting method, computer equipment and storage medium
CN109086134A (en) * 2018-07-19 2018-12-25 郑州云海信息技术有限公司 A kind of operation method and device of deep learning operation
CN109104334A (en) * 2018-08-23 2018-12-28 郑州云海信息技术有限公司 The management method and device of monitoring system interior joint
CN109471727A (en) * 2018-10-29 2019-03-15 北京金山云网络技术有限公司 A kind of task processing method, apparatus and system
WO2020119117A1 (en) * 2018-12-14 2020-06-18 平安医疗健康管理股份有限公司 Distributed computing method, apparatus and system, device and readable storage medium
CN109783237B (en) * 2019-01-16 2023-03-14 腾讯科技(深圳)有限公司 Resource allocation method and device
CN109783237A (en) * 2019-01-16 2019-05-21 腾讯科技(深圳)有限公司 A kind of resource allocation method and device
US11586468B2 (en) 2019-04-12 2023-02-21 Huazhong University Of Science And Technology Docker-container-oriented method for isolation of file system resources
CN110120940A (en) * 2019-04-12 2019-08-13 华中科技大学 A kind of file system resource partition method towards Docker container
CN110287000A (en) * 2019-05-29 2019-09-27 北京达佳互联信息技术有限公司 Data processing method, device, electronic equipment and storage medium
CN110287000B (en) * 2019-05-29 2021-08-17 北京达佳互联信息技术有限公司 Data processing method and device, electronic equipment and storage medium
CN110278257A (en) * 2019-06-13 2019-09-24 中信银行股份有限公司 A kind of method of mobilism configuration distributed type assemblies node label
CN112148467A (en) * 2019-06-28 2020-12-29 微软技术许可有限责任公司 Dynamic allocation of computing resources
CN110389836A (en) * 2019-07-17 2019-10-29 腾讯科技(深圳)有限公司 A kind of more cluster management methods, device, server and storage medium
CN110413382A (en) * 2019-08-06 2019-11-05 山东超越数控电子股份有限公司 A kind of method, equipment and the readable medium of the resource dynamic adjustment of Docker container
CN112445602A (en) * 2019-08-27 2021-03-05 阿里巴巴集团控股有限公司 Resource scheduling method, device and system and electronic equipment
CN110704177A (en) * 2019-09-04 2020-01-17 金蝶软件(中国)有限公司 Computing task processing method and device, computer equipment and storage medium
CN110704177B (en) * 2019-09-04 2022-06-10 金蝶软件(中国)有限公司 Computing task processing method and device, computer equipment and storage medium
CN112835996A (en) * 2019-11-22 2021-05-25 北京初速度科技有限公司 Map production system and method thereof
CN110928666A (en) * 2019-12-09 2020-03-27 湖南大学 Method and system for optimizing task parallelism based on memory in Spark environment
CN110928666B (en) * 2019-12-09 2022-03-22 湖南大学 Method and system for optimizing task parallelism based on memory in Spark environment
CN111078404A (en) * 2019-12-09 2020-04-28 腾讯科技(深圳)有限公司 Computing resource determination method and device, electronic equipment and medium
CN111340613A (en) * 2020-02-26 2020-06-26 中国邮政储蓄银行股份有限公司 Job processing method, system and storage medium
CN111340613B (en) * 2020-02-26 2023-10-03 中国邮政储蓄银行股份有限公司 Job processing method, job processing system and storage medium
CN113703952B (en) * 2020-05-20 2023-10-10 山东省计算中心(国家超级计算济南中心) Resource allocation method for queue resource scheduling based on supercomputer
CN113703952A (en) * 2020-05-20 2021-11-26 山东省计算中心(国家超级计算济南中心) Resource allocation method for queue resource scheduling based on super computer
CN111767199B (en) * 2020-06-24 2023-09-19 中国工商银行股份有限公司 Resource management method, device, equipment and system based on batch job
CN111767199A (en) * 2020-06-24 2020-10-13 中国工商银行股份有限公司 Resource management method, device, equipment and system based on batch processing operation
CN113296929A (en) * 2020-06-29 2021-08-24 阿里巴巴集团控股有限公司 Resource matching method, device and system based on cloud computing
CN111813545A (en) * 2020-06-29 2020-10-23 北京字节跳动网络技术有限公司 Resource allocation method, device, medium and equipment
CN112463290A (en) * 2020-11-10 2021-03-09 中国建设银行股份有限公司 Method, system, apparatus and storage medium for dynamically adjusting the number of computing containers
CN112527454A (en) * 2020-12-04 2021-03-19 上海连尚网络科技有限公司 Container group scheduling method and device, electronic equipment and computer readable medium
CN112597502B (en) * 2020-12-17 2023-02-10 山东乾云启创信息科技股份有限公司 Large-scale computing service configuration method and system based on trusted cloud
CN112597502A (en) * 2020-12-17 2021-04-02 山东乾云启创信息科技股份有限公司 Large-scale computing service configuration method and system based on trusted cloud
US11609837B2 (en) 2021-06-02 2023-03-21 Kyndryl, Inc. Calibration technique using computer analysis for ascertaining performance of containers
CN114168354B (en) * 2022-02-11 2022-05-03 北京易源兴华软件有限公司 Data-driven data cluster parallel computing allocation method and device
CN114168354A (en) * 2022-02-11 2022-03-11 北京易源兴华软件有限公司 Data-driven data cluster parallel computing allocation method and device
CN114826964A (en) * 2022-04-11 2022-07-29 京东科技信息技术有限公司 Resource monitoring method, device and system
CN114826964B (en) * 2022-04-11 2024-04-05 京东科技信息技术有限公司 Resource monitoring method, device and system
CN114640677A (en) * 2022-05-17 2022-06-17 云宏信息科技股份有限公司 Method for adding working nodes to cluster working nodes and APIServer by client configuration
CN115951988A (en) * 2023-03-03 2023-04-11 北京并行科技股份有限公司 Job scheduling method, computing device and storage medium
CN117009060A (en) * 2023-09-27 2023-11-07 腾讯科技(深圳)有限公司 Resource scheduling method, device, equipment and storage medium
CN117009060B (en) * 2023-09-27 2024-01-12 腾讯科技(深圳)有限公司 Resource scheduling method, device, equipment and storage medium

Also Published As

Publication number Publication date
CN106708622B (en) 2020-06-02

Similar Documents

Publication Publication Date Title
CN106708622A (en) Cluster resource processing method and system, and resource processing cluster
CN110290189B (en) Container cluster management method, device and system
CN108833197B (en) Active detection method and detection platform based on cloud
CN108399101A (en) The methods, devices and systems of scheduling of resource
CN109313564A (en) For supporting the server computer management system of the highly usable virtual desktop of multiple and different tenants
CN107968802A (en) The method, apparatus and filtering type scheduler of a kind of scheduling of resource
CN103399781B (en) Cloud Server and virtual machine management method thereof
CN110209492A (en) A kind of data processing method and device
CN105516267B (en) Cloud platform efficient operation method
CN107967175A (en) A kind of resource scheduling system and method based on multiple-objection optimization
CN112202879B (en) Middleware management method and device, electronic equipment and storage medium
CN115185697A (en) Cluster resource scheduling method, system, equipment and storage medium based on kubernets
CN112631680A (en) Micro-service container scheduling system, method, device and computer equipment
US8042158B2 (en) Management of user authorizations
CN109165135A (en) A kind of data managing method, computer readable storage medium and terminal device
CN112685157B (en) Task processing method, device, computer equipment and storage medium
CN112003931B (en) Method and system for deploying scheduling controller and related components
CN111866190B (en) Multi-tenant management method, device and system based on project hierarchical management
CN110225088A (en) A kind of cloud desktop management method and system
CN109818785A (en) A kind of data processing method, server cluster and storage medium
CN113760441A (en) Container creation method and device, electronic equipment and storage medium
US9942083B1 (en) Capacity pool management
CN110750350B (en) Large resource scheduling method, system, device and readable storage medium
CN102868594B (en) Method and device for message processing
CN115310835A (en) Dispatching method and device, computer equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant