CN116595102A - 一种改进聚类算法的大数据管理方法及系统 - Google Patents
一种改进聚类算法的大数据管理方法及系统 Download PDFInfo
- Publication number
- CN116595102A CN116595102A CN202310868599.XA CN202310868599A CN116595102A CN 116595102 A CN116595102 A CN 116595102A CN 202310868599 A CN202310868599 A CN 202310868599A CN 116595102 A CN116595102 A CN 116595102A
- Authority
- CN
- China
- Prior art keywords
- data
- nodes
- node
- load
- result
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 41
- 238000004422 calculation algorithm Methods 0.000 title claims abstract description 40
- 238000013523 data management Methods 0.000 title claims abstract description 25
- 238000011156 evaluation Methods 0.000 claims abstract description 38
- 238000012545 processing Methods 0.000 claims abstract description 24
- 238000004364 calculation method Methods 0.000 claims abstract description 16
- 238000007621 cluster analysis Methods 0.000 claims description 16
- 238000012544 monitoring process Methods 0.000 claims description 14
- 238000004458 analytical method Methods 0.000 claims description 12
- 239000011159 matrix material Substances 0.000 claims description 12
- 238000006243 chemical reaction Methods 0.000 claims description 10
- 238000007781 pre-processing Methods 0.000 claims description 10
- 238000012216 screening Methods 0.000 claims description 9
- 238000004140 cleaning Methods 0.000 claims description 7
- 230000006870 function Effects 0.000 claims description 5
- 238000013501 data transformation Methods 0.000 claims description 4
- 238000010606 normalization Methods 0.000 claims description 4
- 238000000605 extraction Methods 0.000 claims description 3
- 239000013598 vector Substances 0.000 claims description 3
- 230000008569 process Effects 0.000 abstract description 5
- 230000000007 visual effect Effects 0.000 abstract description 2
- 230000006872 improvement Effects 0.000 description 7
- 238000010586 diagram Methods 0.000 description 4
- 238000000513 principal component analysis Methods 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 238000007619 statistical method Methods 0.000 description 2
- 230000002159 abnormal effect Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 239000003086 colorant Substances 0.000 description 1
- 230000003750 conditioning effect Effects 0.000 description 1
- 238000007405 data analysis Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000009499 grossing Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000005192 partition Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 238000012800 visualization Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/27—Replication, distribution or synchronisation of data between databases or within a distributed database system; Distributed database system architectures therefor
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/25—Integrating or interfacing systems involving database management systems
- G06F16/258—Data format conversion from or to a database
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/26—Visual data mining; Browsing structured data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/28—Databases characterised by their database models, e.g. relational or object models
- G06F16/284—Relational databases
- G06F16/285—Clustering or classification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/23—Clustering techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02D—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
- Y02D10/00—Energy efficient computing, e.g. low power processors, power management or thermal management
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Databases & Information Systems (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Artificial Intelligence (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Biology (AREA)
- Evolutionary Computation (AREA)
- Computing Systems (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims (9)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310868599.XA CN116595102B (zh) | 2023-07-17 | 2023-07-17 | 一种改进聚类算法的大数据管理方法及系统 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310868599.XA CN116595102B (zh) | 2023-07-17 | 2023-07-17 | 一种改进聚类算法的大数据管理方法及系统 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN116595102A true CN116595102A (zh) | 2023-08-15 |
CN116595102B CN116595102B (zh) | 2023-10-17 |
Family
ID=87608480
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202310868599.XA Active CN116595102B (zh) | 2023-07-17 | 2023-07-17 | 一种改进聚类算法的大数据管理方法及系统 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN116595102B (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN118260624A (zh) * | 2024-05-29 | 2024-06-28 | 山东优数网络科技有限公司 | 一种面向物联网的感知数据智能汇聚分析方法及系统 |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103838863A (zh) * | 2014-03-14 | 2014-06-04 | 内蒙古科技大学 | 一种基于云计算平台的大数据聚类算法 |
CN107291847A (zh) * | 2017-06-02 | 2017-10-24 | 东北大学 | 一种基于MapReduce的大规模数据分布式聚类处理方法 |
US20180260246A1 (en) * | 2017-03-07 | 2018-09-13 | International Business Machines Corporation | Runtime piggybacking of concurrent jobs in task-parallel machine learning programs |
CN109359679A (zh) * | 2018-10-10 | 2019-02-19 | 洪月华 | 适用于广域网的分布式交通大数据并行聚类方法 |
CN109445936A (zh) * | 2018-10-12 | 2019-03-08 | 深圳先进技术研究院 | 一种云计算负载聚类方法、系统及电子设备 |
CN109657712A (zh) * | 2018-12-11 | 2019-04-19 | 浙江工业大学 | 一种基于Spark改进的K-Means算法的电商餐饮数据分析方法 |
CN109858518A (zh) * | 2018-12-26 | 2019-06-07 | 中译语通科技股份有限公司 | 一种基于MapReduce的大型数据集聚类方法 |
CN110704542A (zh) * | 2019-10-15 | 2020-01-17 | 南京莱斯网信技术研究院有限公司 | 一种基于节点负载的数据动态分区系统 |
CN114077912A (zh) * | 2020-08-14 | 2022-02-22 | 华为技术有限公司 | 数据预测方法以及数据预测装置 |
-
2023
- 2023-07-17 CN CN202310868599.XA patent/CN116595102B/zh active Active
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103838863A (zh) * | 2014-03-14 | 2014-06-04 | 内蒙古科技大学 | 一种基于云计算平台的大数据聚类算法 |
US20180260246A1 (en) * | 2017-03-07 | 2018-09-13 | International Business Machines Corporation | Runtime piggybacking of concurrent jobs in task-parallel machine learning programs |
CN107291847A (zh) * | 2017-06-02 | 2017-10-24 | 东北大学 | 一种基于MapReduce的大规模数据分布式聚类处理方法 |
CN109359679A (zh) * | 2018-10-10 | 2019-02-19 | 洪月华 | 适用于广域网的分布式交通大数据并行聚类方法 |
CN109445936A (zh) * | 2018-10-12 | 2019-03-08 | 深圳先进技术研究院 | 一种云计算负载聚类方法、系统及电子设备 |
CN109657712A (zh) * | 2018-12-11 | 2019-04-19 | 浙江工业大学 | 一种基于Spark改进的K-Means算法的电商餐饮数据分析方法 |
CN109858518A (zh) * | 2018-12-26 | 2019-06-07 | 中译语通科技股份有限公司 | 一种基于MapReduce的大型数据集聚类方法 |
CN110704542A (zh) * | 2019-10-15 | 2020-01-17 | 南京莱斯网信技术研究院有限公司 | 一种基于节点负载的数据动态分区系统 |
CN114077912A (zh) * | 2020-08-14 | 2022-02-22 | 华为技术有限公司 | 数据预测方法以及数据预测装置 |
Non-Patent Citations (2)
Title |
---|
UTKARSHA BAGDE,等: "An Analytic Survey on MapReduce based K-Means and its Hybrid Clustering Algorithms", 《2018 SECOND INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC)》, pages 32 - 36 * |
刘光宗: "基于MapReduce数据倾斜的负载均衡算法研究", 《中国知网》, pages 1 - 37 * |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN118260624A (zh) * | 2024-05-29 | 2024-06-28 | 山东优数网络科技有限公司 | 一种面向物联网的感知数据智能汇聚分析方法及系统 |
Also Published As
Publication number | Publication date |
---|---|
CN116595102B (zh) | 2023-10-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110389950B (zh) | 一种快速运行的大数据清洗方法 | |
CN109934301B (zh) | 一种电力负荷聚类分析方法、装置和设备 | |
CN111259933B (zh) | 基于分布式并行决策树的高维特征数据分类方法及系统 | |
CN111489201A (zh) | 一种客户价值分析的方法、设备、存储介质 | |
CN111723862B (zh) | 开关柜状态评估方法和装置 | |
CN116595102B (zh) | 一种改进聚类算法的大数据管理方法及系统 | |
CN114637263B (zh) | 一种异常工况实时监测方法、装置、设备及存储介质 | |
CN117112871B (zh) | 基于fcm聚类算法模型的数据实时高效融合处理方法 | |
CN114861788A (zh) | 一种基于dbscan聚类的负荷异常检测方法及系统 | |
CN117113235A (zh) | 一种云计算数据中心能耗优化方法及系统 | |
CN117743870B (zh) | 一种基于大数据的水利数据管理系统 | |
CN111680852A (zh) | 地区整体能耗监控方法及其监控系统 | |
CN108596227B (zh) | 一种用户用电行为主导影响因素挖掘方法 | |
CN116883065A (zh) | 商户风险预测方法及装置 | |
CN116561230B (zh) | 一种基于云计算的分布式存储与检索系统 | |
CN118115098A (zh) | 基于深度学习的大数据分析与处理系统 | |
CN117290405A (zh) | 一种大规模设备数据快速查询的物联网系统 | |
CN112100177A (zh) | 数据存储方法、装置、计算机设备及存储介质 | |
CN111858530A (zh) | 一种基于海量日志的实时关联分析方法及系统 | |
CN117171244A (zh) | 基于数据中台构建的企业数据管理系统及其数据分析方法 | |
CN115358797A (zh) | 基于聚类分析法的综合能源用户用能行为分析方法、系统及存储介质 | |
CN113177613A (zh) | 系统资源数据分配方法及装置 | |
CN112925689A (zh) | 一种多路监控数据传输优化方法 | |
CN113792749A (zh) | 时间序列数据异常检测方法、装置、设备及存储介质 | |
CN112330136A (zh) | 一种大用户异常用电分析数据集的关联性挖掘方法和装置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right |
Effective date of registration: 20241023 Address after: 250000, Floor 3, Building 1, Baowei Science and Technology Park, No. 3003 Xinluo Street, Jinan Area, China (Shandong) Pilot Free Trade Zone, Jinan City, Shandong Province, China Patentee after: Jinan Jubang Information Technology Co.,Ltd. Country or region after: China Address before: 814, Building D, Sanqing Century Wealth Center, No. 359 Shunhua Road, Jinan City, Shandong Province, 250013 Patentee before: Fano Information Industry Co.,Ltd. Country or region before: China |