CN106649513B - 基于谱聚类的音频数据聚类方法 - Google Patents
基于谱聚类的音频数据聚类方法 Download PDFInfo
- Publication number
- CN106649513B CN106649513B CN201610899028.2A CN201610899028A CN106649513B CN 106649513 B CN106649513 B CN 106649513B CN 201610899028 A CN201610899028 A CN 201610899028A CN 106649513 B CN106649513 B CN 106649513B
- Authority
- CN
- China
- Prior art keywords
- audio
- audio data
- clustering
- frequency
- calculating
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 34
- 230000003595 spectral effect Effects 0.000 title claims abstract description 23
- 239000011159 matrix material Substances 0.000 claims abstract description 20
- 239000013598 vector Substances 0.000 claims abstract description 10
- 238000009432 framing Methods 0.000 claims abstract description 6
- 238000012545 processing Methods 0.000 claims abstract description 5
- 238000004422 calculation algorithm Methods 0.000 claims description 5
- 238000001228 spectrum Methods 0.000 claims description 3
- 239000011435 rock Substances 0.000 description 6
- 230000008859 change Effects 0.000 description 4
- 238000004364 calculation method Methods 0.000 description 3
- 238000007781 pre-processing Methods 0.000 description 3
- 238000005311 autocorrelation function Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 238000010801 machine learning Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 238000007621 cluster analysis Methods 0.000 description 1
- 238000005314 correlation function Methods 0.000 description 1
- 238000007418 data mining Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000008451 emotion Effects 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000000737 periodic effect Effects 0.000 description 1
- 230000005236 sound signal Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/60—Information retrieval; Database structures therefor; File system structures therefor of audio data
- G06F16/68—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/683—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/22—Matching criteria, e.g. proximity measures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/23—Clustering techniques
- G06F18/232—Non-hierarchical techniques
- G06F18/2323—Non-hierarchical techniques based on graph theory, e.g. minimum spanning trees [MST] or graph cuts
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- General Physics & Mathematics (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Bioinformatics & Computational Biology (AREA)
- Life Sciences & Earth Sciences (AREA)
- Artificial Intelligence (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Library & Information Science (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Biology (AREA)
- Evolutionary Computation (AREA)
- Databases & Information Systems (AREA)
- Multimedia (AREA)
- Discrete Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Auxiliary Devices For Music (AREA)
Abstract
Description
Claims (3)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610899028.2A CN106649513B (zh) | 2016-10-14 | 2016-10-14 | 基于谱聚类的音频数据聚类方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610899028.2A CN106649513B (zh) | 2016-10-14 | 2016-10-14 | 基于谱聚类的音频数据聚类方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN106649513A CN106649513A (zh) | 2017-05-10 |
CN106649513B true CN106649513B (zh) | 2020-03-31 |
Family
ID=58856490
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610899028.2A Active CN106649513B (zh) | 2016-10-14 | 2016-10-14 | 基于谱聚类的音频数据聚类方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106649513B (zh) |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108537254A (zh) * | 2018-03-23 | 2018-09-14 | 浙江工业大学 | 一种基于绘画时间的笔划线条全局聚类方法 |
CN111243618B (zh) * | 2018-11-28 | 2024-03-19 | 阿里巴巴集团控股有限公司 | 用于确定音频中的特定人声片段的方法、装置和电子设备 |
CN109788308B (zh) * | 2019-02-01 | 2022-07-15 | 腾讯音乐娱乐科技(深圳)有限公司 | 音视频处理方法、装置、电子设备及存储介质 |
CN111613244A (zh) * | 2020-05-20 | 2020-09-01 | 北京搜狗科技发展有限公司 | 一种扫描跟读处理的方法及相关装置 |
CN112015925B (zh) * | 2020-08-27 | 2021-04-23 | 上海松鼠课堂人工智能科技有限公司 | 多媒体文件合并生成教学素材包的方法和系统 |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102543063A (zh) * | 2011-12-07 | 2012-07-04 | 华南理工大学 | 基于说话人分割与聚类的多说话人语速估计方法 |
US9124981B2 (en) * | 2012-11-14 | 2015-09-01 | Qualcomm Incorporated | Systems and methods for classification of audio environments |
CN105959270A (zh) * | 2016-04-25 | 2016-09-21 | 盐城工学院 | 一种基于谱聚类算法的网络攻击检测方法 |
-
2016
- 2016-10-14 CN CN201610899028.2A patent/CN106649513B/zh active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102543063A (zh) * | 2011-12-07 | 2012-07-04 | 华南理工大学 | 基于说话人分割与聚类的多说话人语速估计方法 |
US9124981B2 (en) * | 2012-11-14 | 2015-09-01 | Qualcomm Incorporated | Systems and methods for classification of audio environments |
CN105959270A (zh) * | 2016-04-25 | 2016-09-21 | 盐城工学院 | 一种基于谱聚类算法的网络攻击检测方法 |
Also Published As
Publication number | Publication date |
---|---|
CN106649513A (zh) | 2017-05-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106649513B (zh) | 基于谱聚类的音频数据聚类方法 | |
US10664539B2 (en) | Text mining-based attribute analysis method for internet media users | |
Roma et al. | Recurrence quantification analysis features for environmental sound recognition | |
CN103971689B (zh) | 一种音频识别方法及装置 | |
CN111400543B (zh) | 音频片段的匹配方法、装置、设备及存储介质 | |
CN107767869A (zh) | 用于提供语音服务的方法和装置 | |
CN105355214A (zh) | 测量相似度的方法和设备 | |
WO2019233361A1 (zh) | 对音乐进行音量调节的方法及设备 | |
CN112750442B (zh) | 一种具有小波变换的朱鹮种群生态体系监测系统及其方法 | |
CN109408660A (zh) | 一种基于音频特征的音乐自动分类的方法 | |
CN108615532A (zh) | 一种应用于声场景的分类方法及装置 | |
Seyerlehner et al. | Frame level audio similarity-a codebook approach | |
Neammalai et al. | Speech and music classification using hybrid form of spectrogram and fourier transformation | |
CN108538312A (zh) | 基于贝叶斯信息准则的数字音频篡改点自动定位的方法 | |
TW202217597A (zh) | 圖像的增量聚類方法、電子設備、電腦儲存介質 | |
Bhatia et al. | Music genre classification | |
CN111859011B (zh) | 音频处理方法、装置、存储介质及电子设备 | |
Ghosal et al. | Song/instrumental classification using spectrogram based contextual features | |
Jun et al. | Music structure analysis using self-similarity matrix and two-stage categorization | |
Shen et al. | Towards efficient automated singer identification in large music databases | |
Genussov et al. | Musical genre classification of audio signals using geometric methods | |
Siddiquee et al. | An Effective Machine Learning Approach for Music Genre Classification with Mel Spectrograms and KNN | |
George et al. | Unsupervised analysis of similarities between musicians and musical genres using spectrograms. | |
KR20200118587A (ko) | 음악의 내재적 정보를 이용한 음악 추천 시스템 | |
CN115329125A (zh) | 一种歌曲串烧拼接方法和装置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right |
Effective date of registration: 20231129 Address after: 201100 room 1001, 1st floor, building B, 555 Dongchuan Road, Minhang District, Shanghai Patentee after: Shanghai Enterprise Information Technology Co.,Ltd. Address before: 200120 building C, No. 888, Huanhu West 2nd Road, Lingang New Area, China (Shanghai) pilot Free Trade Zone, Pudong New Area, Shanghai Patentee before: Shanghai Xuncha Technology Co.,Ltd. Effective date of registration: 20231129 Address after: 200120 building C, No. 888, Huanhu West 2nd Road, Lingang New Area, China (Shanghai) pilot Free Trade Zone, Pudong New Area, Shanghai Patentee after: Shanghai Xuncha Technology Co.,Ltd. Address before: No. 1166 Century Avenue, Yancheng City, Jiangsu Province, 224051 Patentee before: YANCHENG INSTITUTE OF TECHNOLOGY |
|
TR01 | Transfer of patent right |