Comparative Study of Dimensionality Reduction Techniques for Spectral–Temporal Data
Abstract
:1. Introduction
2. Music Identification and MPEG-7 Audio Signature Descriptors
2.1. Music Identification
2.2. Overview of MPEG-7 Audio Signature Descriptors
- Perform time to frequency conversion. This step uses 4096-point FFT (fast Fourier Transformation) to obtain the spectral components of every segment of windowed audio samples. The chosen window is a Hamming window. The default window duration is 90 ms with 30 ms hopping step.
- Compute the flatness measure for each spectral band. A spectral band has a bandwidth of (1/4) octave, starting from 250 Hz and ending at 16 kHz. The flatness measure is the ratio of the geometric average to the arithmetic average of spectral components in a band.
- Computing the statistics of the flatness measure. Once the flatness measure is obtained, the final step is to compute the average and standard deviation (SD) of the measures over a period of time for each spectral band. The obtained average and SD values are called audio signature descriptors. In this paper, we only use the average values and discard the SD values in the experiments.
3. Dimensionality Reduction Approaches to Be Compared
3.1. Overview of 2-D PCA Computation
- (a)
- Remove means from the dataset. Let be the vector of average for all . We compute:
- (b)
- Construct matrix as:
- (c)
- Decompose matrix as:
- (d)
- Use only in (6) to perform reduction and denote the resultant matrix as . Then, compute dimension-reduced matrix , size of , from full-dimensional features by:
- (e)
- After this step, the reduction in row direction of is complete. The next step is to perform a reduction in the horizontal direction. To do so, we treat as in (5) and repeat the computational steps one more time to complete the 2-D reduction.
3.2. ICA Reduction Technique
3.2.1. Introduction to ICA
- (a)
- Centering: Centering is to remove the average from the received samples. This step is equal to (3) in PCA.
- (b)
- Whitening: This step is used to remove the correlation between components through the covariance matrix. Its computational steps are similar to that of the PCA except a scaling. Specifically, we use:
- (c)
- Maximizing non-Gaussianity: Since we assume that the components of interests are not Gaussian distributed, it is natural to maximize non-Gaussian components. There are two widely used methods to measure the level of non-Gaussian, namely, kurtosis and negentropy. Please note that maximizing non-Gaussianity is not the only approach to perform ICA. Another widely used criterion is minimizing mutual information [7].
3.2.2. Proposed ICA Reduction
- Perform the centering and whitening steps. Simply following (8), we have:
- Keep only eight components in to reduce the dimensionality. As before, the eigenvalues in (8) are arranged from large to small. Thus, we have a 29 × 8 matrix:
- The next step is supposed to compute the whitening version of . However, we are unable to use in the term because is not a square matrix. To overcome this problem, we follow (24) in the proposed FA approach (to be discussed later), which in the present form is:
- Perform the maximizing non-Gaussianity step. In this step, eight vectors are obtained to form a transformation matrix (size of ). Then, compute:
- Treat as and repeating the whitening procedure again. However, this time we keep only one vector in (18) to compute the final low-dimensional features.
3.3. Overview of 2-D FA Computation
3.3.1. Introduction to Factor Analysis
3.3.2. Proposed FA Reduction
- (a)
- Follow the ICA computation in (17) to obtain matrix, and then choose components to construct matrix.
- (b)
- Construct the loading matrix by:
- (c)
- Compute the factor scores for one vector by:
- (d)
- To further reduce the dimensionality of the features in another direction, we perform the same steps a second time by treating as .
4. Experiments and Results
4.1. Experimental Settings
4.2. Comparison between Kurtosis and Negentropy Criteria
4.3. Comparison between Various Approaches
4.4. Discussion
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Grauman, K.; Leibe, B. Visual Object Recognition; Morgan & Claypool Publishers: San Rafael, CA, USA, 2010. [Google Scholar]
- You, S.D.; Chen, W.-H.; Chen, W.-K. Music identification system using MPEG-7 audio signature descriptors. Sci. World J. 2013. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- You, S.D.; Chen, W.-H. Comparative study of methods for reducing dimensionality of MPEG-7 audio signature descriptors. Multimed. Tools Appl. 2015, 74, 3579–3598. [Google Scholar] [CrossRef]
- Gold, B.; Morgan, N. Speech and Audio Signal Processing; John Wiley & Sons: Hoboken, NJ, USA, 2000. [Google Scholar]
- International Organization for Standardization (ISO); International Electrotechnical Commission (IEC). Information Technology—Multimedia Content Description Interface—Part 4: Audio; ISO/IEC 15938-4; ISO/IEC: Genève, Switzerland, 2002. [Google Scholar]
- Hyvärinen, A.; Oja, E. Independent component analysis: Algorithms and applications. Neur. Netw. 2000, 13, 411–430. [Google Scholar] [CrossRef] [Green Version]
- Hyvärinen, A. Survey on independent component analysis. Neur. Comput. Surv. 1999, 2, 94–128. [Google Scholar]
- Mognon, A.; Jovicich, J.; Bruzzone, L.; Buiatti, M. ADJUST: An Automatic EEG artifact Detector based on the Joint Use of Spatial and Temporal features. Psychophysiology 2011, 48, 229–240. [Google Scholar] [CrossRef] [PubMed]
- You, S.D.; Li, Y.-C. Predicting Viewer’s Preference for Music Videos Using EEG Dataset. In Proceedings of the 2020 IEEE International Conference on Consumer Electronics—Asia (ICCE-Asia), Seoul, Korea, 1–3 November 2020; pp. 1–2. [Google Scholar]
- Johnson, R.J.; Wichern, D.W. Applied Multivariate Statistical Analysis, 6th ed.; Pearson Prentice hall: Upper Saddle River, NJ, USA, 2007; p. 481. [Google Scholar]
- Wu, N.; Zhang, J. Factor analysis based anomaly detection. In Proceedings of the IEEE Systems, Man and Cybernetics Society Information Assurance Workshop, West Point, NY, USA, 18–20 June 2003; pp. 108–115. [Google Scholar] [CrossRef]
- Wu, N.; Zhang, J. Factor-analysis based anomaly detection and clustering. Decis. Support Syst. 2006, 42, 375–389. [Google Scholar] [CrossRef]
- Fodor, I.K. A Survey of Dimension Reduction Techniques; Lawrence Livermore National Laboratory: Livermore, CA, USA, 2002.
- You, S.D.; Hung, M.-J. Reducing Dimensionality of Spectro-Temporal Data by Independent Component Analysis. In Proceedings of the 2020 2nd International Conference on Computer Communication and the Internet (ICCCI), Nagoya, Japan, 26–29 June 2020; pp. 93–97. [Google Scholar]
- You, S.D.; Pu, Y.-H. Using Paired Distances of Signal Peaks in Stereo Channels as Fingerprints for Copy Identification. ACM Trans. Multimed. Comput. Commun. Appl. 2015, 12, 1–22. [Google Scholar] [CrossRef]
- Wang, A. The Shazam music recognition service. Commun. ACM 2006, 49, 44–48. [Google Scholar] [CrossRef]
- Shlens, J. A tutorial on principal component analysis. arXiv 2014, arXiv:1404.1100. [Google Scholar]
- Jöreskog, K.G. A general approach to confirmatory maximum likelihood factor analysis. Psychometrika 1969, 34, 183–202. [Google Scholar] [CrossRef]
- Cheng, Y.; Sharma, S.; Sharma, P.; Kulathunga, K. Sharma Role of Personalization in Continuous Use Intention of Mobile News Apps in India: Extending the UTAUT2 Model. Information 2020, 11, 33. [Google Scholar] [CrossRef] [Green Version]
- Yong, A.G.; Pearce, S. A Beginner’s Guide to Factor Analysis: Focusing on Exploratory Factor Analysis. Tutor. Quant. Methods Psychol. 2013, 9, 79–94. [Google Scholar] [CrossRef]
- International Organization for Standardization (ISO); International Electrotechnical Commission (IEC). Information Technology—Coding of Moving Pictures and Associated Audio for Digital Storage Media at up to About 1.5 Mbit/s—Part 3: Audio; ISO/IEC 11172-3; ISO/IEC: Genève, Switzerland, 1993. [Google Scholar]
Distortion | i-th Match | Kurtosis | Negentropy |
---|---|---|---|
192 k MP-3 | 1 | 94.0 | 74.9 |
15 | 99.9 | 98.0 | |
96 k MP-3 | 1 | 91.6 | 66.4 |
15 | 99.7 | 97.5 |
Distortion | i-th Match | FD | PCA | ICA | FA |
---|---|---|---|---|---|
Original | 1 | 100.0 | 99.3 | 94.3 | 95.5 |
15 | 100.0 | 99.9 | 99.9 | ||
192 k MP-3 | 1 | 100.0 | 99.1 | 94.0 | 95.5 |
15 | 100.0 | 99.9 | 99.9 | ||
96 k MP-3 | 1 | 100.0 | 90.1 | 91.6 | 93.5 |
15 | 99.6 | 99.7 | 99.9 | ||
Wire Rec | 1 | 97.5 | 75.2 | 75.6 | 75.7 |
15 | 89.9 | 91.3 | 91.2 | ||
Air Rec | 1 | 93.6 | 26.9 | 44.3 | 45.2 |
15 | 63.6 | 77.5 | 76.9 | ||
−20 dB AWGN | 1 | 95.3 | 19.3 | 39.5 | 37.3 |
15 | 44.0 | 69.9 | 67.6 | ||
−10 dB AWGN | 1 | 45.7 | 1.87 | 6.67 | 5.5 |
15 | 8.67 | 23.6 | 17.6 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
You, S.D.; Hung, M.-J. Comparative Study of Dimensionality Reduction Techniques for Spectral–Temporal Data. Information 2021, 12, 1. https://doi.org/10.3390/info12010001
You SD, Hung M-J. Comparative Study of Dimensionality Reduction Techniques for Spectral–Temporal Data. Information. 2021; 12(1):1. https://doi.org/10.3390/info12010001
Chicago/Turabian StyleYou, Shingchern D., and Ming-Jen Hung. 2021. "Comparative Study of Dimensionality Reduction Techniques for Spectral–Temporal Data" Information 12, no. 1: 1. https://doi.org/10.3390/info12010001
APA StyleYou, S. D., & Hung, M. -J. (2021). Comparative Study of Dimensionality Reduction Techniques for Spectral–Temporal Data. Information, 12(1), 1. https://doi.org/10.3390/info12010001