Analysis of symbolic sequences using the Jensen-Shannon divergence

I Grosse, P Bernaola-Galván, P Carpena… - Physical Review E, 2002 - APS
I Grosse, P Bernaola-Galván, P Carpena, R Román-Roldán, J Oliver, HE Stanley
Physical Review E, 2002APS
We study statistical properties of the Jensen-Shannon divergence D, which quantifies the
difference between probability distributions, and which has been widely applied to analyses
of symbolic sequences. We present three interpretations of D in the framework of statistical
physics, information theory, and mathematical statistics, and obtain approximations of the
mean, the variance, and the probability distribution of D in random, uncorrelated sequences.
We present a segmentation method based on D that is able to segment a nonstationary …
Abstract
We study statistical properties of the Jensen-Shannon divergence D, which quantifies the difference between probability distributions, and which has been widely applied to analyses of symbolic sequences. We present three interpretations of D in the framework of statistical physics, information theory, and mathematical statistics, and obtain approximations of the mean, the variance, and the probability distribution of D in random, uncorrelated sequences. We present a segmentation method based on D that is able to segment a nonstationary symbolic sequence into stationary subsequences, and apply this method to DNA sequences, which are known to be nonstationary on a wide range of different length scales.
American Physical Society