Open AccessArticle

Multivariate Krawtchouk Polynomials and Composition Birth and Death Processes

Robert Griffiths

Department of Statistics, University of Oxford, Oxford OX1 3LB, UK

Symmetry 2016, 8(5), 33; https://doi.org/10.3390/sym8050033

Submission received: 28 February 2016 / Revised: 27 April 2016 / Accepted: 29 April 2016 / Published: 9 May 2016

(This article belongs to the Special Issue Symmetry in Orthogonal Polynomials)

Abstract

This paper defines the multivariate Krawtchouk polynomials, orthogonal on the multinomial distribution, and summarizes their properties as a review. The multivariate Krawtchouk polynomials are symmetric functions of orthogonal sets of functions defined on each of N multinomial trials. The dual multivariate Krawtchouk polynomials, which also have a polynomial structure, are seen to occur naturally as spectral orthogonal polynomials in a Karlin and McGregor spectral representation of transition functions in a composition birth and death process. In this Markov composition process in continuous time, there are N independent and identically distributed birth and death processes each with support

0, 1, \dots

. The state space in the composition process is the number of processes in the different states

0, 1, \dots

. Dealing with the spectral representation requires new extensions of the multivariate Krawtchouk polynomials to orthogonal polynomials on a multinomial distribution with a countable infinity of states.

Keywords:

Bernoulli trials and orthogonal polynomials; birth and death processes; composition Markov processes; Karlin and McGregor spectral representation; multivariate Krawtchouk polynomials

MSC:

33D52; 60J27

1. Introduction

Griffiths [1] and Diaconis and Griffiths [2] construct multivariate Krawtchouk polynomials orthogonal on the multinomial distribution and study their properties. Recent representations and derivations of the orthogonality of these polynomials are in [3,4,5,6].

The authors emphasise different approaches to the multivariate orthogonal polynomials. The approach of Diaconis and Griffiths [2] is probabilistic and directed to Markov chain applications; the approach of Iliev [5] is via Lie groups; and the physics approach of Genest et al. [3] is as matrix elements of group representations on oscillator states. Xu [7] studies discrete multivariate orthogonal polynomials, which have a triangular construction of products of one-dimensional orthogonal polynomials. They are particular cases of the polynomials in this paper; see Diaconis and Griffiths [2]. These polynomials extend the Krawtchouk polynomials on the binomial distribution to a general class of multi-dimensional orthogonal polynomials on the multinomial distribution. They appear naturally in composition Markov chains as eigenfunctions in a diagonal expansion of the transition functions. There are many interesting examples of these Markov chains in Zhou and Lange [8]. Binomial and multinomial random variables can be constructed as a sum of independent and identically distributed random variables, which are indicator functions of the events that occur on each of N trials. The Krawtchouk and multivariate Krawtchouk polynomials are symmetric functions of orthogonal functions sets on each of the trials. The simplest case is the Krawtchouk polynomials where the representation is explained in Section 2. In the multivariate Krawtchouk polynomials, there is not a unique orthogonal function set on trials with multiple outcomes greater than two, so the polynomials depend on which orthogonal function set is taken for a basis on the trials.

A well-known spectral expansion by Karlin and McGregor [9,10,11] for the transition functions

{p_{i j} (t)}_{i, j = 0}^{\infty}

of a birth and death process with rates

λ_{i}, μ_{i}

i = 0, 1, \dots

is that:

p_{i j} (t) = π_{j} \int_{0}^{\infty} e^{- z t} Q_{i} (z) Q_{j} (z) ψ (d z), i, j = 0, 1, \dots

(1)

where

{Q_{i}}_{i = 1}^{\infty}

are orthogonal polynomials on the spectral measure ψ, which is a probability measure, and:

π_{j} = \frac{λ_{0} \dots λ_{j - 1}}{μ_{1} \dots μ_{j}}, j = 1, 2, \dots

(2)

A number of classical birth and death processes have a spectral expansion where the orthogonal polynomials are constructed from the Meixner class. This class has a generating function of the form:

G (v, z) = h (v) e^{x u (v)} = \sum_{m = 0}^{\infty} Q_{m} (z) v^{m} / m!

(3)

where

h (v)

is a power series in t with

h (0) = 1

and

u (v)

is a power series with

u (0) = 0

and

u^{'} (0) \neq 0

. Meixner [12] characterizes the class of weight functions and orthogonal polynomials with the generating function Equation (3). They include the Krawtchouk polynomials, Poisson–Charlier polynomials, scaled Meixner polynomials and Laguerre polynomials (the Meixner orthogonal polynomials are a specific set belonging to the Meixner class with a name in common). A general reference to these orthogonal polynomials is Ismail [13].

In this paper, the spectral expansion is extended to composition birth and death processes, where there are N independent and identically distributed birth and death processes operating and

{X (t)}_{t \geq 0}

is such that the i-th element

{X_{i} (t)}_{t \geq 0}

counts the number of processes in state i at time t. In the analogue of Equation (1), the spectral polynomials are the dual multivariate Krawtchouk polynomials. The dual polynomial system is therefore very important, and attention is paid to describing it.

There are extensions of the multivariate Krawtchouk polynomials to multivariate orthogonal polynomials on the multivariate Meixner distribution and multivariate product Poisson distribution, where they occur as eigenfunctions of multi-type birth and death processes [14].

This paper defines the multivariate Krawtchouk polynomials, summarizes their properties, then considers how they are found in spectral expansions of composition birth and death processes. It is partly a review of these polynomials and is self-contained. For a fuller treatment, see Diaconis and Griffiths [15]. The polynomials are naturally defined by a generating function, and so, generating function techniques are used extensively in the paper. Probabilistic notation is used, particularly the expectation operator

E

, which is a linear operator acting on functions of random variables, which take discrete values in this paper. If

X_{1}, \dots, X_{d}

are random variables, then:

E [f (X_{1}, \dots, X_{d})] = \sum_{x_{1}, \dots, x_{d}} f (x_{1}, \dots, x_{d}) P (X_{1} = x_{1}, \dots, X_{d} = x_{d})

(4)

Often, orthogonal polynomials are regarded as random variables. For example,

{K_{n} (X; N, p)}_{n = 0}^{N}

are the one-dimensional Krawtchouk polynomials as random variables and:

\begin{matrix} E [K_{n} (X; N, p) K_{m} (X; N, p)] & = & \sum_{x = 0}^{N} K_{n} (x; N, p) K_{m} (x; N, p) (\binom{N}{x}) p^{x} q^{N - x} \\ = & δ_{m n} {n!}^{2} (\binom{N}{n}) {(p q)}^{n}, m, n = 0, 1, \dots, N \end{matrix}

(5)

where

q = 1 - p

. A convention of using capital letters for random variables and lower case for values that they take is used, except when the random variables are denoted by Greek letters, when they have to be considered in context.

Section 2, Theorem 1, shows how the Krawtchouk polynomials can be expressed as elementary symmetric functions of N Bernoulli trials, centred at their mean p. The Meixner orthogonal polynomials on the geometric distribution are also expressed as functions of an infinity of centred Bernoulli trials in Theorem 2. There is some, but not total symmetry in this expression. Krawtchouk polynomials occur naturally as eigenfunctions in Ehrenfest urn processes, and the eigenfunction expansion of their transition functions is explained in Section 2.3. Section 3 introduces the multivariate Krawtchouk polynomials, explaining how they are constructed in a symmetric way from a product set of orthogonal functions on N independent multinomial trials. The dual orthogonal system is described and a scaling found, so that they are multivariate Krawtchouk polynomials on a different multinomial distribution in Theorem 3. The polynomial structure of the multivariate Krawtchouk polynomials is described in Theorem 4 and the structure in the dual system in Theorem 5. Recurrence relationships are found for the system in Theorem 6 and for the dual system in Theorem 7. The dual recurrence relationship is used to identify the polynomials as eigenfunctions in a d-type Ehrenfest urn in Theorem 8. In Section 3.2, a new extension is made to multivariate Krawtchouk polynomials where there are an infinite number of possibilities in each multinomial trial. These polynomials occur naturally as eigenfunctions in composition birth and death processes in a Karlin and McGregor spectral expansion in Theorem 9. Theorem 10 considers the polynomial structure of the dual polynomials in the spectral expansion. Theorem 11 gives an interesting identity for these spectral polynomials in composition birth and death processes when the spectral polynomials in the individual processes belong to the Meixner class.

2. Bernoulli Trials and Orthogonal Polynomials

The paper begins with expressing the one-dimensional Krawtchouk polynomials as symmetric functions of Bernoulli trials. The multivariate Krawtchouk polynomials are extensions of this construction in higher dimensions.

2.1. Krawtchouk Orthogonal Polynomials

The Krawtchouk orthogonal polynomials

{K_{n} (x; N, p)}_{n = 0}^{N}

are orthogonal on the binomial

(N, p)

distribution:

(\binom{N}{x}) p^{x} q^{N - x}, x = 0, 1, \dots, N

(6)

They have a generating function:

G (z; x) = \sum_{n = 0}^{N} K_{n} (x; N, p) \frac{z^{n}}{n!} = {(1 + q z)}^{x} {(1 - p z)}^{N - x}

(7)

The scaling is such that the polynomials

K_{n} (x; N, p) / n!

are monic and:

\begin{matrix} E [K_{n} {(X; N, p)}^{2}] & = & \sum_{x = 0}^{N} (\binom{N}{x}) p^{x} q^{N - x} K_{n} {(x; N, p)}^{2} \\ = & n!^{2} (\binom{N}{n}) {(p q)}^{n} \end{matrix}

(8)

If the Krawtchouk polynomials are scaled to be

Q_{n} (x)

, so that

Q_{n} (0) = 1

, then there is a duality that

Q_{n} (x) = Q_{x} (n)

. A binomial random variable X counts the number of successes in N independent trials, each with a probability p of success. Let

ξ_{i} = 1

if the i-th trial is a success and

ξ_{i} = 0

otherwise. Then,

{ξ_{i}}_{i = 1}^{N}

is a sequence of Bernoulli trials with

P (ξ_{i} = 1) = p

P (ξ_{i} = 0) = q

, and

X = \sum_{i = 1}^{N} ξ_{i}

. It is interesting to express the Krawtchouk polynomials as symmetric functions of

{ξ_{i}}_{i = 1}^{N}

. If there is just one trial with

N = 1

X = ξ_{1}

, and the orthogonal polynomial set on X is

{1, ξ_{1} - p}

. There can only be a constant function and a linear function if there are just two values that

ξ_{1}

can take. A product set of orthogonal functions on

{ξ_{i}}_{i = 1}^{N}

⨂_{i = 1}^{N} {1, ξ_{i} - p}

, and we want to form a smaller basis from these functions to orthogonal polynomials in

X = \sum_{i = 1}^{N} ξ_{i}

Theorem 1.

The Krawtchouk polynomials are proportional to the elementary symmetric functions of

{ξ_{i} - p}_{i = 1}^{N}

;

K_{n} (X; N, p) = n! \sum_{σ \in S_{N}} (ξ_{σ (1)} - p) \dots (ξ_{σ (n)} - p)

(9)

where

S_{N}

is the symmetric group on

{1, 2, \dots, N}

Proof.

A generating function for the symmetric functions on the right of Equation (9) is:

\sum_{n = 0}^{n} \frac{z^{n}}{n!} n! \sum_{σ \in S_{N}} (ξ_{σ (1)} - p) \dots (ξ_{σ (n)} - p) = \prod_{i = 1}^{N} (1 + z (ξ_{i} - p))

(10)

If x of the

ξ_{i}

are one and

N - x

are zero, then Equation (10) is equal to:

{(1 + z (1 - p))}^{X} {(1 - z p)}^{N - X}

(11)

identical to the right side of Equation (7). Therefore, Equation (9) holds since the generating functions of both sides, regarding X as a random variable, are the same. ☐

The representation Equation (9) of the Krawtchouk polynomials appears with a full treatment in Diaconis and Griffiths [15] and very briefly as the generating function proof above in Griffiths [1]. The author is not aware of any other appearances of Equation (9).

2.2. Meixner Polynomials on the Geometric Distribution

The Meixner orthogonal polynomials on the geometric distribution are orthogonal on:

p q^{x}, x = 0, 1, \dots

(12)

Let

{ξ_{i}}_{i = 1}^{\infty}

be a sequence of Bernoulli trials. Let X count the number of trials

ξ_{i} = 0

before the first trial where

ξ_{x + 1} = 1

. That is

X = \sum_{j = 1}^{\infty} \prod_{i = 1}^{j - 1} (1 - ξ_{i}) ξ_{j} .

X is clearly not a symmetric function of

{ξ_{i}}_{i = 1}^{\infty}

. The orthogonal polynomials on the geometric distribution are a special case of the general Meixner polynomials and have a generating function:

G (z; x) = \sum_{n = 0}^{\infty} M_{n} (x; 1, q) z^{n} = {(1 - q^{- 1} z)}^{x} {(1 - z)}^{- (x + 1)}

(13)

A product set of orthogonal functions on the trials is:

⨂_{i = 1}^{\infty} {1, ξ_{i} - p}

(14)

It is of interest to express the orthogonal polynomial set

{M_{n} (x; 1, q)}_{n = 0}^{\infty}

as a series expansion in the product set Equation (14) as a comparison of what happens with the Krawtchouk polynomials. A calculation is now made of

E [(ξ_{i_{1}} - p) \dots (ξ_{i_{r}} - p) G (z; X)]

leading to coefficients in the expansion of the Meixner polynomials in the product set of orthogonal functions. Given

X = x

, it must be that

ξ_{j} = 0

j = 1, \dots, x

ξ_{x + 1} = 1

and

{ξ_{j}}_{j = x + 2}^{\infty}

are distributed as Bernoulli trials. Therefore:

E [(ξ_{i_{1}} - p) \dots (ξ_{i_{r}} - p) ∣ X = x] = \{\begin{matrix} 0 & if i_{r} \geq x + 2, \\ q {(- p)}^{r - 1} & if i_{r} = x + 1, \\ {(- p)}^{r} & if i_{r} \leq x . \end{matrix}

(15)

Taking an expectation conditional on X, then over X,

\begin{matrix} E [(ξ_{i_{1}} - p) \dots (ξ_{i_{r}} - p) G (z; X)] \\ = q {(- p)}^{r - 1} G (z; i_{r} - 1) p q^{i_{r} - 1} + \sum_{j = i_{r}}^{\infty} {(- p)}^{r} G (z; j) p q^{j} \\ = {(- 1)}^{r - 1} p^{r} q^{i_{r}} z G (z; i_{r} - 1) \end{matrix}

(16)

Simplification to the last line is straightforward, so it is omitted. Considering the coefficients of

z^{n}

in Equation (16) and using Theorem 1 gives the following theorem.

Theorem 2.

Let

{ξ_{i}}_{i = 1}^{\infty}

be a sequence of Bernoulli

(p)

trials and

X = \sum_{j = 1}^{\infty} \prod_{i = 1}^{j - 1} (1 - ξ_{i}) ξ_{j}

. Then, X has a geometric distribution, and the Meixner polynomials on this geometric distribution have a representation for

n \geq 1

of:

\begin{matrix} M_{n} (X; 1, q) & = & \sum_{r = 1}^{\infty} \sum_{i_{1} < \dots < i_{r}} (ξ_{i_{1}} - p) \dots (ξ_{i_{r}} - p) {(- 1)}^{r - 1} q^{i_{r} - r} M_{n - 1} (i_{r} - 1; 1, q) \\ = & \sum_{r = 1}^{\infty} \sum_{l = r}^{\infty} \frac{1}{(r - 1)!} K_{r - 1} (X_{l}; l, p) (ξ_{l} - p) {(- 1)}^{r - 1} q^{l - r} M_{n - 1} (l - 1; 1, q) \end{matrix}

(17)

where

X_{l} = ξ_{1} + \dots ξ_{l}

2.3. An Ehrenfest urn

The Krawtchouk polynomials appear naturally as eigenfunctions in an Ehrenfest urn model. This is explored in Diaconis and Griffiths [15]. An urn has N balls coloured red or blue. Transitions occur at rate one when a ball is chosen at random, and the colour of the ball is changed according to a transition matrix:

P = [\begin{matrix} 0 & 1 \\ q / p & 1 - q / p \end{matrix}]

(18)

where

p, q > 0

p + q = 1

and

q \leq p

. Let

{X (t)}_{t \geq 0}

be the number of red balls in the urn at time t. That is, if a blue ball is chosen, it is changed to red with probability one, whereas if a red ball is chosen, it is changed to blue with probability

q / p

{X (t)}_{t \geq 0}

is a reversible Markov process, which is a birth and death process, with a Binomial

(N, p)

stationary distribution.

The process is a composition Markov process in the following sense. Label the balls

1, 2, \dots N

at time

t = 0

and keep the labels over time as their colours change. Let

{ξ_{i} (t)}_{t \geq 0}

describe the colour of ball i at time t:

ξ_{i} (t) = 1

if the i-th ball is red or zero if the ball is blue. The processes

{ξ_{i} (t)}_{t \geq 0}

i = 1, \dots, N

are independent; each has a rate of events

1 / N

when the specified ball is chosen; and

X (t) = \sum_{i = 1}^{N} ξ_{i} (t)

. Denote

p_{i j} (t) = P (ξ_{k} (t) = j ∣ ξ_{k} (0) = i)

, for

i, j = 1, 2

. Standard Markov process theory gives that:

\begin{matrix} P (t) = [\begin{matrix} q + p e^{- λ t} & p (1 - e^{- λ t}) \\ q (1 - e^{- λ t}) & p + q e^{- λ t} \end{matrix}] \end{matrix}

(19)

where

λ = 1 / (N p)

. It is immediate that the stationary distribution of each of the labelled processes is

(p, q)

. An eigenvalue-eigenfunction expansion of

P (t)

is:

P_{η, ξ} (t) = π_{ξ} \{1 + e^{- λ t} {(p q)}^{- 1} (η - p) (ξ - p)\}, ξ, η = 0, 1

(20)

where

π_{ξ}

is the stationary distribution with

π_{0} = q

π_{1} = p

. It is straightforward to check the agreement with

P (t)

by substituting the four values of

η, ξ = 0, 1

In the Ehrenfest urn composition process, the transitions are made from

X (0) = x

X (t) = y

\sum_{i = 1}^{N} η_{i} = x

and

\sum_{i = 1}^{N} ξ_{i} = y

. The transition probabilities are:

\begin{matrix} P (X (t) = y ∣ X (0) = x) \\ = \sum_{σ \in S_{N}} P_{η_{1} ξ_{σ (1)}} (t) \dots P_{η_{N} ξ_{σ (N)}} (t) \\ = (\binom{N}{y}) p^{y} {(1 - p)}^{N - y} {1 + \sum_{n = 1}^{N} e^{- λ n t} {(p q)}^{- n} {(\binom{N}{n})}^{- 1} \\ \times \sum_{σ \in S_{N}} (η_{σ (1)} - p) \dots (η_{σ (n)} - p) \sum_{τ \in S_{N}} (ξ_{τ (1)} - p) \dots (ξ_{τ (n)} - p)} \\ = (\binom{N}{y}) p^{y} {(1 - p)}^{N - y} \\ \times \{1 + \sum_{n = 1}^{N} e^{- λ n t} {(p q)}^{- n} {(n!)}^{- 2} {(\binom{N}{n})}^{- 1} K_{n} (x; N, p) K_{n} (y; N, p)\} \end{matrix}

(21)

The Krawtchouk polynomials thus appear naturally as elementary symmetric functions of the individual labelled indicator functions in the Markov process.

3. Multivariate Krawtchouk Polynomials

The multivariate Krawtchouk polynomials with elementary basis u were first constructed by Griffiths [1]. A recent introduction to them is Diaconis and Griffiths [2]. They play an important role in the spectral expansion of transition functions of composition Markov processes. Zhou and Lange [8], Khare and Zhou [16] have many interesting examples of such Markov processes. Later in this paper, we consider the particular composition processes where there are N particles independently performing birth and death processes.

The multivariate Krawtchouk polynomials are orthogonal on the multinomial distribution:

m (x; p) = (\binom{N}{x}) \prod_{j = 1}^{d} p_{j}^{x_{j}}, x_{j} \geq 0, j = 1, \dots, d, | x | = N

(22)

with

p = {p_{j}}_{j = 1}^{d}

a probability distribution. Let

J_{1}, \dots, J_{N}

be independent and identically distributed random variables specifying outcomes on the N trials, such that:

P (J = k) = p_{k}, k = 1, \dots, d

(23)

Then:

X_{i} = | {J_{k} : J_{k} = i, k = 1, \dots, N} |

(24)

Let

u = {u^{(l)}}_{l = 0}^{d - 1}

be an orthogonal set of functions on

p = {p_{k}}_{k = 1}^{d}

with

u^{(0)} = 1

satisfying:

\sum_{i = 1}^{d} u_{i}^{(l)} u_{i}^{(m)} p_{i} = a_{l} δ_{l m}, l, m = 0, \dots d - 1

(25)

This notation for the orthogonal set of functions follows Lancaster [17]. There is an equivalence that:

h_{i l} = u_{i}^{(l - 1)} \sqrt{p_{i} / a_{l - 1}}, i, l = 1, \dots, d

(26)

are elements of a

d \times d

orthogonal matrix H. In this paper,

{u^{(l)}}_{l = 0}^{d - 1}

are usually orthonormal functions with

a_{l} = 1

l = 0, 1, \dots, d - 1

, unless stated otherwise. The one-dimensional Krawtchouk polynomials are constructed from a symmetrized product set of orthogonal functions

⨂ {1, ξ_{i} - p}_{i = 1}^{N}

, and the construction of the multivariate polynomials follows a similar, but more complicated procedure. Instead of having two unique elements in each orthogonal function set, there is a choice of orthogonal basis, and the construction is from the product set

⨂_{i = 1}^{N} {u_{J_{i}}^{(l_{i})}}_{l_{i} = 0}^{d - 1}

. The orthogonality Equation (25) is equivalent to:

E [u_{J_{k}}^{(l)} u_{J_{k}}^{(m)}] = a_{l} δ_{l m}

(27)

for

k = 1, \dots, N

. Define a collection of orthogonal polynomials

\{Q_{n} (X; u)\}

with

n = (n_{1}, \dots n_{d - 1})

and

| n | \leq N

on the multinomial distribution as symmetrized elements from the product set, such that the sum is over products

u_{J_{1}}^{(l_{1})} \dots u_{J_{N}}^{(l_{N})}

with

n_{k} = | {l_{i} : l_{i} = k, k = 1, \dots, N} |

for

k = 1, \dots, d - 1

Q_{n} (X; u)

is the coefficient of

w_{1}^{n_{1}} \dots w_{d - 1}^{n_{d - 1}}

in the generating function:

\begin{matrix} G (x, w, u) & = & \prod_{i = 1}^{N} (1 + \sum_{l_{i} = 1}^{d - 1} w_{l_{i}} u_{J_{i}}^{(l_{i})}) \\ = & \prod_{j = 1}^{d} {(1 + \sum_{l = 1}^{d - 1} w_{l} u_{j}^{(l)})}^{x_{j}} \end{matrix}

(28)

In the one-dimensional case

u_{1}^{(1)} = 0 - p_{1} = - p_{1}

u_{2}^{(1)} = 1 - p_{1}

, orthogonal on

1 - p_{1}, p_{1}

, so the generating function is:

{(1 - p_{1} w_{1})}^{x_{1}} {(1 + (1 - p_{1}) w_{1})}^{x_{2}}

(29)

which is, of course, the generating function of the Krawtchouk polynomials.

x_{1}, x_{2}

are respectively the number of zero and one values in the N trials. It is straightforward to show, by using the generating function Equation (28), that:

\begin{matrix} E [Q_{m} (X; u) Q_{n} (X; u)] & = & \sum_{{x : | x | = N}} Q_{m} (x; u) Q_{n} (x; u) m (x; p) \\ = & δ_{m n} (\binom{N}{n^{+}}) \prod_{j = 1}^{d - 1} {a_{j}}^{n_{j}} \end{matrix}

(30)

where

n^{+} = (n_{0}, \dots, n_{d - 1})

, with

n_{0} = N - \sum_{j = 1}^{d - 1} n_{j}

. Instead of indexing the polynomials by

n = (n_{1}, \dots, n_{d - 1})

, they could be indexed by

n^{+}

. This notation is sometimes convenient to use in the paper. The dual orthogonality relationship is, immediately from Equation (30),

\sum_{{n : | n | \leq N}} {(\binom{N}{n^{+}})}^{- 1} \prod_{j = 1}^{d - 1} {a_{j}}^{- n_{j}} Q_{n} (x; u) Q_{n} (y; u) = δ_{x y} m {(x, p)}^{- 1}

(31)

Expanding the generating function Equation (28) shows that:

Q_{n} (X; u) = \sum_{{r : r_{\cdot k} = n_{k}}} \frac{\prod_{j = 1}^{d} {x_{j}}_{[r_{j} \cdot]}!}{\prod_{j = 1}^{d} \prod_{k = 1}^{d - 1} r_{j k}!} \prod_{j = 1}^{d} \prod_{k = 1}^{d - 1} {(u_{j}^{(k)})}^{r_{j k}}

(32)

where · indicates summation over an index and

a_{[b]} = a (a - 1) \dots (a - b + 1)

for non-negative integers b. The dual generating function is:

\begin{matrix} \sum_{{x : | x | = N}} {(\binom{N}{n^{+}})}^{- 1} (\binom{N}{x}) v_{1}^{x_{1}} \dots v_{d}^{x_{d}} Q_{n} (x; u) \\ = {(\sum_{j = 1}^{d} v_{j})}^{n_{0}} \prod_{i = 1}^{d - 1} {(\sum_{j = 1}^{d} v_{j} u_{j}^{(i)})}^{n_{i}} \end{matrix}

(33)

Expanding the generating function:

{(\binom{N}{n^{+}})}^{- 1} (\binom{N}{x}) Q_{n} (x; u) = \sum_{{r : r_{i \cdot} = n_{i}, r_{\cdot j} = x_{j}}} \frac{\prod_{i = 0}^{d - 1} n_{i}!}{\prod_{i = 0}^{d - 1} \prod_{j = 1}^{d} r_{i j}!} \prod_{i = 1}^{d - 1} \prod_{j = 1}^{d} {(u_{j}^{(i)})}^{r_{i j}}

(34)

The two generating function Equations (28) and (33) are similar, and there is a form of self-duality for the polynomials. Let:

ω_{i}^{(j)} = u_{j + 1}^{(i - 1)}, j = 0, \dots, d - 1, i = 1, \dots, d

(35)

Then, because of Equation (25):

\sum_{l = 1}^{d} ω_{l}^{(i)} ω_{l}^{(k)} a_{l - 1}^{- 1} = δ_{i k} p_{i}^{- 1}

(36)

The right side of Equation (33) is equal to:

\prod_{i = 1}^{d} {(\sum_{j = 0}^{d - 1} ω_{i}^{(j)} v_{j + 1})}^{n_{i - 1}}

(37)

which, apart from the different indexing and non-constant function

ω^{(0)}

, generates multivariate Krawtchouk polynomials. Suppose that

ω_{i}^{(0)} \neq 0

for

i = 1, \dots, d

. Scale by letting

{\hat{ω}}_{i}^{(j)} = ω_{i}^{(j)} / ω_{i}^{(0)}

, so that

{\hat{ω}}_{i}^{(0)} = 1

. The orthogonality of these functions is:

\sum_{l = 1}^{d} {\hat{ω}}_{l}^{(i)} {\hat{ω}}_{l}^{(j)} a_{l - 1}^{- 1} {ω_{l}^{(0)}}^{2} = δ_{i j} p_{i}^{- 1}

(38)

Let

b = {b_{l}}_{l = 1}^{d}

be the scaled probability distribution of

{a_{l - 1}^{- 1} {ω_{l}^{(0)}}^{2}}_{l = 1}^{d}

, so:

\sum_{l = 1}^{d} {\hat{ω}}_{l}^{(i)} {\hat{ω}}_{l}^{(j)} b_{l} = δ_{i j} {(p_{i} \sum_{l = 1}^{d} a_{l - 1}^{- 1} {ω_{l}^{(0)}}^{2})}^{- 1}

(39)

The following theorem is evident from Equations (33) and (37), once the indexing is sorted out.

Theorem 3.

There is a duality

{(\binom{N}{n^{+}})}^{- 1} (\binom{N}{x}) Q_{n} (x; u) = \prod_{i = 1}^{d} {(ω_{i}^{(0)})}^{n_{i - 1}} Q_{x^{-}}^{*} (n^{+}; \hat{ω})

(40)

where

Q_{x^{-}}^{*} (n^{+}; \hat{ω})

, with

x^{-} = (x_{2}, \dots, x_{d})

\sum_{j = 2}^{d} x_{j} \leq N

, are multivariate Krawtchouk polynomials, orthogonal on

m (n^{+}; b)

There is an interesting identity when u is self-dual with an indexing of j beginning from zero instead of one. That is:

u_{j}^{(l)} = u_{l}^{(j)}, j, l = 0, 1, \dots, n

(41)

Then indexing

x = (x_{0}, \dots, x_{n})

{(\binom{N}{n^{+}})}^{- 1} Q_{n} (x; u) = {(\binom{N}{x})}^{- 1} Q_{x} (n^{+}; u^{*})

(42)

where

{u_{j}^{(l)}}^{*} = u_{l}^{(j)}

. This duality occurs in the scaled Krawtchouk polynomial basis, orthogonal on a binomial

(n, p)

distribution.

The emphasis in Theorem 3 is on considering the dual system, obtaining

\hat{ω}

from u; however, sometimes, it is natural to construct u from an orthogonal set

\hat{ω}

, particularly when

ω_{i}^{(0)} = 1

i = 1, \dots, d

and

\hat{ω} = ω

. Then, the polynomials on the left of Equation (40) are defined by the dual polynomials on the right. Later in the paper, it will be seen that this is natural in composition birth and death Markov processes.

The polynomial structure of the multivariate Krawtchouk polynomials is detailed in the next theorem.

Theorem 4.

Define

U_{l} = \sum_{k = 1}^{N} u_{J_{k}}^{(l)} = \sum_{j = 1}^{d} u_{j}^{(l)} X_{j}

for

l = 1, \dots, d - 1

Q_{n} (X; u)

is a polynomial of degree

| n |

(U_{1}, \dots, U_{d - 1})

whose only term of maximal degree

| n |

\prod_{1}^{d - 1} U_{k}^{n_{k}}

Proof.

A method of proof is to consider the transform of

Q_{n} (X; u)

, which is given by:

E [\prod_{j = 1}^{d} ϕ_{j}^{X_{j}} Q_{n} (X; u)] = (\binom{N}{| n |}) (\binom{| n |}{n}) T_{0} {(ϕ)}^{N - | n |} T_{1} {(ϕ)}^{n_{1}} \dots T_{d - 1} {(ϕ)}^{n_{d - 1}}

(43)

where:

T_{i} (ϕ) = \sum_{j = 1}^{d} p_{j} ϕ_{j} u_{j}^{(i)}, i = 0, \dots, d - 1

(44)

This transform is easily found by taking the transform of the generating function Equation (28). One can see directly that

Q_{n} (X; u)

is an orthogonal polynomial by considering the transform:

E (\prod_{j = 1}^{d} {X_{j}}_{[k_{j}]} ϕ_{j}^{X_{j}}) = N_{[k]} \prod_{j = 1}^{d} {(ϕ_{j} p_{j})}^{k_{j}} \cdot {(\sum_{j = 1}^{d} p_{j} ϕ_{i})}^{N - | k |}

(45)

From Equations (43) and (45),

Q_{n} (x)

is a polynomial of degree

| n |

, whose only leading term is:

\frac{\prod_{i = 1}^{d - 1} S_{i}^{n_{i}}}{\prod_{i = 1}^{d - 1} n_{i}!}

(46)

This is seen by noting that the leading term is found by replacing

ϕ_{j} p_{j}

X_{j}

in:

N_{[| n |]}^{- 1} (\binom{N}{| n |}) (\binom{| n |}{n}) T_{1} {(ϕ)}^{n_{1}} \dots T_{d - 1} {(ϕ)}^{n_{d - 1}}

(47)

Since we can replace

{X_{j}}_{[k_{j}]}

X^{k_{j}}

in considering the leading term of Equation (43) and setting

ϕ_{i} = 1

for

i = 1, \dots, d

. ☐

The next theorem explains the polynomial structure in the dual system.

Theorem 5.

Let

{u^{(j)}}_{j = 0}^{d}

be such that

u_{1}^{(j)} = 1

for

j = 0, \dots, d - 1

, as well as the usual assumption that

u_{i}^{(0)} = 1

for

i = 1, \dots, d

. Define

κ_{l} = \sum_{j = 0}^{d - 1} u_{l}^{(j)} n_{j}

l = 2, \dots, d

. Then,

{(\binom{N}{n^{+}})}^{- 1} Q_{n} (x; u)

is a polynomial of total degree

\sum_{i = 2}^{d} x_{i}

{κ_{l}}_{l = 2}^{d}

whose only term of maximal degree is

\prod_{l = 2}^{d} κ_{j}^{x_{j}}

Proof.

This follows from Theorem 3, with

ω_{i}^{(0)} = 1

i = 1, \dots d

, and Theorem 4. ☐

There are recurrence relationships for the multivariate Krawtchouk polynomials, which are found here from a generating function approach; for another different proof, see Theorem 6.1 in Iliev [5]. Note that his multivariate Krawtchouk polynomials are scaled differently as

Q_{n} (x; u) {(\binom{N}{n^{+}})}^{- 1}

(48)

In Theorems 6–8, u is taken to be orthonormal on p, so

a_{l} = 1

l = 0, 1, \dots d - 1

in Equation (25).

Theorem 6.

Denote, for

i, l, k = 0, \dots, d - 1

c (i, l, k) = \sum_{j = 1}^{d} u_{j}^{(i)} u_{j}^{(l)} u_{j}^{(k)} p_{j},

and

u_{i} = \sum_{j = 1}^{d} u_{j}^{(i)} x_{j}

i = 1, \dots, d - 1

. Two recursive systems are:

\begin{matrix} x_{j} Q_{n} (x; u) & = & \sum_{k = 1}^{d - 1} (n_{k} + 1) p_{j} u_{j}^{(k)} Q_{n + e_{k}} (x; u) \\ + (N - | n | + 1) \sum_{l = 1}^{d - 1} p_{j} u_{j}^{(l)} Q_{n - e_{l}} (x, u) \\ + \sum_{l, k = 1}^{d - 1} (n_{k} + 1 - δ_{l k}) p_{j} u_{j}^{(l)} u_{j}^{(k)} Q_{n - e_{l} + e_{k}} (x; u) + p_{j} (N - | n |) Q_{n} (x; u) \end{matrix}

(49)

and:

\begin{matrix} u_{i} Q_{n} (x; u) & = & (n_{i} + 1) Q_{n + e_{i}} (x; u) + (N - | n | + 1) Q_{n - e_{i}} (x, u) \\ + \sum_{l, k = 1}^{d - 1} c (i, l, k) (n_{k} + 1 - δ_{k l}) Q_{n - e_{l} + e_{k}} (x; u) \end{matrix}

(50)

Proof.

Consider:

E [X_{j} G (X, w, u) G (X, z, u)] = N p_{j} (1 + \sum_{i = 1}^{d - 1} w_{i} u_{j}^{(i)}) (1 + \sum_{i = 1}^{d - 1} z_{i} u_{j}^{(i)}) {(1 + \sum_{i = 1}^{d - 1} w_{i} z_{i})}^{N - 1}

(51)

Equating coefficients of

\prod_{1}^{d - 1} w_{j}^{n_{j}} \prod_{1}^{d - 1} z_{j}^{n_{j}^{'}}

;

E [X_{j} Q_{n} (X; u) Q_{n^{'}} (X; u)] = \{\begin{matrix} \frac{N!}{(N - | n | - 1)! \prod_{1}^{d - 1} n_{i}!} p_{j} u_{j}^{(k)} & n^{'} = n + e_{k} \\ \frac{N!}{(N - | n |)! \prod_{1}^{d - 1} n_{i}!} n_{l} p_{j} u_{j}^{(l)} & n^{'} = n - e_{l} \\ \frac{N!}{(N - | n |)! \prod_{1}^{d - 1} n_{i}!} n_{l} p_{j} u_{j}^{(l)} u_{j}^{(k)} & n^{'} = n - e_{l} + e_{k}, l \neq k \\ \frac{N!}{(N - | n | - 1)! \prod_{1}^{d - 1} n_{i}!} p_{j} + \sum_{l = 1}^{d - 1} \frac{N!}{(N - | n |)! \prod_{1}^{d - 1} n_{i}!} n_{l} p_{j} {u_{j}^{(l)}}^{2} & n^{'} = n \end{matrix}

(52)

The first recursive Equation (49) then follows by an expansion of

x_{j} Q_{n} (x; u)

as a series in

Q_{n^{'}} (x; u)

dividing the cases in Equation (52) to obtain the coefficients by:

(\binom{N}{{n^{'}}^{+}}) = \{\begin{matrix} \frac{N!}{(N - | n | - 1)! \prod_{1}^{d - 1} (n_{i} + δ_{i k})!} & n^{'} = n + e_{k} \\ \frac{N!}{(N - | n | + 1)! \prod_{1}^{d - 1} (n_{i} - δ_{i l})!} & n^{'} = n - e_{l} \\ \frac{N!}{(N - | n |)! \prod_{1}^{d - 1} (n_{i} - δ_{i l} + δ_{i k})!} & n^{'} = n - e_{l} + e_{k}, l \neq k \\ \frac{N!}{(N - | n |)! \prod_{1}^{d - 1} n_{i}!} & n^{'} = n . \end{matrix}

(53)

The second recursion Equation (50) is found by summation, using the orthogonality of u. ☐

The dual orthogonal system when u is orthonormal is:

\sum_{{n \geq 0 : | n | = N}} Q_{n^{+}} (x; u) Q_{n^{+}} (x; u) {(\binom{N}{n^{+}})}^{- 1} = m {(x, p)}^{- 1} δ_{x y}

(54)

A dual generating function is:

H (n, v, u) = \sum_{{x : | x | = N}} {(\binom{N}{n^{+}})}^{- 1} (\binom{N}{x}) Q_{n} (x; u) \prod_{i = 1}^{d} v_{i}^{x_{i}} = \prod_{l = 0}^{d - 1} {(\sum_{j = 1}^{d} u_{j}^{(l)} v_{j})}^{n_{l}}

(55)

The generating function Equation (55) arises from considering the coefficient of

\prod_{i = 1}^{d - 1} w_{i}^{n_{i}}

in:

\sum_{{x : | x | = N}} {(\binom{N}{n^{+}})}^{- 1} (\binom{N}{x}) G (x, w, u) \prod_{i = 1}^{d} v_{i}^{x_{i}} = {(\binom{N}{n^{+}})}^{- 1} {(\sum_{j = 1}^{d - 1} v_{j} + \sum_{l = 1}^{d - 1} w_{l} \sum_{j = 1}^{d} v_{j} u_{j}^{(l)})}^{N}

(56)

Theorem 7.

A dual recurrence system is, for

i = 0, \dots, d - 1

n_{i} Q_{n} (x; u) = \sum_{j, l = 1}^{d} x_{j} u_{j}^{(i)} u_{l}^{(i)} p_{l} Q_{n} (x - e_{j} + e_{l}; u)

(57)

Proof.

A derivation of the recurrence system uses a transform method. Consider:

\begin{matrix} \sum_{{n^{+} : | n^{+} | = N}} n_{i} E [\prod_{i = 1}^{d} ϕ_{i}^{X_{i}} φ_{i}^{Y_{i}} Q_{n} (X; u) Q_{n} (Y; u)] {(\binom{N}{n^{+}})}^{- 1} \\ = N T_{i} (ϕ) T_{i} (φ) {[\sum_{j = 1}^{d} p_{j} ϕ_{j} φ_{j}]}^{N - 1} \end{matrix}

(58)

Therefore, non-zero terms with

y = x - e_{j} + e_{l}

are:

\begin{matrix} \sum_{{n^{+} : | n^{+} | = N}} n_{i} Q_{n} (x; u) Q_{n} (y; u) {(\binom{N}{n^{+}})}^{- 1} & = & \frac{N (\binom{N - 1}{x - e_{j}}) \prod_{k = 1}^{d} p_{k}^{x_{k} - δ_{j k}} p_{j} u_{j}^{(i)} p_{l} u_{l}^{(i)}}{m (x, p) m (y, p)} \\ = & \frac{x_{j} u_{j}^{(i)} p_{l} u_{l}^{(i)}}{m (y, p)} \end{matrix}

(59)

The dual recurrence is therefore Equation (57). ☐

The reproducing kernel polynomials:

Q_{n} (x, y) = \sum_{{n : | n | = n}} {(\binom{N}{n})}^{- 1} Q_{n} (x; u) Q_{n} (y; u)

(60)

are invariant under which set of orthonormal functions u is used. They have an explicit form; see Diaconis and Griffiths [2] and Xu [7] for details.

3.1. An Ehrenfest Urn with d-Types

A d-type Ehrenfest urn has N balls of d colours

{1, \dots, d}

. At rate one, a ball is chosen, and if it is of type j, it is changed to colour l with probability

p_{j l}

l = 1, \dots, d

{X (t)}_{t \geq 0}

, with

| X (t) | = N

, is the number of balls of the different colours at time t, which can be regarded as a d-dimensional random walk on

| x | = N

. The transition functions have an eigenfunction expansion in the multivariate Krawtchouk polynomials, extending the case Equation (21) with two colours.

Theorem 8.

Let

{X (t)}_{t \geq 0}

be a d-dimensional random walk on x,

| x | = N

, where transitions are made from

x \to x - e_{j} + e_{l}

at rate

r (x, x - e_{j} + e_{l}) = (x_{j} / N) p_{j l}

. P is a

d \times d

transition matrix, with stationary distribution p, such that:

p_{j l} = p_{l} \{1 + \sum_{i = 1}^{d - 1} ρ_{i} u_{j}^{(i)} u_{l}^{(i)}\}

(61)

Then, the transition functions of

X (t)

have an eigenfunction expansion:

\begin{matrix} p (x, y; t) = m (y, p) \\ \times \{1 + \sum_{{n : 0 < | n | \leq N}} e^{- t \sum_{i = 1}^{d - 1} n_{i} (1 - ρ_{i}) / N} {(\binom{N}{n})}^{- 1} Q_{n} (x; u) Q_{n} (y; u)\} \end{matrix}

(62)

Proof.

{X (t)}_{t \geq 0}

is a reversible Markov process with stationary distribution

m (x; p)

, because it satisfies the balance equation:

m (x; p) r (x, x - e_{j} + e_{l}) = m (x - e_{j} + e_{l}; p) r (x - e_{j} + e_{l}, x)

(63)

The reversibility is a consequence of assuming that P is a reversible transition matrix. The generator of the process acting on

f (x)

is specified by:

L f (x) = \sum_{j, l} r (x, x - e_{j} + e_{l}) (f (x - e_{j} + e_{l}) - f (x))

(64)

so the eigenvalues and eigenvectors

(λ_{n}, g_{n} (x))

satisfy:

L g_{n} (x) = - λ_{n} g_{n} (x)

(65)

Now, from Equation (57):

- \sum_{i = 1}^{d - 1} (n_{i} (1 - ρ_{i}) / N) Q_{n} (x; u) = \sum_{j, l = 1}^{d} (x_{j} / N) p_{j l} Q_{n} (x - e_{j} + e_{l}; u) - Q_{n} (x; u)

(66)

which is the same as Equation (65), noting that the total rate is one away from x. Then, Equation (62) holds immediately. ☐

3.2. Extensions to the Multivariate Krawtchouk Polynomials

It is useful in considering spectral expansions of composition Markov processes to allow the following generalizations of the multivariate Krawtchouk polynomials.

Allow $d = \infty$ as a possibility, and let ${u^{(j)}}_{j = 0}^{\infty}$ be a complete orthogonal set of functions on $p_{1}, p_{2}, \dots$ . The multinomial distribution is still well defined as:

$m (x; p) = \frac{N!}{x_{1}! x_{2}! \dots} p_{1}^{x_{1}} p_{2}^{x_{2}} \dots, | x | = N$

(67)

and the generating function for the multivariate Krawtchouk polynomials still holds with $d = \infty$ .
When $d = \infty$ , take ${u^{(j)}}_{j = 1}^{\infty}$ to be orthogonal on a discrete measure $π$ , which is non-negative, but not a probability measure, because $\sum_{i = 1}^{\infty} π_{i} = \infty$ .
Allow the basis functions u to be orthogonal on $π$ , and take the dual functions ${u_{i}^{(z)}}_{i = 0}^{\infty}$ to be orthogonal on a continuous distribution. An example that occurs naturally in composition birth and death chains is when $u_{i}^{(z)} = L_{i}^{(α)} (z)$ , $z \geq 0$ , $i = 0, 1, \dots$ are the Laguerre polynomials, orthogonal on the density:

$\frac{z^{α}}{Γ (α + 1)} e^{- z}, z > 0$

(68)

3.3. Karlin and McGregor Spectral Theory

Consider a birth and death process

{ξ (t)}_{t \geq 0}

{- 1, 0, 1, \dots}

with birth and death rates

λ_{i}, μ_{i}

from state i and transition probabilities

p_{i j} (t)

. Negative oneis an absorbing state, which can be reached if

μ_{0} > 0

. We assume that the process is non-explosive, so only a finite number of events will take place in any finite time interval. Define orthogonal polynomials

{Q_{n} (z)}_{n = 0}^{\infty}

by:

- z Q_{n} (z) = - (λ_{n} + μ_{n}) Q_{n} (z) + λ_{n} Q_{n + 1} (z) + μ_{n} Q_{n - 1} (z)

(69)

for

n \in Z_{+}

with

Q_{0} = 1

and

Q_{- 1} = 0

. The polynomials are defined by recursion from Equation (69) with

Q_{n + 1}

defined by knowing

Q_{n}

and

Q_{n - 1}

. If

μ_{0} = 0

, then

Q_{n} (0) = 1

. There is a spectral measure ψ with support on the non-negative axis and total mass one, so that:

p_{i j} (t) = π_{j} \int_{0}^{\infty} e^{- z t} Q_{i} (z) Q_{j} (z) ψ (d z)

(70)

for

i, j = 0, 1, \dots

where:

π_{j} = \frac{λ_{0} \dots λ_{j - 1}}{μ_{1} \dots μ_{j}}

(71)

μ_{0} > 0

, then

\sum_{j = 0}^{\infty} p_{i j} (t) < 1

because of possible absorption into state

- 1

. If

μ_{0} = 0

, but there is no stationary distribution, because

\sum_{j = 0}^{\infty} π_{j} = \infty

, then also, possibly,

\sum_{j = 0}^{\infty} p_{i j} (t) < 1

. Placing

t = 0

shows the orthogonality of the polynomials

{Q_{i} (z)}_{i \geq 0}

on the measure ψ because

p_{i j} (0) = δ_{i j}

{ξ (t)}_{t \geq 0}

is clearly reversible with respect to

{π_{j}}_{j \geq 0}

when a stationary distribution exists, or before absorption at zero if it does not exist, since

π_{i} p_{i j} (t) = π_{j} p_{j i} (t)

. As

t \to \infty

the limit stationary distribution, if

μ_{0} = 0

and

\sum_{k = 0}^{\infty} π_{k} < \infty

, is:

p_{j} = \frac{π_{j}}{\sum_{k = 0}^{\infty} π_{k}} = π_{j} ψ ({0})

(72)

Suppose a stationary distribution exists, and there is a discrete spectrum with support

{ζ_{l}}_{l \geq 0}

ζ_{0} = 0

. Then:

\begin{matrix} p_{i j} (t) & = & π_{j} \sum_{l = 0}^{\infty} e^{- ζ_{l} t} Q_{i} (ζ_{l}) Q_{j} (ζ_{l}) ψ ({ζ_{l}}) \\ = & p_{j} \{1 + \sum_{l = 1}^{\infty} e^{- ζ_{l} t} Q_{i} (ζ_{l}) Q_{j} (ζ_{l}) ψ ({ζ_{l}}) / ψ ({0})\} \end{matrix}

(73)

This is an eigenfunction expansion:

p_{i j} (t) = p_{j} \{1 + \sum_{l \geq 1} e^{- ζ_{l} t} u_{i}^{(l)} u_{j}^{(l)}\}, i, j = 0, 1 \dots

(74)

where u is a set of orthonormal functions on p defined by:

u_{i}^{(l)} = Q_{i} (ζ_{l}) \sqrt{ψ ({ζ_{l}}) / ψ ({0})}, i, l = 0, 1, \dots

(75)

Several well-known birth and death processes give rise to classical orthogonal polynomial systems. In this paper, only processes where

μ_{0} = 0

are considered, so there is no absorbing state at

- 1

, and the state space is

{0, 1, \dots}

. Classical papers where theory is developed and particular spectral expansions Karlin and McGregor [9,10,11,18]. Schoutens [19] details the birth and death processes and spectral expansions nicely, from which we summarize.

The $M / M / \infty$ queue where $λ_{n} = λ, μ_{n} = n μ$ , $n \geq 0$ .The process has a stationary Poisson distribution:

$p_{j} = e^{- λ / μ} {(λ / μ)}^{j} / j!, j = 0, 1, \dots$

(76)

The orthogonal polynomials are the Poisson–Charlier polynomials:

$Q_{n} (z) = C_{n} (z / μ; λ / μ), n \geq 0$

(77)

where ${C_{n} (z; ν)}_{n = 0}^{\infty}$ has a generating function:

$\sum_{n = 0}^{\infty} C_{n} (z; ν) \frac{w^{n}}{n!} = e^{w} {(1 - w / ν)}^{z}$

(78)
The linear birth and death process where $λ_{n} = (n + β) λ$ , $μ_{n} = n μ$ , with $λ, μ, β > 0$ . The process arises from individuals which split at rate λ, die at rate μ and immigration of individuals occurs at rate $λ β$ . Then:

$π_{j} = \frac{β_{(j)}}{j!} {(\frac{λ}{μ})}^{j}, j = 0, 1, \dots$

(79)
There are three cases to consider.
- $λ < μ$ . The spectral polynomials are related to the Meixner polynomials by:
  
  $Q_{n} (z) = M_{n} (\frac{z}{μ - λ}; β, \frac{λ}{μ}), n = 0, 1, \dots$
  
  (80)
  
  The polynomials are orthogonal on:
  
  ${(1 - \frac{λ}{μ})}^{β} \frac{β_{(z)}}{z!} {(\frac{λ}{μ})}^{z}, z = 0, 1, \dots$
  
  (81)
  
  at points $(μ - λ) z$ , $z = 0, 1, \dots$ . The first point of increase is zero corresponding to $e^{0 t} = 1$ in the spectrum. There is a negative binomial stationary distribution for the process:
  
  $p_{i} = {(1 - \frac{λ}{μ})}^{β} \frac{β_{(i)}}{i!} {(\frac{λ}{μ})}^{i}, i = 0, 1, \dots$
  
  (82)
  
  The Meixner polynomials have a generating function:
  
  $\sum_{n = 0}^{\infty} M_{n} (x; a; q) \frac{a_{(n)}}{n!} z^{n} = {(1 - q^{- 1} z)}^{x} {(1 - z)}^{- x - a}$
  
  (83)
- $λ > μ$ .
  
  $Q_{n} (z) = {(\frac{λ}{μ})}^{n} M_{n} (\frac{z}{λ - μ} - β; β, \frac{μ}{λ}), n = 0, 1, \dots$
  
  (84)
  
  The polynomials are orthogonal on:
  
  ${(1 - \frac{μ}{λ})}^{β} \frac{β_{(z)}}{z!} {(\frac{μ}{λ})}^{z}, z = 0, 1, \dots$
  
  (85)
  
  at points $(z + β) (λ - μ)$ , $z = 0, 1, \dots$ . The first point of increase is $β (λ - μ)$ , corresponding to a spectral term $e^{- β (λ - μ) t}$ . There is not a stationary distribution for the process in this case, with $\sum_{j = 0}^{\infty} π_{j} = \infty$ .
- $λ = μ$ . The spectral polynomials are related to the Laguerre polynomials by:
  
  $Q_{n} (z) = \frac{n!}{β_{(n)}} L_{n}^{(β - 1)} (z / λ), n \geq 0$
  
  (86)
  
  In this case, there is a continuous spectrum, and the polynomials are orthogonal on the gamma distribution:
  
  $\frac{1}{λ^{β} Γ (β)} z^{β - 1} e^{- z / β}, z > 0$
  
  (87)
  
  There is no stationary distribution of the process in this case. The Laguerre polynomials have a generating function:
  
  $\sum_{n = 0}^{\infty} L_{n}^{(β - 1)} (x) z^{n} = {(1 - z)}^{- β} exp {x z / (1 - z)}$
  
  (88)
A two-urn model with $λ_{n} = (N - n) (a - n)$ , $μ_{n} = n (b - (N - n))$ , $n = 0, 1, \dots, N$ , $a, b \geq N$ . The process arises from a model with two urns with a and b balls, with N tagged balls. At an event, two balls are chosen at random from the urns and interchanged. The state of the process is the number of tagged balls in the first urn. The spectral polynomials are related to the dual Hahn polynomials by:

$Q_{n} (z) = R_{n} (λ (z); a, b, N), n = 0, 1, \dots$

(89)

where:

$R_{n} (λ (z); a, b, N) =_{3} F_{2} (- n, - z, z - a - b - 1; - a, - N; 1)$

(90)

orthogonal on:

$\frac{(\binom{N - b - 1}{N}) N! N_{[z]} a_{[z]} (2 z - a - b - 1)}{z! b_{[z]} {(z - a - b - 1)}_{(N + 1)}}$

(91)

with $λ (z) = z (z - a - b - 1)$ . There is a hypergeometric stationary distribution in the process of:

$p_{i} = \frac{(\binom{a}{i}) (\binom{b}{N - i})}{(\binom{a + b}{N})}, i = 0, 1, \dots, N$

(92)
An Ehrenfest urn where $λ_{n} = (N - n) p$ , $μ_{n} = n q$ , $0 \leq n \leq N$ , $0 < p < 1$ and $q = 1 - p$ . The spectral polynomials are the Krawtchouk polynomials:

$Q_{n} (z) = K_{n} (z; N, p), 0 \leq n \leq N$

(93)

orthogonal on the Binomial $(N, p)$ distribution:

$(\binom{N}{z}) p^{z} q^{N - z}, z = 0, 1, \dots N$

(94)

which is also the stationary distribution in the process.

3.4. Composition Birth and Death Processes

Consider N identically distributed birth and death processes

{ξ_{i} (t)}_{t \geq 0}

i = 1, \dots N

, each with state space

0, 1, \dots

. It is assumed that there is no absorbing state at

- 1

and

λ_{0} > 0

. The transition functions for the labelled processes are

p_{i j} (t) : = \prod_{k = 1}^{N} p_{i_{k}, j_{k}} (t)

. In composition Markov processes, interest is in the unlabelled configuration of

ξ (t)

specified by

X (t)

, where:

X_{k} (t) = | {i_{j} = k, j = 1, \dots, N} |

(95)

for

k = 0, 1 \dots

. The probability generating function of

X (t)

conditional on

X (0) = x

is:

E [\prod_{k = 1}^{d} s_{k}^{X_{k} (t)}] = \prod_{i = 0}^{d} {(\sum_{j = 0}^{d} p_{i j} (t) s_{j})}^{x_{i}}

(96)

where possibly, there is a countable infinity of states with

d = \infty

. Transitions and rates are, for

j = 0, 1, \dots

x \to \{\begin{matrix} x + e_{j + 1} - e_{j} & rate x_{j} λ_{j}, \\ x + e_{j - 1} - e_{j} & rate x_{j} μ_{j} . \end{matrix}

(97)

The total rate from x is

\sum_{j \geq 0} x_{j} (λ_{j} + μ_{j})

{X (t)}_{t \geq 0}

is reversible with respect to

\tilde{m} (x; π) = (\binom{N}{x}) \prod_{j = 1}^{d} π_{j}^{x_{j}}

in the sense that:

\begin{matrix} \tilde{m} (x; π) λ_{j} x_{j} & = & \tilde{m} (x + e_{j}; π) μ_{j + 1} x_{j + 1}, j = 0, 1, \dots \\ \tilde{m} (x; π) μ_{j} x_{j} & = & \tilde{m} (x - e_{j}; π) λ_{j - 1} x_{j - 1}, j = 1, 2 \dots . \end{matrix}

(98)

Theorem 9.

If the spectrum is discrete, with support

{ζ_{l}}_{l \geq 0}

μ_{0} = 0

ζ_{0} = 0

, and a stationary distribution exists, then:

p (x, y; t) = m (y, p) \{1 + \sum_{{n : 0 < | n | \leq N}} e^{- t \sum_{i \geq 1} n_{i} ζ_{i}} {(\binom{N}{n})}^{- 1} Q_{n} (x; u) Q_{n} (y; u)\}

(99)

where

{Q_{n} (x; u)}

are the multivariate Krawtchouk polynomials with:

u_{i}^{(l)} = Q_{i} (ζ_{l}) \sqrt{ψ (ζ_{l}) / ψ (0)}, i, l = 0, 1, \dots .

(100)

The indexing in elements of

x, y

now begins at zero. If the spectrum is discrete, with support

{ζ_{l}}_{l \geq 0}

μ_{0} = 0

, then:

p (x, y; t) = \tilde{m} (y; π) \sum_{{n : 0 \leq | n | \leq N}} e^{- t \sum_{i \geq 0} n_{i} ζ_{i}} {(\binom{N}{n})}^{- 1} Q_{n} (x; u) Q_{n} (y; u)

(101)

where

{Q_{n} (x; u)}

are the multivariate Krawtchouk polynomials with:

u_{i}^{(l)} = Q_{i} (ζ_{l}) \sqrt{ψ (ζ_{l})}, i, l = 0, 1, \dots .

(102)

In this case,

ζ_{0} > 0

u^{(0)}

is not identically one, and:

\sum_{i \geq 0} u_{i}^{(k)} u_{i}^{(l)} π_{i} = δ_{k l}, k, l = 0, 1, \dots .

(103)

This covers the case when a stationary distribution does exist and also when a stationary distribution does not exist, because

\sum_{k = 0}^{\infty} π_{k} = \infty

Proof.

The probabilistic structure of

{X (t)}_{t \geq 0}

with probability-generating function Equation (96) implies that the multivariate Krawtchouk polynomials are the eigenfunctions of the transition distribution. Indexing in

X (t)

is from zero, rather than the usual indexing from one. From the Karlin and McGregor spectral expansion Equation (70):

\begin{matrix} p_{i j} (t) & = & π_{j} \{\sum_{k \geq 0} e^{- t ζ_{k}} Q_{i} (ζ_{k}) Q_{j} (ζ_{k}) ψ ({ζ_{k}})\} \\ p_{i j} (t) & = & ψ ({0}) π_{j} \{1 + \sum_{k \geq 1} e^{- t ζ_{k}} Q_{i} (ζ_{k}) Q_{j} (ζ_{k}) ψ ({ζ_{k}}) / ψ ({0})\} \\ = & p_{j} \{1 + \sum_{k \geq 1} e^{- t ζ_{k}} u_{i}^{(k)} u_{j}^{(k)}\} \end{matrix}

(104)

where

{u_{i}^{(k)}}

is defined in Equation (100) and satisfies:

\sum_{i \geq 0} u_{i}^{(k)} u_{i}^{(l)} p_{i} = δ_{k l}, k, l \geq 0

(105)

The second case Equation (101) follows similarly. The multivariate Krawtchouk polynomials then have a generating function:

G (x, w, u) = \prod_{j \geq 0} {(u_{j}^{(0)} + \sum_{l \geq 1}^{d - 1} w_{l} u_{j}^{(l)})}^{x_{j}} .

(106)

☐

The transition probability expansion Equation (101) can be written in a Karlin and McGregor spectral expansion form where the dual polynomials are important. Denote

{\tilde{u}}_{i}^{(l)} = Q_{i} (ξ_{l})

i, l = 0, 1, \dots

;

Q_{x} (ν; \tilde{u}) = {(\binom{N}{ν})}^{- 1} Q_{ν} (x; \tilde{u})

; and a multinomial spectral measure (which is a probability measure):

\tilde{m} (ν; ψ) = (\binom{N}{ν}) ψ {(ζ_{0})}^{ν_{0}} ψ {(ζ_{1})}^{ν_{1}} \dots, ν_{0} + ν_{1} + \dots = N

(107)

Then, Equation (101) can be expressed as a spectral expansion:

p (x, y : t) = \tilde{m} (y; π) \sum_{{ν; 0 \leq | ν | \leq N}} e^{- t \sum_{i \geq 0} ν_{i} ζ_{i}} Q_{x} (ν; \tilde{u}) Q_{y} (ν; \tilde{u}) \tilde{m} (ν; ψ)

(108)

The generating function of the dual polynomials:

\begin{matrix} H (n, v, \tilde{u}) & = & \sum_{{x : | x | = N}} {(\binom{N}{n^{+}})}^{- 1} (\binom{N}{x}) Q_{n} (x, \tilde{u}) \prod_{i \geq 0} v_{i}^{x_{i}} \\ = & \prod_{l \geq 0} {(v_{0} + \sum_{j \geq 1} Q_{j} (ζ_{l}) v_{j})}^{n_{l}} \\ = & \prod_{k = 1}^{N} (v_{0} + \sum_{j \geq 1} Q_{j} (Z_{k}) v_{j}) \end{matrix}

(109)

where in this generating function

n (Z)

is regarded as a random variable by taking:

n_{l} (Z) = | {Z_{k} : Z_{k} = ζ_{l}, k = 1, \dots, N} |

(110)

{Z_{k}}_{k = 1}^{N}

are independent and identically distributed random variables with probability measure ψ. Without loss of generality, take

v_{0} = 1

in Equation (109) and consider coefficients of

\prod_{i \geq 1} v_{i}^{x_{i}}

, indexing the dual polynomial by

(x_{1}, x_{2}, \dots)

with

x_{1} + x_{2} + \dots \leq N

. Note the scaling that the dual polynomials is one when

x_{i} = 0

i \geq 1

Theorem 10.

Define:

N_{j} = \sum_{k = 1}^{N} Q_{j} (Z_{k}) = \sum_{l \geq 0} n_{l} Q_{j} (ζ_{l}), j \geq 1

(111)

{(\binom{N}{n^{+}})}^{- 1} Q_{n} (x, \tilde{u})

is a polynomial of degree

x_{1} + x_{2} + \dots

{N_{j}}_{j \geq 1}

whose only term of maximal degree is

\prod_{j \geq 1} N_{j}^{x_{j}}

. The total degree of Z in the dual polynomials indexed by

(x_{1}, x_{2}, \dots)

\sum_{j \geq 1} j x_{j}

with a single leading term of this degree.

Proof.

The proof of the first statement follows from Theorem 5. The proof of the second statement is immediate by knowing that

N_{j}

is of degree j in Z. ☐

The third case of linear birth and death processes’ composition Markov chains is interesting, as it has a continuous spectral measure, which is a product measure of N gamma distribution measures. The spectral polynomials are well defined by a generating function as coefficients of

\prod_{j = 1}^{\infty} v_{j}^{x_{j}}

in:

\prod_{k = 1}^{N} (1 + \sum_{j \geq 1} Q_{j} (Z_{k}) v_{j})

(112)

however, elements of

{Z_{k}}_{k = 1}^{N}

are distinct, being continuous random variables, and the dual of the dual system is the products of dual Laguerre polynomials, which are not grouped to an index n, as when there is a discrete spectrum.

The polynomials in the Meixner class Equation (3) are additive in the sense that if

{Q_{m}^{N} (| z |)}

are the orthogonal polynomials on the distribution of

| Z |

, then the generating function for these polynomials is:

G^{N} (v, | z |) = h {(v)}^{N} e^{| z | u (v)} = \sum_{m = 0}^{\infty} Q_{m}^{N} (| z |) v^{m} / m!

(113)

and:

Q_{m}^{N} (| z |) = \sum_{{m : | m | = m}} (\binom{m}{m}) \prod_{j = 1}^{N} Q_{m_{j}} (z_{j})

(114)

This additivity implies an interesting identity.

Theorem 11.

The dual multivariate Krawtchouk polynomials with generating function Equation (109) satisfy the identity:

{(\binom{N}{n})}^{- 1} \sum_{{x : \sum_{j = 1}^{\infty} j x_{j} = m}} (\binom{N}{x}) \frac{m!}{\prod_{j = 1}^{\infty} {j!}^{x_{j}}} Q_{n} (x, \tilde{n}) = Q_{m}^{N} (| Z |)

(115)

where

x = (x_{1}, x_{2}, \dots)

. In this equation,

n = n (Z)

is regarded as a random variable in the sense of Equation (110).

Proof.

Set

v_{j} = v^{j} / j!

j = 0, 1, \dots

in Equation (109). Then:

\begin{matrix} \sum_{{x : | x | = N}} {(\binom{N}{n^{+}})}^{- 1} (\binom{N}{x}) Q_{n} (x, \tilde{u}) \frac{v^{\sum_{j = 1}^{\infty} j x_{j}}}{\prod_{j = 1}^{\infty} {j!}^{x_{j}}} & = & \prod_{k = 1}^{N} (\sum_{j \geq 0} Q_{j} (Z_{k}) v^{j} / j!) \\ = & h {(v)}^{N} e^{| Z | u (v)} \\ = & \sum_{m = 0}^{\infty} Q_{m}^{N} (| Z |) v^{m} / m! \end{matrix}

(116)

The theorem then follows by equating coefficients of

v^{m}

on both sides of the generating function. ☐

References

Griffiths, R.C. Orthogonal polynomials on the multinomial distribution. Aust. J. Stat. 1971, 13, 27–35. [Google Scholar] [CrossRef]
Diaconis, P.; Griffiths, R.C. An introduction to multivariate Krawtchouk polynomials and their applications. J. Stat. Plan. Inference 2014, 154, 39–53. [Google Scholar] [CrossRef]
Genest, V.X.; Vinet, L.; Zhedanov, A. The multivariate Krawtchouk polynomials as matrix elements of the rotation group representations on oscillator states. J. Phys. A Math. Theor. 2013, 46, 505203. [Google Scholar] [CrossRef]
Grunbaum, F.; Rahman, M. A system of multivariable Krawtchouk polynomials and a probabilistic application. SIGMA 2011, 7, 119–136. [Google Scholar] [CrossRef]
Iliev, P. A Lie-theoretic interpretation of multivariate hypergeometric polynomials. Compos. Math. 2012, 148, 991–1002. [Google Scholar] [CrossRef]
Mizukawa, H. Orthogonality relations for multivariate Krawtchouk polynomials. SIGMA 2011, 7, 17–22. [Google Scholar] [CrossRef]
Xu, Y. Hahn, Jacobi, and Krawtchouk polynomials of several variables. J. Approx. Theory 2015, 195, 19–42. [Google Scholar] [CrossRef]
Zhou, H.; Lange, K. Composition Markov chains of multinomial type. Adv. Appl. Probab. 2009, 41, 270–291. [Google Scholar] [CrossRef]
Karlin, S.; McGregor, J.L. The differential equations of birth-and-death processes, and the Stieltjes moment problem. Trans. Am. Math. Soc. 1957, 85, 489–546. [Google Scholar] [CrossRef]
Karlin, S.; McGregor, J.L. The classification of birth and death processes. Trans. Amer. Math. Soc. 1957, 86, 366–400. [Google Scholar] [CrossRef]
Karlin, S.; McGregor, J.L. Linear growth, birth and death processes. J. Math. Mech. 1958, 7, 643–662. [Google Scholar] [CrossRef]
Meixner, J. Orthogonale Polynomsysteme mit einer besonderen Gestalt der erzeugenden Funktion. J. Lond. Math. Soc. 1934, 9, 6–13. [Google Scholar] [CrossRef]
Ishmail, M.E.H. Classical and Quantum Orthogonal Polynomials in one variable. In Encyclopedia of Mathematics and Its Applications; Cambridge University Press: Cambridge, UK, 2005; Volume 98. [Google Scholar]
Griffiths, R.C. Lancaster distributions and Markov chains with multivariate Poisson-Charlier, Meixner and Hermite-Chebycheff polynomial eigenfunctions. J. Approx. Theory 2016, 207, 139–164. [Google Scholar] [CrossRef]
Diaconis, P.; Griffiths, R.C. Exchangeable pairs of Bernoulli random variables, Krawtchouk polynomials, and Ehrenfest urns. Aust. N. Z. J. Stat. 2012, 54, 81–101. [Google Scholar] [CrossRef]
Khare, K.; Zhou, H. Rates of convergence of some multivariate Markov chains with polynomial eigenfunctions. Ann. Appl. Probab. 2009, 19, 737–777. [Google Scholar] [CrossRef]
Lancaster, H. The Chi-Squared Distribution; John Wiley & Sons: New York, NY, USA, 1969. [Google Scholar]
Karlin, S.; McGregor, J.L. Ehrenfest urn models. J. Appl. Probab. 1965, 19, 477–487. [Google Scholar] [CrossRef]
Schoutens, W. Stochastic Processes and Orthogonal Polynomials; Lecture Notes in Mathematics 146; Springer-Verlag: New York, NY, USA, 2000. [Google Scholar]

© 2016 by the author; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC-BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Griffiths, R. Multivariate Krawtchouk Polynomials and Composition Birth and Death Processes. Symmetry 2016, 8, 33. https://doi.org/10.3390/sym8050033

AMA Style

Griffiths R. Multivariate Krawtchouk Polynomials and Composition Birth and Death Processes. Symmetry. 2016; 8(5):33. https://doi.org/10.3390/sym8050033

Chicago/Turabian Style

Griffiths, Robert. 2016. "Multivariate Krawtchouk Polynomials and Composition Birth and Death Processes" Symmetry 8, no. 5: 33. https://doi.org/10.3390/sym8050033

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Multivariate Krawtchouk Polynomials and Composition Birth and Death Processes

Abstract

1. Introduction

2. Bernoulli Trials and Orthogonal Polynomials

2.1. Krawtchouk Orthogonal Polynomials

2.2. Meixner Polynomials on the Geometric Distribution

2.3. An Ehrenfest urn

3. Multivariate Krawtchouk Polynomials

3.1. An Ehrenfest Urn with d-Types

3.2. Extensions to the Multivariate Krawtchouk Polynomials

3.3. Karlin and McGregor Spectral Theory

3.4. Composition Birth and Death Processes

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI