1. Introduction
Given
and discrete random vector
, the set function
defined by
is called the entropy function of
, where
and
by convention. We also say
characterizes
, or
is the characterizing random vector of
. An entropy function
can be considered as a vector in the entropy space
. (For a set
A and a finite set
B,
denotes the
Cartesian product of
A with each coordinate indexed by
. When
is a field,
is a
-dimensional vector space over
with each coordinate indexed by
.) We say
and the vectors in it have order
n. The set of all entropy functions of order
n, denoted by
, is called the entropy region of order
n. The closure of
, denoted by
, is called almost entropic region. It is a convex cone [
1]. A vector
is called entropic if
, almost entropic if
, and non-entropic if
. Characterization of entropy functions, i.e., for a vector
, determining whether it is in
or
, is of fundamental importance in information theory.
For a vector
, if it is nonnegative, i.e.,
for all
, monotone, i.e.,
for all
, and submodular, i.e.,
for all
, the pair
is called a polymatroid, where
N is the ground set and
is the rank function of the polymatroid. For a polymatroid
, if
and
for all
,
is called a matroid. By frequent abuse of terminology, we do not distinguish a (poly)matroid and its rank function if there is no ambiguity. See
Section 2.1 for a more detailed discussion on matroids.
The set of all polymatroids in
, denoted by
, is called the polymatroidal region of order
n. It is proved in [
2] that any entropy function is a polymatroid, thus
is an outer bound of
. As
is closed, it is also an outer bound of
. As inequalities bounding
are equivalent to the nonnegativity of Shannon information measures, they are called Shannon-type information inequalities, and
is also called the Shannon outer bound of
and
. For more about entropy functions and information inequalities, readers are referred to [
3], (Chapter 13–15) [
4,
5].
It is well known that
when
due to the existence of non-Shannon-type inequalities, e.g., Zhang-Yeung inequality [
6]. However, though
, Zhang and Yeung also discovered that on an extreme ray of
, only countably many vectors are entropic, which implies that
, and therefore there exists a gap between
and
[
1]. Given a random vector
with
mutually independent and each of them the function of the other two, it is proved in [
1] that
must be uniformly distributed on a finite set, say
, thus
. On the other hand, for each integer
, they proved that polymatroid
with
is entropic: let
and
uniformly distributed on
and
, then
is the entropy function of
.
As the rank function of
is equal to
, Zhang-Yeung indeed proved that for any vector
on the ray
,
is entropic if and only if
for some positive integer
v. In [
7], Matúš proved that, for any extreme ray
of
containing a connected matroid
M with rank
,
is entropic only if
for some positive integer
v. However, on the other hand,
is not entropic for all positive integers. For example, we will see in
Section 4 that
is non-entropic when
.
Definition 1. For a connected matroid M with rank , we call the set of all positive integers v such that is entropic theprobabilistically (p-)characteristicset of M.
The term p-characteristic set of a matroid
M is first coined in [
7]. As discussed above,
, the set of all positive integers, and
. In this paper, we study the p-characteristic set of an arbitrary connected matroid with rank
.
Definition 2. For a connected matroid M with rank and a positive integer v, if , we call the entropy function a matroidal entropy function induced by M with degree v.
It can be seen in the proof of Zhang-Yeung, characterizing random vectors of matroidal entropy functions on
is constructed by the multiplication table of an additive group on
. It is not difficult to see that a random vector constructed by any quasigroup on
, or equivalently, a Latin square with symbols in
, or equivalently, an orthogonal array
, characterizes
. (See
Section 2.2 for the definition of an orthogonal array.) More generally, an
can be used to construct a characterizing random vector of the matroidal entropy function
with
. It is a natural question to ask whether such construction can be generalized to an arbitrary connected matroid
M with rank
? In [
8], partition-representations
of a matroid
with degree
v was defined, where each
is partition of a set
with cardinality
. See more details in
Section 3.1.2. Characterizing random vectors of
can be obtained by the uniform distributions on the blocks of
. In [
9], an equivalent definition in coding theory called almost affine code was defined. In this paper, in coordinate with the language in combinatorial design, we introduce variable strength orthogonal arrays(VOA) indexed by matroid
M and integer
, which is equivalent to a partition-representation of
M with degree
v and an
almost affine code. We denoted it by
. A
can be regarded as expanding the concept of orthogonal array. If a
exists, we will prove that the matroidal entropy function
is entropic and a characterizing random vector of
can be constructed by
. On the other hand, if
does not exist,
is non-entropic.
It is well known that orthogonal arrays with index unity in design theory are equivalent to maximum distance saperable (MDS) codes in coding theory. In discussions of our paper, we also see a more generalized equivalence, i.e., the equivalence between a VOA and an almost affine code. Thus, we review and develop the correspondences and equivalences in literatures such as [
8,
9] among four fields, i.e., information theory, matroid theory, combinatorial design, and coding theory, which may help them benefit from each other. In this paper, VOAs are also leveraged to characterize matroidal entropy functions induced by matroids of order
.
The rest of this paper is organized as follows.
Section 2 gives the preliminaries on matroid theory and orthogonal arrays. In
Section 3, we first define variable strength orthogonal arrays and show their equivalence to the partition representation of a matroid and almost affine codes. Then we characterize matroid entropy functions via variable strength orthogonal arrays. in
Section 4, we characterize matroidal entropy functions of order
. A discussion of the applications and further research is in
Section 5.
2. Preliminary
2.1. Matroids
There exist various cryptomorphic definitions of a matroid. In this paper we discuss matroid theory mainly from the perspective of rank functions. For a detailed treatment of matroid theory, readers are referred to [
10,
11]. In
Section 1, we defined matroids as special cases of polymatroids. Here we restate the definition in the following.
Definition 3. A matroid M is an ordered pair , where the ground set N is a finite set and the rank function is a set function on , and they satisfy the conditions that: for any ,
and ,
,
.
The value is called the rank of M.
With a slight abuse of terminology and notations, we do not distinguish a matroid and its rank function. So and may all denote the rank function of M when there is no ambiguity.
Definition 4. For integer and , the uniform matroid
with rank t and order n is defined by Given a matroid , for , if , element i is called a loop of M. For , if , we call A a parallel class. If , the parallel class is called non-trivial. A matroid is called simple if it contains no loops and no non-trivial classes. For a matroid M, if we delete its loops and in each non-trivial parallel class, we delete all elements but one, then we obtain a simple matroid . We call the simplification of M.
For a matroid , a nonempty is called a circuit with size of M if for any . It can be seen that any loop of M is a circuit of size 1 and a parallel pair is a circuit of size 2. For a uniform matroid , circuits are exactly those -subsets C of N. In particular, contains n loops, any two elements of are parallel, and the ground set of forms the unique circuit of .
Definition 5. A matroid is connected if any two elements in the ground set are contained in a circuit.
It is easy to be verified that any uniform matroid with is connected. This is because any is contained in a subset of N which is a circuit of .
An extreme ray R of a convex cone C is a subset of C and for any such that and , we have , where and for some .
Lemma 1. [12] A loopless matroid is connected if and only if M is contained in an extreme ray of . Each extreme ray of contains an integer-valued polymatroid, some of which are matroids. Such a matroid on an extreme ray is either a loopless connected matroid as stated in the above lemma, or a matroid obtained by adding loops to a connected matroids.
2.2. Orthogonal Arrays
Orthogonal array is a well studied topic in design theory. In this paper, orthogonal arrays are leveraged to characterize matroidal entropy functions. For a detailed treatment of this topic, readers are referred to [
13].
Definition 6. A array T with entries from is called an orthogonal array of strength t, factor n, level v and index λ if for any subarray of T, each t-tuple in occurs in the rows of exactly λ times. We call T an . When , we say such orthogonal array has index unity and call it an for short.
By the definition, for any , an is also an , where . In this paper, we only consider the strength of the orthogonal array largest possible.
An important research problem of orthogonal arrays is the existence of an
. The following lemmas state some results of this problem, in which Lemmas 2–4 can be found in Handbook [
14].
Lemma 2 ([
14], (III.7.16)).
There exists an for any . Lemma 3 ([
14], (III.3.28, III.3.39)).
For , an exists if and only if . The nonexistence of in Lemma 3 is the famous Euler’s 36 officer problem.
Lemma 4 ([
14], (III.3.28, III.3.36, III.3.39)).
An exists for all with three exceptions and one possible exception . Lemma 5. For , there does not exist an .
This lemma is a folklore in the combinatorial design community. For self-contain, we prove it in the following.
Proof. We prove the non-existence of for by contradiction. Assume there exists an , i.e., a array whose each subarray contains each 3-tuple in as a row exactly one time. By permuting the rows of , we obtain an such that the entries in the first rows and the 5-th column of are all 0. Let be the 5 columns of and be the vector consisting of the first entries in . Now consider the subarray with . As its rows are exactly all 3-tuples in and is a zero vetor, it can be seen that the rows of are exactly all 2-tuples in . Thus, forms an which contradicts Lemma 3. The non-existence of can be proved similarly.
For , assume such an array exists. As each subarray of contains each 3-tuple in as a row exactly one time, for each two rows of the rows of , their Hamming distance must be . Therefore, any two Hamming balls with center a row of and radius 1 are disjoint. As there are 27 such Hamming balls with each size 11, there are at least 5-tuples, which contradicts the fact that only 5-tuples exist. □
Lemma 6 ([
15]).
If and , then there is an . Lemma 7 ([
16]).
Let x be an arbitrary odd positive integer. Let g be an arbitrary positive integer whose prime power factors are all such that . Then- 1.
there is an with , if ;
- 2.
there is an with , if .
4. P-Characteristic Set of Matroids with Order
Rank 1 matroids of order
n are exactly those matroids containing
on
as a submatroid and other elements loops. Let
X be an arbitrary random variable. Let
be defined by
It can be seen that characterizes on the ray as long as we let .
Armed with the results of orthogonal arrays in
Section 2.2 and Theorem 1, we can characterize the matroidal entropy functions
for a connected matroid
M with rank
. In this section, we determine the p-characteristic set
for all connected matroids
with rank
and order
. For a disconnected matroid
M with each connected component
rank
,
is the intersections of all
. Thus, it is sufficient to consider connected matroids and take them as building blocks. It matches the fact that matroidal entropy functions indexed by a connected matroid live on an extreme rays of
(see Lemma 1), while those indexed by a disconnected matroid can be written as the sum of the matroidal entropy functions indexed by its connected components.
Among all connected matroids, we only need to consider those simple matroids since the p-characteristic set of a matroid is the same as its simplification. For a matroid and its simplification with , if characterizes , for each parallel class A, let where j is the only element in , and let be a constant if i is a loop of M. Then characterizes . On the other hand, if characterizes , by the reverse method, we obtain characterizing . Hence they have the same p-characteristic set.
Non-isomorphic simple matroids with order
is listed in [
17] (A simple matroid is also called a combinatorial geometry.). Here we consider connected simple matroids with rank
and order
. Before that we first consider
for general
. By Lemma 2 and Theorem 1, we have the following proposition.
When
, the case
is also proved by Zhang-Yeung [
1] as we discussed in
Section 1. As
is the only case we need to consider for matroids with oder
, in the following we discuss the cases for
and 5.
4.1.
For , besides , one more matroid we need to consider is . By Theorem 1 together and Lemma 3, we have the following propositions.
4.2.
For , besides , there are four more matroids we need to consider, namely, , , defined in Example 1 and defined in Example 2.
For , by Theorem 1 and Lemma 4, we have the following propositions.
Proposition 3. For , and .
For , by Theorem 1 and Lemmas 5–7, we have the following propositions.
Proposition 4. For , and , where and
- 1.
- 2.
is the set of such that
, if ;
, if
where x is an arbitrary odd positive integer, and g is an arbitrary positive integer whose prime power factors are all such that .
For , we give a in Example 1, thus . We will have in the following proposition on the existence of for an arbitrary .
Proof. For any , let be any 3-tuple in . Now given , let , , , and , we obtain a 5-tuple . Run out of all , it can be checked that the resulting 5-tuples form a . Since is arbitrary, the proposition holds. □
The following proposition determines the p-characteristic set of .
Proof. We prove that if there exist an , then there exists a , and vice versa. It implies that and hence the proposition.
Now assume there is an
with columns
. So each
is a
-vector. Let
be
-vectors defined as follows.
for each
and
. It can be checked that
form a
.
On the other hand, assume there is a with columns . As and . The fifth column of contains each times. Rearrange the rows of such that the first entries of are zeros, i.e., for . Let be -vectors and for . Then it can be checked that form an . □