A method to simulate fuzzy random variables
González-Rodrı́guez. G.1 , Colubi, A.1 , Gil, M.A.1 and Coppi, R.2
1
2
Dpto. de Estadı́stica e I.O. Universidad de Oviedo. 33007 Spain
{gil,colubi,magil}@uniovi.es
Dpto. di Statistica, Probabilit e Statistiche Applicate, Universit degli Studi di
Roma, ”La Sapienza” renato.coppi@uniroma1.it
In this paper a method is introduced to simulate fuzzy random variables by
using the support function. On the basis of the support function, the class
of values of a fuzzy random variable can be ‘identified’ with a closed convex
cone of a Hilbert space, and we now suggest to simulate Hilbert space-valued
random elements and to project later into such a cone. To make easier the
projection above we will consider isotonic regression. The procedure will be
illustrated by means of several examples.
1 Introduction
In the literature on fuzzy-valued random variables, there are only a few references to modeling the distribution of these random elements. These models
(for instance, see [12]) are theoretically well stated, but they are not soundly
supported by empirical evidence, since they correspond to quite restrictive
random mechanisms and hence they are not realistic in practice (see [4]).
Nevertheless, many probabilistic and statistical studies on fuzzy random
variables would be better developed if simulation studies could be carried out
(cf. [8], [9], [11]).
A similar situation arises in connection with functional data, to which a lot
of attention is being paid in the last years, especially in which concerns random elements taking on values in Hilbert spaces (see, for instance, [14], [15]).
The assumption of the Hilbert space structure is very helpful for simulation
purposes (see [7], [1] or [16]).
The key idea in the methodology to be presented is first based on passing
from the space of fuzzy random variable values into the Hilbert space of the
corresponding integrable functions through the support function; then, one
can generate Hilbert space-valued random elements and project them into
the convex cone of the image of the space of fuzzy values. The projection
theorem in Hilbert spaces validates the way to proceed and theoretically it
would be possible to simulate all possible distributions on the space.
2
González-Rodrı́guez. G., Colubi, A., Gil, M.A. and Coppi, R.
This idea can be easily implemented from a theoretical viewpoint. In practice, when fuzzy values to be dealt with are fuzzy sets of the one-dimensional
Euclidean space the implementation does not entail important difficulties,
since the support function of a fuzzy value is characterized by two real-valued
functions on the unit interval, namely, the one associated with the infima
and that associated with the suprema. These two functions are in the cone of
the monotonic functions, and they are subject to the constraint of the infimum being lower than the supremum for each level. They have been analyzed
in connection with some probabilistic problems (see [2]). However, for fuzzy
sets of multi-dimensional Euclidean spaces, the practical developments become much more complex, although some alternatives to simplify them will
be commented along the paper.
In this paper a procedure to simulate fuzzy random variables for which the
shape of fuzzy values is not constrained will be introduced. In case there are
some preferences on the shape of the considered fuzzy values the procedure
could also adapted.
2 Preliminaries
Let Kc (Rp ) be the class of the nonempty compact convex subsets of Rp
endowed with the Minkowski sum and the product by a scalar, that is,
A + B = {a + b | a ∈ A, b ∈ B} and λA = {λa | a ∈ A} for all A, B ∈ Kc (Rp )
and λ ∈ R. We will consider the class of fuzzy sets
¯
©
ª
Fc (Rp ) = U : Rp → [0, 1] ¯ Uα ∈ Kc (Rp ) for all α ∈ [0, 1]
where Uα is the α-level of U (i.e. Uα = {x ∈ Rp | U (x) ≥ α}) for all α ∈ (0, 1],
and U0 is the closure of the support of U . The space Fc (Rp ) can be endowed
with the sum and the product by a scalar based on Zadeh’s extension principle
[17], which satisfies that (U + V )α = Uα + Vα and (λU )α = λUα for all
U, V ∈ Fc (Rp ), λ ∈ R and α ∈ [0, 1].
The support function of a fuzzy set U ∈ Fc (Rp ) is sU (u, α) = supw∈Uα hu, wi
for any u ∈ Sp−1 and α ∈ [0, 1], where Sp−1 is the unit sphere in Rp and h·, ·i
denotes the inner product. The support function allows us to embed Fc (Rp )
onto a cone of the continuous and Lebesgue integrable functions L(Sp−1 ) by
means of the mapping s : Fc (Rp ) → L(Sp−1 × [0, 1]) where s(U ) = sU (see
[5]).
We will consider the generalized metric by Körner and Näther [10] DK ,
which is defined so that
2
[DK (U, V )] =
Z
¡
¢¡
¢
sU (u, α) − sV (u, α) sU (v, β) − sV (v, β) dK(u, α, v, β),
(Sp−1 )2 ×[0,1]2
for all U, V ∈ Fc (Rp ), where K is a positive definite and symmetric kernel;
thus, DK coincides with a generic L2 distance k · k2 on the Hilbert space
L(Sp−1 × [0, 1]).
A method to simulate fuzzy random variables
3
Let (Ω, A, P ) be a probability space. A fuzzy random variable (FRV) in
p
Puri & Ralescu’s sense [13] is a mapping X : Ω → Fc (R
¡ ) so¢that the α-level
p
mappings Xα : Ω → Kc (R ), defined so that Xα (ω) = X (ω) α for all ω ∈ Ω,
are random sets (that is, Borel-measurable mappings with the Borel σ-field
generated by the topology associated with the well-known Hausdorff metric
dH on K(Rp )). Alternatively, an FRV is an Fc (Rp )-valued random element
(i.e. a Borel-measurable mapping) when the Skorokhod metric is considered
on Fc (Rp ) (see [3]).
¡
¢
If X : Ω → Fc (Rp ) is a fuzzy random variable such that dH {0}, X0
∈ L1 (Ω, A, P ), then
value (or mean) of X is the unique E(X ) ∈
¡ the expected
¢
Fc (Rp ) such that E(X ) α = Aumman’s integral of the random set Xα for
all α ∈ [0, 1], that is,
¯
©
ª
¡
¢
E(X ) α = E(X|P ) ¯ X : Ω → Rp , X ∈ L1 (Ω, A, P ), X ∈ Xα a.s. [P ] .
3 Simulation of fuzzy random variables
through functional random variables
The space of fuzzy values Fc (Rp ) is a closed convex cone of the Hilbert space
L(Sp−1 × [0, 1]), and hence there exists a unique projection. As a consequence,
given an arbitrary f ∈ L(Sp−1 × [0, 1]) there is a unique fuzzy set P (f ) = Af
which corresponds to the anti-image of the support function of the projection
of f onto the cone s(Fc (Rp )). We will denote by P : L(Sp−1 × [0, 1]) →
s(Fc (Rp )) the projection function.
For any random element X taking on values in L(Sp−1 ×[0, 1]), the mapping
−
s 1 ◦ P ◦ X is a fuzzy random variable. In this way, if random elements of
L(Sp−1 ×[0, 1]) are generated, random elements of s(Fc (Rp )) could be obtained
through the projection P . Due to the fact that s(Fc (Rp )) ⊂ L(Sp−1 × [0, 1]),
we can guarantee that this method involves all the possible distributions on
s(Fc (Rp )) and, since s is an isometry, by applying s−1 we would get all the
possible distributions on Fc (Rp ).
The theoretical method to generate Fc (Rp )-valued fuzzy random variables
consists in
Step 1
Simulating random elements on L(Sp−1 × [0, 1]) by following the current
directions in Functional Data Analysis (i.e., by considering bases either
from a given function plus a noise term, or from discretized brownian
motions, and so on).
Step 2
Projecting the simulated elements into the isometric cone of Fc (Rp ).
Step 3
Identifying the fuzzy set associated with the generated support function.
4
González-Rodrı́guez. G., Colubi, A., Gil, M.A. and Coppi, R.
This theoretical method seems to be complex to implement in practice,
although it would be feasible in some particular cases. Thus, in case p = 1,
the unit sphere S p−1 reduces to the set {−1, 1} whence the fuzzy set A ∈
Fc (R) can be characterized by means of two monotonic functions sA (−1, ·) and
sA (1, ·) (see [2]) which satisfy certain constraints (since the infimum should
always be lower than the supremum).
To make the problem easy to handle, fuzzy values can be reparameterized
in terms of the left and right spreads with respect to the center of the 1level. Once fuzzy values are reparameterized in such a way, arbitrary functions
can be generated to construct later the function of the left spreads (for the
infima) and the function of the right spreads (for the suprema). Since these
two functions are monotonic and nonnegative, we can apply an algorithm
of the isotonic regression restricted to positive values (see [6]). Later, the
mid point of the 1-level would be generated at random and, along with the
spreads simulated before, the infimum and supremum functions defining the
fuzzy value would be obtained.
The ‘practical’ method to generate Fc (R)-valued fuzzy random variables
we suggest in this paper can be summarized as follows:
Step Fc (R)-1
To generate at random the mid-point of the 1-level, x0 , as well as two
random functions on the Hilbert space L([0, 1]), fl , fr : [0, 1] → R (there
is no need for these functions to be generated independently).
Step Fc (R)-2
To find the antitonic regressions of fl∗ and fr∗ to get the left and right
spreads sl , sr : [0, 1] → [0, ∞), respectively.
Step Fc (R)-3
The α-levels of the fuzzy value A generated through Steps Fc (R)-1 and
Fc (R)-2 would be given by Aα = [x0 − sl (α), x0 + sr (α)] (which is welldefined).
As we have commented before, the procedure above does not involve constraints on the shape of fuzzy values to be generated, although this type of
constraint (like, for instance, to assume that x0 is deterministic, functions fi
are linear functions, etc.) could be incorporated if required.
4 Some illustrative examples
We now illustrate the ideas in Section 3 by means of two examples. Since Steps
Fc (R)-2 and Fc (R)-3 do not involve any random process, the differences in
applying the algorithm are restricted to Step Fc (R)-1. There are many ways
of simulating random functions in the Hilbert space L([0, 1]). Some of them,
A method to simulate fuzzy random variables
5
as those based on a function plus a noise term or considering a class depending on real random parameters, can be easily imitated in Fc (R). However, the
Hilbert spaces present some distinguishing characteristics, such as the generating basis, that can be taken into account to simulate random elements in a
wider context.
In this section two ways of simulating from generating bases the functions
f1 and f2 in Step Fc (R)-1 of the above-described procedure are detailed.
Consider a referential triangular fuzzy set Tri(−1, 0, 1), which is equivalent
to consider the spread functions f1 (α) = f2 (α) = 1 − α for all α ∈ [0, 1]. Since
these spread functions correspond to linear functions, the trigonometric basis
will be suitable to represent them. This basis is given by
½
1
if j = 0
φj (x) = √
2 cos(πjx) if j = 1, 2, . . .
Coefficients of the spread functions in this basis are given by
.5
if j = 0
0√
if j is an even number
θj =
2 2
2 2 if j is an odd number
π j
For practical purposes we will consider the approximation of the function corresponding to the first 21 terms of the linear combination (i.e.,
j = 0, . . . , 20). Coefficients are distorted in a random way so that all the
generated random functions follow the expression
20
X
(θj + εj )φj
j=0
where (ε0 , . . . , ε20 ) is a random vector.
The way of distorting the coefficients is crucial, since small perturbations
can produce shapes completely different from the original one. It should be
recalled that, in order to get well-defined fuzzy sets, we will need to apply an
antitonic regression algorithm after the simulation of the functions in L([0, 1]).
Thus, if the simulated functions are highly variable (in the sense of showing
many monotonicity changes), the antitonic regression corresponding to the
spreads will have many constant parts, and hence the obtained fuzzy set will
present a lot of discontinuities. In order to illustrate this behaviour, we will
firstly consider the following:
Case A. For the left spread a sequence of independent realizations, εl0 , . . . , εl20 ,
are simulated from the normal distribution N (0, .01), and for the right spread
a sequence of independent realizations, εr0 , . . . , εr20 , are simulated from the
normal distribution N (0, .1). Thus, we get two random functions
6
González-Rodrı́guez. G., Colubi, A., Gil, M.A. and Coppi, R.
fl =
20
X
(θj + εlj )φj and fr =
j=0
20
X
(θj + εrj )φj .
j=0
The mid-point of the 1-level is chosen at random from a normal distribution N (2, 1). By applying Steps Fc (R)-2 and Fc (R)-3, a random fuzzy set is
obtained.
In order to compare some particular realizations of the simulated fuzzy random variable with the expected value of such an element, we have made 10,000
simulations and we have approximated the (fuzzy) mean value by Monte Carlo
method. In Figures 1 and 2 three simulated values and the corresponding mean
value are shown. We can see that, although the perturbations were chosen to
follow distributions with a relative small variability, the simulated fuzzy sets
are quite different from the referential triangular fuzzy number and have many
discontinuities. Nonetheless, the shape of the expected value is quite smooth
and more similar to the referential fuzzy number. The difference between this
mean value and the original triangular one is mainly due to the application
of the antitonic regression algorithm (the expected value of the antitonic regression can be different from the antitonic regression of the expected value).
1
1
1
.5
.5
.5
0
0
0
1
1.5
2
2.5
3
3.5
1.5
2
2.5
3
3.5
.5
4
1
1.5
2
2.5
3
Fig. 1. Simulated values in Case A
1
.5
0
.5
1
1.5
2
2.5
3
3.5
Fig. 2. Approximated mean value in the simulation in Case A
In order to obtain smoother shapes, we can simulate the perturbations in
the coefficients with a decreasing weight as follows.
l
Case B. For the left spread a sequence of independent realizations U0l , . . . , U20
from the uniform distribution U(0,1) are simulated, and the perturbations are
A method to simulate fuzzy random variables
7
considered so that εl0 = U0l , εlj = Ujl · εlj−1 . For the right spread the same process is followed but using the beta distribution β(5, 3) instead of the uniform
one. Again, the mid-point is chosen at random from a normal distribution
N (2, 1) and Steps Fc (R)-2 and Fc (R)-3 are followed to get the random fuzzy
set. In Figures 3 and 4 three simulated values and the corresponding mean
value (approximated by 10,000 realizations of the process) are shown. As expected, we can see smoother shapes than those in Case A, although they are
also quite different and the greater the magnitude of right perturbations the
greater the probability of discontinuities.
1
1
1
.5
.5
.5
0
0
0
-1 0
1
2
3
4
5
6
7
-3
8
-2
-1
0
1
2
-1.5
3
-.5
.5
1.5
2.5
3.5
Fig. 3. Simulated values in Case B
1
.5
0
-1
0
1
2
3
4
5
6
Fig. 4. Approximated mean value in the simulation in Case B
Acknowledgement
The research in this paper has been partially supported by the Spanish
Ministry of Education and Science Grant MTM2005-00045. Its financial support is gratefully acknowledged.
References
1. H. Cardot. Nonparametric regression for functional responses with application
to conditional functional principal components analysis. Technical Report LSP2005-01, Universite Paul Sabatier, 2005.
2. A. Colubi, J. S. Domı́nguez-Menchero, M. López-Dı́az, and R. Körner. A method
to derive strong laws of large numbers for random upper semicontinuous functions. Statist. Probab. Lett., 53:269–275, 2001.
8
González-Rodrı́guez. G., Colubi, A., Gil, M.A. and Coppi, R.
3. A. Colubi, J. S. Domı́nguez-Menchero, M. López-Dı́az, and D. A. Ralescu. A
de [0, 1]-representation of random upper semicontinuous functions. Proc. Amer.
Math. Soc., 130:3237–3242, 2002.
4. A. Colubi, C. Fernández-Garcı́a, M. López-Dı́az, , and M. A. Gil. Simulation
of random fuzzy variables: an empirical approach to statistical/probabilistic
studies with fuzzy experimental data. IEEE Trans. Fuzzy Syst., 10:384–390,
2002.
5. P. Diamond and P. Kloeden. Metric Spaces of Fuzzy Sets: Theory and Applications. World Scientific, Singapore, 1994.
6. J.S. Domı́nguez-Menchero and G. González-Rodrı́guez. Analyzing an extension
of the isotonic regression problem. 2006. (submitted).
7. R. Fraiman and G. Muñiz. Trimmed means for functional data. Test, 10:419–
440, 2001.
8. M.A. Gil, M. Montenegro, G. González-Rodrı́guez, A. Colubi, and M.R. Casals.
Bootstrap approach to the multi-sample test of means with imprecise data.
Comput. Statist. Data Anal., 2006. (accepted, in press).
9. G. González-Rodrı́guez, A. Colubi, and M.A. Gil. A fuzzy representation of
random variables: an operational tool in exploratory analysis and hypothesis
testing. Comput. Statist. Data Anal., 2006. (accepted, in press).
10. R. Körner and W. Näther. On the variance of random fuzzy variables. In
C. Bertoluzza, M.A. Gil, and D.A. Ralescu, editors, Statistical Modeling, Analysis and Management of Fuzzy Data, pages 22–39. Physica-Verlag, Heidelberg,
2002.
11. M. Montenegro, A. Colubi, M. R. Casals, and M. A. Gil. Asymptotic and
bootstrap techniques for testing the expected value of a fuzzy random variable.
Metrika, 59:31–49, 2004.
12. M. L. Puri and D. A. Ralescu. The concept of normality for fuzzy random
variables. Ann. Probab., 11:1373–1379, 1985.
13. M. L. Puri and D. A. Ralescu. Fuzzy random variables. J. Math. Anal. Appl.,
114:409–422, 1986.
14. J. O. Ramsay and B. W. Silverman. Functional data analysis. Springer Series
in Statistics. Springer–Verlag, New York, 1997.
15. J. O. Ramsay and B. W. Silverman. Applied functional data analysis. Methods
and case studies. Springer Series in Statistics. Springer–Verlag, New York, 2002.
16. F. Yao, H.-G. Müller, and J.-L. Wang. Functional data analysis for sparse
longitudinal data. J. Amer. Statist. Assoc., 100:577–590, 2005.
17. L.A. Zadeh. The concept of a linguistic variable and its application to approximate reasoning, II. Inform. Sci., 8:301–353, 1975.