Statistical inference on quantiles of two independent populations under uncertainty

Gholamreza Hesamian¹,
Nataliya Chukhrova² &
Arne Johannssen ORCID: orcid.org/0000-0002-4265-3996³

960 Accesses
Explore all metrics

Abstract

Statistical inference is the process of drawing conclusions about underlying population(s) using sample data to either confirm or falsify hypotheses. However, the complexity of real-life problems often makes the underlying statistical models inadequate, as information is often imprecise in many respects. To address this common problem, some papers have been published on modifications and extensions of test concepts by employing tools of fuzzy statistics. In this paper, we present a non-parametric test for the difference between quantiles of two independent populations based on fuzzy random variables. For this purpose, we consider the fuzzy quantile function and its estimation based on $\alpha $-values of fuzzy random variables. We then provide a fuzzy test based on the fuzzy empirical distribution function for the difference of fuzzy order statistics from these independent populations. We also suggest a specific degree-based criterion to compare the fuzzy test statistics at a specific significance level to decide whether the underlying fuzzy null hypothesis can be rejected or not. The effectiveness of the proposed two-sample test on quantiles is investigated via numerical examples.

Parametric testing statistical hypotheses for fuzzy random variables

Article Open access 05 February 2015

Testing statistical hypotheses for intuitionistic fuzzy data

Article 20 October 2018

Bootstrapped Kolmogorov-Smirnov Test for Epistemic Fuzzy Data

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Hypothesis testing regarding an assumption about the probability distribution of one random variable or a set of random variables is a main field of statistical inference. Those tests demand a well-defined modeling of the tested hypothesis, i.e., precisely stated requirements in relation to the distribution of the underlying random variable(s) (Chukhrova and Johannssen 2020a). In many practical applications, it is necessary to compare two independent populations concerning their central tendencies or other distribution-related criteria (O’Gorman 2004; Shi and Tao 2008; Taff 2018). With the aim to conclude whether the difference of interest between both these populations is significant or not, techniques of statistical inference are employed. Considering two-sample hypothesis tests, the populations are usually compared in terms of their central tendency measures (like the means) to conclude if there are significant differences in these populations. Although comparing two populations concerning their means is a common problem, there are situations where one needs to compare other characteristics of the populations such as quantiles (see, e.g., Hutson 2009; Viertl 2006; Kosorok 1999). When using quantiles (e.g., quartiles) instead of the means, the test decision is based on robust location parameters and outliers have no impact on the test decision (Chukhrova and Johannssen 2021b).

Further, censoring or truncation can complicate estimation of entire distribution functions and an examination of a collection of quantiles is a reasonable alternative (Gözde and Özdemir 2018). In addition, in many real-life applications (like psychology, biology, medicine, economics), the quantiles of the underlying characteristic variables are important boundaries for decision-making (Wang and Hettmansperger 1990; Farrell et al. 1997). For instance, the determination of the differences in the tails (by employing quantiles) is often of interest in such cases. But, the main advantage of non-parametric testing on quantiles compared to common tests such as Student’s t-test is that there is no need for additional assumptions related to homoscedasticity or normality of two population distributions, which are often not fulfilled in practical applications (for instance, when the distributions are asymmetric or irregularly shaped). That is, the results of Student’s t-test can be misleading when the assumptions are not satisfied. For these reasons, several non-parametric two-sample quantile-based tests have been developed (Heinzl and Mittlboeck 2017; Hutson 2009).

However, in situations, where point- or interval-valued formulations of hypotheses appear too rigid for real-life problems, the above limitations make the practitioner do decision procedures in a non-reasonable way. In this case, common statistical inference techniques are inappropriate for testing a hypothesis. Moreover, there are many situations in practical applications where the observations cannot be measured as crisp quantities, because information is often imprecise, incomplete, linguistic, noisy, qualitative, or vague (Chukhrova and Johannssen 2019). In contrast, fuzzy hypothesis testing provides a more realistic framework for such hypothesis testing problems, as fuzzy set theory is a natural tool for modeling and analyzing subjective and imprecise concepts. A fuzzy hypothesis allows for a more appropriate treatment of the unknown parameter(s), i.e., instead of specifying its (their) hypothesized values over a crisp interval, it allows, for example, specifying a smooth transition from “preferred” to “non-preferred” or from “possible” to “impossible” values in terms of an appropriate modeled membership function (Chukhrova and Johannssen 2020a). Furthermore, hypothesis testing in fuzzy environments facilitates to add available expert knowledge to the test procedure, taking into consideration the economic context or possibilistic aspects. Therefore, fuzzy modeling approaches provide appropriate techniques for dealing with those various types of uncertain information (Chukhrova and Johannssen 2021a).

Through the years, various techniques have been developed for testing hypotheses based on fuzzy information. Such methods can be decomposed in parametric tests (Arnold 1998; Filzmoser and Viertl 2004; Chen and Chang 2020; Haktanir and Kahraman 2019; Hesamian and Akbari 2021; Hryniewicz 2006b; Mylonas and Papadopoulos 2021; Parchami 2020; Rodríguez et al. 2006; Viertl 2006; Wu 2005; Akbari and Rezaei 2010; Chukhrova and Johannssen 2020a, b, c; Hesamian and Shams 2016; Montenegro et al. 2004; Kahraman et al. 2004) and non-parametric tests (Chukhrova and Johannssen 2021b, c, 2022; Grzegorzewski 2000; Denoeux et al. 2005; Gil et al. 2006; Grzegorzewski 2004, 2009, 2020; Hesamian and Chachi 2015; Hesamian and Taheri 2013; Hryniewicz 2006a; Lin et al. 2010; Hesamian and Taheri 2013; Kahraman et al. 2004). Further, Akbari and Rezaei (2010); Chukhrova and Johannssen (2020a, 2020b, 2020c, 2021b, 2021c, 2022, 2023); Denoeux et al. (2005); Filzmoser and Viertl (2004); Grzegorzewski (2000); Hesamian and Shams (2016); Hesamian and Taheri (2013); Hryniewicz (2006b); Montenegro et al. (2004); Rodríguez et al. (2006); Viertl (2006); Wu (2005); Kahraman et al. (2004) proposed one-sample tests, while Denoeux et al. (2005); Gajivaradhan and Parthiban (2015); Grzegorzewski (2005); Hesamian and Chachi (2015); Montenegro et al. (2001); Kahraman et al. (2004) discussed two-sample tests. Some of these approaches mainly rely on fuzzy data and exact hypotheses (Hesamian and Akbari 2021; Parchami 2020; Kahraman et al. 2004; Grzegorzewski 2000; Denoeux et al. 2005; Grzegorzewski 2004, 2009; Filzmoser and Viertl 2004; Chen and Chang 2020; Haktanir and Kahraman 2019; Hryniewicz 2006b; Mylonas and Papadopoulos 2021; Rodríguez et al. 2006; Viertl 2006; Akbari and Rezaei 2010; Chukhrova and Johannssen 2020a, b; Montenegro et al. 2004; Grzegorzewski 2020; Hryniewicz 2006a; Lin et al. 2010), fuzzy data and fuzzy hypotheses (Wu 2005; Hesamian and Shams 2016; Gil et al. 2006; Hesamian and Chachi 2015; Hesamian and Taheri 2013) or crisp data and fuzzy hypotheses (Arnold 1998; Chukhrova and Johannssen 2020a, b, c, 2021c). In addition, several studies rely on univariate hypothesis testing with parametric or non-parametric intuitionistic fuzzy information (Akbari and Hesamian 2019a, b; Hesamian and Akbari 2017; Zainali et al. 2014). For a comprehensive review on the topic of fuzzy hypothesis testing, we refer to Chukhrova and Johannssen (2021a).

Considering previous fuzzy non-parametric tests for the two-sample case, they essentially rely on comparing fuzzy medians. As comparing any quantiles of two populations is an important issue, it is necessary to develop a methodology to compare quantiles of two populations based on fuzzy data. In this study, therefore, we introduce a new idea of non-parametric testing for comparing fuzzy quantiles of two independent populations based on fuzzy random variables. Since the observed data are fuzzy quantities, it is a natural step to consider the components of the population such as distribution function, order statistics and quantile functions as fuzzy quantities as well. In this regard, we extend the concept of the fuzzy quantile function of fuzzy random variables and their empirical estimation based on fuzzy data. We also construct the respective hypotheses to compare fuzzy quantiles of the populations via ranking criteria and introduce a test statistic that employs the $\alpha $-cuts of fuzzy numbers. Then, a procedure for constructing a fuzzy test function to reject or not reject the underlying null hypothesis related to the comparison of fuzzy quantiles is presented. Therefore, besides testing on the medians of both populations, the introduced method can be applied for any fuzzy quantile functions of two populations related to, e.g., percentiles or deciles. For practical reasons, the proposed method is illustrated via some application examples.

The rest of this paper is organized as follows. Section 2 reviews essential concepts related to fuzzy numbers and fuzzy random variables. Section 3 introduces the notion of fuzzy quantiles and discusses their empirical estimation considering fuzzy random variables. In Sect. 4, the non-parametric hypothesis test for comparing fuzzy quantiles of two independent populations is developed. In Sect. 5, practical applications of the proposed test are illustrated. Finally, conclusions are provided in Sect. 6.

2 Preliminaries

This section reviews some necessary basic definitions of fuzzy numbers and fuzzy random variables.

2.1 Fuzzy sets and fuzzy numbers

A fuzzy set $\widetilde{A}$ on the real line $\mathbb {R}$ is defined by the membership function $\mu _{\widetilde{A}}:\mathbb {R}\rightarrow [0,1]$ (Lee 2005). The subset $\{x\in \mathbb {R} \mid \mu _{\widetilde{A}}(x)\ge \alpha \}$, $\alpha \in (0,1]$, is referred to as $\alpha $-cut of $\widetilde{A}$, i.e., $\widetilde{A}[\alpha ]$. The set $\widetilde{A}[0]=$ $\overline{\{x\in \mathbb {R}:\mu _{\widetilde{A}}(x)>0\}}$ is the support of $\widetilde{A}$, while $\overline{A}$ is the closure of A. The lower and upper bounds of $\widetilde{A}[\alpha ]$, $\alpha \in [0,1]$, are denoted by $\widetilde{A}^L[\alpha ]$ and $\widetilde{A}^U[\alpha ]$, respectively. Further, a fuzzy set $\widetilde{A}$ is a fuzzy number (FN) when $\widetilde{A}[\alpha ]$ is a non-empty, bounded interval in $\mathbb {R}$ for all $\alpha \in [0,1]$. As for the practical handling of FNs, they are often modeled via a functional parametric form called LR-FN $\widetilde{A} = (a; l_a, r_a)_{LR}$. The membership function of an LR-FN $\widetilde{A}$ is given by

$$\begin{aligned} \mu _{\widetilde{A}}(x)=\left\{ \begin{array}{lll} L\left( \dfrac{a-x}{l_a}\right) &{}\text { if}\quad &{} a-l_a\le x\le a,\\ R\left( \dfrac{x-a}{r_a}\right) &{}\text { if}\quad &{} a \le x\le a+r_a, \end{array} \right. \end{aligned}$$

where $l_a>0$ is the left spread, $r_a>0$ is the right spread, and L and R are reference functions defining the left and the right shapes of the FN, respectively, where $L,R:[0,1]\rightarrow [0,1]$ should satisfy the following conditions:

1.
$L(1) = R(1) = 0$,
2.
$L(0) =R(0) = 1$, and
3.
L(x) and R(x) are continuous and monotone-decreasing functions on [0, 1].

The set of all LR-FNs is represented by $\mathcal {F}(\mathbb {R})$. Furthermore, the most commonly used (unimodal) LR-FNs (with $L(x)=R(x)=\max \{0,1-x\}$) are triangular FNs (TFNs). The membership function of a TFN, denoted by $\widetilde{A}=(a;l_a,r_a)_{T}$, is given by:

$$\begin{aligned} \mu _{\widetilde{A}}(x)=\left\{ \begin{array}{ll} \frac{x-(a-l_a)}{l_a} \quad &{}\text {if } a-l_a \le x \le a\\ \frac{a+r_a-x}{r_a} \quad &{}\text {if } a < x \le a+r_a\\ 0 \quad &{} \text {otherwise}.\\ \end{array} \right. \end{aligned}$$

Remark 1

(Hesamian and Shams 2016) For a given $\widetilde{A}\in \mathcal {F}(\mathbb {R})$, the mapping $\widetilde{A}_{\alpha }:[0,1]\rightarrow \mathbb {R}$ is called $\alpha $-cut of $\widetilde{A}$ defined by

$$\begin{aligned} \widetilde{A}_{\alpha }=\left\{ \begin{array}{ll} \widetilde{A}^L[2\alpha ] &{}\quad \text {if }\alpha \in [0,0.5],\\ \widetilde{A}^U[2(1-\alpha )] &{}\quad \text {if } \alpha \in , (0.5,1],\end{array}\right. \end{aligned}$$

where $\widetilde{A}^L[\alpha ]$ and $\widetilde{A}^U[\alpha ]$ denote the lower and upper limits of $\alpha $-cuts of $\widetilde{A}$, respectively. Then, it follows:

$$\begin{aligned} \widetilde{A}[\alpha ]=[\widetilde{A}^L[\alpha ],\widetilde{A}^U[\alpha ]]=[\widetilde{A}_{\alpha /2}, \widetilde{A}_{1-\alpha /2}]. \end{aligned}$$

For instance, $\alpha $-cuts of an LR-fuzzy number $\widetilde{A} = (a;l_a,r_a)_{LR}$ can be calculated as:

$$\begin{aligned} \widetilde{A}_{\alpha }=\left\{ \begin{array}{ll} a-l_aL^{-1}(2\alpha ) \qquad &{} \text {if }\alpha \in [0,0.5]\\ a+r_aR^{-1}(2(1-\alpha )) \qquad &{} \text {if }\alpha \in (0.5,1]. \end{array} \right. \end{aligned}$$

Specifically, if $\widetilde{A}=(a;l_a,r_a)_T$ is a TFN, then:

$$\begin{aligned} \widetilde{A}_{\alpha }=\left\{ \begin{array}{ll} (a-l_a)+2l_a\alpha \qquad &{} \text {if }\alpha \in [0,0.5]\\ a+r_a-2r_a(1-\alpha ) \qquad &{} \text {if }\alpha \in (0.5,1]. \end{array} \right. \end{aligned}$$

Remark 2

(Hesamian et al. 2019) Note that for all $\widetilde{A}, \widetilde{B}\in {{\mathcal {F}}}({\mathbb {R}})$, $\lambda \in {\mathbb {R}}$ and $\alpha \in [0,1]$, the following arithmetic operations on fuzzy numbers can be defined:

$$\begin{aligned} (\widetilde{A}\oplus \widetilde{B})_{\alpha }= & {} \widetilde{A}_{\alpha }+\widetilde{B}_{\alpha },\\ (\lambda \otimes \widetilde{A})_{\alpha }= & {} \left\{ \begin{array}{ll} \lambda \widetilde{A}_{\alpha } &{}\text{ if }\ ~~\lambda > 0,\\ 0&{}\text{ if }\ ~~\lambda =0, \\ \lambda \widetilde{A}_{1-\alpha }&{}\text{ if }\ ~~\lambda <0,\\ \end{array}\right. \end{aligned}$$

where $\oplus $ and $\otimes $ denote common arithmetic operators of fuzzy numbers.

Definition 1

(Yuan 1991) For two FNs $\widetilde{A} $ and $\widetilde{B}\in \mathcal {F}(\mathbb {R})$, let

$$\begin{aligned} \varDelta _{\widetilde{A}\widetilde{B}}=\int _{\{\alpha :\widetilde{A}_{\alpha }\ge \widetilde{B}_{\alpha }\}}(\widetilde{A}_{\alpha }-\widetilde{B}_{\alpha })d\alpha . \end{aligned}$$

The preference degree “$\widetilde{A}$ is larger than $\widetilde{B}$” is defined by:

$$\begin{aligned} P_d(\widetilde{A}\succ \widetilde{B})=\left\{ \begin{array}{ll} 0.5 &{} ~~~~~~~~~ \widetilde{A}=\widetilde{B}\\ \frac{\varDelta _{\widetilde{A}\widetilde{B}}}{\varDelta _{\widetilde{A}\widetilde{B}}+\varDelta _{\widetilde{B}\widetilde{A}}} &{} ~~~~~~~~~\widetilde{A}\ne \widetilde{B}.\\ \end{array} \right. \end{aligned}$$

Definition 2

For two FNs $\widetilde{A}$ and $\widetilde{B}$, it holds that:

1.
$\widetilde{A}$ is larger than $\widetilde{B}$, denoted by $\widetilde{A}\succ _{P_d}\widetilde{B}$, if $P_d(\widetilde{A}\succ \widetilde{B})> 0.5$.
2.
$\widetilde{A}$ is equivalent to $\widetilde{B}$, denoted by $\widetilde{A}\simeq _{P_d}\widetilde{B}$, if $P_d(\widetilde{A}\succ \widetilde{B})=P_d(\widetilde{B}\succ \widetilde{A})= 0.5$.

The preference criterion $P_d$ meets the following properties:

Proposition 1

Let $\widetilde{A}$, $\widetilde{B}$, $\widetilde{C}$ be three FNs in $\mathcal {F}(\mathbb {R})$. Then, it holds:

(1)
$P_d$ is reciprocal, i.e., $P_d(\widetilde{A}\succ \widetilde{B})=1-P_d(\widetilde{B}\succ \widetilde{A})$.
(2)
$P_d$ is reflexive, i.e., $\widetilde{A}\succ _{P_d}\widetilde{A}$.
(3)
$P_d$ is transitive, i.e., $\widetilde{A}\succ _{P_d}\widetilde{B}$ and $\widetilde{B}\succ _{P_d} \widetilde{C}$ imply $\widetilde{A}\succ _{P_d}\widetilde{C}$.
(4)
$P_d(\widetilde{A}\succeq \widetilde{B})=1$ if and only if $\widetilde{B}_{\alpha }\le \widetilde{A}_{\alpha }$ for all $\alpha \in [0,1]$.

Proof

See Yuan (1991). $\square $

Definition 3

(Hesamian and Akbari 2018) The absolute error distance between two FNs $\widetilde{A}$ and $\widetilde{B}$ is defined as follows:

$$\begin{aligned} D(\widetilde{A},\widetilde{B})=\sup _{\alpha \in [0,1]}|\widetilde{A}_{\alpha }-\widetilde{B}_{\alpha }|. \end{aligned}$$

The TFNs $\widetilde{A}$, $\widetilde{B}$, $\widetilde{C}$ satisfy the following conditions:

(1)
$D(\widetilde{A},\widetilde{B})=0$, if and only if $\widetilde{A}=\widetilde{B},$
(2)
$D(\widetilde{A},\widetilde{B})=D(\widetilde{B},\widetilde{A}),$
(3)
$D(\widetilde{A},\widetilde{C})\le D(\widetilde{A},\widetilde{B})+ D(\widetilde{B},\widetilde{C}).$

2.2 Fuzzy random variables

In the following, we briefly give common definitions of fuzzy random variables, fuzzy cumulative distribution function and its estimator.

Definition 4

(Hesamian and Shams 2016) Let $(\varOmega ,\mathcal {A},P)$ be a probability space. The fuzzy-valued mapping $\widetilde{X}: \varOmega \rightarrow \mathcal {F}(\mathbb {R})$ is called a fuzzy random variable (FRV), if for any $\alpha \in [0,1]$ the real-valued mapping $\widetilde{X}_\alpha : \varOmega \rightarrow \mathbb {R}$ is a real-valued random variable.

Definition 5

Two FRVs $\widetilde{X}$ and $\widetilde{Y}$ are called identically distributed and independent, if $\widetilde{X}_{\alpha }$ and $\widetilde{Y}_{\alpha }$ are identically distributed and independent for all $\alpha \in [0,1]$. Similarly, it can be said that $\widetilde{X}_1,\ldots ,\widetilde{X}_n$ is a fuzzy random sample (FRS) of size n if all $\widetilde{X}_i$ are independent and identically distributed FRVs. An observed fuzzy random sample can denoted by $\widetilde{x}_1,\ldots ,\widetilde{x}_n$.

Definition 6

(Hesamian et al. 2019) Let $\widetilde{X}$ be a FRV and $\{\widetilde{X}_n\}_{n=1}^{\infty }$ a collection of FRVs defined on the same probability space. Then, $\widetilde{X}_n$ converges almost surely to $\widetilde{X}$, denoted by $\widetilde{X}_n{\mathop {\rightarrow }\limits ^{a.s.}} \widetilde{X}$. For every $\varepsilon >0$, it holds

$$\begin{aligned} P\left( \lim _{n\rightarrow \infty }D(\widetilde{X}_n,\widetilde{X})>\varepsilon \right) =0. \end{aligned}$$

Definition 7

Let $\widetilde{X}_1,\ldots ,\widetilde{X}_m$ be a FRS of $\widetilde{X}$. The $j^{th}$ order statistic of $\widetilde{X}_1,\ldots ,\widetilde{X}_m$ is defined to be a FN with $\alpha $-cuts $(\widetilde{X}_{(j)})_{\alpha }=(\widetilde{X}_{\alpha })_{(j)}$.

Lemma 1

Let $\widetilde{X}_1,\ldots ,\widetilde{X}_m$ be a FRS of $\widetilde{X}$. Then, $\widetilde{X}_{(j+1)}\succ _{P_d} \widetilde{X}_{(j)}$ for every $j=1,2,\ldots ,m-1$.

Definition 8

(Hesamian and Chachi 2015) The fuzzy number $\widetilde{F}_{\widetilde{X}}(x)$ is said to be a fuzzy cumulative distribution function (FCDF) of $\widetilde{X}$, if its $\alpha $-cuts are defined by $(\widetilde{F}_{\widetilde{X}}(x))_{\alpha }=P(\widetilde{X}_{1-\alpha }\le x)$.

Definition 9

(Hesamian and Chachi 2015) Let $\widetilde{x}_1,\ldots ,\widetilde{x}_n$ be a FRS of $\widetilde{X}$. The fuzzy number $\widetilde{\widehat{F}}_{n}(x)$ is said to be a fuzzy empirical cumulative distribution function, if its $\alpha $-cuts are defined by $(\widetilde{\widehat{F}}_{n}(x))_{\alpha }=\frac{1}{n}\sum _{i=1}^{n}I((\widetilde{x}_i)_{1-\alpha }\le x)$.

Lemma 2

Suppose that $\widetilde{X}_1,\ldots ,\widetilde{X}_n$ is a fuzzy random sample with FCDF $\widetilde{F}_{\widetilde{X}}(x)$. Then,

$$\begin{aligned} P\left( \sup _{x\in \mathbb {R}}D(\widetilde{\widehat{F}}_{n}(x),\widetilde{F}_{\widetilde{X}}(x))\rightarrow 0\right) =1. \end{aligned}$$

Proof

See Hesamian and Chachi (2015). $\square $

3 Fuzzy quantile function

In this section, the notions of fuzzy quantile function and fuzzy empirical quantile are introduced and discussed.

Definition 10

Let $\widetilde{X}$ be a FRV. The fuzzy quantile function (FQF) of $\widetilde{X}$ at level $\tau $ is defined by a FN with the following $\alpha $-cuts:

$$\begin{aligned} (\widetilde{Q}_{\widetilde{X}}(\tau ))_{\alpha }=\inf \{x:(\widetilde{F}_{\widetilde{X}}(x))_{\alpha }>\tau \}. \end{aligned}$$

Example 1

Consider the FRV $\widetilde{X}=(0.99X,X,1.1X)_T$, where $X\sim \exp (\lambda )$. Then, it holds that $\widetilde{Q}_{\widetilde{X}}(\tau )=-\ln (1-\tau )/\lambda \otimes (0.99,1,1.1)_T$.

Example 2

Let $\widetilde{X}$ be a (normal) FRV (Puri and Ralescu 1985) with $\widetilde{X}=\widetilde{\mu }\oplus \epsilon $, where $\epsilon \sim N(0,\sigma ^2)$. Then, it holds that $(\widetilde{Q}_{\widetilde{X}}(\tau ))_{\alpha }=\inf \{x:(\widetilde{F}_{\widetilde{X}}(x))_{1-\alpha }>\tau \}=F^{-1}_{\widetilde{X}_{\alpha }}(\tau )=\widetilde{\mu }_{\alpha }+Z_{\tau } \sigma $, where $Z_{\tau }$ denotes the $\tau ^{th}$ quantile of the standard normal distribution. Therefore, the FQF of $\widetilde{X}$ can be evaluated by $\widetilde{Q}_{\widetilde{X}}(\tau )=\widetilde{\mu }\oplus Z_{\tau } \sigma $.

Definition 11

Let $\widetilde{x}_1,\ldots ,\widetilde{x}_m$ be a FRS of $\widetilde{X}$. The fuzzy empirical quantile function (FEQF) of $\widetilde{X}$ at level $\tau $ is defined by a FN with the following $\alpha $-cuts:

$$\begin{aligned} (\widetilde{\widehat{Q}}(\tau ))_{\alpha }=\inf \{x:(\widetilde{\widehat{F}}_n(x))_{\alpha }>\tau \}. \end{aligned}$$

Lemma 3

Let $\widetilde{X}_1,\ldots ,\widetilde{X}_n$ be a FRS of $\widetilde{X}$. Then, $\widetilde{\widehat{Q}}_{n}(\tau )=\widetilde{X}_{([n\tau ]+1)}$, where [k] represents the integer part of k.

Proof

The claim is immediately verified via

$$\begin{aligned} (\widetilde{\widehat{Q}}_n(\tau ))_{\alpha }=\inf \{x:(\widetilde{\widehat{F}}_n(x))_{\alpha }>\tau \}=(\widetilde{X}_{\alpha })_{([n\tau ]+1)}. \end{aligned}$$

$\square $

Table 1 Data set in Example 3

Full size table

Example 3

Consider the data set given in Table 1 with $L(x)=1-x^3$ and $R(x)=\sqrt{1-x^5}$. From Definition 9, first note that

$$\begin{aligned} (\widetilde{\widehat{F}}_{n}(x))_{\alpha }=\frac{1}{30}\sum _{i=1}^{30}I((\widetilde{x}_i)_{1-\alpha }\le x), \end{aligned}$$

where

$$\begin{aligned} (\widetilde{x}_i)_{\alpha }=\left\{ \begin{array}{ll} x_i-l_{x_i}\root 3 \of {1-2\alpha } \qquad &{} \text {if }\alpha \in [0,0.5],\\ x_i+r_{x_i}\root 5 \of {1-4(1-\alpha )^2} \qquad &{} \text {if }\alpha \in (0.5,1]. \end{array} \right. \end{aligned}$$

Therefore, at quantile level $\tau \in (0,1)$, the FEQF of $\widetilde{X}$ is given as follows:

$$\begin{aligned} (\widetilde{\widehat{Q}}_{n}(\tau ))_{\alpha }=\inf \left\{ x:\frac{1}{30}\sum _{i=1}^{30}I((\widetilde{x}_i)_{1-\alpha }\le x)>\tau \right\} . \end{aligned}$$

Table 2 shows the lower and upper bounds of $\widetilde{\widehat{Q}}_{n}(\tau )[\alpha ]$ for $\tau =0.25,0.50$ and 0.75 and some values of $\alpha $. The plots of $\widetilde{\widehat{Q}}_{n}(0.25),\widetilde{\widehat{Q}}_{n}(0.5)$ and $\widetilde{\widehat{Q}}_{n}(0.75)$ are presented in Fig. 1.

Table 2 Some $\alpha $-cuts of $\widetilde{\widehat{Q}}_{n}(0.25)$, $\widetilde{\widehat{Q}}_{\widetilde{X}}(0.50)$ and $\widetilde{\widehat{Q}}_{\widetilde{X}}(0.75)$ in Example 3

Full size table

Lemma 4

Suppose that $\widetilde{X}_1,\ldots ,\widetilde{X}_n$ is a fuzzy random sample with FQF $\widetilde{Q}_{\widetilde{X}}(\tau )$. Then, $\widetilde{\widehat{Q}}_n(\tau ){\mathop {\rightarrow }\limits ^{a.s.}}\widetilde{Q}_{\widetilde{X}}(\tau )$.

Proof

$$\begin{aligned} (\widetilde{\widehat{Q}}_n(\tau ))_{\alpha }=\inf \{x:(\widetilde{\widehat{F}}_n(x))_{\alpha }>\tau \}{\mathop {\rightarrow }\limits ^{a.s.}}(\widetilde{Q}_{\widetilde{X}}(\tau ))_{\alpha }, \end{aligned}$$

holds for every $\alpha \in [0,1]$, it follows that

$$\begin{aligned}{} & {} P\left( \lim _{n\rightarrow \infty }D(\widetilde{\widehat{Q}}_n(\tau ),\widetilde{Q}_{\widetilde{X}}(\tau ))>\varepsilon \right) \le \\{} & {} P\left( \lim _{n\rightarrow \infty }|(\widetilde{\widehat{Q}}_n(\tau ))_{\alpha }-(\widetilde{Q}_{\widetilde{X}}(\tau ))_{\alpha }|>\varepsilon \right) =0 \end{aligned}$$

is satisfied for every $\varepsilon >0$, which completes the proof. $\square $

4 Hypothesis test for comparing fuzzy quantiles of two populations

Let $X_1,\ldots ,X_m$ and $Y_1,\ldots ,Y_n$ be random samples from two independent populations with absolutely continuous distribution functions $F_X$ and $G_Y$, respectively. Also, let $X_{(1)},\ldots ,X_{(m)}$ and $Y_{(1)},\ldots ,Y_{(n)}$ be the corresponding order statistics. The null hypothesis of interest is $H_0:Q_X(\tau )=Q_Y(\nu )$, where $\tau $ and $\nu $ are two quantile levels. A test statistic for the difference of the empirical quantile functions $\widehat{Q}_X(\tau )=X_{([m\tau +1])}$ and $\widehat{Q}_Y(\tau )=Y_{([n\tau +1])}$ from different populations can be defined by Heinzl and Mittlboeck (2017); Hutson (2009)

$$\begin{aligned} T^{m,n} =&({[}m\tau ]+1){\left( {\begin{array}{c}m\\ {[}m\tau ]+1\end{array}}\right) }\\&\sum _{k=1}^{n}\varvec{\beta }(\widehat{F}_X(Y_{(k)})),{[}m\tau ]+1,m-{[}m\tau ])w_{{[}m\tau ]+1}(k) \end{aligned}$$

with

$$\begin{aligned} \varvec{\beta }(x,p,q)= & {} \int _{0}^{x}s^{p-1}(1-s)^{q-1}ds,\\ w_{{[}n\nu ]+1}(k)= & {} ({[}n\nu ]+1){\left( {\begin{array}{c}n\\ {[}n\nu ]+1\end{array}}\right) }\left( \varvec{\beta }\bigg (\frac{k}{n},{[}n\nu ]+1,n-{[}n\nu ]\right) \\{} & {} -\varvec{\beta }\left( \frac{k-1}{n},{[}n\nu ]+1,n-{[}n\nu ]\right) \bigg ), \end{aligned}$$

where

$$\begin{aligned} \widehat{F}_X(Y_{(k)})=\frac{1}{m}\sum _{i=1}^{m}I(X_i\le Y_{(k)}). \end{aligned}$$

Now, let $\tilde{X}_1,\ldots ,\tilde{X}_m$ and $\tilde{Y}_1,\ldots ,\tilde{Y}_n$ be two independent FRSs from two populations with FCDFs $\widetilde{F}_{\widetilde{X}}$ and $\widetilde{F}_{\widetilde{Y}}$. In the following, a procedure is established for comparing fuzzy quantiles of two populations. For this purpose, consider the following fuzzy hypotheses concerning quantiles of two populations:

Definition 12

Let $\widetilde{X}$ and $\widetilde{Y}$ be two FRVs. The hypotheses of interest are defined as

$$\begin{aligned}{} & {} H_0: \widetilde{Q}_{\widetilde{X}}(\tau )=_{P_d} \widetilde{Q}_{\widetilde{Y}}(\nu )~\text {with}~ P_d(\widetilde{Q}_{\widetilde{X}}(\tau )\succ \widetilde{Q}_{\widetilde{Y}}(\nu ))\\{} & {} \quad =P_d(\widetilde{Q}_{\widetilde{Y}}(\nu )\succ \widetilde{Q}_{\widetilde{X}}(\tau ))=0.5 \end{aligned}$$

versus

$$\begin{aligned} H^A_1: \widetilde{Q}_{\widetilde{X}}(\tau )\succ _{P_d} \widetilde{Q}_{\widetilde{Y}}(\nu )~\text {with}~ P_d(\widetilde{Q}_{\widetilde{X}(\tau )}\succ \widetilde{Q}_{\widetilde{Y}(\nu )})>0.5, \end{aligned}$$

$$\begin{aligned} H^B_1: \widetilde{Q}_{\widetilde{Y}}(\nu )\succ _{P_d} \widetilde{Q}_{\widetilde{X}}(\tau )~\text {with}~ P_d(\widetilde{Q}_{\widetilde{Y}(\nu )}\succ \widetilde{Q}_{\widetilde{X}(\tau )})>0.5. \end{aligned}$$

For testing the above hypotheses, we employ the following test statistic.

Definition 13

Let $\tilde{X}_1,\ldots ,\tilde{X}_m$ and $\tilde{Y}_1,\ldots ,\tilde{Y}_n$ be two independent FRS from two FCDFs $\widetilde{F}_{\widetilde{X}}$ and $\widetilde{F}_{\widetilde{Y}}$. The $\alpha $-cuts of the fuzzy test statistic are defined by

$$\begin{aligned} \widetilde{T}^{m,n}[\alpha ]= \left[ \inf _{\beta \in I_{\alpha }}\widetilde{T}^{m,n}_{\beta },\sup _{\beta \in I_{\alpha }}\widetilde{T}^{m,n}_{\beta }\right] , \end{aligned}$$

where

$$\begin{aligned} (\widetilde{\widehat{F}}_{\widetilde{X}})_{\beta }((\widetilde{Y}_{\beta })_{(k)})=\frac{1}{m}\sum _{i=1}^{m}I((\widetilde{X}_i)_{1-\beta }\le (\widetilde{Y}_{\beta })_{(k)}). \end{aligned}$$

The test decision on rejecting or non-rejecting $H_0$ can be made as follows:

Definition 14

Let us consider the problem of testing the fuzzy hypothesis $H_0$ versus $H_1^A$ or $H_1^B$ based on two independent FRS $\tilde{x}_1,\ldots ,\tilde{x}_m$ and $\tilde{y}_1,\ldots ,\tilde{y}_n$. Then, at significance level $\delta $, the fuzzy test is defined as a fuzzy set:

1.
As for testing $H_0$ versus $H_1^A$, we use
$$\begin{aligned} \widetilde{\varphi }^A_{\delta }[(\widetilde{x}_1,\ldots ,\widetilde{x}_m),(\tilde{y}_1,\ldots ,\tilde{y}_n)]= \bigg \{\frac{Reject}{\widetilde{\varphi }^A_{\delta }(Reject)},\frac{Accept}{\widetilde{\varphi }^A_{\delta }(Accept)}\bigg \}, \end{aligned}$$
where $\widetilde{\varphi }^A_{\delta }(Reject)=P_d(\widetilde{T}^{m,n}\succ 1-\delta /2)$ is called the degree of rejection of $H_0$ and $\widetilde{\varphi }^A_{\delta }(Accept)=1-\widetilde{\varphi }^A_{\delta }(1)$ is the degree of non-rejection of $H_0$.
2.
As for testing $H_0$ versus $H_1^B$, we use
$$\begin{aligned} \widetilde{\varphi }^B_{\delta }[(\widetilde{x}_1,\ldots ,\widetilde{x}_m),(\tilde{y}_1,\ldots ,\tilde{y}_n)]= \bigg \{\frac{Reject}{\widetilde{\varphi }^B_{\delta }(Reject)},\frac{Accept}{\widetilde{\varphi }^B_{\delta }(Accept)}\bigg \}, \end{aligned}$$
where $\widetilde{\varphi }^B_{\delta }(Reject)=P_d(\delta /2\succ \widetilde{T}^{m,n})$ is called the degree of rejection of $H_0$ and $\widetilde{\varphi }^B_{\delta }(Accept)=1-\widetilde{\varphi }^B_{\delta }(Reject)$ is the degree of non-rejection of $H_0$.

Here, “Accept” and “Reject” stand for non-rejection and rejection of $H_0$, respectively. Since $\widetilde{\varphi }_{\delta }(Reject)+\widetilde{\varphi }_{\delta }(Accept)=1$, at significance level $\delta $, one cannot reject the null hypothesis if $\widetilde{\varphi }_{\delta }(Accept)>\widetilde{\varphi }_{\delta }(Reject)$ or $\widetilde{\varphi }_{\delta }(Accept)\ge 0.5$.

Remark 3

Since the decision to reject or non-reject $H_0$ versus $H_1^A$ or $H_1^B$ is made via a fuzzy test, this motivates to defuzzify the decision in order to get an exact decision similar to classical statistical hypothesis testing. For this purpose, note that $\widetilde{A}\succ _{P_D}k$ if and only if $M_{\widetilde{A}}=0.5\int _0^1(\widetilde{A}_{\alpha }+\widetilde{A}_{1-\alpha })>k$. As for the interpretation of the test decision, it can be done similar to the classical approach as follows:

1.
Testing $H_0$ versus $H_1^A$: if $M_{\widetilde{T}^{m,n}}> 1-\delta /2$, then $H_0$ is rejected; otherwise it cannot be rejected
2.
Testing $H_0$ versus $H_1^B$: if $M_{\widetilde{T}^{m,n}}<\delta /2$, then $H_0$ is rejected; otherwise, it cannot be rejected

Remark 4

As a special case of the proposed method, it can be employed for comparing the fuzzy medians of two populations. In this regard, Grzegorzewski (2005) and Grzegorzewski (2009) introduced a fuzzy test for comparing two and k (crisp) population medians based on fuzzy random variables, respectively. He developed a fuzzy test statistic by employing fuzzy random variables and proposed a fuzzy test based on the necessity ranking criterion. However, the method presented in this paper follows a different strategy for comparing fuzzy quantiles of two populations based on fuzzy random variables. The proposed fuzzy quantile technique includes the following procedure:

1.
Extending the quantile of a FRV as a FN
2.
Extending the empirical estimator of a fuzzy quantile based on a FRS
3.
Investigating the relationship between a fuzzy quantile and its corresponding estimator for large sample cases

Then, a non-parametric statistical hypothesis test was developed for comparing any fuzzy quantiles of two populations based on two independent fuzzy random samples.

5 Numerical examples

In this section, the feasibility and effectiveness of the proposed non-parametric two-sample test on quantiles are examined via numerical examples. Note that there is no method for a reasonable comparison, as other two-sample fuzzy tests rely on comparing the means or medians (or variances) of two populations.

Table 3 Data set in Example 4

Full size table

Example 4

A random sample of 30 identical twins underwent psychological tests to measure their aggressiveness. We are interested in comparing the twins to see if the firstborn twin tends to be more aggressive than the other one. Assume that, due to limitations in psychological measurements, the results of the evaluations are reported as TFNs for the first born as $(x; 0.02x)_L$ and the second born as $(y;0.02y)_L$ with $L(x)=1-x^2$, where the observations $x_i$ and $y_i$, $i=1,\ldots ,30$, are shown in Table 3. At significance level $\delta =0.05$ we intend to test:

$$\begin{aligned} \begin{array}{ll} H_0: &{} \widetilde{Q}_{\widetilde{X}}(0.5)\simeq _{P_d} \widetilde{Q}_{\widetilde{Y}}(0.5),\\ H^A_1: &{} \widetilde{Q}_{\widetilde{X}}(0.5)\succ _{P_d} \widetilde{Q}_{\widetilde{Y}}(0.5). \end{array} \end{aligned}$$

The fuzzy test statistic can be determined based on Definition 13. The lower and upper bounds of $\widetilde{T}^{30,30}$ are shown in Table 4 based on various values of $\alpha \in (0,1)$. The plot of $\widetilde{T}^{30,30}$ is shown in Fig. 2. According to Definition 14, the fuzzy test can be performed via

$$\begin{aligned}{} & {} \widetilde{\varphi }^A_{0.05}[(\widetilde{x}_1,\ldots ,\widetilde{x}_{30}),(\tilde{y}_1,\ldots ,\tilde{y}_{30})]\\{} & {} =\bigg \{\frac{Reject}{\widetilde{\varphi }^A_{0.05}(Reject)},\frac{Accept}{\widetilde{\varphi }^A_{0.05}(Accept)}\bigg \}, \end{aligned}$$

where $\widetilde{\varphi }^A_{0.05}(Reject)=P_d(\widetilde{T}^{30,30}\succ 0.975)=0.72$. Therefore, at significance level 0.05, the fuzzy null hypothesis is rejected with a degree of 0.72 and non-rejected with a degree of 0.28. As for the defuzzification of this test decision, note that it holds $M_{\widetilde{T}^{30,30}}=0.984>1-\delta /2=0.975$. Following this approach, the final decision is to reject the null hypothesis at level $\delta = 0.05$.

Table 4 Some $\alpha $-cuts of $\widetilde{T}^{30,30}$ in Example 4

Full size table

Table 5 Data set in Example 5

Full size table

Example 5

Let us consider the rocket-motor experiment data set based on Weerahandi and Johnson (1992). It is of interest to make inference on the reliability of the rocket motor at the highest operating temperature of 59$^oC$. At this temperature, the distribution of the operating pressure Y tends to be closest to the distribution of the chamber burst strength X. It is assumed that the observations can be reported as TFNs via $\widetilde{x}_i=(x_i;0.05x_i)_T$ and $\widetilde{y}_i=(y_i;0.1y_i)_T$, where the observations $x_i$ and $y_i$ are given in Table 5. At significance level $\delta =0.05$, we test the following pair of hypotheses:

$$\begin{aligned} \begin{array}{ll} H_0: &{} \widetilde{Q}_{\widetilde{X}}(0.10)\simeq _{P_d} \widetilde{Q}_{\widetilde{Y}}(0.90),\\ H^B_1: &{} \widetilde{Q}_{\widetilde{X}}(0.10)\prec _{P_d} \widetilde{Q}_{\widetilde{Y}}(0.90). \end{array} \end{aligned}$$

According to Definition 13, the plot of $\widetilde{T}^{17,23}$ is shown in Fig. 3. Table 6 also shows the lower and upper bounds of $\widetilde{T}^{17,23}$ based on specific values of $\alpha \in (0,1)$. According to Definition 14, the fuzzy test can be conducted via

$$\begin{aligned}{} & {} \widetilde{\varphi }^B_{0.05}[(\widetilde{x}_1,\ldots ,\widetilde{x}_{17}),(\tilde{y}_1,\ldots ,\tilde{y}_{23})]\\{} & {} =\bigg \{\frac{Reject}{\widetilde{\varphi }^B_{0.05}(Reject)},\frac{Accept}{\widetilde{\varphi }^B_{0.05}(Accept)}\bigg \}, \end{aligned}$$

where $\widetilde{\varphi }^B_{0.05}(Reject)=P_d(\widetilde{T}^{17,23}\prec 0.025)=0.21$ and $\widetilde{\varphi }^B_{0.05}(Accept)=0.79$. Thus, at significance level 0.05, the fuzzy null hypothesis is not rejected with a degree of 0.79 and rejected with a degree of 0.21. Furthermore, the defuzzified value related to $\widetilde{T}^{17,23}$ ($M_{\widetilde{T}^{17,23}}=0.057$) is larger than $\delta /2=0.025$, so the decision will be to non-reject the null hypothesis at level $\delta = 0.05$.

Table 6 Some $\alpha $-cuts of $\widetilde{T}^{17,23}$ in Example 5

Full size table

6 Conclusions

In this article, an inferential procedure was developed for comparing fuzzy quantiles of two independent populations. For this purpose, the notion of the fuzzy quantile of a fuzzy random variable was introduced. Then, an estimator of the proposed fuzzy quantile function was developed according to a fuzzy random sample. The estimation procedure was illustrated based on some numerical examples. Further, the large sample property of the proposed fuzzy empirical quantile function was analyzed based on an absolute error distance for fuzzy numbers. In addition, the concept of the fuzzy test statistic was introduced based on fuzzy order statistics of two fuzzy random samples. To test the fuzzy hypotheses on quantiles of two populations, the obtained fuzzy test statistic and the crisp significance level were compared using an criterion called preference degree. As the proposed fuzzy test leads to a degree of rejection or non-rejection of the underlying fuzzy null hypothesis, we also proposed an approach to defuzzify the fuzzy test decision in order to reach a crisp test decision that is important for practical usage.

The results of the practical applications indicate that the proposed method is effective for comparing fuzzy quantiles of two independent populations. One of the advantages of the proposed method is that it can be applied to all kind of fuzzy numbers. As for potential future investigations, employing the proposed method to other types of imprecision such as intuitionistic fuzzy data and/or intuitionistic fuzzy hypotheses could be a promising direction. As another idea for future research, the presented methodology could also be extended to the case when more than two populations need to be compared in terms of their quantiles.

Data Availability

The data that support the findings of this study are available from the respective references as mentioned in the main text.

References

Akbari MG, Hesamian G (2019a) Testing statistical hypotheses for intuitionistic fuzzy data. Soft Comput 23:10385–10392
MATH Google Scholar
Akbari MG, Hesamian G (2019b) Neyman-Pearson lemma based on intuitionistic fuzzy parameters. Soft Comput 23:5905–5911
MATH Google Scholar
Akbari MG, Rezaei A (2010) Bootstrap testing fuzzy hypotheses and observations on fuzzy statistic. Expert Syst Appl 37:5782–5787
Google Scholar
Arnold BF (1998) Testing fuzzy hypothesis with crisp data. Fuzzy Sets Syst 9:323–333
MATH Google Scholar
Chen KS, Chang TC (2020) Construction and fuzzy hypothesis testing of Taguchi Six Sigma quality index. Int J Prod Res 58:3110–3125
Google Scholar
Chukhrova N, Johannssen A (2019) Fuzzy Regression Analysis: Systematic Review and Bibliography. Appl Soft Comput 84:105708
Google Scholar
Chukhrova N, Johannssen A (2020a) Fuzzy Hypothesis Testing for a Population Proportion Based on Set-Valued Information. Fuzzy Sets Syst 387:127–157
MathSciNet MATH Google Scholar
Chukhrova N, Johannssen A (2020b) Generalized One-Tailed Hypergeometric Test with Applications in Statistical Quality Control. J Qual Technol 52(1):14–39
MATH Google Scholar
Chukhrova N, Johannssen A (2020c) Randomized vs. Non-Randomized Hypergeometric Hypothesis Testing with Crisp and Fuzzy Hypotheses. Stat Pap 61(6):2605–2641
MATH Google Scholar
Chukhrova N, Johannssen A (2021a) Fuzzy Hypothesis Testing: Systematic Review and Bibliography. Appl Soft Comput 106:107331
Google Scholar
Chukhrova N, Johannssen A (2021b) Non-parametric Fuzzy Hypothesis Testing for Quantiles applied to Clinical Characteristics of COVID-19. Int J Intell Syst 36(6):2922–2963
Google Scholar
Chukhrova N, Johannssen A (2021c) Generalized two-tailed hypothesis testing for quantiles applied to the psychosocial status during the COVID-19 pandemic. Int J Intell Syst 36(12):7412–7442
Google Scholar
Chukhrova N, Johannssen A (2022) Two-Tailed Hypothesis Testing for the Median with Fuzzy Categories applied to the Detection of Health Risks. Expert Syst Appl 192:116362
Google Scholar
Chukhrova N, Johannssen A (2023) Employing fuzzy hypothesis testing to improve modified $p$ charts for monitoring the process fraction nonconforming. Inf Sci 633:141–157
Google Scholar
Denoeux T, Masson MH, Herbert PH (2005) Non-parametric rank-based statistics and significance tests for fuzzy data. shape Fuzzy Sets and Systems 153:1–28
MATH Google Scholar
Farrell PM, Kosorok MR, Laxova A, Shen G, Koscik RE, Bruns T, Splaingard M, Mischler EH (1997) Nutritional benefits of newborn screening for cystic fibrosis. N Engl J Med 337:963–969
Google Scholar
Filzmoser P, Viertl R (2004) Testing hypotheses with fuzzy data: the fuzzy p-value. shape Metrika 59:21–29
MathSciNet MATH Google Scholar
Gajivaradhan P, Parthiban P (2015) Two sample statistical hypothesis test for trapezoidal fuzzy interval data. International Journal of Applied Mathematics and Statistical Sciences 4:11–24
Google Scholar
Gil MA, Montenegro M, Rodríguez G, Colubi A, Casals MR (2006) Bootstrap approach to the multi-sample test of means with imprecise data. Computational Statistics & Data Analysis 51:148–162
MathSciNet MATH Google Scholar
Gözde N, Özdemir AF (2018) Quantile estimation and comparing two independent groups with an approach based on percentile bootstrap. Communications in Statistics - Simulation and Computation 47:2119–2138
MathSciNet MATH Google Scholar
Grzegorzewski P (2000) Testing statistical hypotheses with vague data. Fuzzy Sets Syst 11:501–510
MathSciNet MATH Google Scholar
Grzegorzewski P (2004) Distribution-free tests for vague data, In: (Lopez-Diaz M, et al. (Eds)) Soft Methodology and Random Information Systems, Springer, Heidelberg, 495–502
Grzegorzewski P (2005) Two-sample median test for vague data, In: Proceedings of the 4th Conference European Society for Fuzzy Logic and Technology-Eusflat, Barcelona, 621–626
Grzegorzewski P (2009) $K$-sample median test for vague data. Int J Intell Syst 24:529–539
MATH Google Scholar
Grzegorzewski P (2020) Two-sample dispersion problem for fuzzy data. Information Processing and Management of Uncertainty in Knowledge-Based Systems 1239:82–96
MATH Google Scholar
Haktanir E, Kahraman C (2019) $Z$-fuzzy hypothesis testing in statistical decision making. Journal of Intelligent and Fuzzy Systems 37:6545–6555
Google Scholar
Hesamian G, Akbari MG (2017) Statistical test based on intuitionistic fuzzy hypotheses. Communications in Statistics - Theory and Methods 46:9324–9334
MathSciNet MATH Google Scholar
Hesamian G, Akbari MG (2018) Fuzzy absolute error distance measure based on a generalised difference operation. Int J Syst Sci 49:2454–2462
MathSciNet MATH Google Scholar
Hesamian G, Akbari MG (2021) Testing hypotheses for multivariate normal distribution with fuzzy random variables. Int J Syst Sci 53:43–58
MathSciNet MATH Google Scholar
Hesamian G, Akbari MG, Ranjbar V (2019) Some inequalities and limit theorems for fuzzy random variables adopted with a-values of fuzzy numbers. Soft Comput 24:3797–3807
MATH Google Scholar
Hesamian G, Akbari MG, Yaghoobpoor R (2019) Quality control process based on fuzzy random variables. IEEE Trans Fuzzy Syst 27:671–685
Google Scholar
Hesamian G, Chachi J (2015) Two-sample Kolmogorov-Smirnov fuzzy test for fuzzy random variables. Stat Pap 56:61–82
MathSciNet MATH Google Scholar
Hesamian G, Shams M (2016) Parametric testing statistical hypotheses for fuzzy random variables. Soft Comput 20:1537–1548
MATH Google Scholar
Hesamian G, Taheri SM (2013) Fuzzy empirical distribution: properties and applications. Kybernetika 49:962–982
MathSciNet MATH Google Scholar
Heinzl H, Mittlboeck M (2017) Assessing a hypothesis test for the difference between two quantiles from independent populations. Communications in Statistics - Simulation and Computation 46:1–10
MathSciNet MATH Google Scholar
Hryniewicz O (2006a) Goodman-Kruskal measure of dependence for fuzzy ordered categorical data. Comput Stat Data Anal 51:323–334
MathSciNet MATH Google Scholar
Hryniewicz O (2006b) Possibilistic decisions and fuzzy statistical tests. Fuzzy Sets Syst 157:2665–2673
MathSciNet MATH Google Scholar
Hutson AD (2009) A distribution function estimator for the difference of order statistics from two independent samples. Stat Pap 50:203–208
MathSciNet MATH Google Scholar
Kahraman C, Bozdag CF, Ruan D (2004) Fuzzy sets approaches to statistical parametric and non-parametric tests. shape International Journal of Intelligent Systems 19:1069–1078
MATH Google Scholar
Kosorok MR (1999) Trust two-sample quantile tests under general conditions. Biometrika 86:909–921
MathSciNet MATH Google Scholar
Lee KH (2005) First course on fuzzy theory and applications. Springer-Verlag, Berlin
MATH Google Scholar
Lin P, Wu B, Watada J (2010) Kolmogorov-Smirnov two sample test with continuous fuzzy data. Advances in Intelligent and Soft Computing 68:175–186
MATH Google Scholar
Montenegro M, Casals MR, Lubiano MA, Gil MA (2001) Two-sample hypothesis tests of means of a fuzzy random variable. Inf Sci 133:89–100
MathSciNet MATH Google Scholar
Montenegro M, Colubi A, Casals MR, Gil MA (2004) Asymptotic and Bootstrap techniques for testing the expected value of a fuzzy random variable. Metrika 59:31–49
MathSciNet MATH Google Scholar
Mylonas N, Papadopoulos B (2021) Unbiased fuzzy estimators in fuzzy hypothesis testing. Algorithms 14:185–192
Google Scholar
O’Gorman TW (2004) Applied adaptive statistical methods. Tests of significance and confidence intervals, SIAM, USA
MATH Google Scholar
Parchami A (2020) Fuzzy decision in testing hypotheses by fuzzy data: Two case studies. Iranian Journal of Fuzzy Systems 17:127–136
MathSciNet MATH Google Scholar
Puri ML, Ralescu DA (1985) The concept of normality for fuzzy random variables. Ann Probab 13:1373–1379
MathSciNet MATH Google Scholar
Rodríguez G, Montenegro M, Colubi A, Gil MA (2006) Bootstrap techniques and fuzzy random variables: Synergy in hypothesis testing with fuzzy data. Fuzzy Sets Syst 157:2608–2613
MathSciNet MATH Google Scholar
Shi NZ, Tao J (2008) Statistical hypothesis testing: Theory and methods. World Scientific Publishing Company, USA
MATH Google Scholar
Taff A (2018) Hypothesis testing: The Ultimate Beginner’s Guide to Statistical Significance. Create Space Independent Publishing Platform, USA
Google Scholar
Viertl R (2006) Univariate statistical analysis with fuzzy data. Comput Stat Data Anal 51:133–147
MathSciNet MATH Google Scholar
Wang JL, Hettmansperger TP (1990) Two-sample inference for median survival times based on one-sample procedures for censored survival data. J Am Stat Assoc 85:529–536
MathSciNet MATH Google Scholar
Weerahandi S, Johnson RA (1992) Testing reliability in a stress-strength model when $X$ and $Y$ are normally distributed. Technometrics 34:83–91
MathSciNet Google Scholar
Wu HC (2005) Statistical hypotheses testing for fuzzy data. Inf Sci 175:30–57
MathSciNet MATH Google Scholar
Yu CM, Luo WJ, Hsu TH, Lai KK (2020) Two-Tailed Fuzzy Hypothesis Testing for Unilateral Specification Process Quality Index. Mathematics 8:1–10
Google Scholar
Yuan Y (1991) Criteria for evaluating fuzzy ranking methods. Fuzzy Sets Syst 43:139–157
MathSciNet MATH Google Scholar
Zainali AZ, Akbari MG, Noughabi A (2014) Intuitionistic fuzzy random variable and testing hypothesis about its variance. Soft Comput 19:2681–2689
MATH Google Scholar

Download references

Acknowledgements

The authors would like to thank both anonymous reviewers for their constructive suggestions and comments that improved the presentation of this paper.

Funding

Open Access funding enabled and organized by Projekt DEAL. The authors declare that no funds, grants, or other support were received during the preparation of this manuscript.

Author information

Authors and Affiliations

Department of Statistics, Payame Noor University, Tehran, 19395-3697, Iran
Gholamreza Hesamian
HafenCity University of Hamburg, 20457, Hamburg, Germany
Nataliya Chukhrova
Faculty of Business Administration, University of Hamburg, 20146, Hamburg, Germany
Arne Johannssen

Authors

Gholamreza Hesamian
View author publications
You can also search for this author in PubMed Google Scholar
Nataliya Chukhrova
View author publications
You can also search for this author in PubMed Google Scholar
Arne Johannssen
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

The authors have contributed equally to this paper.

Corresponding author

Correspondence to Arne Johannssen.

Ethics declarations

Conflict of interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Human Participants and/or Animals

This article does not contain any studies with human participants or animals performed by the authors.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Hesamian, G., Chukhrova, N. & Johannssen, A. Statistical inference on quantiles of two independent populations under uncertainty. Soft Comput 27, 17573–17583 (2023). https://doi.org/10.1007/s00500-023-09202-9

Download citation

Accepted: 31 August 2023
Published: 23 September 2023
Issue Date: December 2023
DOI: https://doi.org/10.1007/s00500-023-09202-9

Statistical inference on quantiles of two independent populations under uncertainty

Abstract

Similar content being viewed by others

Parametric testing statistical hypotheses for fuzzy random variables

Testing statistical hypotheses for intuitionistic fuzzy data

Bootstrapped Kolmogorov-Smirnov Test for Epistemic Fuzzy Data

Explore related subjects

1 Introduction

2 Preliminaries

2.1 Fuzzy sets and fuzzy numbers

Remark 1

Remark 2

Definition 1

Definition 2

Proposition 1

Proof

Definition 3

2.2 Fuzzy random variables

Definition 4

Definition 5

Definition 6

Definition 7

Lemma 1

Definition 8

Definition 9

Lemma 2

Proof

3 Fuzzy quantile function

Definition 10

Example 1

Example 2

Definition 11

Lemma 3

Proof

Example 3

Lemma 4

Proof

4 Hypothesis test for comparing fuzzy quantiles of two populations

Definition 12

Definition 13

Definition 14

Remark 3

Remark 4

5 Numerical examples

Example 4

Example 5

6 Conclusions

Data Availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Human Participants and/or Animals

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation