Nothing Special   »   [go: up one dir, main page]

Skip to main content

Online auction-based relay selection for cooperative communication in CR networks

Abstract

Cognitive radio and cooperative communication are two new network technologies. So, the combination of these two new technologies is a novel solution to solve the problem of spectrum scarcity. Two main challenges exist in the integration of cognitive radio and cooperative communication. First, there is a lack of incentives for the participating wireless devices to serve as relay nodes. Second, there is not an effective relay selection policy. In this paper, we propose an online auction-based relay selection scheme for cooperative communication in cognitive radio (CR) networks. Specifically, we design an auction scheme through adopting stopping theory. The proposed scheme ensures that the primary user (PU) can effectively select a CR relay to transmit its packets in a given time bound. In addition, we have analytically proven the truthfulness and the individual rationality of our online auction scheme. Extensive simulations demonstrate that the proposed relay selection scheme can always successfully and efficiently select a proper relay for a PU and can achieve a higher cooperative communication throughput compared with the conventional schemes.

1 Introduction

Cognitive radio technology has been recognized to be able to improve the spectrum utilization [1] through opportunistic spectrum access without interfering with the primary communications. Cooperative communication is an emerging technology that has the great potential to increase the throughput between two wireless devices [2] without requiring the support of infrastructure. Intuitively, the integration of these two hot technologies is expected to be a solution for the problems of spectrum scarcity and capacity limitation. In this paper, we particularly target on the scenario where the primary CR users (PU) seek to find a suitable cognitive radio (CR) relay to help transmit data, so that its end-to-end throughput can be increased and its end-to-end delay can be reduced.

Certainly, this hybrid cognitive-cooperative communication architecture is facing both technical and practical challenges. For instances, the relays should get sufficient incentives for participating in the cooperative communications. In addition, if there are multiple candidate relays, it may be impossible/impractical for the primary user to collect information from all the candidate relays to make a decision of relay selection. The reason is twofold: (i) The time for collecting information from all the candidate relays may exceed the PU’s end-to-end delay tolerance, and (ii) the availability of candidate relays may change during the collecting time as CR links are generally not stable. Therefore, an idea cognitive-cooperative scheme should be able to well incentivize the CR users and select a proper relay in a given time.

To design such a scheme, we conservatively consider to collect a candidate relay information and make the relay selection decision for this candidate relay online so that the collected information is valid and end-to-end delay requirement can be satisfied. In other words, the candidate relays send their information (including its sealed bid for cooperative transmissions) to the PU one by one. After receiving the information from a candidate relay, the PU immediately decides whether to choose this relay for cooperative transmission. Once a relay has been selected, the PU stops considering the rest of the candidate relays and pays the chosen relay a compensation based on its bid. In the paper, we consider the candidate relays as bidders and the PU as buyer, respectively.

Specifically, we design an online auction-based relay selection scheme for cooperative CR communication. The main contributions of this paper can be summarized as follows:

  1. 1.

    We formulate the online relay selection problem as an optimal stopping problem and design an optimal stopping policy, which can guarantee the success of relay selection in a given time while control the cost of PU for utilizing cooperative communication. To the best of our knowledge, this is the first work to jointly adopt stopping theory and design auction scheme for relay selection in CR networks.

  2. 2.

    We analytically prove the truthfulness of the proposed online auction scheme without requiring any information about bidders behavior. This result guarantees the bid truthfulness of the candidate relays and thereby eliminates the possibility of market manipulation.

  3. 3.

    Our proposed online auction scheme is also individual rational. This feature indicates that both the PU and relay can be benefited from cooperative transmission, which guarantees the incentive of participation for both the PU and the relay.

  4. 4.

    Extensive experiments are conducted to study the performance of our online auction scheme. We investigate the impact of several parameters and compare our scheme with the optimal relay selection (ORS) scheme, which will look at all the candidate relays for minimizing the PU cost. The results demonstrate that the PU can always select an appropriate relay in the given time, and our scheme achieves a higher throughput than ORS with a tiny cost increase.

The rest of the paper is organized as follows: The related work is presented in Section 3. Preliminary is illustrated in Section 3. In Section 3, we describe the proposed stopping policy and the online auction scheme for cooperative communication. The analysis of our auction scheme appears in Section 3. The simulation results are reported and analyzed in Section 3. We conclude our paper in Section 3.

2 Related work

In this section, we briefly review the most related works to our research in the area of auction-based cooperative communication.

In [3], Huang et al. treated relay nodes as sellers and source nodes as buyers. Two auction mechanisms were proposed: the SNR auction and the power auction. In each auction mechanism, each user iteratively updates its bid according to the others’ previous bids to maximize its own profit. The existence and the uniqueness of the Nash equilibrium in a single-relay network has been proven. In networks with multiple relays, the Nash equilibrium only exists under certain conditions. Zhang et al. considered the network scenario where there is a single relay and multiple user nodes in [4]. The authors formulated this problem as a sellers’ market competition. A distributed algorithm has been developed to search the Nash equilibrium. In [5], Wang et al. studied the situation of one source node and multiple relay nodes. The authors modeled it as a Stackelberg game which is divided into two levels of hierarchy. The benefits of the source node and relay nodes were jointly considered. In [6], the authors formed a cooperative cognitive radio network (CCRN) where each PU employs a set of appropriate second users (SUs) for relaying the PU’s information in an energy-efficiency solution. The relay selection and parameter optimization have been formulated as two Stackelberg games. A unique Nash equilibrium is achieved and proved for each game. In [7], the authors supposed that SUs can make rational choice between cooperation and band leasing. This novel CCRN model has been modeled as a Stackelberg game and analyzed. In [8], the impact of spectrum leasing was studied with a game-theoretic analysis. The authors first illustrated the significance of the problem with a two-secondary-user game. Subsequently, a n-secondary-user game was studied and analyzed given the occupancy model of primary users. However, none of the above work had considered the truthfulness. As a result, these mechanisms are vulnerable to market manipulation and may produce poor outcome [9].

In [10], the authors considered the problem of cooperative spectrum sensing scheduling in a CR network. The author assumed that each SU has the freedom to decide whether or not to participate in cooperative spectrum sensing; if not, the SU can wiretap the decision on channel status made by other SUs. This mechanism strived the balance between conserving the energy for spectrum sensing and bring down the spectrum sensing performance. In [11], Yang et al. designed an optimal relay assignment algorithm for cooperative communication. In [12], Klemperer designed a double auction mechanism for cooperative communication. These two works satisfied the truthfulness requirements. They both needed to collect the information from all the source nodes and all the relay nodes, and their auction mechanisms were performed in a single-round fashion. As a comparison, our approach does not require the information from all relay nodes, and the auction is performed in an online manner.

3 Preliminary

In this section, we first give a description of online cooperative communication. Then, we present our system model and depict the two common economic properties (truthfulness and individual rationality) of an auction.

3.1 Online cooperative communication

In a CR network, it often happens that a PU (source node) wants to send its data in a limited period of time. As the source node is far away from the destination, it needs to find a proper PU/SU as a relay to help it to transmit the data, as shown in Figure 1. The source node returns the relay node with monetary reward (the PU’s payment to the selected node for relaying data). As the number of wireless users is rapidly increasing due to the fast growing number of mobile communication device and the source node needs to obey the delay constraint, it is impossible for the source node to detect all candidate relays. Besides, the state of candidate relays change quickly with time. So, after detecting a relay, the source node needs to make an online decision whether to adopt this relay or not. In this paper, we assume that the transmitter can only choose one relay for cooperative transmission. A work by Zhao et al. showed that it is sufficient for a source node to choose the best relay even when multiple relay nodes are available [13]. So, it is reasonable to assume that the PU transmitter will choose only one relay node for cooperative transmission.

Figure 1
figure 1

A model of online cooperative communication.

There exist two types of cooperative communication pattern: amplify-and-forward (AF) and decode-and-forward (DF) [2]. In AF, the relay amplifies the received data and transmits it to the destination. In DF, the relay fully decodes the data from the source node and then delivers it after re-encoding the data. Our proposed scheme can be applied to both of these two cooperative communication patterns.

3.2 System model

In this paper, we consider a time-slotted system consisted by a pair of primary transmitter and receiver (PU) and a number of R candidate relays. The PU needs to send out a packet within T time slots, and it values the packet by β. The time needed for collecting information from each relay is α time slots. Then, the PU can check at most M=⌊T/α⌋ relays within T. The PU randomly collects information from the R candidate relays one by one. The ith candidate relay X i has a private true valuation v i and a bid price b i for forwarding the packet. When the PU collects the information from X i , X i returns its bid b i . Note that, the bid b i is valid for α time slots, and each candidate relay bids in a sealed-bid manner. After receiving b i from X i , the PU needs to immediately make a decision on whether to select X i as the relay to transmit the packet or continue to check X i+1. Table 1 summarizes the notations used through this paper.

Table 1 Notations

3.3 Economic properties

3.3.1 Truthfulness

An auction is truthful if revealing the private true value is the dominate strategy for each bidder. This means that no bidder can raise its utility by bidding b i ≠v i , no matter how others bids. This property can resist market manipulation and ensure the auction fairness and efficiency. In untruthful auctions, a selfish bidder can alter its bid to obtain extra outcome which will benefit itself but hurt the others.

3.3.2 Individual rationality

An auction is individual rational if a winner (the selected relay) is always paid by no less than its bid and the PU’s cost is no more than the value of the packet. This property can ensure that the utilities of both the PU and the selected relay are no less than 0. In other words, both the PU and the selected relay have incentive to participate in the cooperative communication.

4 Online auction scheme for relay selection

In this section, we formulate the relay selection process as an optimal stopping problem and present our proposed scheme in detail.

4.1 Problem formulation

In the following, we first give a brief introduction of the finite stopping problems and then formulate the relay selection process accordingly.

Definition 1.

Finite stopping problems: Given a sequence of random variables, X 1,X 2,…,X M , and a sequence of real-valued reward functions, y 0(),y 1(X 1),y 2(x 1,X 2),…,y M (x 1,x 2,…,X M ), the objective is to find a n, so that the reward function y n (x 1,x 2,…,X n ) is the maximal. Note that the joint distribution of the random variables and the reward functions are known and that X denotes the random variable and x denotes the value of a random variable, respectively.

In order to maximize the PU’s utility, the PU makes each decision by comparing the instantaneous reward of selecting X i with the expected reward of checking X i+1. Accordingly, we formulate the relay selection process as a sequence of decision problem which is defined based on the theory of optimal stopping [14] below.

According to [15], the PU knows (or can learn) the distribution of the bidding price of the bidders (candidate relays). We assume that there is no collusion among the candidate relays, and the bidding price of the candidate relays is independently distributed. In this paper, we particularly assume the bidding price follows the normal distribution with the mean value μ and the standard deviation σ, i.e., \(f\left (b_{i}\right)=\dfrac {1} {\sqrt {2\pi }\sigma }e^{-\dfrac {\left (b_{i}-\mu \right)^{2}} {2\sigma ^{2}}}\), where f(b i ) is the probability density function of b i . Note that, our proposed scheme can be generally applied to any bidding price distribution.

We divide the bidding price range into J intervals so that the price is within a finite-state space. We use S to represent the finite-state space of the bidding price.

$$ {S=\left\{s_{1}=0, s_{2}, s_{3},\ldots,s_{J}=\beta, s_{J+1}=+\infty\right\}}. $$

If the bidding price b i ∈[s j ,s j+1), we say that b i is in state s j . The probability that b i is in state s j (j=1,2,…,J) is:

$$ {p_{j}=\int_{s_{j}}^{s_{j+1}}f\left(b_{i}\right)db_{i}=\int_{s_{j}}^{s_{j+1}}\dfrac {1} {\sqrt {2\pi}\sigma}e^{-\dfrac {\left(b_{i}-\mu\right)^{2}} {2\sigma^{2}}}db_{i}}. $$
((1))

We use X n to denote the bidding price of the nth checked candidate relay. As a result, the probability that X n bid for s j is:

$$ {P(X_{n}=s_{j})=p_{j}, 1\leq j\leq J, 1\leq n\leq M}. $$
((2))

Note that, in order to ensure that the PU’s cost is no larger than β, we set s J =β and s J+1=∞.

Considering the fact that the time consumed for checking candidate relays are also valuable to the PU (because the PU can use them to transmit other packets, and the PU has a time limit to transmit a packet), checking more candidate relays should result in less PU’s utility. Therefore, we use a decreasing function \(f(n)=C^{\frac {n\cdot \alpha }{T}}\) to represent the time’s impact, where C∈(0,1]. Accordingly the PU’s real-valued reward function for checking the nth candidate relay is:

$$ \begin{aligned} Y_{n}=\,\,&y_{n}(X_{1},X_{2},\ldots,X_{n})=(\beta-X_{n})\cdot f(n)\\ =\,\,&(\beta-X_{n})\cdot C^{\frac{n\cdot \alpha}{T}}\\ &\text{for}\,\,{1\leq n \leq M}, {C\in (0,1]}. \end{aligned} $$
((3))

According to [14], a necessary condition for the existence of optimal stopping rules to solve the problem is that Y n should satisfy the following two requirements:

$$ E\{\text{sup}_{n} Y_{n}\}<+\infty. $$
((4))
$$ \text{limsup}_{n\rightarrow +\infty} Y_{n}\leq Y_{+\infty}. $$
((5))

We have proven that the Y n defined in (3) satisfies the requirements. Due to the page limit, we will report the proof in our technical report.

4.2 Online auction-based selection scheme

We use \(V_{n}^{(M)}(x_{1}, x_{2},\ldots,x_{n})\) to represent the maximum reward that could be obtained after checking n candidate relays:

$$ \begin{aligned} V_{n}^{(M)}(x_{1}, x_{2},\ldots,x_{n})=&~\text{max}\left\{\vphantom{V_{n+1}^{(M)}}y_{n}(x_{1}, x_{2},\ldots,x_{n}),\right.\\ &E\left(\vphantom{V_{n+1}^{(M)}}V_{n+1}^{(M)}\left(x_{1}, x_{2},\ldots,x_{n},X_{n+1}\right)\right.\\ &\left.\left.\!\mid X_{1}\!=x_{1},X_{2}\,=\,x_{2},\ldots,X_{n}=x_{n}\vphantom{V_{n+1}^{(M)}}\right)\right\}. \end{aligned} $$
((6))

In the above equation, y n (x 1,x 2,…,x n ) means the instantaneous reward if we stop at stage n, \(E\left (V_{n+1}^{(M)}(x_{1}, x_{2},\ldots,x_{n},X_{n+1})\mid X_{1}=x_{1},X_{2}=x_{2}\right.,\ldots, \left.{\vphantom {V_{n+1}^{(M)}}}X_{n}=~x_{n}\right)\) represents the expected maximum reward by proceeding to stage n+1. Thus, it is better for the PU to stop checking the next candidate relay when the instantaneous reward is larger than the expected maximum reward. Accordingly, the stopping rule is:

$$ \begin{aligned} \textrm{If:} y_{n}(x_{1}, x_{2},\ldots,x_{n})\geq E\left(V_{n+1}^{(M)}(x_{1}, x_{2},\ldots,x_{n},X_{n+1})\right.\\ \left.\mid X_{1}=x_{1},X_{2}=x_{2},\ldots,X_{n}=x_{n}{\vphantom{V_{n+1}^{(M)}}}\right), \end{aligned} $$
((7))

the PU stops at stage n (selects the nth candidate relay); otherwise, it continues to check the next candidate relay.

We use the method of backward induction to solve the finite stopping problem under the above stopping rule. Since the PU has to stop at stage M, we first calculate the optimal stopping rule at stage M−1, then conclude the optimal rule at stage M−2, and so on until we get the optimal rule at stage 1. We use Z M−n to represent the maximum expected reward at stage n+1, i.e., \(Z_{M-n}=E\left (V_{n+1}^{(M)}\right)\). We define that Z 0=−∞. As the bidding price of each candidate relay is independent and identically distributed, the value of the Z u , u=M−n, only depends on the remaining number of steps u. Therefore, the valuation of Z u can be calculated as follows:

$$ \begin{aligned} Z_{1}&=E\left\{V_{M}^{(M)}\right\}=E\{(\beta-X_{M})\cdot C^{\frac{M\cdot \alpha}{T}}\}\\ &=C^{\frac{M\cdot \alpha}{T}}\cdot (\beta-E(X_{M}))\\ &=C^{\frac{M\cdot \alpha}{T}}\cdot \beta-C^{\frac{M\cdot \alpha}{T}}\cdot \sum\limits_{j=1}^{J}s_{j}\cdot p_{j}. \end{aligned} $$
((8))

When u≥1, we can get:

$$ \begin{aligned} Z_{u+1}&=E\text{max}\{Y_{M-u}, Z_{u}\}\\ &=\sum\limits_{m}C^{\frac{(M-u)\cdot \alpha}{T}}\cdot (\beta-s_{m})\cdot p_{m}+\sum\limits_{n}Z_{u}\cdot p_{n}, \end{aligned} $$
((9))

where the values of m and n need to satisfy the following condition:

$$ \begin{aligned} &m\in\left\{j\mid C^{\frac{(M-u)\cdot \alpha}{T}}\cdot (\beta-s_{j})\geq Z_{u}, j=1,2,\ldots,J\right\},\\ &n\in\left\{j\mid C^{\frac{(M-u)\cdot \alpha}{T}}\cdot (\beta-s_{j})< Z_{u}, j=1,2,\ldots,J\right\},\\ &m+n=J. \end{aligned} $$
((10))

Once a relay has been selected, the PU needs to calculate its payment to the selected relay. In order to frustrate the selfish candidate relays who bid untruthful, the payment that is paid to the winner should be bid-independent. As Z u is a static value for each u∈[1,M], we can have the highest bidding price for the checked candidate relay to win at each step. Specifically, at step n, if y n (x 1,x 2,…,x n )≥Z M−n , the winner gets payment p n , which is calculated as follows:

$$ \text{As}: y_{n}(x_{1}, x_{2},\ldots,x_{n})=(\beta-p_{n})\cdot C^{\frac{n\cdot \alpha}{T}}=Z_{M-n}, $$
((11))
$$ \textrm{So:} p_{n}=\beta-\frac{Z_{M-n}}{C^{\frac{n\cdot \alpha}{T}}}. $$
((12))

Figure 2 briefly demonstrates the model of relay selection. Our proposed scheme is described in Algorithm 1. When the PU needs to send a packet in a given time, it first calculate M. The PU calculates M based on (1) the delay requirement of a packet, (2) the time of checking each candidate relay, and (3) the expected transmission delay between the PU and the relays. After checking the nth candidate relay, the PU computes y n and the expected reward Z M−n . According to the comparison of y n and Z M−n , the PU makes a decision whether to select X n for cooperative communication immediately. If the PU does not chose any of the first M−1 candidate relay, it has to select the last candidate relay. Under this worst case, the PU needs to pay the Mth candidate relay’s bidding price b M . Note that, the decreasing function f(n) can prevent the happening of the worst case, which has been verified by the simulations.

Figure 2
figure 2

The model of online relay selection.

5 Rationality and truthfulness analysis

In this section, we justify that our proposed online auction-based relay selection scheme is both truthful and individual rational. Due to the page limit, we only present the main results. The detailed proof of the lemmas will be given in our technical report.

Lemma 1.

Z u ≥0, for u∈[1,M].

Theorem 1.

The proposed online auction scheme is individual rational.

Proof.

For winner n, 1≤n≤M−1, the payment it can get is:

$$\begin{array}{@{}rcl@{}} &p_{n}=\beta-\frac{Z_{M-n}}{C^{\frac{n\cdot \alpha}{T}}}\\ &\textrm{As:} (\beta-b_{n})\cdot C^{\frac{n\cdot \alpha}{T}}\geq Z_{M-n}\\ &\textrm{So:} b_{n}\leq \beta-\frac{Z_{M-n}}{C^{\frac{n\cdot \alpha}{T}}}=p_{n} \end{array} $$

We can get that utility=p n −b n ≥0. If the winner is M, its utility=0. Therefore, we can conclude that, for any winner, its utility≥0.

For the PU, the value of the packet is β, and it has to pay p n to winner n. When 1≤n≤M−1:

$$\begin{array}{@{}rcl@{}} \text{utility}&=\beta-p_{n}\\ &\!\!\!\!=\frac{Z_{M-n}}{C^{\frac{n\cdot \alpha}{T}}} \end{array} $$

According to Lemma 1, we have Z u ≥0,u∈[1,M]. Thus, utility≥0. When n=M, the utility=β−b M . As the PU has to transmit the packet in this worst case, it is also rational.

Lemma 2.

The payment to winner n will be the same whatever it had bidden for b n or v n .

Lemma 3.

If bidder n wins the auction when bidding truthfully, the payment from the PU is no less than its evaluation, i.e., p n ≥v n .

Lemma 4.

If bidder n cannot win the auction when bidding truthfully, the possible payment from the PU is less than its evaluation, i.e., p n ≤v n .

Theorem 2.

The auction scheme is truthful.

Proof.

We use u n and u n′ to represent the utilities when bidder n bids with v n and b n , respectively.

I. We consider the case that b n >v n .

  1. 1.

    Bidder n wins when bidding with either b n or v n : according to Lemma 2 and Lemma 3, we have \(u_{n}^{\prime }=p_{n}-v_{n}=u_{n}\geq 0\);

  2. 2.

    Bidder n loses with b n but wins with v n : according to Lemma 3, we can get \(u_{n}^{\prime }=0\leq p_{n}-v_{n}=u_{n}\);

  3. 3.

    Bidder n wins with b n but loses with v n : this cannot happen in our auction scheme;

  4. 4.

    Bidder n loses with both b n and v n : u n′=u n =0;

II. We consider the case that b n <v n .

  1. 1.

    Bidder n wins when bidding with either b n or v n : according to Lemma 2 and Lemma 3, we have \(u_{n}^{\prime }=p_{n}-v_{n}=u_{n}\geq 0\);

  2. 2.

    Bidder n loses with b n but wins with v n : this cannot happen in our auction scheme;

  3. 3.

    Bidder n wins with b n but loses with v n : according to Lemma 4, we can get \(u_{n}^{\prime }=p_{n}-v_{n}\leq 0=u_{n}\);

  4. 4.

    Bidder n loses with both b n and v n : \(u_{n}^{\prime }=u_{n}=0\).

Through the analysis of all the possible cases, we can conclude that the utility of the bidder n cannot be increased if b n ≠v n . Therefore, our auction scheme can ensure the bidding truthfulness.

6 Simulations

In this section, we conduct simulations to examine the performance of our proposed online auction mechanism. We assume that the bidding price of each candidate relay follows the normal distribution with the expected value μ=0.5 and the variance σ=0.2. We partition the bidding price into J=20 intervals. We set the time limit for the PU to select a relay is 90 time slots, i.e., T=90. In the following, we study the impact of the parameters such as α, C, and β on the number of observation steps and the reward of the PU, respectively. In order to justify the effectiveness of our scheme, we compare ours with the ORS scheme, which has the knowledge of who is the optimal relay that has the maximal PU reward among the M candidate relays. In the ORS scheme, the PU checks the candidate relay one by one randomly and stops after checking the optimal relay. The simulations are conducted under several network scales. Particularly, the M takes the value of 20, 30, and 40, separately. Each reported result is the average of 100 independent instances.

6.1 The impact of observation duration α

In this subsection, we set the time efficiency parameter C=0.3. The private value of the packet β=0.9. Figure 3 illustrates the relationship between the number of observation steps and the observation duration α under different network scales. From Figure 3, we observe that the PU generally stops checking within six steps in our scheme. When α changes from 2 to 2.2, there is an obvious decline on the number of observation steps in our scheme. Generally speaking, a larger α results in a smaller observation steps. This is because the cost of checking a candidate relay is large when α is big. Thus, the PU intends to find a proper relay as soon as possible. On the other hand, when α is small, the PU may attempt to check more candidate relays to maximize its reward. We also find that the number of observation steps is always less than M, which indicates that the worst case of choosing the Mth relay will not happen. In addition, we can see that the number of observation steps of ORS is always higher than the corresponding one of ours. It means that our scheme needs less time for selecting a cooperative relay.

Figure 3
figure 3

Average number of observation steps α .

In Figure 4, we compare our scheme with ORS in terms of reward. Specifically, the reward of our scheme is normalized by the corresponding one of ORS, and we name this ratio as relative monetary reward. Figure 4 shows the relationship between the relative monetary reward of the PU and the observation duration. According to the results, we find that the relative reward of the PU fluctuates with the checking duration α, but with a small variance (within 4%), and that the relative monetary reward is always higher than 91.5%. Jointly considering the results in Figure 3, we can conclude that our scheme selects the relay sooner, and the monetary reward for a single transmission is slightly smaller than the one of ORS.

Figure 4
figure 4

Relative reward for one packet transmission α .

By the time that the PU finds the optimal relay in OSR, the PU in our scheme can find multiple relays and get rewards via transmitting multiple packets. Therefore, to make a fair comparison in terms of reward, we define the ratio of monetary reward as the ratio between the total reward obtained in our scheme and the reward of ORS during the same time period. The value of ratio is reported in the Figure 5. From Figure 5, we find that the ratio monetary reward is always higher than 1. The maximum value can reach 2 when α=1 and M=40. In other words, our proposed scheme can produce higher reward than OSR during the same time period.

Figure 5
figure 5

Ratio of reward during the same time period with α .

6.2 The impact of time efficiency C

In this subsection, we set the observation duration α=2, the private value of the packet β=0.9. From Figure 6, we can see that the PU can stop within six steps in our scheme, and the number of observation steps increases with the increase of C. This is because the impact of C on the reward is big when C is small. Thus, the PU intends to stop as soon as possible at a smaller C. By comparing with the ORS scheme, we can find that the number of observation steps in our scheme is always less than the corresponding one of ORS.

Figure 6
figure 6

Average number of observation steps with C .

In Figure 7, we compare our scheme with ORS scheme in terms of relative reward for one packet transmission under different C. Similarly, the results of our scheme are normalized by the corresponding one of ORS. From Figure 7, we find that the relative reward decreases with the increase of C. The reason is that the time impact on the reward function is small when C approximate 1. In general, the relative reward is larger than 85%.

Figure 7
figure 7

Relative reward for one packet transmission with C .

Similarly to the ratio results reported in Figure 5, we report the ratio of reward in Figure 8 under several C. From Figure 8, we can see that the ratio of monetary reward increases with the increase of C. The value of the ratio is always larger than 1 and can reach 3 when M=40 and C=0.9. In other words, the reward of our scheme is always higher than the one of ORS scheme during the same time period.

Figure 8
figure 8

Ratio of reward during the same time period with C .

6.3 The impact of private value β

In this subjection, we set the observation duration α=2 and time efficiency C=0.3. Figure 9 reports the relationship between the observation steps and β. From Figure 9, we can observe that the number of observation steps decreases with β at the beginning in our scheme. When β is between 0.6 and 0.7, the number of observation steps reaches the smallest value. Hereafter, the curve rises with the increase of β. When β=1, the number of observation steps reaches the largest value 5. The reason for the trend of curve is that it is hard to find a relay that makes a small bid when the value of β is small, and the PU intends to check more candidate relays to find larger reward when β is big. We also observe that with different number of candidate relays, the number of observation steps almost stay the same. By comparing with ORS, we can see that the number of observation steps in our scheme is always less than the corresponding one of ORS.

Figure 9
figure 9

Average number of observation steps with β .

In Figure 10, we plot the relative reward for one packet transmission under several β. From Figure 10, we can see that the relative reward increases with the increase of β. The reason is that a larger β can enlarge the reward. When β is greater than 0.8, the relative reward is larger than 90%.

Figure 10
figure 10

Relative reward for one packet transmission with β .

Figure 11 reports the ratio of reward time during the same time period under several β. From Figure 11, we can observe that the ratio fluctuates with β, but the value of the ratio is always higher than 1.

Figure 11
figure 11

Ratio of reward during the same time period with β .

In summary, we can conclude that the PU can always select a relay within six observations by our scheme, which is much less than the number of observation steps in OSR. Although the reward of transmitting one packet in our scheme may be slightly smaller than the corresponding one in OSR, the accumulated reward of our scheme is always much larger than the corresponding one of OSR for transmitting multiple packets in a given time period.

7 Conclusions

In this paper, we have designed an online auction-based relay selection scheme for cooperative communication in CR networks. Without checking the information from all the candidate relays, the PU can select a desirable relay within a given time limit. Our scheme achieves the property of truthfulness, which assures that all the bidders reveal their true valuations and eliminates the fear of market manipulation. Meanwhile, our scheme is individual rational, which guarantees that both the PU and relays can be benefited from cooperative communication. Extensive simulations show that our scheme enables the PU to select a proper relay in a very short time and achieves a higher accumulated reward in a given time period comparing to the OSRscheme.

References

  1. IF Akyildiz, WY Lee, MC Vuran, S Mohanty, NeXt generation/dynamic spectrum access/cognitive radio wireless networks: a survey. Comput. Networks. 50(13), 2127–2159 (2006).

    Article  MATH  Google Scholar 

  2. JN Laneman, DN Tse, GW Wornell, Cooperative diversity in wireless networks: Efficient protocols and outage behavior. IEEE Trans. Inf. Theory. 50(12), 3062–3080 (2004).

    Article  MATH  MathSciNet  Google Scholar 

  3. J Huang, Z Han, M Chiang, Poor H, Auction-based resource allocation for cooperative communications. IEEE J. Sel. Areas Commun. 26(7), 1226–1237 (2008).

    Article  Google Scholar 

  4. G Zhang, L Cong, L Zhao, K Yang, H Zhang, Competitive resource sharing based on game theory in cooperative relay networks. ETRI J. 31, 89–91 (2009).

    Article  Google Scholar 

  5. B Wang, Z Han, KR Liu, in IEEE INFOCOM. Distributed relay selection and power control for multiuser cooperative communication networks using buyer/seller game (Barcelona, 6–12 May 2007), pp. 544–552.

  6. B Caoy, JW Mark, Q Zhang, R Lu, X Lin, XS Shen, in IEEE INFOCOM. On optimal communication strategies for cooperative cognitive radio networking, (2013), pp. 1774–1782.

  7. H Roh, C Jung, W Lee, DZ Du, in IEEE ICNC. A Stackelberg game for cooperative cognitive radio network with active SUs, (2013), pp. 911–915.

  8. H Roh, C Shao, S Lee, W Lee, DZ Du, in IEEE INFOCOM DEMO. Secondary user games with spectrum leasing market in cooperative cognitive radio networks, (2013), pp. 2461–2462.

  9. P Klemperer, What really matters in auction design. J. Econ. Perspect. 16, 169–189 (2002).

    Article  Google Scholar 

  10. H Li, X Cheng, K Li, X Xing, T Jing, in IEEE INFOCOM. Utility-based cooperative spectrum sensing scheduling in cognitive radio networks, (2013), pp. 165–169.

  11. D Yang, X Fang, G Xue, HERA: An optimal relay assignment scheme for cooperative networks. IEEE J. Sel. Areas Commun. 30(2), 245–253 (2012).

    Article  Google Scholar 

  12. P Klemperer, What really matters in auction design. J. Econ. Perspect. 16, 169–189 (2002).

    Article  Google Scholar 

  13. Y Zhao, R Adve, TJ Lim, Improving amplify-and-forward relay networks: optimal power allocation versus selection. IEEE Trans. Wireless Commun. 6(8), 3114 (2007).

    Google Scholar 

  14. TS Ferguson, Optimal stopping and applications. http://www.math.ucla.edu/~tom/Stopping/Contents.html.

  15. S Kern, SD Müller, N Hansen, D Büche, J Ocenasek, P Koumoutsakos, Learning probability distributions in continuous evolutionary algorithms–a comparative review. Nat. Comput. 3, 77–112 (2004).

    Article  MATH  MathSciNet  Google Scholar 

Download references

Acknowledgements

This work is partially supported by the National Natural Science Foundation of China (Grant Nos. 61272505, 61172074, and 61371069) and the National Science Foundation of the US (CNS-1265311).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Wei Cheng.

Additional information

Competing interests

The authors declare that they have no competing interests.

Rights and permissions

Open Access  This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made.

The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

To view a copy of this licence, visit https://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Jing, T., Zhang, F., Cheng, W. et al. Online auction-based relay selection for cooperative communication in CR networks. J Wireless Com Network 2015, 20 (2015). https://doi.org/10.1186/s13638-015-0258-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s13638-015-0258-9

Keywords