Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3241539.3241552acmconferencesArticle/Chapter ViewAbstractPublication PagesmobicomConference Proceedingsconference-collections
research-article
Public Access

GPF: A GPU-based Design to Achieve ~100 μs Scheduling for 5G NR

Published: 15 October 2018 Publication History

Abstract

5G New Radio (NR) is designed to operate under a broad range of frequency bands and support new applications with ultra-low latency. To support its diverse operating conditions, a set of different OFDM numerologies has been defined in the standards body. Under this numerology, it is necessary to perform scheduling with a time resolution of ∼100 μs. This requirement poses a new challenge that does not exist in LTE and cannot be supported by any existing LTE schedulers. In this paper, we present the design of GPF -- a GPU-based proportional fair (PF) scheduler that can meet the ∼100 μs time requirement. The key ideas include decomposing the scheduling problem into a large number of small and independent sub-problems and selecting a subset of sub-problems from the most promising search space to fit into a GPU. By implementing GPF on an off-the-shelf Nvidia Quadro P6000 GPU, we show that GPF is able to achieve near-optimal performance while meeting the ∼100 $\mathrmμs time requirement. GPF represents the first successful design of a GPU-based PF scheduler that can meet the new time requirement in NR.

References

[1]
Ericsson Technology Review, "5G new radio: Designing for the future." Available: https://www.ericsson.com/en/ericsson-technology-review/ archive/2017/designing-for-the-future-the-5g-nr-physical-layer
[2]
Qualcomm, "Making 5G NR a commercial reality." Available: https://www.qualcomm.com/media/documents/files/ making-5g-nr-a-commercial-reality.pdf
[3]
Z. E. Ankarali, B. Peköz, and H. Arslan, "Flexible radio access beyond 5G: A future projection on waveform, numerology, and frame design principles,?" IEEE Access, vol. 5, pp. 18295--18309, May 2017.
[4]
3GPP TR 38.913 version 14.3.0, "Study on scenarios and requirements for next generation access technologies." Available: https://portal.3gpp.org/desktopmodules/Specifications/ SpecificationDetails.aspx'specificationId=2996
[5]
3GPP TR 38.804 version 14.0.0, "Study on New Radio access technology; Radio interface protocol aspects." Available: https: //portal.3gpp.org/desktopmodules/Specifications/SpecificationDetails. aspx'specificationId=3070
[6]
3GPP TS 38.211 version 15.0.0, "NR; Physical channels and modulation." Available: https://portal.3gpp.org/desktopmodules/Specifications/ SpecificationDetails.aspx'specificationId=3213
[7]
3GPP TS 38.214 version 15.0.0, "NR; Physical layer procedures for data." Available: https://portal.3gpp.org/desktopmodules/Specifications/ SpecificationDetails.aspx'specificationId=3216
[8]
3GPP TS 38.101--1 version 15.0.0, "NR; User Equipment (UE) radio transmission and reception; Part 1: Range 1 Standalone." Available: https://portal.3gpp.org/desktopmodules/Specifications/ SpecificationDetails.aspx'specificationId=3201
[9]
3GPP TS 38.101--2 version 15.0.0, "NR; User Equipment (UE) radio transmission and reception; Part 2: Range 2 Standalone." Available: https://portal.3gpp.org/desktopmodules/Specifications/ SpecificationDetails.aspx'specificationId=3284
[10]
3GPP TS 38.300 version 15.0.0, "NR; NR and NG-RAN overall description." Available: https://portal.3gpp.org/desktopmodules/Specifications/ SpecificationDetails.aspx'specificationId=3191
[11]
3GPP TR 38.901 version 15.0.0, "Study on channel model for frequencies from 0.5 to 100 GHz." Available: https://portal. 3gpp.org/desktopmodules/Specifications/SpecificationDetails.aspx? specificationId=3173
[12]
3GPP TR 22.891 version 14.2.0, "Feasibility study on new services and markets technology enablers; Stage 1." Available: https: //portal.3gpp.org/desktopmodules/Specifications/SpecificationDetails. aspx'specificationId=2897
[13]
3GPP TS 36.211 version 15.0.0, "Evolved Universal Terrestrial Radio Access (E-UTRA); Physical channels and modulation." Available: https://portal.3gpp.org/desktopmodules/Specifications/ SpecificationDetails.aspx'specificationId=2425
[14]
3GPP TS 36.213 version 15.0.0, "Evolved Universal Terrestrial Radio Access (E-UTRA); Physical layer procedures." Available: https://portal.3gpp.org/desktopmodules/Specifications/ SpecificationDetails.aspx'specificationId=2427
[15]
3GPP TS 36.101 version 15.1.0, "Evolved Universal Terrestrial Radio Access (E-UTRA); User Equipment (UE) radio transmission and reception." Available: https://portal.3gpp.org/desktopmodules/Specifications/ SpecificationDetails.aspx'specificationId=2411
[16]
D. Schmalstieg and T. Höllerer, Augmented Reality: Principles and Practice. Reading, MA, USA: Addison-Wesley, 2016.
[17]
G. Burdea, and P. Coiffet, Virtual Reality Technology, 2nd ed. New York, NY, USA: Wiley, 2003.
[18]
L. Kong, M. K. Khan, F. Wu, G. Chen, and P. Zeng, "Millimeter-wave wireless communications for IoT-cloud supported autonomous vehicles: Overview, design, and challenges," IEEE Commun. Mag., vol. 55, no. 1, pp. 62--68, Jan. 2017.
[19]
T. S. Rappaport, Wireless Communications: Principles and Practice. Upper Saddle River, NJ, USA: Prentice-Hall, 1996.
[20]
S. Sesia, I. Toufik, and M. Baker, LTE-The UMTS Long Term Evolution: From Theory to Practice. New York, NY, USA: Wiley, 2009.
[21]
F. Capozzi, G. Piro, L. Grieco, G. Boggia, and P. Camarda, "Downlink packet scheduling in LTE cellular networks: Key design issues and a survey," IEEE Commun. Surveys Tutorials, vol. 15, no. 2, pp. 678--700, 2013.
[22]
O. Grondalen, A. Zanella, K. Mahmood, M. Carpin, J. Rasool, and O. Osterbo, "Scheduling policies in time and frequency domains for LTE downlink channel: a performance comparison," IEEE Trans. Veh. Technol., vol. 66, no. 4, pp. 3345--3360, Apr. 2017.
[23]
A. Stolyar, "On the asymptotic optimality of the gradient scheduling algorithm for multi-user throughput allocation," Operations Research, vol. 53, pp. 12--25, 2005.
[24]
D. Tse, "Multiuser diversity in wireless networks: smart scheduling, dumb antennas and epidemic communication," in IMA Workshop on Wireless Networks, 2001. Available: https://web.stanford.edu/~dntse/papers/ ima810.pdf
[25]
R. Kwan, C. Leung, and J. Zhang, "Proportional fair multiuser scheduling in LTE," IEEE Signal Process. Lett., vol. 16, pp. 461--464, June 2009.
[26]
S. B. Lee, S. Choudhury, A. Khoshnevis, S. Xu, and S. Lu, "Downlink MIMO with frequency-domain packet scheduling for 3GPP LTE," in Proc. IEEE INFOCOM, pp. 1269--1277, Apr. 2009, Rio de Janeiro, Brazil.
[27]
H. Zhang, N. Prasad, and S. Rangarajan, "MIMO downlink scheduling in LTE systems," in Proc. IEEE INFOCOM, pp. 2936--2940, Mar. 2012, Orlando, FL, USA.
[28]
H. S. Liao, P. Y. Chen, and W. T. Chen, "An efficient downlink radio resource allocation with carrier aggregation in LTE-Advanced networks," IEEE Trans. Mobile Computing, vol. 13, no. 10, pp. 2229--2239, Oct. 2014.
[29]
S. Han, K. Jang, K. Park, and S. Moon, "PacketShader: a GPUaccelerated software router," in Proc. ACM SIGCOMM, pp. 195--206, Aug. 2010, New Delhi, India.
[30]
F. Fusco, M. Vlachos, X. Dimitropoulos, and L. Deri, "Indexing million of packets per second using GPUs," in Proc. ACM SIGCOMM-IMC, pp. 327- 332, Oct. 2013, Barcelona, Spain.
[31]
M. Varvello, R. Laufer, F. Zhang, and T. V. Lakshman, "Multilayer packet classification with graphics processing units," IEEE Trans. Networking, vol. 24, no. 5, pp. 2728--2741, Oct. 2016.
[32]
S. Roger, C. Ramiro, A. Gonzalez, V. Almenar, and A. M. Vidal, "Fully parallel GPU implementation of a fixed-complexity soft-output MIMO detector," IEEE Trans. Veh. Technol., vol. 61, no. 8, pp. 3796--3800, Oct. 2012.
[33]
Y. Zhao and F. Lau, "Implementation of decoders for LDPC block codes and LDPC convolutional codes based on GPUs," IEEE Trans. Parallel Distrib. Syst., vol. 25, no. 3, pp. 663--672, Mar. 2014.
[34]
A. Li, R. G. Maunder, B. M. Al-Hashimi, and L. Hanzo, "Implementation of a fully-parallel turbo decoder on a general-purpose graphics processing unit," IEEE Access, vol. 4, pp. 5624--5639, Jun. 2016.
[35]
H.D. Sherali andW. P.Adams, A Reformulation-Linearization Technique for Solving Discrete and Continuous Nonconvex Problems, Chapter 8. Norwell, MA, USA: Kluwer Academic Publishers, 1999.
[36]
Y. T. Hou, Y. Shi, and H. D. Sherali, Applied Optimization Methods for Wireless Networks, Chapter 6. Cambridge, U.K.: Cambridge Univ. Press, 2014.
[37]
G. L. Nemhauser and L. A. Wolsey, Integer and Combinatorial Optimization. New York, NY, USA: John Wiley & Sons, 1999.
[38]
D. P. Bertsekas, Dynamic Programming and Optimal Control. Belmont, MA, USA: Athena Scientific, 1995.
[39]
E. G. Talbi, Metaheuristics: From Design to Implementation. Hoboken, NJ, USA: Wiley, 2009.
[40]
M. R. Zargham, Computer Architecture: Single and Parallel Systems. Upper Saddle River, NJ, USA: Prentice-Hall, 1996.
[41]
IBM ILOG CPLEX Optimizer, software available at http://www01.ibm. com/software/integration/optimization/cplex-optimizer
[42]
Nvidia, "Data sheet: Quadro P6000." Available: https://images.nvidia.com/content/pdf/quadro/data-sheets/ 192152-NV-DS-Quadro-P6000-US-12Sept-NV-FNL-WEB.pdf
[43]
Nvidia, "CUDA C programming guide v9.1." Available: http://docs. nvidia.com/cuda/cuda-c-programming-guide/index.html
[44]
Nvidia, "Optimizing parallel reduction in CUDA." Available: https://developer.download.nvidia.com/compute/cuda/1.1-Beta/x86_ website/projects/reduction/doc/reduction.pdf
[45]
W. Yang, G. Durisi, and E. Riegler, "On the capacity of large-MIMO block-fading channels," IEEE J. Sel. Areas Commun., vol. 31, no. 2, pp. 117- 132, Feb. 2013.
[46]
Nvidia, "Nvidia Tesla P100 -- The most advanced data center accelerator ever built." Available: https://images.nvidia.com/content/pdf/tesla/ whitepaper/pascal-architecture-whitepaper.pdf
[47]
F. Zhang, J. Zhai, B. He, S. Zhang, and W. Chen, "Understanding corunning behaviors on integrated CPU/GPU architectures," IEEE Trans. Parallel Distrib. Syst., vol. 28, no. 3, pp. 905--918, Mar. 2017.
[48]
M. Daga, M. Nutter, and M. Meswani, "Efficient breadth-first search on a heterogeneous processor," in Proc. IEEE Int. Conf. Big Data, pp. 373--382, Oct. 2014, Washington DC, USA.
[49]
Intel, "The compute architecture of Intel Processor Graphics Gen7.5." Available: https://software.intel.com/sites/default/files/managed/4f/e0/ Compute_Architecture_of_Intel_Processor_Graphics_Gen7dot5_Aug4_ 2014.pdf

Cited By

View all
  • (2024)CloudRIC: Open Radio Access Network (O-RAN) Virtualization with Shared Heterogeneous ComputingProceedings of the 30th Annual International Conference on Mobile Computing and Networking10.1145/3636534.3649381(558-572)Online publication date: 29-May-2024
  • (2024)Modeling and Optimization of Channel Allocation for PAL and GAA Users in the CBRS BandIEEE Transactions on Cognitive Communications and Networking10.1109/TCCN.2023.331841210:1(1-20)Online publication date: Feb-2024
  • (2024) O-M 3 : Real-Time Multi-Cell MIMO Scheduling in 5G O-RAN IEEE Journal on Selected Areas in Communications10.1109/JSAC.2023.333616442:2(339-355)Online publication date: Feb-2024
  • Show More Cited By

Index Terms

  1. GPF: A GPU-based Design to Achieve ~100 μs Scheduling for 5G NR

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MobiCom '18: Proceedings of the 24th Annual International Conference on Mobile Computing and Networking
    October 2018
    884 pages
    ISBN:9781450359030
    DOI:10.1145/3241539
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 15 October 2018

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. 5g nr
    2. gpu
    3. optimization
    4. real-time
    5. resource scheduling

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    MobiCom '18
    Sponsor:

    Acceptance Rates

    MobiCom '18 Paper Acceptance Rate 42 of 187 submissions, 22%;
    Overall Acceptance Rate 440 of 2,972 submissions, 15%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)356
    • Downloads (Last 6 weeks)34
    Reflects downloads up to 13 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)CloudRIC: Open Radio Access Network (O-RAN) Virtualization with Shared Heterogeneous ComputingProceedings of the 30th Annual International Conference on Mobile Computing and Networking10.1145/3636534.3649381(558-572)Online publication date: 29-May-2024
    • (2024)Modeling and Optimization of Channel Allocation for PAL and GAA Users in the CBRS BandIEEE Transactions on Cognitive Communications and Networking10.1109/TCCN.2023.331841210:1(1-20)Online publication date: Feb-2024
    • (2024) O-M 3 : Real-Time Multi-Cell MIMO Scheduling in 5G O-RAN IEEE Journal on Selected Areas in Communications10.1109/JSAC.2023.333616442:2(339-355)Online publication date: Feb-2024
    • (2024)Providing UE-level QoS Support by Joint Scheduling and Orchestration for 5G vRANIEEE INFOCOM 2024 - IEEE Conference on Computer Communications10.1109/INFOCOM52122.2024.10621408(51-60)Online publication date: 20-May-2024
    • (2024)YinYangRAN: Resource Multiplexing in GPU-Accelerated Virtualized RANsIEEE INFOCOM 2024 - IEEE Conference on Computer Communications10.1109/INFOCOM52122.2024.10621380(721-730)Online publication date: 20-May-2024
    • (2024)Scalable and Efficient URLLC Resource Allocation Based on Offline Graph Reinforcement Learning2024 International Conference on Artificial Intelligence in Information and Communication (ICAIIC)10.1109/ICAIIC60209.2024.10463212(348-353)Online publication date: 19-Feb-2024
    • (2023)Turbo-HB: A Sub-Millisecond Hybrid Beamforming Design for 5G mmWave SystemsIEEE Transactions on Mobile Computing10.1109/TMC.2022.315248022:7(4332-4346)Online publication date: 1-Jul-2023
    • (2023)Achieving Real-Time Spectrum Sharing in 5G Underlay Coexistence With Channel UncertaintyIEEE Transactions on Mobile Computing10.1109/TMC.2021.312094522:4(1922-1937)Online publication date: 1-Apr-2023
    • (2023)Towards an Efficient SIMD Virtual Radio Access Network (vRAN) and Edge Cloud SystemIEEE Transactions on Cloud Computing10.1109/TCC.2023.3275576(1-12)Online publication date: 2023
    • (2022)OutRANProceedings of the 18th International Conference on emerging Networking EXperiments and Technologies10.1145/3555050.3569122(369-385)Online publication date: 30-Nov-2022
    • Show More Cited By

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    ePub

    View this article in ePub.

    ePub

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media