Online Learning Methods for Border Patrol Resource Allocation

Richard Klíma^17,18,
Christopher Kiekintveld¹⁸ &
Viliam Lisý¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNSC,volume 8840))

Included in the following conference series:

International Conference on Decision and Game Theory for Security

2027 Accesses

Abstract

We introduce a model for border security resource allocation with repeated interactions between attackers and defenders. The defender must learn the optimal resource allocation strategy based on historical apprehension data, balancing exploration and exploitation in the policy. We experiment with several solution methods for this online learning problem including UCB, sliding-window UCB, and EXP3. We test the learning methods against several different classes of attackers including attacker with randomly varying strategies and attackers who react adversarially to the defender’s strategy. We present experimental data to identify the optimal parameter settings for these algorithms and compare the algorithms against the different types of attackers.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Combining Online Learning and Equilibrium Computation in Security Games

Online Versus Offline Reinforcement Learning for False Target Control Against Known Threat

Online Learning Methods for Controlling Dynamic Cyber Deception Strategies

References

2012–2016 border patrol strategic plan. U.S. Customs and Border Protection (2012)
Google Scholar
Auer, P.: Using confidence bounds for exploitation-exploration trade-offs. The Journal of Machine Learning Research 3, 397–422 (2003)
MathSciNet MATH Google Scholar
Auer, P., Cesa-Bianchi, N., Freund, Y., Schapire, R.E.: The non-stochastic multi-armed bandit problem. SIAM Journal on Computing 32(1) (2001)
Google Scholar
Fudenberg, D., Levine, D.K.: The Theory of Learning in Games. The MIT Press (1998)
Google Scholar
Garivier, A., Moulines, E.: On upper-confidence bound policies for non-stationary bandit problems. Technical report (2008)
Google Scholar
Kiekintveld, C., Jain, M., Tsai, J., Pita, J., Ordonez, F., Tambe, M.: Computing optimal randomized resource allocations for massive security games. In: AAMAS 2009 (2009)
Google Scholar
Pita, J., Jain, M., Western, C., Portway, C., Tambe, M., Ordonez, F., Kraus, S., Parachuri, P.: Depoloyed ARMOR protection: The application of a game-theoretic model for security at the Los Angeles International Airport. In: AAMAS 2008 (Industry Track) (2008)
Google Scholar
Pita, J., Tambe, M., Kiekintveld, C., Cullen, S., Steigerwald, E.: GUARDS - game theoretic security allocation on a national scale. In: AAMAS 2011 (Industry Track) (2011)
Google Scholar
Predd, J., Willis, H., Setodji, C., Stelzner, C.: Using pattern analysis and systematic randomness to allocate U.S. border security resources (2012)
Google Scholar
Shieh, E., An, B., Yang, R., Tambe, M., Baldwin, C., Direnzo, J., Meyer, G., Baldwin, C.W., Maule, B.J., Meyer, G.R.: PROTECT: A Deployed Game Theoretic System to Protect the Ports of the United States. In: AAMAS (2012)
Google Scholar
Tambe, M.: Security and Game Theory: Algorithms, Deployed Systems, Lessons Learned. Cambridge University Press (2011)
Google Scholar
Tsai, J., Rathi, S., Kiekintveld, C., Ordóñez, F., Tambe, M.: IRIS - A tools for strategic security allocation in transportation networks. In: AAMAS 2009 (Industry Track) (2009)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, FEE, Czech Technical University in Prague, Prague, Czeck Republic
Richard Klíma & Viliam Lisý
Computer Science Department, University of Texas at El Paso, EI Paso, TX, USA
Richard Klíma & Christopher Kiekintveld

Authors

Richard Klíma
View author publications
You can also search for this author in PubMed Google Scholar
Christopher Kiekintveld
View author publications
You can also search for this author in PubMed Google Scholar
Viliam Lisý
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Electrical Engineering Department, Network Securitiy Lab, University of Washington, Box 352500, 98195-2055, Seattle, WA, USA
Radha Poovendran
Department of Electrical and Computer Engineering, Virginia Tech, Whittmore Hall 302, 1185 Perry Street, 24061, Blacksburg, VA, USA
Walid Saad

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Klíma, R., Kiekintveld, C., Lisý, V. (2014). Online Learning Methods for Border Patrol Resource Allocation. In: Poovendran, R., Saad, W. (eds) Decision and Game Theory for Security. GameSec 2014. Lecture Notes in Computer Science, vol 8840. Springer, Cham. https://doi.org/10.1007/978-3-319-12601-2_20

Download citation

DOI: https://doi.org/10.1007/978-3-319-12601-2_20
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-12600-5
Online ISBN: 978-3-319-12601-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics