research-article

Deep value of information estimators for collaborative human-machine information gathering

Authors:

Nicholas Sweet,

Soumik SarkarAuthors Info & Claims

ICCPS '16: Proceedings of the 7th International Conference on Cyber-Physical Systems

Article No.: 3, Pages 1 - 10

Published: 11 April 2016 Publication History

Abstract

Effective human-machine collaboration can significantly improve many learning and planning strategies for information gathering via fusion of 'hard' and 'soft' data originating from machine and human sensors, respectively. However, gathering the most informative data from human sensors without task overloading remains a critical technical challenge. In this context, Value of Information (VOI) is a crucial decision-theoretic metric for scheduling interaction with human sensors. We present a new Deep Learning based VOI estimation framework that can be used to schedule collaborative human-machine sensing with efficient online inference and minimal policy hand-tuning. Supervised learning is used to train deep convolutional neural networks (CNNs) to extract hierarchical features from 'images' of belief spaces obtained via data fusion. These features can be associated with soft data query choices to reliably compute VOI for human interaction. The CNN framework is described in detail, and a performance comparison to a feature-based POMDP scheduling policy is provided. The practical feasibility of our method is also demonstrated on a mobile robotic search problem with language-based semantic human sensor inputs.

References

[1]

N. Ahmed, M. Campbell, D. Casbeer, Y. Cao, and D. Kingston. Fully bayesian learning and spatial reasoning with flexible human sensor networks. In Proceedings of the ACM/IEEE Sixth Int'l Conf. on Cyber-Physical Systems, pages 80--89. ACM, 2015.

Digital Library

[2]

N. Ahmed, E. Sample, and M. Campbell. Bayesian multicategorical soft data fusion for human robot collaboration. IEEE Trans. on Robotics, 29:189--206, 2013.

Digital Library

[3]

F. Bourgault, A. Chokshi, J. Wang, D. Shah, J. Schoenberg, R. Iyer, F. Cedano, and M. Campbell. Scalable Bayesian human robot cooperation in mobile sensor networks. In Int'l Conf. on Intelligent Robots and Sys., pages 2342--2349, 2008.

[4]

M. F. Huber, T. Bailey, H. Durrant-Whyte, and U. D. Hanebeck. On entropy approximation for Gaussian mixture random vectors. In 2008 IEEE Int'l Conf. on Multisensor Fusion and Integration, pages 181--188, Aug. 2008.

[5]

T. Kaupp, B. Douillard, F. Ramos, A. Makarenko, and B. Upcroft. Shared environment representation for a human robot team performing information fusion. Journal of Field Robotics, 24(11):911--942, 2007.

Digital Library

[6]

T. Kaupp, A. Makarenko, and H. Durrant-Whyte. Human robot communication for collaborative decision making: A probabilistic approach. Robotics and Autonomous Systems, 58(5):444--456, May 2010.

Digital Library

[7]

K. Kavukcuoglu, P. Sermanet, Y.-L. Boureau, K. Gregor, M. Mathieu, and Y. LeCun. Imagenet classification with deep convolutional neural networks. Neural Information Processing Systems, 2010.

[8]

D. Kingston. Intruder Tracking Using UAV Teams and Ground Sensor Networks. In German Aviation and Aerospace Congress (DLRK 2012), Berlin, Germany, 2012. German Society for Aeronautics and Astronautics (DGLR).

[9]

A. Krause and C. E. Guestrin. Near-optimal nonmyopic value of information in graphical models. In Proceedings of the 21st Conference on Uncertainty in Artificial Intelligence, 2005.

[10]

V. Krishnamurthy and D. V. Djonin. Structured threshold policies for dynamic sensor scheduling: A partially observed markov decision process approach. IEEE Transactions on Signal Processing, 55(10):4938--4957, Oct. 2007.

Digital Library

[11]

S. Levine, C. Finn, T. Darrell, and P. Abbeel. End-to-end training of deep visuomotor policies. arXiv preprint arXiv:1504.00702, 2015.

Digital Library

[12]

Q. Liu and A. Ihler. Belief Propagation for Structured Decision Making. In Proceedings of Uncertainty in Artificial Intelligence (UAI), 2012.

[13]

V. Mnih, K. Kavukcuoglu, D. Silver, A. A. Rusu, J. Veness, et al. Human-level control through deep reinforcement learning. Nature, 518(7540):529--533, 02 2015.

[14]

B. Park, A. Johannson, and D. Nicholson. Crowdsourcing soft data for improved urban situation assessment. In Int'l Conf. on Information Fusion (FUSION), pages 669--675. IEEE, 2013.

[15]

N. Roy, G. Gordon, and S. Thrun. Finding Approximate POMDP Solutions Through Belief Compression. Journal of Machine Learning Research, 23:1--40, 2005.

Digital Library

[16]

D. Silver and J. Veness. Monte-Carlo planning in large POMDPs. In Advances in Neural Information Processing Systems, pages 1--9, 2010.

[17]

S. Thrun, W. Burgard, and D. Fox. Probabilistic Robotics. MIT Press, Cambridge, MA, 2005.

[18]

T. Zhang, G. Kahn, S. Levine, and P. Abbeel. Learning deep control policies for autonomous aerial vehicles with mpc-guided policy search. arXiv preprint arXiv:1509.06791, 2015.

Cited By

Lore KAkintayo ASarkar S(2017)LLNetPattern Recognition10.1016/j.patcog.2016.06.00861:C(650-662)Online publication date: 1-Jan-2017
https://dl.acm.org/doi/10.1016/j.patcog.2016.06.008

Recommendations

A Method of Information Protection for Collaborative Deep Learning under GAN Model Attack
Deep learning is widely used in the medical field owing to its high accuracy in medical image classification and biological applications. However, under collaborative deep learning, there is a serious risk of information leakage based on the deep ...
Value-driven information gathering
Human action recognition in videos with articulated pose information by deep networks
Abstract
Action recognition is of great importance in understanding human motion from video. It is an important topic in computer vision due to its many applications such as video surveillance, human–machine interaction and video retrieval. One key problem ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

ICCPS '16: Proceedings of the 7th International Conference on Cyber-Physical Systems

April 2016

291 pages

General Chairs:
Xenofon Koutsoukos
Vanderbilt University
,
Ian M. Mitchell
University of British Columbia, Canada

Sponsors

SIGBED: ACM Special Interest Group on Embedded Systems

Publisher

IEEE Press

Publication History

Published: 11 April 2016

Check for updates

Qualifiers

Research-article

Conference

ICCPS '16

Sponsor:

SIGBED

ICCPS '16: ACM/IEEE 7th International Conference on Cyber-Physical Systems

April 11 - 14, 2016

Vienna, Austria

Acceptance Rates

Overall Acceptance Rate 25 of 91 submissions, 27%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1
Total Citations
View Citations
69
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 18 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

Lore KAkintayo ASarkar S(2017)LLNetPattern Recognition10.1016/j.patcog.2016.06.00861:C(650-662)Online publication date: 1-Jan-2017
https://dl.acm.org/doi/10.1016/j.patcog.2016.06.008

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents