Nothing Special   »   [go: up one dir, main page]

skip to main content
10.5555/2984464.2984467acmconferencesArticle/Chapter ViewAbstractPublication PagesiccpsConference Proceedingsconference-collections
research-article

Deep value of information estimators for collaborative human-machine information gathering

Published: 11 April 2016 Publication History

Abstract

Effective human-machine collaboration can significantly improve many learning and planning strategies for information gathering via fusion of 'hard' and 'soft' data originating from machine and human sensors, respectively. However, gathering the most informative data from human sensors without task overloading remains a critical technical challenge. In this context, Value of Information (VOI) is a crucial decision-theoretic metric for scheduling interaction with human sensors. We present a new Deep Learning based VOI estimation framework that can be used to schedule collaborative human-machine sensing with efficient online inference and minimal policy hand-tuning. Supervised learning is used to train deep convolutional neural networks (CNNs) to extract hierarchical features from 'images' of belief spaces obtained via data fusion. These features can be associated with soft data query choices to reliably compute VOI for human interaction. The CNN framework is described in detail, and a performance comparison to a feature-based POMDP scheduling policy is provided. The practical feasibility of our method is also demonstrated on a mobile robotic search problem with language-based semantic human sensor inputs.

References

[1]
N. Ahmed, M. Campbell, D. Casbeer, Y. Cao, and D. Kingston. Fully bayesian learning and spatial reasoning with flexible human sensor networks. In Proceedings of the ACM/IEEE Sixth Int'l Conf. on Cyber-Physical Systems, pages 80--89. ACM, 2015.
[2]
N. Ahmed, E. Sample, and M. Campbell. Bayesian multicategorical soft data fusion for human robot collaboration. IEEE Trans. on Robotics, 29:189--206, 2013.
[3]
F. Bourgault, A. Chokshi, J. Wang, D. Shah, J. Schoenberg, R. Iyer, F. Cedano, and M. Campbell. Scalable Bayesian human robot cooperation in mobile sensor networks. In Int'l Conf. on Intelligent Robots and Sys., pages 2342--2349, 2008.
[4]
M. F. Huber, T. Bailey, H. Durrant-Whyte, and U. D. Hanebeck. On entropy approximation for Gaussian mixture random vectors. In 2008 IEEE Int'l Conf. on Multisensor Fusion and Integration, pages 181--188, Aug. 2008.
[5]
T. Kaupp, B. Douillard, F. Ramos, A. Makarenko, and B. Upcroft. Shared environment representation for a human robot team performing information fusion. Journal of Field Robotics, 24(11):911--942, 2007.
[6]
T. Kaupp, A. Makarenko, and H. Durrant-Whyte. Human robot communication for collaborative decision making: A probabilistic approach. Robotics and Autonomous Systems, 58(5):444--456, May 2010.
[7]
K. Kavukcuoglu, P. Sermanet, Y.-L. Boureau, K. Gregor, M. Mathieu, and Y. LeCun. Imagenet classification with deep convolutional neural networks. Neural Information Processing Systems, 2010.
[8]
D. Kingston. Intruder Tracking Using UAV Teams and Ground Sensor Networks. In German Aviation and Aerospace Congress (DLRK 2012), Berlin, Germany, 2012. German Society for Aeronautics and Astronautics (DGLR).
[9]
A. Krause and C. E. Guestrin. Near-optimal nonmyopic value of information in graphical models. In Proceedings of the 21st Conference on Uncertainty in Artificial Intelligence, 2005.
[10]
V. Krishnamurthy and D. V. Djonin. Structured threshold policies for dynamic sensor scheduling: A partially observed markov decision process approach. IEEE Transactions on Signal Processing, 55(10):4938--4957, Oct. 2007.
[11]
S. Levine, C. Finn, T. Darrell, and P. Abbeel. End-to-end training of deep visuomotor policies. arXiv preprint arXiv:1504.00702, 2015.
[12]
Q. Liu and A. Ihler. Belief Propagation for Structured Decision Making. In Proceedings of Uncertainty in Artificial Intelligence (UAI), 2012.
[13]
V. Mnih, K. Kavukcuoglu, D. Silver, A. A. Rusu, J. Veness, et al. Human-level control through deep reinforcement learning. Nature, 518(7540):529--533, 02 2015.
[14]
B. Park, A. Johannson, and D. Nicholson. Crowdsourcing soft data for improved urban situation assessment. In Int'l Conf. on Information Fusion (FUSION), pages 669--675. IEEE, 2013.
[15]
N. Roy, G. Gordon, and S. Thrun. Finding Approximate POMDP Solutions Through Belief Compression. Journal of Machine Learning Research, 23:1--40, 2005.
[16]
D. Silver and J. Veness. Monte-Carlo planning in large POMDPs. In Advances in Neural Information Processing Systems, pages 1--9, 2010.
[17]
S. Thrun, W. Burgard, and D. Fox. Probabilistic Robotics. MIT Press, Cambridge, MA, 2005.
[18]
T. Zhang, G. Kahn, S. Levine, and P. Abbeel. Learning deep control policies for autonomous aerial vehicles with mpc-guided policy search. arXiv preprint arXiv:1509.06791, 2015.

Cited By

View all

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
ICCPS '16: Proceedings of the 7th International Conference on Cyber-Physical Systems
April 2016
291 pages

Sponsors

Publisher

IEEE Press

Publication History

Published: 11 April 2016

Check for updates

Qualifiers

  • Research-article

Conference

ICCPS '16
Sponsor:

Acceptance Rates

Overall Acceptance Rate 25 of 91 submissions, 27%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 18 Dec 2024

Other Metrics

Citations

Cited By

View all

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media