Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

Sparse Representation-Based Semi-Supervised Regression for People Counting

Published: 01 August 2017 Publication History

Abstract

Label imbalance and the insufficiency of labeled training samples are major obstacles in most methods for counting people in images or videos. In this work, a sparse representation-based semi-supervised regression method is proposed to count people in images with limited data. The basic idea is to predict the unlabeled training data, select reliable samples to expand the labeled training set, and retrain the regression model. In the algorithm, the initial regression model, which is learned from the labeled training data, is used to predict the number of people in the unlabeled training dataset. Then, the unlabeled training samples are regarded as an over-complete dictionary. Each feature of the labeled training data can be expressed as a sparse linear approximation of the unlabeled data. In turn, the labels of the labeled training data can be estimated based on a sparse reconstruction in feature space. The label confidence in labeling an unlabeled sample is estimated by calculating the reconstruction error. The training set is updated by selecting unlabeled samples with minimal reconstruction errors, and the regression model is retrained on the new training set. A co-training style method is applied during the training process. The experimental results demonstrate that the proposed method has a low mean square error and mean absolute error compared with those of state-of-the-art people-counting benchmarks.

References

[1]
S. Bai and X. Bai. 2016. Sparse contextual activation for efficient visual re-ranking. IEEE Transactions on Image Processing 25, 3, 1056--1069.
[2]
A. B. Chan and N. Vasconcelos. 2012. Counting people with low-level features and Bayesian regression. IEEE Transactions on Image Processing 21, 4, 2160--2177.
[3]
C. L. Chen et al. 2013. From semi-supervised to transfer counting of crowds. In Proceedings of the IEEE International Conference on Computer Vision. 2256--2263.
[4]
K. Chen et al. 2012. Feature mining for localised crowd counting. In Proceedings of the British Machine Vision Conference. 1--11.
[5]
W. J. Chen et al. 2014. Laplacian smooth twin support vector machine for semi-supervised classification. International Journal of Machine Learning and Cybernetics 5, 3, 459--468.
[6]
Y. Cong et al. 2009. Flow mosaicking: Real-time pedestrian counting without scene-specific learning. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 1093--1100.
[7]
H. Foroughi et al. 2015. Robust people counting using sparse representation and random projection. Pattern Recognition 48, 10, 3038--3052.
[8]
C. Gao et al. 2016. People-flow counting in complex environments by combining depth and color information. Multimedia Tools and Applications 75, 15, 1--17.
[9]
X. Guo and K. Uehara. 2015. Graph-based semi-supervised regression and its extensions. International Journal of Advanced Computer Science 8 Applications 6, 6.
[10]
Y. L. Hou and G. K. H. Pang. 2011. People counting and human detection in a challenging situation. IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans 41, 1, 24--33.
[11]
C. L. Huang et al. 2011. People counting using ellipse detection and forward/backward tracing. In Proceedings of the 2011 1st Asian Conference on Pattern Recognition (ACPR). 505--509.
[12]
S. Karaman et al. 2014. Leveraging local neighborhood topology for large scale person re-identification. Pattern Recognition 47, 12, 3767--3778.
[13]
L. Maddalena et al. 2014. People counting by learning their appearance in a multi-view camera environment. Pattern Recognition Letters 36, 125--134.
[14]
D. Merad et al. 2010. Fast people counting using head detection from skeleton graph. In Proceedings of the 2010 7th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS). 233--240.
[15]
S. Mukherjee et al. 2015. Unique people count from monocular videos. The Visual Computer 31, 10, 1405--1417.
[16]
A. Oliva and A. Torralba. 2001. Modeling the shape of the scene: A holistic representation of the spatial envelope. International Journal of Computer Vision 42, 3, 145--175.
[17]
C. Raghavachari et al. 2015. A comparative study of vision based human detection techniques in people counting applications. Procedia Computer Science 58, 461--469.
[18]
R. Raina et al. 2007. Self-taught learning: Transfer learning from unlabeled data. In Proceedings of the International Conference on Machine Learning. 759--766.
[19]
D. Ryan et al. 2015. An evaluation of crowd counting methods, features and regression models. Computer Vision 8 Image Understanding 130, C, 1--17.
[20]
B. Tan et al. 2011. Semi-supervised elastic net for pedestrian counting. Pattern Recognition 44, 10-11, 2297--2304.
[21]
J. Tang et al. 2011. Image annotation by kNN-sparse graph-based label propagation over noisily tagged web images. ACM Transactions on Intelligent Systems 8 Technology 2, 2, 14.
[22]
J. Wang et al. 2014. Semi-supervised learning via geodesic weighted sparse representation. IEICE Transactions on Information 8 Systems E97.D, 6, 1673--1676.
[23]
W. Xia et al. 2015. Semisupervised pedestrian counting with temporal and spatial consistencies. IEEE Transactions on Intelligent Transportation Systems 16, 4, 1--11.
[24]
A. Y. Yang et al. 2010. Fast ℓ1-minimization algorithms and an application in robust face recognition: A review. Technical Report, EECS Department, University of California, Berkeley: 1849--1852.
[25]
G. Yu et al. 2015. Semi-supervised classification based on subspace sparse representation. Knowledge and Information Systems 43, 1, 81--101.
[26]
C. Zeng and H. Ma. 2010. Robust head-shoulder detection by PCA-based multilevel HOG-LBP detector for people counting. In Proceedings of the International Conference on Pattern Recognition. 2069--2072.
[27]
Z. H. Zhou and M. Li. 2007. Semi-supervised regression with co-training style algorithms. IEEE Transactions on Knowledge 8 Data Engineering 19, 11, 1479--1493.

Cited By

View all
  • (2024)Transductive classification via patch alignmentAI Communications10.3233/AIC-22017937:1(37-51)Online publication date: 21-Mar-2024
  • (2024)Fusion-Embedding Siamese Network for Light Field Salient Object DetectionIEEE Transactions on Multimedia10.1109/TMM.2023.327493326(984-994)Online publication date: 1-Jan-2024
  • (2024)Light Field Salient Object Detection With Sparse Views via Complementary and Discriminative Interaction NetworkIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2023.329060034:2(1070-1085)Online publication date: 1-Feb-2024
  • Show More Cited By

Index Terms

  1. Sparse Representation-Based Semi-Supervised Regression for People Counting

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Multimedia Computing, Communications, and Applications
    ACM Transactions on Multimedia Computing, Communications, and Applications  Volume 13, Issue 4
    November 2017
    362 pages
    ISSN:1551-6857
    EISSN:1551-6865
    DOI:10.1145/3129737
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 01 August 2017
    Accepted: 01 May 2017
    Revised: 01 May 2017
    Received: 01 January 2017
    Published in TOMM Volume 13, Issue 4

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Counting people
    2. reconstruction error
    3. semi-supervised regression
    4. sparse reconstruction
    5. sparse representation

    Qualifiers

    • Research-article
    • Research
    • Refereed

    Funding Sources

    • National Science Foundation of China
    • Natural Science Foundation of Fujian Province of China
    • Education and scientific research projects of young andmiddle-aged teachers in Fujian Province
    • Pilot Project of Fujian Province of China
    • Promotion Program for Young and Middle-aged Teachers in Science and Technology Research of Huaqiao University

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)12
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 24 Sep 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Transductive classification via patch alignmentAI Communications10.3233/AIC-22017937:1(37-51)Online publication date: 21-Mar-2024
    • (2024)Fusion-Embedding Siamese Network for Light Field Salient Object DetectionIEEE Transactions on Multimedia10.1109/TMM.2023.327493326(984-994)Online publication date: 1-Jan-2024
    • (2024)Light Field Salient Object Detection With Sparse Views via Complementary and Discriminative Interaction NetworkIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2023.329060034:2(1070-1085)Online publication date: 1-Feb-2024
    • (2024)Analysis of Advanced Technology Integrated Interviews2024 8th International Conference on Inventive Systems and Control (ICISC)10.1109/ICISC62624.2024.00107(613-617)Online publication date: 29-Jul-2024
    • (2023)Detection of Moving Object Using Superpixel Fusion NetworkACM Transactions on Multimedia Computing, Communications, and Applications10.1145/357999819:5(1-15)Online publication date: 16-Mar-2023
    • (2023)A Thorough Benchmark and a New Model for Light Field Saliency DetectionIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2023.323541545:7(8003-8019)Online publication date: 1-Jul-2023
    • (2023)LFTransNet: Light Field Salient Object Detection via a Learnable Weight DescriptorIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2023.328146533:12(7764-7773)Online publication date: 1-Dec-2023
    • (2023)A Tutorial on Immersive Video Delivery: From Omnidirectional Video to HolographyIEEE Communications Surveys & Tutorials10.1109/COMST.2023.326325225:2(1336-1375)Online publication date: 1-Apr-2023
    • (2022)Semi-supervised pre-processing for learning-based traceability framework on real-world software projectsProceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering10.1145/3540250.3549151(570-582)Online publication date: 7-Nov-2022
    • (2022)LFBCNet: Light Field Boundary-aware and Cascaded Interaction Network for Salient Object DetectionProceedings of the 30th ACM International Conference on Multimedia10.1145/3503161.3548275(3430-3439)Online publication date: 10-Oct-2022
    • Show More Cited By

    View Options

    Get Access

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media