Abstract
In this paper, we propose a novel classification model for the multiple instance data, which aims to maximize the number of positive instances ranked before the top-ranked negative instances. This method belongs to a recently emerged performance, named as Pos@Top. Our proposed classification model has a convolutional structure that is composed by four layers, i.e., the convolutional layer, the activation layer, the max-pooling layer and the full connection layer. In this paper, we propose an algorithm to learn the convolutional filters and the full connection weights to maximize the Pos@Top measure over the training set. Also, we try to minimize the rank of the filter matrix to explore the low-dimensional space of the instances in conjunction with the classification results. The rank minimization is conducted by the nuclear norm minimization of the filter matrix. In addition, we develop an iterative algorithm to solve the corresponding problem. We test our method on several benchmark datasets. The experimental results show the superiority of our method compared with other state-of-the-art Pos@Top maximization methods.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Abadi M, Agarwal A, Barham P, Brevdo E, Chen Z, Citro C, Corrado GS, Davis A, Dean J, Devin M et al (2015) Tensorflow: large-scale machine learning on heterogeneous systems, 2015. Software available from tensorflow.org 1
Agarwal S (2011) The infinite push: a new support vector ranking algorithm that directly optimizes accuracy at the absolute top of the list. In: Proceedings of the 11th SIAM international conference on data mining, SDM 2011, pp 839–850
Al Madi, NS, Khan JI (2016) Measuring learning performance and cognitive activity during multimodal comprehension. In: 2016 7th international conference on information and communication systems (ICICS). IEEE, pp 50–55
Bewley A, Upcroft B (2016) From imagenet to mining: adapting visual object detection with minimal supervision. Springer Tracts Adv Robot 113:501–514
Boyd S, Cortes C, Mohri M, Radovanovic A (2012) Accuracy at the top. Adv Neural Inf Process Syst 2:953–961
Chen S, Wang H, Xu F, Jin YQ (2016) Target classification using the deep convolutional networks for SAR images. IEEE Trans Geosci Remote Sens 54(8):4806–4817
Ding M, Fan G (2013) Multi-layer joint gait-pose manifold for human motion modeling. In: 2013 10th IEEE international conference and workshops on automatic face and gesture recognition (FG), pp 1–8
Ding M, Fan G (2015) Generalized sum of Gaussians for real-time human pose tracking from a single depth sensor. In: 2015 IEEE winter conference on applications of computer vision, pp 47–54
Ding M, Fan G (2015) Multilayer joint gait-pose manifolds for human gait motion modeling. IEEE Trans Cybern 45(11):2413–2424
Ding M, Fan G (2016) Articulated and generalized Gaussian kernel correlation for human pose estimation. IEEE Trans Image Process 25(2):776–789
Fan J, Liang RZ (2016) Stochastic learning of multi-instance dictionary for earth mover’s distance based histogram comparison. Neural Comput Appl 1–11. arXiv:1609.00817v1
Goadrich M, Oliphant L, Shavlik J (2006) Gleaner: creating ensembles of first-order clauses to improve recall–precision curves. Mach Learn 64(1–3):231–261
Gong C, Tao D, Maybank SJ, Liu W, Kang G, Yang J (2016) Multi-modal curriculum learning for semi-supervised image classification. IEEE Trans Image Process 25(7):3249–3260
Griffin G, Holub A, Perona P (2007) Caltech-256 object category dataset. Technical report 7694, California Institute of Technology
Halligan S, Altman DG, Mallett S (2015) Disadvantages of using the area under the receiver operating characteristic curve to assess imaging tests: a discussion and proposal for an alternative approach. Eur Radiol 25(4):932–939
Hammami N, Bedda M, Farah N (2012) Spoken Arabic digits recognition using MFCC based on GMM. In: 2012 IEEE conference on sustainable utilization and development in engineering and technology (STUDENT). IEEE, pp 160–163
Harris G, Panangadan A, Prasanna VK (2015) Learning of performance measures from crowd-sourced data with application to ranking of investments. In: Pacific–Asia conference on knowledge discovery and data mining. Springer, pp 538–549
He K, Zhang X, Ren S, Sun J (2016) Delving deep into rectifiers: surpassing human-level performance on imagenet classification. In: Proceedings of the IEEE international conference on computer vision, 11–18 December 2015, pp 1026–1034
Hendrickx I, Kim SN, Kozareva Z, Nakov P, Ó Séaghdha, D, Padó, S, Pennacchiotti, M, Romano, L, Szpakowicz, S (2009) Semeval-2010 task 8: multi-way classification of semantic relations between pairs of nominals. In: Proceedings of the workshop on semantic evaluations: recent achievements and future directions. Association for Computational Linguistics, pp 94–99
Hentschel C, Wiradarma T, Sack H (2015) If we did not have imagenet: comparison of fisher encodings and convolutional neural networks on limited training data. Lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics), vol 9475. Springer, Switzerland, pp 400–409
Jain S, Kashyap R, Kuo TT, Bhargava S, Lin G, Hsu CN (2016) Weakly supervised learning of biomedical information extraction from curated data. BMC Bioinform 17(1):1
Joachims T (2005) A support vector method for multivariate performance measures. In: Proceedings of the 22nd international conference on machine learning. ACM, pp 377–384
Li N, Jin R, Zhou ZH (2014) Top rank optimization in linear time. Adv Neural Inf Process Syst 2:1502–1510
Liang RZ, Shi L, Wang H, Meng J, Wang JJY, Sun Q, Gu, Y (2016) Optimizing top precision performance measure of content-based image retrieval by learning similarity function. In: 2016 23st International conference on pattern recognition (ICPR). IEEE
Liang RZ, Xie W, Li W, Wang H, Wang JJY, Taylor L (2016) A novel transfer learning method based on common space mapping and weighted domain matching. In: 2016 IEEE 28th international conference on tools with artificial intelligence (ICTAI)
Lin F, Wang J, Zhang N, Xiahou J, McDonald N (2016) Multi-kernel learning for multivariate performance measures optimization. Neural Comput Appl 1–13. arXiv:1508.06264v1
Lu S, Lu H, Kolarik WJ (2001) Multivariate performance reliability prediction in real-time. Reliab Eng Syst Saf 72(1):39–45
Madsen ME, Konge L, Nørgaard LN, Tabor A, Ringsted C, Klemmensen Å, Ottesen B, Tolsgaard MG (2014) Assessment of performance measures and learning curves for use of a virtual-reality ultrasound simulator in transvaginal ultrasound examination. Ultrasound Obstet Gynecol 44(6):693–699
Mao Q, Tsang IWH (2013) A feature selection method for multivariate performance measures. IEEE Trans Pattern Anal Mach Intell 35(9):2051–2063
Martinel N, Piciarelli C, Micheloni C (2016) A supervised extreme learning committee for food recognition. Comput Vis Image Underst 148:67–86
Meyen A, Sooriyarachchi M (2016) Simulation study of a novel method for comparing more than two independent receiver operating characteristic (ROC) curves based on the area under the curves (AUCS). J Natl Sci Found Sri Lanka 43(4):357–367
Patel M, Agius S, Wilkinson J, Patel L, Baker P (2016) Value of supervised learning events in predicting doctors in difficulty. Med Educ 50(7):746–756
Russakovsky O, Deng J, Su H, Krause J, Satheesh S, Ma S, Huang Z, Karpathy A, Khosla A, Bernstein M, Berg A, Fei-Fei L (2015) Imagenet large scale visual recognition challenge. Int J Comput Vis 115(3):211–252
Shih SM, Wu WH, Hsieh HN (2016) A non-inferiority test for diagnostic accuracy in the absence of the golden standard test based on the paired partial areas under receiver operating characteristic curves. J Appl Stat 43(3):550–562
Xu S, Xu L, Zhan Z, Ye K, Han K, Born F (2014) Method and system for resilient and adaptive detection of malicious websites. US Patent WO2013184653 A1
Sofotasios PC, Fikadu MK, Ho-Van K, Valkama M, Karagiannidis GK (2014) The area under a receiver operating characteristic curve over enriched multipath fading conditions. In: 2014 IEEE global communications conference. IEEE, pp 3490–3495
Sun F, Guo J, Lan Y, Xu J, Cheng X (2015) Learning word representations by jointly modeling syntagmatic and paradigmatic relations. In: Proceedings of the 53rd annual meeting of the association for computational linguistics and the 7th international joint conference on natural language processing, Beijing, China, 26–31 July 2015, pp 136–145
Takeda A, Kanamori T (2014) Using financial risk measures for analyzing generalization performance of machine learning models. Neural Netw 57:29–38
Tang B, Liu X, Lei J, Song M, Tao D, Sun S, Dong F (2016) Deepchart: combining deep convolutional networks and deep belief networks in chart classification. Signal Process 124:156–161
Wang CY, Peng DY, Xu L, Yi XS (2007) Gradual gray-watermark embedding algorithm in the wavelet domain. J Comput Appl 6:025
Wang J, Wang H, Zhou Y, McDonald N (2015) Multiple kernel multivariate performance learning using cutting plane algorithm. In: 2015 IEEE international conference on systems, man, and cybernetics (SMC). IEEE, pp 1870–1875
Wang JJY, Tsang IWH, Gao X (2016) Optimizing multivariate performance measures from multi-view data. In: Thirtieth AAAI conference on artificial intelligence, pp 2152—2158
Wang L, Scott K, Xu L, Clausi D (2016) Sea ice concentration estimation during melt from dual-pol SAR scenes using deep convolutional neural networks: a case study. IEEE Trans Geosci Remote Sens 54(8):4524–4533
Xu L, Zhan Z, Xu S, Ye K (2013) Cross-layer detection of malicious websites. In: Proceedings of the third ACM conference on data and application security and privacy. ACM, pp 141–152
Xu L, Zhan Z, Xu S, Ye K (2014) An evasion and counter-evasion study in malicious websites detection. In: 2014 IEEE conference on communications and network security (CNS). IEEE, pp 265–273
Yang W, Jin L, Tao D, Xie Z, Feng Z (2016) Dropsample: a new training method to enhance deep convolutional neural networks for large-scale unconstrained handwritten chinese character recognition. Pattern Recogn 58:190–203
Zahedi M, Sorkhi A (2013) Improving text classification performance using PCA and recall–precision criteria. Arab J Sci Eng 38(8):2095–2102
Zhang P, Su W (2012) Statistical inference on recall, precision and average precision under random selection. In: Proceedings—2012 9th international conference on fuzzy systems and knowledge discovery, FSKD 2012, pp 1348–1352
Zokaei N, Burnett Heyes S, Gorgoraptis N, Budhdeo S, Husain M (2015) Working memory recall precision is a more sensitive index than span. J Neuropsychol 9(2):319–329
Acknowledgements
The Foundation of modern educational technology research of Jiangsu Province (No. 2015-R-42631) and The University Natural Science Foundation of Jiangsu Province (14KJD520003).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflicts of interests.
Rights and permissions
About this article
Cite this article
Li, Q., Zhou, X., Gu, A. et al. Nuclear norm regularized convolutional Max Pos@Top machine. Neural Comput & Applic 30, 463–472 (2018). https://doi.org/10.1007/s00521-016-2680-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-016-2680-2