Abstract
Feature selection (FS) represents an important task in classification. Hadith represents an example in which we can apply FS on it. Hadiths are the second major source of Islam after the Quran. Thousands of Hadiths are available in Islam, and these Hadiths are grouped into a number of classes. In the literature, there are many studies conducted for Hadiths classification. Sine Cosine Algorithm (SCA) is a new metaheuristic optimization algorithm. SCA algorithm is mainly based on exploring the search space using sine and cosine mathematical formulas to find the optimal solution. However, SCA, like other Optimization Algorithm (OA), suffers from the problem of local optima and solution diversity. In this paper, to overcome SCA problems and use it for the FS problem, two major improvements were introduced to the standard SCA algorithm. The first improvement includes the use of singer chaotic map within SCA to improve solutions diversity. The second improvement includes the use of the Simulated Annealing (SA) algorithm as a local search operator within SCA to improve its exploitation. In addition, the Gini Index (GI) is used to filter the resulted selected features to reduce the number of features to be explored by SCA. Furthermore, three new Hadith datasets were created. To evaluate the proposed Improved SCA (ISCA), the new three Hadiths datasets were used in our experiments. Furthermore, to confirm the generality of ISCA, we also applied it on 14 benchmark datasets from the UCI repository. The ISCA results were compared with the original SCA and the state-of-the-art algorithms such as Particle Swarm Optimization (PSO), Genetic Algorithm (GA), Grasshopper Optimization Algorithm (GOA), and the most recent optimization algorithm, Harris Hawks Optimizer (HHO). The obtained results confirm the clear outperformance of ISCA in comparison with other optimization algorithms and Hadith classification baseline works. From the obtained results, it is inferred that ISCA can simultaneously improve the classification accuracy while it selects the most informative features.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
AbuZeina D, Al-Anzi FS (2018) Employing fisher discriminant analysis for Arabic text classification. Comput Electr Eng 66:474–486
Afianto MF, Al-Faraby S et al. (2018) Text categorization on hadith Sahih Al-Bukhari using randomforest. In: Journal of physics: conference series, vol 971. IOP Publishing, 012037
Afshar-Nadjafi B, Yazdani M, Majlesi M (2017) A hybrid of Tabu search and simulated annealing algorithms for preemptive project scheduling problem. In: Benferhat S, Tabia K, Ali M (eds) Advances in artificial intelligence: from theory to practice. IEA/AIE, vol 10350. Lecture Notes in Computer Science. Springer, Cham
Al-Anzi FS, AbuZeina D (2018) Beyond vector space model for hierarchical Arabic text classification: a Markov chain approach. Inf Process Manage 54(1):105–115
Al-Kabi MN, Al-Sinjilawi SI (2007) A comparative study of the efficiency of different measures to classify Arabic text. Univ Sharjah J Pure Appl Sci 4(2):13–26
Al-Kabi MN, Kanaan G, Al-Shalabi R, Al-Sinjilawi SI, Al-Mustafa RS (2005) Al-Hadith text classifier. J Appl Sci 5(3):584–587
Alkhatib M (2010) Classification of Al-Hadith Al-Shareef using data mining algorithm. In: European, mediterranean and middle eastern conference on information systems. EMCIS2010, Abu Dhabi, UAE, pp 1–23
Al Faraby S, Riviera E, Jasin R (2018) Classification of hadith into positive suggestion, negative suggestion, and information Classification of hadith into positive suggestion, negative suggestion, and information. J Phys Conf Ser 971(012046):1–8
El-Halees AM (2008) A comparative study on Arabic text classification. Egypt Comput Sci J 30(2)
Aljarah I, Al-Zoubi AM, Faris H et al (2018) Simultaneous feature selection and support vector machine optimization using the grasshopper optimization algorithm. Cogn Comput 10:478–495
Arora S, Anand P (2019) Binary butterfly optimization approaches for feature selection. Expert Syst Appl 116:147–160
Azmi R, Pishgoo B, Norozi N, Koohzadi M, Baesi F (2010) A hybrid GA and SA algorithms for feature selection in recognition of hand-printed Farsi characters. In: 2010 IEEE international conference on intelligent computing and intelligent systems, pp 384–387
Bahassine S, Madani A, Al-Sarem M, Kissi M (2018) Feature selection using an improved Chi-square for Arabic text classification. J King Saud Univ Comput Inf Sci 32:225–231
Belazzoug M, Touahria M, Nouioua F, Brahimi M (2019) An improved sine cosine algorithm to select features for text categorization. J King Saud Univ-Comput Inf Sci 32:454–464
Breiman L, Friedman JH, Olshen RA, Stone CJ (1984) Classification and Regression Trees, The Wadsworth Statistics and Probability Series, Wadsworth International Group, Belmont California (pp. 356)
Eesa AS, Orman Z, Brifcani AMA (2015) A novel feature-selection approach based on the cuttlefish optimization algorithm for intrusion detection systems. Expert Syst Appl 42(5):2670–2679
Elaziz MA, Oliva D, Xiong S (2017) An improved opposition-based sine cosine algorithm for global optimization. Expert Syst Appl 90:484–500
Elaziz Mohamed EA, Ewees AA, Oliva D, Duan P, Xiong S (2017) A hybrid method of sine cosine algorithm and differential evolution for feature selection. In: International conference on neural information processing, Springer, Berlin, pp 145–155
Elgamal ZM, Yasin NBM, Tubishat M, Alswaitti M, Mirjalili S (2020) An improved harris hawks optimization algorithm with simulated annealing for feature selection in the medical field. IEEE Access 8:186638–186652
Emary E, Zawbaa HM, Grosan C, Hassenian AE (2015) Feature subset selection approach by gray-wolf optimization. In: Afro-European conference for industrial advancement. Springer, Cham, pp 1–13
Feng Y, Yang J, Wu C, Lu M, Zhao X-J (2018) Solving 0–1 knapsack problems by chaotic monarch butterfly optimization algorithm with Gaussian mutation. Memetic Comput 10(2):135–150
Feng Z-K, Liu S, Niu W-J, Li B-J, Wang W-C, Luo B, Miao S-M (2020) A modified sine cosine algorithm for accurate global optimization of numerical functions and multiple hydropower reservoirs operation. Knowl Based Syst 208:106461
Gu S, Cheng R, Jin Y (2018) Feature selection for high-dimensional classification using a competitive swarm optimizer. Soft Comput 22(3):811–822
Guo W-Y, Wang Y, Dai F, Xu P (2020) Improved sine cosine algorithm combined with optimal neighborhood and quadratic interpolation strategy. Eng Appl Artif Intell 94:103779
Gupta S, Deep K (2019) Improved sine cosine algorithm with crossover scheme for global optimization. Knowl-Based Syst 165:374–406
Gupta S, Deep K (2020) Hybrid sine cosine artificial bee colony algorithm for global optimization and image segmentation. Neural Comput Appl 32(13):9521–9543
Hammouri AI, Mafarja M, Al-Betar MA, Awadallah MA, Abu-Doush I (2020) An improved Dragonfly Algorithm for feature selection. Knowl Based Syst 203:106131
Hans R, Kaur H (2020) Hybrid binary Sine Cosine Algorithm and Ant Lion Optimization (SCALO) approaches for feature selection problem. Int J Comput Mater Sci Eng 9(01):1950021
Harrag F, El-Qawasmah E (2009) Neural network for Arabic text classification. In: Second international conference on the applications of digital information and web technologies. IEEE, London, UK, pp 778–783
Harrag F, El-Qawasmah E, Al-Salman AMS (2011) Stemming as a feature reduction technique for Arabic Text Categorization. In: 2011 10th international symposium on programming and systems, pp 128–133
Hu P, Pan J-S, Chu S-C (2020) Improved binary grey wolf optimizer and its application for feature selection. Knowl Based Syst 195:105746
Javidrad F, Nazari M (2017) A new hybrid particle swarm and simulated annealing stochastic optimization method. Appl Soft Comput 60:634–654
Jurgens H, Peitgen H-O, Saupe D (1992) Chaos and fractals: new frontiers of science. New Springer-Verlag, New York
Kaveh A, Javadi S (2019) Chaos-based firefly algorithms for optimization of cyclically large-size braced steel domes with multiple frequency constraints. Comput Struct 214:28–39
Kirkpatrick S, Gelatt CD, Vecchi MP (1983) Optimization by simulated annealing. Science 220(4598):671–680
Lan S, Fan W, Liu T, Yang S (2019) A hybrid SCA-VNS meta-heuristic based on Iterated Hungarian algorithm for physicians and medical staff scheduling problem in outpatient department of large hospitals with multiple branches. Appl Soft Comput 85:105813
Dua D, Graff C (2017) UCI machine learning repository. Retrieved from http://archive.ics.uci.edu/ml
Lu Y, Liang M, Ye Z, Cao L (2015) Improved particle swarm optimization algorithm and its application in text feature selection. Appl Soft Comput 35:629–636
Mafarja M, Mirjalili S (2018) Whale optimization approaches for wrapper feature selection. Appl Soft Comput 62:441–453
Mafarja MM, Mirjalili S (2017) Hybrid whale optimization algorithm with simulated annealing for feature selection. Neurocomputing 260:302–312
Meedeniya D, Perera A (2009) Evaluation of partition-based text clustering techniques to categorize Indic language documents. In: 2009 IEE international advance computing conference, pp 1497–1500. IEEE
Meedeniya DA, Perera AS (2008) A comparative study on data representation to categorize text documents. In: Proceedings of the 20th international conference on software engineering and knowledge engineering (SEKE’08), pp 371–374
Mirjalili S (2016) SCA: a sine cosine algorithm for solving optimization problems. Knowl-Based Syst 96:120–133
Mirjalili S, Gandomi AH (2017) Chaotic gravitational constants for the gravitational search algorithm. Appl Soft Comput 53:407–419
Najeeb MM (2014) Towards innovative system for Hadith Isnad processing. Int J Comput Trends Technol 18(6):257–259
Najib SRM, Rahman NA, Ismail NK, Nor ZM, Alias MN, Alias N (2017) Comparative study of machine learning approach on malay translated Hadith text classification based on Sanad. In: 8th international conference on mechanical and manufacturing engineering 2017 (ICME’17) MATEC Web Conf, vol 135, pp 1–9
Oliva D, El Aziz MA, Hassanien AE (2017) Parameter estimation of photovoltaic cells using an improved chaotic whale optimization algorithm. Appl Energy 200:141–154
Potthuri S, Shankar T, Rajesh A (2016) Lifetime improvement in wireless sensor networks using hybrid differential evolution and simulated annealing (DESA). Ain Shams Eng J 9:655–663
Raileanu LE, Stoffel K (2004) Theoretical comparison between the gini index and information gain criteria. Ann Math Artif Intell 41(1):77–93
Ramteke SP, Gurjar AA, Deshmukh DS (2019) A novel weighted SVM classifier based on SCA for handwritten marathi character recognition. IETE J Res. https://doi.org/10.1080/03772063.2019.1623093
Riahi V, Kazemi M (2018) A new hybrid ant colony algorithm for scheduling of no-wait flowshop. Oper Res Int Journal 18(1):55–74
Saloot MA, Idris N, Mahmud R, Ja’afar R, Thorleuchter D, Gani A (2016) Hadith data mining and classification: a comparative analysis. Artif Intell Rev 46(1):113–128
Sayed GI, Hassanien AE, Azar AT (2019) Feature selection via a novel chaotic crow search algorithm. Neural Comput Appl 31(1):171–188
Sayed GI, Tharwat A, Hassanien AE (2019) Chaotic dragonfly algorithm: an improved metaheuristic algorithm for feature selection. Appl Intell 49(1):188–205
Sharma N, Kaur A, Sharma H, Sharma A, Bansal JC (2019) Chaotic Spider Monkey Optimization Algorithm with Enhanced Learning Soft Computing for Problem Solving (pp. 149–161): Springer.
Sihwail R, Omar K, Ariffin KAZ, Tubishat M (2020) Improved harris hawks optimization using elite opposition-based learning and novel search mechanism for feature selection. IEEE Access 8:121127–121145
Tawhid MA, Savsani P (2019) Discrete sine-cosine algorithm (DSCA) with local search for solving traveling salesman problem. Arab J Sci Eng 44(4):3669–3679
Tayal A, Singh SP (2018) Integrating big data analytic and hybrid firefly-chaotic simulated annealing approach for facility layout problem. Ann Oper Res 270(1–2):489–514
Tubishat M, Abushariah MAM, Idris N et al (2019) Improved whale optimization algorithm for feature selection in Arabic sentiment analysis. Appl Intell 49:1688–1707. https://doi.org/10.1007/s10489-018-1334-8
Tubishat M, Alswaitti M, Mirjalili S, Al-Garadi MA, Rana TA (2020) Dynamic butterfly optimization algorithm for feature selection. IEEE Access 8:194303–194314
Tubishat M, Idris N, Shuib L, Abushariah MA, Mirjalili S (2020) Improved Salp Swarm Algorithm based on opposition based learning and novel local search algorithm for feature selection. Expert Syst Appl 145:113122
Tubishat M, Ja’afar S, Alswaitti M, Mirjalili S, Idris N, Ismail MA, Omar MS (2020) Dynamic Salp swarm algorithm for feature selection. Expert Syst Appl 164:113873
Wan M, Chen X, Zhan T, Xu C, Yang G, Zhou H (2021) Sparse fuzzy two-dimensional discriminant local preserving projection (SF2DDLPP) for robust image feature extraction. Inf Sci 563:1–15
Wan M, Yang G, Sun C, Liu M (2019) Sparse two-dimensional discriminant locality-preserving projection (S2DDLPP) for feature extraction. Soft Comput 23(14):5511–5518
Wang M, Chen H, Yang B, Zhao X, Hu L, Cai Z, Huang H, Tong C (2017) Toward an optimal kernel extreme learning machine using a chaotic moth-flame optimization strategy with applications in medical diagnoses. Neurocomputing 267:69–84
Wolpert DH, Macready WG (1997) No free lunch theorems for optimization. IEEE Trans Evol Comput 1(1):67–82
Xiang J, Han X, Duan F, Qiang Y, Xiong X, Lan Y, Chai H (2015) A novel hybrid system for feature selection based on an improved gravitational search algorithm and k-NN method. Appl Soft Comput 31:293–307
Yan C, Ma J, Luo H, Patel A (2019) Hybrid binary coral reefs optimization algorithm with simulated annealing for feature selection in high-dimensional biomedical datasets. Chemom Intell Lab Syst 184:102–111
Zawbaa HM, Emary E, Parv B, Sharawi M (2016) Feature selection approach based on moth-flame optimization algorithm. In: 2016 IEEE congress on evolutionary computation (CEC), pp 4612–4617. https://doi.org/10.1109/CEC.2016.7744378
Zhang X, Feng T (2018) Chaotic bean optimization algorithm. Soft Comput 22(1):67–77
Zhao Y, Zou F, Chen D (2019) A discrete sine cosine algorithm for community detection. In: International conference on intelligent computing. Springer, pp 35–44
Acknowledgements
This study was funded by the University of Malaya (Grant No: UMRG RP043C-17HNE).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Tubishat, M., Ja’afar, S., Idris, N. et al. Improved sine cosine algorithm with simulated annealing and singer chaotic map for Hadith classification. Neural Comput & Applic 34, 1385–1406 (2022). https://doi.org/10.1007/s00521-021-06448-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-021-06448-y