Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3071178.3071217acmconferencesArticle/Chapter ViewAbstractPublication PagesgeccoConference Proceedingsconference-collections
research-article

A hybrid method for feature construction and selection to improve wind-damage prediction in the forestry sector

Published: 01 July 2017 Publication History

Abstract

Catastrophic damage to forests resulting from major storms has resulted in serious timber and financial losses within the sector across Europe in the recent past. Developing risk assessment methods is thus one of the keys to finding forest management strategies to reduce future damage. Previous approaches to predicting damage to individual trees have used mechanistic models of wind-flow or logistical regression with mixed results. We propose a novel filter-based Genetic Programming method for constructing a large set of new features which are ranked using the Hellinger distance metric which is insensitive to skew in the data. A wrapper-based feature-selection method that uses a random forest classifier is then applied predict damage to individual trees. Using data collected from two forests within South-West France, we demonstrate significantly improved classification results using the new features, and in comparison to previously published results. The feature-selection method retains a small set of relevant variables consisting only of newly constructed features whose components provide insights that can inform forest management policies.

References

[1]
M. Ahmed, S. and Zhang, L. Peng, and B. Xue. 2014. Multiple feature construction for effective biomarker identification and classification using genetic programming. In Proceedings of the 2014 Annual Conference on Genetic and Evolutionary Computation. ACM, 249--256.
[2]
A. Albrecht, M. Hanewinkel, J. Bauhus, and U. Kohnle. 2012. How does silviculture affect storm damage in forests of south-western Germany? Results from empirical modeling based on long-term observations. European Journal of Forest Research 131, 1 (2012), 229--247.
[3]
K.J. Archer and R.V. Kimes. 2008. Empirical characterization of random forest variable importance measures. Computational Statistics & Data Analysis 52, 4 (2008), 2249--2260.
[4]
G.S. Biging and M. Dobbertin. 1995. Evaluation of competition indices in individual tree growth models. Forest Science 41, 2 (1995), 360--377.
[5]
Leo Breiman. 2001. Random forests. Machine learning 45, 1 (2001), 5--32.
[6]
D.A. Cieslak, R.T. Hoens, N. V. Chawla, and W. P. Kegelmeyer. 2012. Hellinger distance decision trees are robust and skew-insensitive. Data Mining and Knowledge Discovery 24, 1 (2012), 136--158.
[7]
A. Colin, C. Meredieu, T. Labbe, and T. Belouard. 2010. Etude retrospective et mise a' jour de la ressource en pin maritime du massif des Landes de Gascogne apres la tempete Klaus du 24 janvier 2009. Technical Report IFN n2010-CER-2-077. Institut National de l'Information Geographique et Forestiere.
[8]
V. Cucchi, C. Meredieu, A. Stokes, S. Berthier, D. Bert, M. Najar, A. Denis, and R. Lastennet. 2004. Root anchorage of inner and edge trees in stands of Maritime pine (Pinus pinasterAit.) growing in different podzolic soil conditions. Trees 18, 4 (2004), 460--466.
[9]
Commission des affaires economiques. 2009. Les consequences de la tempete du 24 janvier 2009 das le Sud-Quest. http://www.assemblee-nationale.fr/13/rap-info/i1836.asp. (2009). {Online; accessed 23-01-2017}.
[10]
M.A. Dorning, J.W. Smith, D.A. Shoemaker, and R.K. Meentemeyer. 2015. Changing decisions in a changing landscape: How might forest owners in an urbanizing region respond to emerging bioenergy markets? Land Use Policy 49 (2015), 1 -- 10.
[11]
R. Genuer, J.-M Poggi, and C. Tuleau-Malot. 2015. VSURF: An R package for variable selection using random forests. The R Journal 7, 2 (2015), 19--33.
[12]
H. Guo and A. Nandi. 2006. Breast cancer diagnosis using genetic programming generated feature. Pattern Recognition 39, 5 (2006), 980--987.
[13]
K. Kamimura, B. Gardiner, S. Dupont, D. Guyon, and C. Meredieu. 2016. Mechanistic and statistical approaches to predicting wind damage to individual maritime pine (Pinus pinaster) trees in forests. Canadian Journal of Forest Research 46 (2016), 88--100.
[14]
John R. Koza. 1992. Genetic Programming: On the Programming of Computers by Means of Natural Selection. MIT Press, Cambridge, MA, USA.
[15]
K. Krawiec. 2002. Genetic programming-based construction of features for machine learning and knowledge discovery tasks. Genetic Programming and Evolvable Machines 3, 4 (2002), 329--343.
[16]
Y. Lin and B. Bhanu. 2005. Evolutionary feature synthesis for object recognition. IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews) 35, 2 (2005), 156--171.
[17]
M. Muharram and G.D. Smith. 2005. Evolutionary constructive induction. IEEE transactions on knowledge and data engineering 17, 11 (2005), 1518--1528.
[18]
K. Neshatian, M. Zhang, and P. Andreae. 2012. A filter approach to multiple feature construction for symbolic learning classifiers using genetic programming. IEEE Transactions on Evolutionary Computation 16, 5 (2012), 645--661.
[19]
G. Patterson and M. Zhang. 2007. Fitness functions in genetic programming for classification with unbalanced data. In Australasian Joint Conference on Artificial Intelligence. Springer, 769--775.
[20]
M. Smith and L. Bull. 2005. Genetic programming with a genetic algorithm for feature construction and selection. Genetic Programming and Evolvable Machines 6, 3 (2005), 265--281.
[21]
B. Tran, B. Xue, and M. Zhang. 2016. Genetic programming for feature construction and selection in classification on high-dimensional data. Memetic Computing 8, 1 (2016), 3--15.

Cited By

View all
  • (2023)Improving spatial predictions of Eucalypt plantation growth by combining interpretable machine learning with the 3-PG modelFrontiers in Forests and Global Change10.3389/ffgc.2023.11810496Online publication date: 20-Jul-2023
  • (2023)Evolutionary Machine Learning in Environmental ScienceHandbook of Evolutionary Machine Learning10.1007/978-981-99-3814-8_19(563-590)Online publication date: 2-Nov-2023
  • (2022)A Robust Feature Construction for Fish Classification Using Grey Wolf OptimizerCybernetics and Information Technologies10.2478/cait-2022-004522:4(152-166)Online publication date: 10-Nov-2022
  • Show More Cited By

Index Terms

  1. A hybrid method for feature construction and selection to improve wind-damage prediction in the forestry sector

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      GECCO '17: Proceedings of the Genetic and Evolutionary Computation Conference
      July 2017
      1427 pages
      ISBN:9781450349208
      DOI:10.1145/3071178
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 01 July 2017

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. feature-construction
      2. forestry
      3. machine-learning

      Qualifiers

      • Research-article

      Conference

      GECCO '17
      Sponsor:

      Acceptance Rates

      GECCO '17 Paper Acceptance Rate 178 of 462 submissions, 39%;
      Overall Acceptance Rate 1,669 of 4,410 submissions, 38%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)9
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 03 Mar 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2023)Improving spatial predictions of Eucalypt plantation growth by combining interpretable machine learning with the 3-PG modelFrontiers in Forests and Global Change10.3389/ffgc.2023.11810496Online publication date: 20-Jul-2023
      • (2023)Evolutionary Machine Learning in Environmental ScienceHandbook of Evolutionary Machine Learning10.1007/978-981-99-3814-8_19(563-590)Online publication date: 2-Nov-2023
      • (2022)A Robust Feature Construction for Fish Classification Using Grey Wolf OptimizerCybernetics and Information Technologies10.2478/cait-2022-004522:4(152-166)Online publication date: 10-Nov-2022
      • (2022)Feature selection by genetic algorithm in nonlinear taper modelCanadian Journal of Forest Research10.1139/cjfr-2021-026552:5(769-779)Online publication date: May-2022
      • (2022)Advanced Scientific Methods and Tools in Sustainable Forest Management: A Synergetic PerspectiveForest Dynamics and Conservation10.1007/978-981-19-0071-6_14(279-309)Online publication date: 17-May-2022
      • (2021)Genetic Programming for Evolving a Front of Interpretable Models for Data VisualizationIEEE Transactions on Cybernetics10.1109/TCYB.2020.297019851:11(5468-5482)Online publication date: Nov-2021
      • (2021)Feature Creation Towards the Detection of Non-control-Flow Hijacking AttacksArtificial Neural Networks and Machine Learning – ICANN 202110.1007/978-3-030-86362-3_13(153-164)Online publication date: 7-Sep-2021
      • (2021)Mining Feature Relationships in DataGenetic Programming10.1007/978-3-030-72812-0_16(247-262)Online publication date: 25-Mar-2021
      • (2020)Exploring Problem State Transformations to Enhance Hyper-heuristics for the Job-Shop Scheduling Problem2020 IEEE Congress on Evolutionary Computation (CEC)10.1109/CEC48606.2020.9185709(1-8)Online publication date: Jul-2020
      • (2020)A filter-based feature construction and feature selection approach for classification using Genetic ProgrammingKnowledge-Based Systems10.1016/j.knosys.2020.105806(105806)Online publication date: Mar-2020
      • Show More Cited By

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media