Abstract
Rule induction can achieve orders of magnitude reduction in the volume of data descriptions. For example, we applied a commercial tool (IXLtm) to a 1,819 record tropical storm database, yielding 161 rules. However, human comprehension of the discovered results may require further reduction. We present a rule refinement strategy, partly implemented in a Prolog program, that operationalizes “interestingness” into performance, simplicity, novelty, and significance. Applying the strategy to the induced rulebase yielded 10 “genuinely interesting” rules.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Abramowitz, M., and Stegun, I. A. (Eds.) (1972).Handbook of Mathematical Functions, U. S. Department of Commerce, National Bureau of Standards, Washington, DC.
Buntine, W. (1988). “Generalized Subsumption and Its Applications to Induction and Redundancy”,Artificial Intelligence, Vol. 36, pp. 149–176.
Cry, G. W. (1965).Tropical Cyclones of the North Atlantic Ocean, Technical Paper No. 55, U. S. Department of Commerce, Weather Bureau, Washington, DC.
Duda, R., and Hart, P. E. (1973).Pattern Classification and Scene Analysis, Wiley, New York, NY.
Dunn, G. E., and Miller, B. I. (1964).Atlantic Hurricanes, Louisiana State University Press, Baton Rouge, LA.
Frawley, W., Piatetsky-Shapiro, G., and Matheus, C. J. (1991) “Knowledge Discovery in Databases: an Overview”, In Piatetsky-Shapiro, G., and Frawley, W. (Eds.),Knowledge Discovery in Databases. AAAI/MIT Press, Cambridge, MA.
Friedman, D. G. (1984). “Natural Hazard Risk Assessment for an Insurance Program”,The Geneva Papers on Risk and Insurance, Vol. 9, pp. 57–128.
Friedman, D. G. (1990). “Is Hugo a Forerunner of Future Great Hurricanes?”Research Review: Journal of the Society of Insurance Research, July.
Gaines, B. R. (1991a). “Refining Induction into Knowledge”, InProceedings of the 1991 AAAI Workshop on Knowledge Discovery in Databases, AAAI, Anaheim, CA, pp. 1–10.
Gaines, B. R. (1991b). “The Trade-off Between Knowledge and Data in Knowledge Acquisition”, In Piatetsky-Shapiro, G., and Frawley, W. (Eds.),Knowledge Discovery in Databases. AAAI/MIT Press, Cambridge, MA.
Gebhardt, F. (1991). “Choosing Among Competing Generalizations,”Knowledge Acquisition, Vol. 3, pp. 361–380.
Ginsberg, A., Weiss, S. M., and Politakis, P. (1988). “Automatic Knowledge Base Refinement For Classification Systems”,Artificial Intelligence, Vol. 35, pp. 197–226.
Grinstein, G., et al. (1992). “Visualization For Knowledge Discovery”,International Journal of Intelligent Systems, Vol. 7, pp. 637–648.
Harter, H. L. (1980). “Early History of Multiple Comparison Tests,” In Krishnaiah, P. R. (Ed.),Handbook of Statistics. North-Holland, New York, NY.
Hope, J., and Neumann, C. J. (1970). “An Operational Technique for Relating the Movement of Existing Tropical Cyclones to Past Tracks”,Monthly Weather Review, Vol. 98, pp. 925–933.
IntelligenceWare, (1990).IXL: The Machine Learning System User's Manual.
Jarvinen, B. R., Neumann, C. J., and Davis, M. A. S. (1984).A Tropical Cyclone Data Tape for the North Atlantic Basin, Technical Memorandum NWS NHC 22, National Oceanic and Atmospheric Administration and National Weather Service, Washington, DC.
Jensen, D. (1991). “Knowledge Discovery Through Induction With Randomization Testing,” InProceedings of the 1991 AAAI Workshop on Knowledge Discovery in Databases, Anaheim, CA, pp. 148–159.
Klosgen, W. (1992). “Problems For Knowledge Discovery in Databases and Their Treatment in the Statistics Explorer Explora”,International Journal of Intelligent Systems, Vol. 7, pp. 649–673.
Lenat, D. B. (1983). “The Role of Heuristics in Learning by Discovery: Three Case Studies”, In Michalski, R., Carbonell, J., and Mitchell, T. (Eds.),Machine Learning: An Artificial Intelligence Approach. Tioga, Palo Alto, CA.
Major, J. A., and Riedinger, D. R. (1992). “EFD: A Hybrid Knowledge/ Statistical-Based System for the Detection of Fraud”,International Journal of Intelligent Systems, Vol. 7, pp. 687–703.
Michalski, R. S., and Stepp, R. E. (1983) “Learning from Observation: Conceptual Clustering”, In Michalski, R., Carbonell, J., and Mitchell, T. (Eds.),Machine Learning: An Artificial Intelligence Approach. Tioga, Palo Alto, CA.
Michalski, R. S. (1983) “A Theory and Methodology of Inductive Learning”, In Michalski, R., Carbonell, J., and Mitchell, T. (Eds.),Machine Learning: An Artificial Intelligence Approach. Tioga, Palo Alto, CA.
Moninger, W. R., et al. (1991). “Shootout-89, a Comparative Evaluation of Knowledge-Based Systems that Forecast Severe Weather”,Bulletin of the American Meteorological Society, Vol. 72, pp. 1339–1354.
Parsaye, K., et al. (1989).Intelligent Databases: Object-Oriented, Deductive, Hypermedia Technologies, Wiley, New York, NY.
Pearl, J. (1988).Probabilistic Reasoning in Intelligent Systems, Morgan Kaufmann, San Mateo, CA.
Piatetsky-Shapiro, G. (1991) “Discovery, Analysis and Presentation of Strong Rules”, In Piatetsky-Shapiro, G., and Frawley, W. (Eds.),Knowledge Discovery in Databases. AAAI/MIT Press, Cambridge, MA.
Piatetsky-Shapiro, G., and Matheus, C. J. (1992). “Knowledge Discovery Workbench for Exploring Business Databases”,International Journal of Intelligent Systems, Vol. 7, pp. 675–686.
Pregibon, D. (1991) “A Statistician's View of Knowledge Discovery in Data (KDD)-What are Important Long Term Directions?”, In “Panel Positions on ‘Hilbert’ Problems in KDD,” addendum toProceedings of the 1991 AAAI Workshop on Knowledge Discovery in Databases, AAAI, Anaheim, CA, pp. 9–10.
Property Claim Services (1993),Catastrophe Bulletin, February 24, American Insurance Services Group, Rahway, NJ.
Quinlan, J. R. (1987). “Generating Production Rules From Decision Trees”, InProceedings of the Tenth International Joint Conference on Artificial Intelligence, pp. 304–307.
Shen, W. M. (1992). “Discovering Regularities From Knowledge Bases”,International Journal of Intelligent Systems, Vol. 7, pp. 623–635.
Weiss, S. M., Galen, R. S., and Tadepalli, P. V. (1990) “Maximizing the Predictive Value of Production Rules”,Artificial Intelligence, Vol. 45, pp. 47–71.
Winston, P. H. (1975). “Learning Structural Descriptions From Examples”, In P. H. Winston (Ed.),The Psychology of Computer Vision. McGraw-Hill, New York, NY.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Major, J.A., Mangano, J.J. Selecting among rules induced from a hurricane database. J Intell Inf Syst 4, 39–52 (1995). https://doi.org/10.1007/BF00962821
Issue Date:
DOI: https://doi.org/10.1007/BF00962821