Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

A relatedness analysis of government regulations using domain knowledge and structural organization

Published: 01 December 2006 Publication History

Abstract

The complexity and diversity of government regulations make understanding and retrieval of regulations a non-trivial task. One of the issues is the existence of multiple sources of regulations and interpretive guides with differences in format, terminology and context. This paper describes a comparative analysis scheme developed to help retrieval of related provisions from different regulatory documents. Specifically, the goal is to identify the most strongly related provisions between regulations. The relatedness analysis makes use of not only traditional term match but also a combination of feature matches, and not only content comparison but also structural analysis.
Regulations are first compared based on conceptual information as well as domain knowledge through feature matching. Regulations also possess specific organizational structures, such as a tree hierarchy of provisions and heavy referencing between provisions. These structures represent useful information in locating related provisions, and are therefore exploited in the comparison of regulations for completeness. System performance is evaluated by comparing a similarity ranking produced by users with the machine-predicted ranking. Ranking produced by the relatedness analysis system shows a reduction in error compared to that of Latent Semantic Indexing. Various pairs of regulations are compared and the results are analyzed along with observations based on different feature usages. An example of an e-rulemaking scenario is shown to demonstrate capabilities and limitations of the prototype relatedness analysis system.

References

[1]
Al-Kofahi, K., Tyrrell, A., Vachher, A., & Jackson, P. (2001). A machine learning approach to prior case retrieval. In Proceedings of the 8th International Conference on Artificial Intelligence and Law (ICAIL 2001) (pp. 88–93). St. Louis, Missouri.
[2]
Americans with Disabilities Act (ADA) Accessibility Guidelines for Buildings and Facilities (1999). US Architectural and Transportation Barriers Compliance Board (Access Board). Washington, DC.
[3]
Attar R. and Fraenkel AS. Local feedback in full-text retrieval systems Journal of the ACM 1977 24 3 397-417
[4]
Baeza-Yates, R., & Ribeiro-Neto, B. (1999). Modern information retrieval. New York, NY: ACM Press.
[5]
Balmer, D. C. (2003). Trends and issues in platform lift. In Proceedings ofSpace Requirements for Wheeled Mobility Workshop. Buffalo, NY.
[6]
Baru C., Gupta A., Papakonstantinou Y., Hollebeek R., and Featherman D. Putting government information at citizens' fingertips EnVision 2000 16 3 8-9
[7]
Bench-Capon, T. J. M. (1991). Knowledge based systems and legal applications. San Diego, CA: Academic Press Professional, Inc.
[8]
Bender, D. (2004). 2003 Data protection survey: Cross-border transfer of personal data in 22 major jurisdictions. In Proceedings ofthe 3rd Annual Law Firm C.I.O. Forum 2004 (pp. 95–122). San Francisco, CA.
[9]
Berman D. H. and Hafner C. D. The potential of artificial intelligence to help solve the crisis in our legal system Communications of the ACM 1989 32 8 928-938
[10]
Berry, M. W., & Browne, M. (1999). Understanding search engines: mathematical modeling and text retrieval. Society for Industrial and Applied Mathematics (SIAM). Philadelphia, PA.
[11]
Bollacker, K. D., Lawrence, S., & Giles, C. L. (1998). CiteSeer: An autonomous web agent for automatic retrieval and identification of interesting publications. In Proceedings ofthe 2nd International Conference on Autonomous Agents (pp. 116–123). Minneapolis, MN.
[12]
Branting, L. K. (1991). Reasoning with portions of precedents. In Proceedings ofthe 3rd International Conference on Artificial Intelligence and Law (ICAIL 1991) (pp. 145–154). Oxford, England.
[13]
Branting L. K. Building explanations from rules and structured cases International Journal of Man-Machine Studies 1991 34 6 797-837
[14]
Brin, S., & Page, L. (1998). The anatomy of a large-scale hypertextual web search engine. In Proceedings ofthe 7th International World Wide Web Conference (pp. 107–117). Brisbane, Australia.
[15]
British Standard 8300 (2001). British Standards Institution (BSI), London, UK.
[16]
Brüninghaus, S., & Ashley, KD. (2001). Improving the representation of legal case texts with information extraction methods. In Proceedings ofthe 8th International Conference on Artificial Intelligence and Law (ICAIL 2001) (pp. 42–51). St. Louis, Missouri.
[17]
Calado P., Ribeiro-Neto B., Ziviani N., Moura E., and Silva I. Local versus global link information in the web ACM Transactions on Information Systems (TOIS) 2003 21 1 42-63
[18]
California Building Code (CBC) (1998). California building standards commission. Sacramento, CA.
[19]
Code of Federal Regulations (CFR) (2002). Title 40, Parts 141–143, US Environmental Protection Agency. Washington, DC.
[20]
Coglianese, C. (2003). E-Rulemaking: Information Technology and Regulatory Policy. Technical Report, Regulatory Policy Program, Kennedy School of Government. Cambridge, MA: Harvard University.
[21]
Coglianese C. Information technology and regulatory policy Social Science Computer Review 2004 22 1 85-91
[22]
Crouch, C. J., & Yang, B. (1992). Experiments in automatic statistical thesaurus construction. In Proceedings ofthe 15th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 77–88). Copenhagen, Denmark.
[23]
Crouch, R., Condoravdi, C., Stolle, R., King, T., de Paiva, V., Everett, J., & Bobrow, D, (2002). Scalability of redundancy detection in focused document collections. In Proceedings ofthe 1st International Workshop on Scalable Natural Language Understanding (ScaNaLU-2002). Heidelberg, Germany.
[24]
Daniels, J. J., & Rissland, E. L. (1997). What you saw is what you want: Using cases to seed information retrieval. In Proceedings ofthe 2nd International Conference on Case-Based Reasoning (ICCBR-97) (pp. 325–336). Providence, RI.
[25]
Deerwester S., Dumais S. T, Furnas G. W, Landauer T. K., and Harshman R. Indexing by latent semantic analysis Journal of the American Society of Information Science 1990 41 6 391-407
[26]
Draft Guidelines for Accessible Public Rights-of-Way (2002). US architectural and transportation barriers compliance board (access board). Washington, DC.
[27]
Dumais S. T. Improving the retrieval of information from external sources Behavior Research Methods, Instruments, and Computers 1991 23 2 229-236
[28]
Everett J. O., Bobrow D. G., Stolle R., Crouch R., de Paiva V., Condoravdi C., Berg M. V. D., and Polanyi L. Making ontologies work for resolving redundancies across documents Communications of the ACM 2002 45 2 55-60
[29]
Gardner, A. (1984). An artificial intelligence approach to legal reasoning. Ph.D. Thesis, Computer Science. Stanford, CA: Stanford University.
[30]
Garfield, E. (1995). New international professional society signals the maturing of scientometrics and informetrics. The Scientist,9(16).
[31]
Gibbens, M. P. (2000). CalDAG 2000: California disabled accessibility guidebook. Canoga Park, CA: Builder's Book.
[32]
Gibson, D., Kleinberg, J., & Raghavan, P. (1998). Inferring web communities from link topology. In Proceedings ofthe 9th ACM Conference on Hypertext and Hypermedia (pp. 225–234). Pittsburgh, PA.
[33]
Golub, G. H., & Van Loan, C. F. (1983). Matrix computations. Baltimore, MD: The Johns Hopkins University Press.
[34]
Gurrin, C., & Smeaton, A. F. (1999). A connectivity analysis approach to increasing precision in retrieval from hyperlinked documents. In Proceedings ofText REtrieval Conference (TREC). Gaithersburg, MD.
[35]
Hofmann, T. (1999). Probabilistic latent semantic indexing. In Proceedings ofthe 22nd Annual ACM Conference on Research and Development in Information Retrieval (pp. 50–57). Berkeley, California.
[36]
Ide, E. (1971). New experiments in relevance feedback. In G. Salton (Eds.), The SMART retrieval system-experiments in automatic document processing. Englewood Cliffs, NJ: Prentice Hall Inc.
[37]
Kerrigan, S. (2003). A software infrastructure for regulatory information management and compliance assistance. Ph.D. Thesis, Department of Civil and Environmental Engineering, Stanford University, Stanford, CA.
[38]
Kerrigan, S., & Law, K. (2003). Logic-based regulation compliance-assistance. In Proceedings ofthe 9th International Conference on Artificial Intelligence and Law (ICAIL 2003) (pp. 126–135). Edinburgh, Scotland.
[39]
Kleinberg, J. (1998). Authoritative sources in a hyperlinked environment. In Proceedings ofthe 9th ACM-SIAM Symposium on Discrete Algorithms (pp. 668–677). San Francisco, CA.
[40]
Lau, G. (2004). A comparative analysis framework for semi-structured documents, with applications to government regulations, Ph.D. Thesis, Civil and Environmental Engineering, Stanford University, Stanford, CA.
[41]
Lau, G., Kerrigan, S., & Law, K. (2003). An information infrastructure for government regulations. In Proceedings ofthe 13th Workshop on Information Technology and Systems (WITS'03) (pp. 37–42). Seattle, WA.
[42]
Lau, G., Law, K., & Wiederhold, G. (2003). Similarity analysis on government regulations. In Proceedings ofthe 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 111–117). Washington, DC.
[43]
Lau, G., Law, K., & Wiederhold, G. (2003). A framework for regulation comparison with application to accessibility codes. In Proceedings ofthe National Conference on Digital Government Research (pp. 251–254). Boston, MA.
[44]
Lin, C., Hu, PJ., Chen, H., & Schroeder, J. (2003). Technology implementation management in law enforcement: COPLINK system usability and user acceptance evaluations. In Proceedings ofthe National Conference on Digital Government Research (pp. 151–154). Boston, MA.
[45]
Merkl, D., & Schweighofer, E. (1997). En route to data mining in legal text corpora: Clustering, neural computation, and international treaties. In Proceedings ofthe 8th International Workshop on Database and Expert Systems Applications (pp. 465–470). Toulouse, France.
[46]
Miller, G. A, Beckwith, R., Fellbaun, C., Gross, D., & Miller, K. (1993). Five papers on wordnet. Technical Report, Cognitive Science Laboratory, Princeton, NJ.
[47]
Moens, M.-F., Uyttendaele, C., & Dumortier, J. (1997). Abstracting of legal cases: The SALOMON experience. In Proceedings ofthe 6th International Conference on Artificial Intelligence and Law (ICAIL 1997) (pp. 114–122). Melbourne, Australia.
[48]
Osborn, J., & Sterling, L. (1999). JUSTICE: A judicial search tool using intelligent concept extraction. In Proceedings ofthe 7th International Conference on Artificial Intelligence and Law (ICAIL 1999) (pp. 173–181). Oslo, Norway.
[49]
Page, L., Brin, S., Motwani, R., & Winograd, T. (1998). The pagerank citation ranking: bringing order to the web. Stanford, CA: Technical Report, Stanford University.
[50]
Potential Drinking Water Contaminant Index (2003). US Environmental Protection Agency. Washington, DC.
[51]
Proceedings of Business Compliance One Stop Workshop (2002). Small Business Administration. Queenstown, MD.
[52]
Proceedings of the National Conference on Digital Government Research (dg.o 2001). Los Angeles, CA.
[53]
Proceedings of the National Conference on Digital Government Research (dg.o 2002). Los Angeles, CA.
[54]
Proceedings of the National Conference on Digital Government Research (dg.o 2003). Boston, MA.
[55]
Qiu, Y., & Frei, H.-P. (1993). Concept based query expansion. In Proceedings ofthe 16th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 160–169). Pittsburgh, PA.
[56]
Raskopf, R. L., & Bender, D. (2003). Cross-border data: Information transfer restrictions pose a global challenge. New York Law Journal.
[57]
Rissland E. L., Ashley K. D., and Loui R. P. AI and law: a fruitful synergy Artificial Intelligence 2003 150 1–2 1-15
[58]
Rissland E. L. and Skalak D. B CABARET: rule interpretation in a hybrid architecture International Journal of Man-Machine Studies 1991 34 6 839-887
[59]
Rocchio, J. J. (1971). Relevance feedback in information retrieval. In G. Salton (Eds.), The SMART Retrieval System—Experiments in Automatic Document Processing. Englewood Cliffs, NJ: Prentice Hall Inc.
[60]
Salton, G. (1971). The smart retrieval system—Experiments in automatic document processing. Englewood Cliffs, NJ: Prentice Hall.
[61]
Salton G. and Buckley C. Term-weighting approaches in automatic retrieval Information Processing and Management 1988 24 5 513-523
[62]
Salton, G., & McGill, M. (1983). Introduction to modern information retrieval. New York, NY: McGraw-Hill.
[63]
Schweighofer, E., Rauber, A., & Dittenbach, M. (2001). Automatic text representation, classification and labeling in European law. In Proceedings ofthe 8th International Conference on Artificial Intelligence and Law (ICAIL 2001) (pp. 78–87.) St. Louis, Missouri.
[64]
Sergot M. J., Sadri F., Kowalski R. A., Kriwaczek F., Hammond P., and Cory H. T. The british nationality act as a logic program Communications of the ACM 1986 29 5 370-386
[65]
Shepard's Federal Citations (1990). Colorado Springs, CO: Shepards/Mcgraw-Hill.
[66]
Silva, I., Ribeiro-Neto, B., Calado, P., Moura, E., & Ziviani, N. (2000). Link-based and content-based evidential information in a belief network model. In Proceedings ofthe 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 96–103). Athens, Greece.
[67]
Technical Standards (2001). Scottish executive. Edinburgh, Scotland, UK.
[68]
Thompson, P. (2001). Automatic categorization of case law. In Proceedings ofthe 8th International Conference on Artificial Intelligence and Law (ICAIL 2001) (pp. 70–77). St. Louis, Missouri.
[69]
Uniform Federal Accessibility Standards (UFAS) (1997). US Architectural and Transportation Barriers Compliance Board (Access Board). Washington, DC.
[70]
Xu, J., & Croft, W. B. (1996). Query expansion using local and global document analysis. In Proceedings ofthe 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 4–11). Zurich, Switzerland.
[71]
Zeleznikow, J., & Hunter, D. (1994). Building intelligent legal information systems: Representation and reasoning in law. Deventer, The Netherlands: Kluwer Law and Taxation Publishers.

Cited By

View all
  • (2022)Learning by ConnectingOrganization Science10.1287/orsc.2021.152433:5(2018-2040)Online publication date: 1-Sep-2022
  • (2013)Link analysis for representing and retrieving legal informationProceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume 210.1007/978-3-642-37256-8_32(380-393)Online publication date: 24-Mar-2013
  • (2012)REGNETProceedings of the 6th International Conference on Theory and Practice of Electronic Governance10.1145/2463728.2463764(175-183)Online publication date: 22-Oct-2012
  • Show More Cited By

Index Terms

  1. A relatedness analysis of government regulations using domain knowledge and structural organization
            Index terms have been assigned to the content through auto-classification.

            Recommendations

            Comments

            Please enable JavaScript to view thecomments powered by Disqus.

            Information & Contributors

            Information

            Published In

            cover image Information Retrieval
            Information Retrieval  Volume 9, Issue 6
            Dec 2006
            121 pages

            Publisher

            Kluwer Academic Publishers

            United States

            Publication History

            Published: 01 December 2006
            Accepted: 17 January 2006
            Received: 30 May 2005

            Author Tags

            1. Relatedness analysis
            2. E-rulemaking
            3. Structural analysis
            4. Feature matching

            Qualifiers

            • Research-article

            Contributors

            Other Metrics

            Bibliometrics & Citations

            Bibliometrics

            Article Metrics

            • Downloads (Last 12 months)0
            • Downloads (Last 6 weeks)0
            Reflects downloads up to 30 Sep 2024

            Other Metrics

            Citations

            Cited By

            View all
            • (2022)Learning by ConnectingOrganization Science10.1287/orsc.2021.152433:5(2018-2040)Online publication date: 1-Sep-2022
            • (2013)Link analysis for representing and retrieving legal informationProceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume 210.1007/978-3-642-37256-8_32(380-393)Online publication date: 24-Mar-2013
            • (2012)REGNETProceedings of the 6th International Conference on Theory and Practice of Electronic Governance10.1145/2463728.2463764(175-183)Online publication date: 22-Oct-2012
            • (2008)Exploratory text mining of ocean law to measure overlapping agency and jurisdictional authorityProceedings of the 2008 international conference on Digital government research10.5555/1367832.1367844(53-62)Online publication date: 18-May-2008
            • (2007)Towards a compliance support framework for global software companiesProceedings of the 11th IASTED International Conference on Software Engineering and Applications10.5555/1647636.1647643(31-36)Online publication date: 6-Nov-2007
            • (2006)Locating related regulations using a comparative analysis approachProceedings of the 2006 international conference on Digital government research10.1145/1146598.1146662(229-238)Online publication date: 21-May-2006

            View Options

            View options

            Get Access

            Login options

            Media

            Figures

            Other

            Tables

            Share

            Share

            Share this Publication link

            Share on social media