Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3491101.3503575acmconferencesArticle/Chapter ViewAbstractPublication PageschiConference Proceedingsconference-collections
extended-abstract

Informing Housing Policy through Web Automation: Lessons for Designing Programming Tools for Domain Experts

Published: 28 April 2022 Publication History

Abstract

Housing costs have risen dramatically in the past decade, surpassing their pre-Recession levels, but the data that housing researchers and policymakers rely on to understand these dynamics remain subject to important limitations in their spatiotemporal granularity or methodological transparency. While these aspects of existing public and private data sources present barriers to understanding the geography of cost and availability in markets across the United States, web data about housing opportunities provide an important alternative—albeit one that demands technical skills that would-be data users may lack. This case study documents the experiences of a collaboration between social and computer scientists focused on using a novel programming-by-demonstration tool for web automation, Helena, to inform rental housing policy and inequalities in the United States. While this project was initially focused on collecting housing ads from a single site within the Seattle area, the capacity to scale our project to new sources and locations afforded by Helena’s human-centered design allowed a team of social scientists to progress to scraping data across the country and multiple platforms. Using this project as a case study, we discuss a.) important programming and research challenges that were encountered and b.) how Helena’s design helped us overcome these barriers to using scraped web data in basic research and policy analysis.

References

[1]
Geoff Boeing and Paul Waddell. 2017. New insights into rental housing markets across the United States: Web scraping and analyzing craigslist rental listings. Journal of Planning Education and Research 37, 4 (2017), 457–476.
[2]
Sarah Chasins and Rastislav Bodik. 2017. Skip Blocks: Reusing Execution History to Accelerate Web Scripts. Proc. ACM Program. Lang. 1, OOPSLA, Article 51 (Oct. 2017), 28 pages. https://doi.org/10.1145/3133875
[3]
Sarah Elizabeth Chasins. 2019. Democratizing Web Automation: Programming for Social Scientists and Other Domain Experts. Ph.D. Dissertation. UC Berkeley.
[4]
Sarah E. Chasins, Maria Mueller, and Rastislav Bodik. 2018. Rousillon: Scraping Distributed Hierarchical Web Data. In Proceedings of the 31st Annual ACM Symposium on User Interface Software and Technology (Berlin, Germany) (UIST ’18). Association for Computing Machinery, New York, NY, USA, 963–975. https://doi.org/10.1145/3242587.3242661
[5]
Raj Chetty, Nathaniel Hendren, and Lawrence F Katz. 2016. The effects of exposure to better neighborhoods on children: New evidence from the Moving to Opportunity experiment. American Economic Review 106, 4 (2016), 855–902.
[6]
Jennie E Christensen. 2020. The Demise of the CFAA in Data Scraping Cases. Notre Dame JL Ethics & Pub. Pol’y 34 (2020), 529.
[7]
PostGIS Project Steering Committee. 2021. PostGIS - Spatial and Geographic objects for PostgreSQL. Website.
[8]
Ana Costa, Victoria Sass, Ian Kennedy, Roshni Roy, Rebecca J Walter, Arthur Acolin, Kyle Crowder, Chris Hess, Alex Ramiller, and Sarah Chasins. 2021. Toward a Cross-Platform Framework. Cityscape 23, 2 (2021), 327–340.
[9]
Kyle Crowder, Erin Carll, Jerald Herting, and Chris Hess. 2018. Seattle Rental Housing Study.
[10]
Kyle Crowder, Matthew Hall, and Stewart E Tolnay. 2011. Neighborhood immigration and native out-migration. American sociological review 76, 1 (2011), 25–47.
[11]
Marcia Fernald 2021. The State of the Nation’s Housing. Joint Center for Housing Studies of Harvard University (2021).
[12]
Joint Center for Housing Studies of Harvard University. 2021. The Rent Eats First: Rental Housing Unaffordability in the US. (2021).
[13]
John I Gilderbloom, Lin Ye, Matthew J Hanka, and Kareem M Usher. 2009. Intercity rent differentials in the US housing market 2000: Understanding rent variations as a sociological phenomenon. Journal of urban affairs 31, 4 (2009), 409–430.
[14]
Chris Hess, Arthur Acolin, Rebecca Walter, Ian Kennedy, Sarah Chasins, and Kyle Crowder. 2021. Searching for housing in the digital age: Neighborhood representation on internet rental housing platforms across space, platform, and metropolitan segregation. Environment and Planning A: Economy and Space (2021), 0308518X211034177.
[15]
Chris Hess, Gregg Colburn, Kyle Crowder, and Ryan Allen. 2020. Racial disparity in exposure to housing cost burden in the United States: 1980–2017. Housing Studies (2020), 1–21.
[16]
Christian Hess, Rebecca J Walter, Arthur Acolin, and Sarah Chasins. 2019. Comparing Small Area Fair Market Rents with Other Rental Measures across Diverse Housing Markets. Cityscape (Washington, DC) 21, 3 (2019), 159.
[17]
iOpus. 2013. Browser Scripting, Data Extraction and Web Testing by iMacros. http://www.iopus.com/imacros/.
[18]
Ian Kennedy, Chris Hess, Amandalynne Paullada, and Sarah Chasins. 2021. Racialized Discourse in Seattle Rental Ad Texts. Social Forces 99, 4 (2021), 1432–1456.
[19]
Maria Krysan and Kyle Crowder. 2017. Cycle of segregation: Social processes and residential stratification. Russell Sage Foundation.
[20]
Tessa Lau. 2009. Why programming-by-demonstration systems fail: Lessons learned for usable ai. AI Magazine 30, 4 (2009), 65–65.
[21]
David Lazer and Jason Radford. 2017. Data ex machina: introduction to big data. Annual Review of Sociology 43 (2017), 19–39.
[22]
Vu Le and Sumit Gulwani. 2014. FlashExtract: A Framework for Data Extraction by Examples. In Proceedings of the 35th ACM SIGPLAN Conference on Programming Language Design and Implementation (Edinburgh, United Kingdom) (PLDI ’14). ACM, New York, NY, USA, 542–553. https://doi.org/10.1145/2594291.2594333
[23]
Gilly Leshed, Eben M. Haber, Tara Matthews, and Tessa Lau. 2008. CoScripter: automating & sharing how-to knowledge in the enterprise. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (Florence, Italy) (CHI ’08). ACM, New York, NY, USA, 1719–1728. https://doi.org/10.1145/1357054.1357323
[24]
Henry Lieberman. 2001. Your wish is my command: Programming by example. Morgan Kaufmann.
[25]
Henry Lieberman, Fabio Paternò, Markus Klann, and Volker Wulf. 2006. End-user development: An emerging paradigm. In End user development. Springer, 1–8.
[26]
Wil Lieberman-Cribbin, Stephanie Tuminello, Raja M Flores, and Emanuela Taioli. 2020. Disparities in COVID-19 testing and positivity in New York City. American journal of preventive medicine 59, 3 (2020), 326–332.
[27]
James Lin, Jeffrey Wong, Jeffrey Nichols, Allen Cypher, and Tessa A. Lau. 2009. End-user programming of mashups with Vegemite. In Proceedings of the 14th international conference on Intelligent user interfaces (Sanibel Island, Florida, USA) (IUI ’09). ACM, New York, NY, USA, 97–106. https://doi.org/10.1145/1502650.1502667
[28]
Ann Owens. 2019. Building inequality: Housing segregation and income segregation. Sociological Science 6(2019), 497.
[29]
R Core Team. 2013. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria. http://www.R-project.org/
[30]
CA: Environmental Systems Research Institute Redlands. 2011. ArcGIS Desktop: Release 10.
[31]
Daniel Schneider and Kristen Harknett. 2019. Consequences of routine work-schedule instability for worker health and well-being. American Sociological Review 84, 1 (2019), 82–114.
[32]
Wei Wang, David Rothschild, Sharad Goel, and Andrew Gelman. 2015. Forecasting elections with non-representative polls. International Journal of Forecasting 31, 3 (2015), 980–991.

Cited By

View all
  • (2023)Co-Designing for Transparency: Lessons from Building a Document Organization Tool in the Criminal Justice DomainProceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency10.1145/3593013.3594093(1463-1478)Online publication date: 12-Jun-2023
  • (2023)MIWA: Mixed-Initiative Web Automation for Better User Control and ConfidenceProceedings of the 36th Annual ACM Symposium on User Interface Software and Technology10.1145/3586183.3606720(1-15)Online publication date: 29-Oct-2023
  • (2022)Segmented Information, Segregated Outcomes: Housing Affordability and Neighborhood Representation on a Voucher-Focused Online Housing Platform and Three Mainstream AlternativesHousing Policy Debate10.1080/10511482.2022.213354833:6(1511-1535)Online publication date: Nov-2022

Index Terms

  1. Informing Housing Policy through Web Automation: Lessons for Designing Programming Tools for Domain Experts
      Index terms have been assigned to the content through auto-classification.

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      CHI EA '22: Extended Abstracts of the 2022 CHI Conference on Human Factors in Computing Systems
      April 2022
      3066 pages
      ISBN:9781450391566
      DOI:10.1145/3491101
      Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 28 April 2022

      Check for updates

      Author Tags

      1. applied sociology
      2. housing policy
      3. web automation

      Qualifiers

      • Extended-abstract
      • Research
      • Refereed limited

      Funding Sources

      Conference

      CHI '22
      Sponsor:
      CHI '22: CHI Conference on Human Factors in Computing Systems
      April 29 - May 5, 2022
      LA, New Orleans, USA

      Acceptance Rates

      Overall Acceptance Rate 6,164 of 23,696 submissions, 26%

      Upcoming Conference

      CHI '25
      CHI Conference on Human Factors in Computing Systems
      April 26 - May 1, 2025
      Yokohama , Japan

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)35
      • Downloads (Last 6 weeks)6
      Reflects downloads up to 12 Nov 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2023)Co-Designing for Transparency: Lessons from Building a Document Organization Tool in the Criminal Justice DomainProceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency10.1145/3593013.3594093(1463-1478)Online publication date: 12-Jun-2023
      • (2023)MIWA: Mixed-Initiative Web Automation for Better User Control and ConfidenceProceedings of the 36th Annual ACM Symposium on User Interface Software and Technology10.1145/3586183.3606720(1-15)Online publication date: 29-Oct-2023
      • (2022)Segmented Information, Segregated Outcomes: Housing Affordability and Neighborhood Representation on a Voucher-Focused Online Housing Platform and Three Mainstream AlternativesHousing Policy Debate10.1080/10511482.2022.213354833:6(1511-1535)Online publication date: Nov-2022

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      HTML Format

      View this article in HTML Format.

      HTML Format

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media