Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

Modeling Interference for Individual Treatment Effect Estimation from Networked Observational Data

Published: 09 December 2023 Publication History

Abstract

Estimating individual treatment effect (ITE) from observational data has attracted great interest in recent years, which plays a crucial role in decision-making across many high-impact domains such as economics, medicine, and e-commerce. Most existing studies of ITE estimation assume that different units at play are independent and do not influence each other. However, many social science experiments have shown that there often exist different levels of interactions between units in observational data, especially in a networked environment. As a result, the treatment assignment of one unit can affect the outcome of other units connected to it in the network, which is referred to as the interference or spillover effect. In this article, we study an important problem of ITE estimation from networked observational data by modeling the interference between different units and provide a principled framework to support such study. Methodologically, we propose a novel framework, SPNet, that first captures the influence of hidden confounders with the aid of graph convolutional network and then models the interference by introducing an environment summary variable and developing a masked attention mechanism. Experimental evaluations on several semi-synthetic datasets based on real-world networks corroborate the superiority of our proposed framework over state-of-the-art individual treatment effect estimation methods.

References

[1]
Peter M. Aronow, Cyrus Samii, et al. 2017. Estimating average causal effects under general interference, with application to a social network experiment. Ann. Appl. Stat. 11, 4 (2017), 1912–1947.
[2]
Onur Atan, James Jordon, and Mihaela Van der Schaar. 2018. Deep-treat: Learning optimal personalized treatments from observational data using neural networks. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 32.
[3]
Usaid Awan, Marco Morucci, Vittorio Orlandi, Sudeepa Roy, Cynthia Rudin, and Alexander Volfovsky. 2020. Almost-matching-exactly for treatment effect estimation under network interference. In International Conference on Artificial Intelligence and Statistics. PMLR, 3252–3262.
[4]
Guillaume Basse and Avi Feller. 2018. Analyzing two-stage experiments in the presence of interference. J. Am. Statist. Assoc. 113, 521 (2018), 41–55.
[5]
Howard Bauchner, Robert Vinci, Sharon Bak, Colleen Pearson, and Michael J. Corwin. 1996. Parents and procedures: A randomized controlled trial. Pediatrics 98, 5 (1996), 861–867.
[6]
David M. Blei, Andrew Y. Ng, and Michael I. Jordan. 2003. Latent dirichlet allocation. The Journal of Machine Learning Research 3, Jan (2003), 993–1022.
[7]
Leo Breiman. 2001. Random forests. Mach. Learn. 45, 1 (2001), 5–32.
[8]
Shiyu Chang, Wei Han, Jiliang Tang, Guo-Jun Qi, Charu C. Aggarwal, and Thomas S. Huang. 2015. Heterogeneous network embedding via deep architectures. In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’15). 119–128.
[9]
Zhixuan Chu, Stephen L. Rathbun, and Sheng Li. 2021. Graph infomax adversarial learning for treatment effect estimation with networked observational data. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. 176–184.
[10]
Michaël Defferrard, Xavier Bresson, and Pierre Vandergheynst. 2016. Convolutional neural networks on graphs with fast localized spectral filtering. Advances in Neural Information Processing Systems 29, (2016), 3837–3845.
[11]
Johannes A. N. Dorresteijn, Frank L. J. Visseren, Paul M. Ridker, Annemarie M. J. Wassink, Nina P. Paynter, Ewout W. Steyerberg, Yolanda van der Graaf, and Nancy R. Cook. 2011. Estimating treatment effects for individual patients based on the results of randomised clinical trials. Br. Med. J. 343 (2011).
[12]
Zahra Fatemi and Elena Zheleva. 2020. Minimizing interference and selection bias in network experiment design. In Proceedings of the International AAAI Conference on Web and Social Media, Vol. 14. 176–186.
[13]
Zahra Fatemi and Elena Zheleva. 2020. Network experiment design for estimating direct treatment effects. In Proceedings of the KDD Workshop on Mining and Learning with Graphs (MLG’20), Vol. 8.
[14]
Amy Finkelstein, Annetta Zhou, Sarah Taubman, and Joseph Doyle. 2020. Health care hotspotting—a randomized, controlled trial. New Engl. J. Med. 382, 2 (2020), 152–162.
[15]
Laura Forastiere, Edoardo M. Airoldi, and Fabrizia Mealli. 2021. Identification and estimation of treatment and interference effects in observational studies on networks. J. Am. Statist. Assoc. 116, 534 (2021), 901–918.
[16]
Ruocheng Guo, Lu Cheng, Jundong Li, P. Richard Hahn, and Huan Liu. 2020. A survey of learning causality with data: Problems and methods. ACM Comput. Surv. 53, 4 (2020), 1–37.
[17]
Ruocheng Guo, Jundong Li, Yichuan Li, K. Selçuk Candan, Adrienne Raglin, and Huan Liu. 2021. Ignite: A minimax game toward learning individual treatment effects from networked observational data. In Proceedings of the 29th International Conference on International Joint Conferences on Artificial Intelligence. 4534–4540.
[18]
Ruocheng Guo, Jundong Li, and Huan Liu. 2020. Counterfactual evaluation of treatment assignment functions with networked observational data. In Proceedings of the SIAM International Conference on Data Mining. SIAM, 271–279.
[19]
Ruocheng Guo, Jundong Li, and Huan Liu. 2020. Learning individual causal effects from networked observational data. In Proceedings of the 13th International Conference on Web Search and Data Mining. 232–240.
[20]
Jennifer L. Hill. 2011. Bayesian nonparametric modeling for causal inference. J. Comput. Graph. Statist. 20, 1 (2011), 217–240.
[21]
Guanglei Hong. 2015. Causality in a Social World: Moderation, Mediation and Spill-over. John Wiley&Sons.
[22]
Qiang Huang, Tingyu Xia, Huiyan Sun, Makoto Yamada, and Yi Chang. 2020. Unsupervised nonlinear feature selection from high-dimensional signed networks. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34. 4182–4189.
[23]
Michael G. Hudgens and M. Elizabeth Halloran. 2008. Toward causal inference with interference. J. Am. Statist. Assoc. 103, 482 (2008), 832–842.
[24]
Guido W. Imbens and Donald B. Rubin. 2015. Causal Inference in Statistics, Social, and Biomedical Sciences. Cambridge University Press.
[25]
Fredrik Johansson, Uri Shalit, and David Sontag. 2016. Learning representations for counterfactual inference. In International Conference on Machine Learning, PMLR, 3020–3029.
[26]
Nathan Kallus, Xiaojie Mao, and Angela Zhou. 2019. Interval estimation of individual-level causal effects under unobserved confounding. In The 22nd International Conference on Artificial Intelligence and Statistics, PMLR, 2281–2290.
[27]
Dongyeop Kang, Waleed Ammar, Bhavana Dalvi, Madeleine van Zuylen, Sebastian Kohlmeier, Eduard Hovy, and Roy Schwartz. 2018. A Dataset of Peer Reviews (PeerRead): Collection, Insights and NLP Applications. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), Association for Computational Linguistics, New Orleans, Louisiana, 1647–1661.
[28]
Diederik P. Kingma and Jimmy Ba. 2015. Adam: A method for stochastic optimization. In Proceedings of the 3rd International Conference on Learning Representations (ICLR’15), Yoshua Bengio and Yann LeCun (Eds.).
[29]
Diederik P. Kingma and Max Welling. 2014. Auto-encoding variational bayes. In Prceedings of the 2nd International Conference on Learning Representations (ICLR’14), Yoshua Bengio and Yann LeCun (Eds.).
[30]
Thomas N. Kipf and Max Welling. 2017. Semi-Supervised Classification with Graph Convolutional Networks. In 5th International Conference on Learning Representations (ICLR’17), Toulon, France, April 24-26, 2017, Conference Track Proceedings.
[31]
Lan Liu and Michael G. Hudgens. 2014. Large sample randomization inference of causal effects in the presence of interference. J. Am. Statist. Assoc. 109, 505 (2014), 288–301.
[32]
Christos Louizos, Uri Shalit, Joris M. Mooij, David Sontag, Richard Zemel, and Max Welling. 2017. Causal effect inference with deep latent-variable models. Advances in Neural Information Processing Systems 30, (2017), 6446–6456.
[33]
Jing Ma, Ruocheng Guo, Chen Chen, Aidong Zhang, and Jundong Li. 2021. Deconfounding with networked observational data in a dynamic environment. In Proceedings of the 14th ACM International Conference on Web Search and Data Mining. 166–174.
[34]
Yunpu Ma and Volker Tresp. 2021. Causal inference under networked interference and intervention policy enhancement. In International Conference on Artificial Intelligence and Statistics. PMLR, 3700–3708.
[35]
Yunpu Ma, Yuyi Wang, and Volker Tresp. 2020. Causal inference under networked interference. arXiv:2002.08506. Retrieved from https://arxiv.org/abs/2002.08506
[36]
Elizabeth L. Ogburn, Oleg Sofrygin, Ivan Diaz, and Mark J. Van der Laan. 2022. Causal inference for social network data. Journal of the American Statistical Association (2022), 1–15.
[37]
Zhaonan Qu, Ruoxuan Xiong, Jizhou Liu, and Guido Imbens. 2021. Efficient treatment effect estimation in observational studies under heterogeneous partial interference. arXiv:2107.12420. Retrieved from https://arxiv.org/abs/2107.12420
[38]
Vineeth Rakesh, Ruocheng Guo, Raha Moraffah, Nitin Agarwal, and Huan Liu. 2018. Linked causal variational autoencoder for inferring paired spillover effects. In Proceedings of the 27th ACM International Conference on Information and Knowledge Management. 1679–1682.
[39]
Donald B. Rubin. 1974. Estimating causal effects of treatments in randomized and nonrandomized studies. J. Educ. Psychol. 66, 5 (1974), 688.
[40]
Donald B. Rubin. 2005. Causal inference using potential outcomes: Design, modeling, decisions. J. Am. Statist. Assoc. 100, 469 (2005), 322–331.
[41]
Usman Shahid and Elena Zheleva. 2019. Counterfactual learning in networks: an empirical study of model dependence. In Beyond Curve Fitting: Causation, Counterfactuals, and Imagination-based AI. AAAI Spring Symposium (AAAI-WHY 2019), Standford, CA., Association for the Advancement of Artificial Intelligence.
[42]
Uri Shalit, Fredrik D. Johansson, and David Sontag. 2017. Estimating individual treatment effect: generalization bounds and algorithms. In International Conference on Machine Learning. 3076–3085.
[43]
Cosma Rohilla Shalizi and Edward McFowland III. 2023. Estimating causal peer influence in homophilous social networks by inferring latent locations. Journal of the American Statistical Association 118, 541 (2023), 707–718.
[44]
Cosma Rohilla Shalizi and Andrew C. Thomas. 2011. Homophily and contagion are generically confounded in observational social network studies. Sociol. Methods Res. 40, 2 (2011), 211–239.
[45]
Chuan Shi, Yitong Li, Jiawei Zhang, Yizhou Sun, and S Yu Philip. 2016. A survey of heterogeneous information network analysis. IEEE Trans. Knowl. Data Eng. 29, 1 (2016), 17–37.
[46]
Jiliang Tang, Yi Chang, Charu Aggarwal, and Huan Liu. 2016. A survey of signed network mining in social media. Comput. Surv. 49, 3 (2016), 1–37.
[47]
Lei Tang and Huan Liu. 2011. Leveraging social media networks for classification. Data Min. Knowl. Discov. 23, 3 (2011), 447–478.
[48]
Panos Toulis, Alexander Volfovsky, and Edoardo M. Airoldi. 2018. Propensity score methodology in the presence of network entanglement between treatments. arXiv:1801.07310. Retrieved from https://arxiv.org/abs/1801.07310
[49]
Victor Veitch, Yixin Wang, and David Blei. 2019. Using embeddings to correct for unobserved confounding in networks. Adv. Neural Inf. Process. Syst. 32 (2019).
[50]
Stefan Wager and Susan Athey. 2018. Estimation and inference of heterogeneous treatment effects using random forests. J. Am. Statist. Assoc. 113, 523 (2018), 1228–1242.
[51]
Tianyu Wang, Marco Morucci, M Usaid Awan, Yameng Liu, Sudeepa Roy, Cynthia Rudin, and Alexander Volfovsky. 2021. Flame: A fast large-scale almost matching exactly approach to causal inference. J. Mach. Learn. Res. 22, 1 (2021), 1477–1517.
[52]
Christopher Winship and Stephen L. Morgan. 1999. The estimation of causal effects from observational data. Annu. Rev. Sociol. 25, 1 (1999), 659–706.
[53]
Liuyi Yao, Zhixuan Chu, Sheng Li, Yaliang Li, Jing Gao, and Aidong Zhang. 2021. A survey on causal inference. ACM Transactions on Knowledge Discovery from Data (TKDD) 15, 5 (2021), 1–46.
[54]
Liuyi Yao, Sheng Li, Yaliang Li, Mengdi Huai, Jing Gao, and Aidong Zhang. 2018. Representation learning for treatment effect estimation from observational data. Adv. Neural Inf. Process. Syst. 31 (2018).
[55]
Bo-Heng Zhang, Bing-Hui Yang, and Zhao-You Tang. 2004. Randomized controlled trial of screening for hepatocellular carcinoma. J. Cancer Res. Clin. Oncol. 130, 7 (2004), 417–422.

Index Terms

  1. Modeling Interference for Individual Treatment Effect Estimation from Networked Observational Data

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Knowledge Discovery from Data
    ACM Transactions on Knowledge Discovery from Data  Volume 18, Issue 3
    April 2024
    663 pages
    EISSN:1556-472X
    DOI:10.1145/3613567
    Issue’s Table of Contents

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 09 December 2023
    Online AM: 18 October 2023
    Accepted: 26 September 2023
    Revised: 29 March 2023
    Received: 12 September 2022
    Published in TKDD Volume 18, Issue 3

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Causal inference
    2. ITE estimation
    3. network interference

    Qualifiers

    • Research-article

    Funding Sources

    • National Natural Science Foundation of China

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 504
      Total Downloads
    • Downloads (Last 12 months)425
    • Downloads (Last 6 weeks)31
    Reflects downloads up to 22 Nov 2024

    Other Metrics

    Citations

    View Options

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Full Text

    View this article in Full Text.

    Full Text

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media