Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3638584.3638602acmotherconferencesArticle/Chapter ViewAbstractPublication PagescsaiConference Proceedingsconference-collections
research-article

User entity alignment method based on cross-attribute knowledge association

Published: 14 March 2024 Publication History

Abstract

User entity alignment is the core technology of associating multisource user identities and constructing user portraits, which is of great significance in cyberspace security, personalized service recommendation, social network, data mining, and other fields. It is difficult to accurately align user entities based on common attributes when the common attributes of multisource user data are sparse. Aiming at the above problem, we propose a user entity alignment method based on cross-attribute knowledge association. Firstly, the attribute values in the user information are linked to the corresponding entities in a knowledge graph, and the representation vector of each attribute value is obtained by embedding the subgraph of the knowledge graph. With the help of knowledge graph, the knowledge association between attribute values is embedded into the attribute vectors. At the same time, to accurately measure the attribute weight, the attribute identification degree is calculated by the distribution of attribute values. Finally, the user representation vector is generated by weighted cumulative attribute value vectors, and the similarity between user vectors is calculated to judge whether two users refer to the same person entity. Experimental results demonstrate that, the accuracy, recall, and F1 score of the proposed method are not less than 0.87 on the person entity dataset with sparse attributes. Compared with existing typical methods based on common attributes and methods based on knowledge graph embedding, the accuracy, recall, and F1 score are 12%, 7% and 10% higher than the comparative algorithm respectively.

References

[1]
Zafarani R, Liu H. "Connecting users across social media sites: a behavioral-modeling approach," Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. Chicago: ACM Press, 2013: 41-49.
[2]
Zhang Y, Tang J, Yang Z "Cosnet: connecting heterogeneous social networks with local and global consistency," Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. Sydney: ACM Press, 2015: 1485-1494.
[3]
Korula N, Lattanzi S. "An efficient reconciliation algorithm for social networks," Proceedings of the VLDB Endowment, 2014, 7(5): 377-388.
[4]
Lu C T, Shuai H H, Yu P S. "Identifying your customers in social networks," Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management. Shanghai: ACM Press, 2014:391-400.
[5]
Zafarani R, Liu H. "Connecting Corresponding Identities across Communities," ICWSM. 2009, 9: 354–357.
[6]
Iofciu T, Fankhauser P, Abel F "Identifying Users Across Social Tagging Systems," In International Conference on Weblogs and Social Media, Barcelona, Catalonia, Spain, July. 2010
[7]
Ikeda M, Ono S, Sato I "Person Name Disambiguation on the Web by Two-Stage Clustering," In 2nd Web People Search Evaluation Workshop (WePS 2009), 18th WWW Conference.
[8]
Motoyama M, Varghese G. "I seek you: searching and matching individuals in social networks," In Eleventh International Workshop on Web Information and Data Management. 2009: 67–75.
[9]
Jain P, Kumaraguru P, Joshi A. "identifying users across multiple online social networks," 2013: 1259–1268.
[10]
Raad E, Chbeir R, Dipanda A. "User Profile Matching in Social Networks," In International Conference on Network-Based Information Systems. 2010: 297–304.
[11]
Ma J, Qiao Y, Hu G "Balancing User Profile and Social Network Structure for Anchor Link Inferring across Multiple Online Social Networks," IEEE Access. 2017, PP (99): 1-1.
[12]
Wei Y C, Lin M S et al. "Name Disambiguation in Person Information Mining," IEEE/WIC/ACM International Conference on Web, 2008:378-381
[13]
Wang X, Sun A, Kardes H, "Probabilistic estimates of attribute statistics and match likelihood for people entity resolution," IEEE International Conference on Big Data. IEEE, 2015.
[14]
Guan S, Jin X, Jia Y "Self-learning and embedding based entity alignment," IEEE International Conference on Big Knowledge. 2017.
[15]
Qiu Q, Luo J, Yin M. "Person Name Disambiguation by distinguishing the Importance of features based on Topological Distance," Current Trends in Computer Science and Mechanical Automation Vol.1. De Gruyter Open Poland, 2018, pp. 342-351.
[16]
Zhang Y, Wang L, Li X "Social Identity Link Across Incomplete Social In-formation Sources Using Anchor Link Expansion," In PAKDD 2016. 2016:395–408.
[17]
Goga O, Loiseau P, Sommer R "On the Reliability of Profile Matching Across Large Online Social Networks," In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2015: 1799–1808.
[18]
Zhang Y, Tang J, Yang Z "Cosnet: connecting heterogeneous social networks with local and global consistency," In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2015: 1485–1494.
[19]
Zhang Q, Sun Z, Hu W, "Multi-view Knowledge Graph Embedding for Entity Alignment," arXiv preprint arXiv:1906.02390, 2019.
[20]
Suchanek F M, Abiteboul S, Senellart P. "PARIS: probabilistic alignment of relations, instances, and schema," Proceedings of the Vldb Endowment, 2011, 5(3):157-168.
[21]
Lee S, Hwang S. "ARIA: asymmetry resistant instance alignment," Twenty-Eighth AAAI Conference on Artificial Intelligence. 2014.
[22]
Zhuang Y, Li G L, Feng J H. "Review of Knowledge base entity alignment techniques," Journal of Computer Research and Development, 2016, a53(1):165-192.
[23]
Fan F, Li Z, Wang Y. "Cohesion based attribute value matching," International Congress on Image & Signal Processing. 2018.
[24]
Wang X, Sun A, Kardes H "Probabilistic estimates of attribute statistics and match likelihood for people entity resolution," IEEE International Conference on Big Data. IEEE, 2015
[25]
Li J, Wang Z, Zhang X "Large scale instance matching via multiple indexes and candidate selection," Knowledge-Based Systems, 2013, 50(Complete):112-120.
[26]
Wick M, Singh S, Mccallum A. "A discriminative hierarchical model for fast coreference at large scale," Proc Acl, 2012:379-388
[27]
Guan S, Jin X, Jia Y "Self-learning and embedding based entity alignment," IEEE International Conference on Big Knowledge. 2017.
[28]
Wu G, Ying H, Hu X. "Entity linking: an issue to extract corresponding entity with knowledge base," IEEE Access, 2018, 6(99):1-1.
[29]
McNeill N, Kardes H, Borthwick A. "Dynamic record blocking: efficient linking of massive databases in mapreduce," Proceedings of the 10th International Workshop on Quality in Databases (QDB). 2012.
[30]
Xu C "Research on Markov logic networks," Journal of software, 2011, 22 (8): 1699-1713
[31]
Wang X, Liu K, He S "Multi-Source Knowledge Bases Entity Alignment by Leveraging Semantic Tags," Journal of computer science, 2017 (03): 169-179
[32]
Gao S, Xing Z, Ma Y "Enhancing knowledge sharing in stack overflow via automatic external web resources linking," International Conference on Engineering of Complex Computer Systems. 2018.
[33]
Lacoste-Julien S, Palla K, Davies A "Sigma: Simple greedy matching for aligning large knowledge bases," Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 2013: 572-580.
[34]
Christen P. "Development and user experiences of an open source data cleaning, deduplication and record linkage system," ACM SIGKDD Explorations Newsletter, 2009, 11(1): 39-48
[35]
Zhao D, Ning Z, Ming X "An Improved User Identification Method Across Social Networks Via Tagging Behaviors," 2018 IEEE 30th International Conference on Tools with Artificial Intelligence (ICTAI). IEEE, 2018.
[36]
Tang R, Miao Z, Jiang S "Interlayer link prediction in multiplex social networks based on multiple types of consistency between embedding vectors," IEEE Transactions on Cybernetics, 2021.
[37]
Majeed A, Lee S. "Anonymization techniques for privacy preserving data publishing: A comprehensive survey," IEEE access, 2020, 9: 8512-8545.
[38]
Tang R, Jiang S, Chen X "Interlayer link prediction in multiplex social networks: an iterative degree penalty algorithm," Knowledge-Based Systems, 2020, 194: 105598.
[39]
Milne D, Witten I H. "Learning to link with wikipedia,"Proceedings of the 17th ACM conference on Information and knowledge management. ACM, 2008: 509-518.
[40]
Shen W, Wang J, Han J. "Entity linking with a knowledge base: issues, techniques, and solutions," IEEE Transactions on Knowledge and Data Engineering, 2015, 27(2):443-460.
[41]
Zhang C, Miao Z, Xiao H "Knowledge graph embedding for hyper-relational data," Tsinghua Science & Technology, 2017, 22(2):185-197.
[42]
Bordes A, Usunier N, Garcia-Duran A "Translating embeddings for modeling multi-relational data," Advances in neural information processing systems. 2013: 2787-2795.

Index Terms

  1. User entity alignment method based on cross-attribute knowledge association

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    CSAI '23: Proceedings of the 2023 7th International Conference on Computer Science and Artificial Intelligence
    December 2023
    563 pages
    ISBN:9798400708688
    DOI:10.1145/3638584
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 14 March 2024

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. attribute embedding
    2. attribute identification degree
    3. entity alignment
    4. knowledge association
    5. knowledge graph

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Funding Sources

    Conference

    CSAI 2023

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 25
      Total Downloads
    • Downloads (Last 12 months)25
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 11 Feb 2025

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media