Li X, Jiang C and Wang P. Matching Tabular Data to Knowledge Graph Based on Multi-level Scoring Filters for Table Entity Disambiguation. Web and Big Data. (227-242).
Paton N, Chen J and Wu Z.
(2023). Dataset Discovery and Exploration: A Survey. ACM Computing Surveys. 56:4. (1-37). Online publication date: 30-Apr-2024.
Zhong L, Wu J, Li Q, Peng H and Wu X.
(2023). A Comprehensive Survey on Automatic Knowledge Graph Construction. ACM Computing Surveys. 56:4. (1-62). Online publication date: 30-Apr-2024.
Berenguer A, Mazón J and Tomás D.
(2024). Word embeddings for retrieving tabular data from research publications. Machine Language. 113:4. (2227-2248). Online publication date: 1-Apr-2024.
Zecchini L, Bleifuß T, Simonini G, Bergamaschi S and Naumann F.
(2024). Determining the Largest Overlap between Tables. Proceedings of the ACM on Management of Data. 2:1. (1-26). Online publication date: 12-Mar-2024.
Fernandez R, Elmore A, Franklin M, Krishnan S and Tan C.
(2023). How Large Language Models Will Disrupt Data Management. Proceedings of the VLDB Endowment. 16:11. (3302-3309). Online publication date: 1-Jul-2023.
Fan G, Wang J, Li Y and Miller R. Table Discovery in Data Lakes: State-of-the-art and Future Directions. Companion of the 2023 International Conference on Management of Data. (69-75).
Fan G, Wang J, Li Y, Zhang D and Miller R.
(2023). Semantics-Aware Dataset Discovery from Data Lakes with Contextualized Column-Based Representation Learning. Proceedings of the VLDB Endowment. 16:7. (1726-1739). Online publication date: 1-Mar-2023.
Khatiwada A, Shraga R, Gatterbauer W and Miller R.
(2022). Integrating Data Lake Tables. Proceedings of the VLDB Endowment. 16:4. (932-945). Online publication date: 1-Dec-2022.
Asudeh A and Nargesian F.
(2022). Towards distribution-aware query answering in data markets. Proceedings of the VLDB Endowment. 15:11. (3137-3144). Online publication date: 1-Jul-2022.
Trabelsi M, Chen Z, Zhang S, Davison B and Heflin J. StruBERT: Structure-aware BERT for Table Search and Matching. Proceedings of the ACM Web Conference 2022. (442-451).
Amsterdamer Y and Cohen M. Automated Selection of Multiple Datasets for Extension by Integration. Proceedings of the 30th ACM International Conference on Information & Knowledge Management. (27-36).
Zhang S and Balog K.
(2021). Semantic Table Retrieval Using Keyword and Table Queries. ACM Transactions on the Web. 15:3. (1-33). Online publication date: 31-Aug-2021.
Chen Z, Zhang S and Davison B. WTR: A Test Collection for Web Table Retrieval. Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. (2514-2520).
Wang F, Sun K, Chen M, Pujara J and Szekely P. Retrieving Complex Tables with Multi-Granular Graph Representation Learning. Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. (1472-1482).
Kruit B, Boncz P and Urbani J. Extracting N-ary Facts from Wikipedia Table Clusters. Proceedings of the 29th ACM International Conference on Information & Knowledge Management. (655-664).
Chen Z, Trabelsi M, Heflin J, Xu Y and Davison B. Table Search Using a Deep Contextualized Language Model. Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. (589-598).
Zhang S and Balog K.
(2020). Web Table Extraction, Retrieval, and Augmentation. ACM Transactions on Intelligent Systems and Technology. 11:2. (1-35). Online publication date: 30-Apr-2020.
Nargesian F, Zhu E, Miller R, Pu K and Arocena P.
(2019). Data lake management. Proceedings of the VLDB Endowment. 12:12. (1986-1989). Online publication date: 1-Aug-2019.
Huang S, Liu J, Korn F, Wang X, Wu Y, Markowitz D and Yu C. Contextual Fact Ranking and Its Applications in Table Synthesis and Compression. Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. (285-293).
Zhang L, Zhang S and Balog K. Table2Vec. Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval. (1029-1032).
Lehmberg O and Bizer C. Synthesizing N-ary Relations from Web Tables. Proceedings of the 9th International Conference on Web Intelligence, Mining and Semantics. (1-12).
Wang P, Shea R, Wang J and Wu E. Progressive Deep Web Crawling Through Keyword Queries For Data Enrichment. Proceedings of the 2019 International Conference on Management of Data. (229-246).
Zhang S, Abdul Zada V and Balog K. SmartTable. The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval. (1297-1300).
Zhang S and Balog K. On-the-fly Table Generation. The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval. (595-604).
Nargesian F, Zhu E, Pu K and Miller R.
(2018). Table union search on open data. Proceedings of the VLDB Endowment. 11:7. (813-825). Online publication date: 1-Mar-2018.
(2018). Emerging trend of big data analytics in bioinformatics. International Journal of Bioinformatics Research and Applications. 14:1-2. (144-205). Online publication date: 1-Jan-2018.
Lehmberg O and Bizer C.
(2017). Stitching web tables for improving matching quality. Proceedings of the VLDB Endowment. 10:11. (1502-1513). Online publication date: 1-Aug-2017.
Ritze D, Lehmberg O, Oulabi Y and Bizer C. Profiling the Potential of Web Tables for Augmenting Cross-domain Knowledge Bases. Proceedings of the 25th International Conference on World Wide Web. (251-261).
Hu C, Li Y, Cheng X and Liu Z.
(2016). A Virtual Dataspaces Model for large-scale materials scientific data access. Future Generation Computer Systems. 54:C. (456-468). Online publication date: 1-Jan-2016.
Liu D, Ma L and Liu X. Research on Adaptive Wrapper in Deep Web Data Extraction. Proceedings of the Second International Conference on Internet of Vehicles - Safe and Intelligent Mobility - Volume 9502. (409-423).
Ristoski P, Bizer C and Paulheim H.
(2015). Mining the Web of Linked Data with RapidMiner. Web Semantics: Science, Services and Agents on the World Wide Web. 35:P3. (142-151). Online publication date: 1-Dec-2015.
Lehmberg O, Ritze D, Ristoski P, Meusel R, Paulheim H and Bizer C.
(2015). The Mannheim Search Join Engine. Web Semantics: Science, Services and Agents on the World Wide Web. 35:P3. (159-166). Online publication date: 1-Dec-2015.
Jamil H and Jagadish H. A Structured Query Model for the Deep Relational Web. Proceedings of the 24th ACM International on Conference on Information and Knowledge Management. (1679-1682).
Jagadish H, Qian L and Nandi A.
(2015). Organic databases. International Journal of Computational Science and Engineering. 11:3. (270-283). Online publication date: 1-Oct-2015.
Wang H, Liu A, Wang J, Ziebart B, Yu C and Shen W. Context Retrieval for Web Tables. Proceedings of the 2015 International Conference on The Theory of Information Retrieval. (251-260).
Qiu D, Barbosa L, Dong X, Shen Y and Srivastava D.
(2015). Dexter. Proceedings of the VLDB Endowment. 8:13. (2194-2205). Online publication date: 1-Sep-2015.
Eberius J, Thiele M, Braunschweig K and Lehner W. DrillBeyond. Proceedings of the 27th International Conference on Scientific and Statistical Database Management. (1-12).
Eberius J, Thiele M, Braunschweig K and Lehner W. Top-k entity augmentation using consistent set covering. Proceedings of the 27th International Conference on Scientific and Statistical Database Management. (1-12).
Daniel F. Live, Personal Data Integration Through UI-Oriented Computing. Proceedings of the 15th International Conference on Engineering the Web in the Big Data Era - Volume 9114. (479-497).
Braunschweig K, Thiele M, Eberius J and Lehner W. Column-specific context extraction for web tables. Proceedings of the 30th Annual ACM Symposium on Applied Computing. (1072-1077).
Jamil H.
(2015). Improving integration effectiveness of ID mapping based biological record linkage. IEEE/ACM Transactions on Computational Biology and Bioinformatics. 12:2. (473-486). Online publication date: 1-Mar-2015.
Lan R, Adelfio M and Samet H. Spatio-temporal disease tracking using news articles. Proceedings of the Third ACM SIGSPATIAL International Workshop on the Use of GIS in Public Health. (31-38).
Piccinini H, Casanova M, Leme L and Furtado A.
(2014). Publishing deep web geographic data. Geoinformatica. 18:4. (769-792). Online publication date: 1-Oct-2014.
Sarawagi S and Chakrabarti S. Open-domain quantity queries on web tables. Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining. (711-720).
Chen Z and Cafarella M. Integrating spreadsheet data via accurate and low-effort extraction. Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining. (1126-1135).
Qiu D and Luce L. Extraction and integration of web sources with humans and domain knowledge. Proceedings of the 23rd International Conference on World Wide Web. (1295-1298).
Mehmood R and Maurer H. Towards the integration of images on the Web. Proceedings of International Conference on Information Integration and Web-based Applications & Services. (580-584).
Weninger T, Johnston T and Han J.
(2013). The parallel path framework for entity discovery on the web. ACM Transactions on the Web. 7:3. (1-29). Online publication date: 1-Sep-2013.
Janga P and Davis K. Tabular Web Data. Proceedings of the 15th International Conference on Data Warehousing and Knowledge Discovery - Volume 8057. (26-33).
Ling X, Halevy A, Wu F and Yu C. Synthesizing union tables from the web. Proceedings of the Twenty-Third international joint conference on Artificial Intelligence. (2677-2683).
Liu D, Wang X, Yan Z and Li Q. Robust web data extraction. Proceedings of the 2012 international conference on Web Information Systems and Mining. (497-509).
Naffakhi N and Faiz R. Using Bayesian networks theory for aggregated search to XML retrieval. Proceedings of the 2nd International Conference on Web Intelligence, Mining and Semantics. (1-4).
Nakashole N and Weikum G. Real-time population of knowledge bases. Proceedings of the Joint Workshop on Automatic Knowledge Base Construction and Web-scale Knowledge Extraction. (41-45).
Pimplikar R and Sarawagi S.
(2012). Answering table queries on the web using column keywords. Proceedings of the VLDB Endowment. 5:10. (908-919). Online publication date: 1-Jun-2012.
De Meo P, Ferrara E, Fiumara G and Ricciardello A.
(2012). A novel measure of edge centrality in social networks. Knowledge-Based Systems. 30. (136-150). Online publication date: 1-Jun-2012.
Qian L, Cafarella M and Jagadish H. Sample-driven schema mapping. Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data. (73-84).
Howe B, Cole G, Khoussainova N and Battle L. Automatic example queries for ad hoc databases. Proceedings of the 2011 ACM SIGMOD International Conference on Management of data. (1319-1322).
Elmeleegy H, Madhavan J and Halevy A.
(2011). Harvesting relational tables from lists on the web. The VLDB Journal — The International Journal on Very Large Data Bases. 20:2. (209-226). Online publication date: 1-Apr-2011.
Michelson M, Macskassy S, Minton S and Getoor L. Materializing multi-relational databases from the web using taxonomic queries. Proceedings of the fourth ACM international conference on Web search and data mining. (355-364).
Cafarella M, Halevy A and Madhavan J.
(2011). Structured data on the web. Communications of the ACM. 54:2. (72-79). Online publication date: 1-Feb-2011.
Jamil H. A secured collaborative model for data integration in life sciences. Transactions on large-scale data- and knowledge-centered systems IV. (158-187).
Blanco L, Bronzi M, Crescenzi V, Merialdo P and Papotti P. Redundancy-driven web data extraction and integration. Procceedings of the 13th International Workshop on the Web and Databases. (1-6).
Weikum G and Theobald M. From information to knowledge. Proceedings of the twenty-ninth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems. (65-76).
Lin C, Zhao B, Weninger T, Han J and Liu B. Entity relation discovery from web tables and links. Proceedings of the 19th international conference on World wide web. (1145-1146).
Yin X, Tan W, Li X and Tu Y. Automatic extraction of clickable structured web contents for name entity queries. Proceedings of the 19th international conference on World wide web. (991-1000).