default search action
Valter Crescenzi
Person information
- affiliation: Roma Tre University, Rome, Italy
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2022
- [c41]Roger Voyat, Valter Crescenzi, Paolo Merialdo:
OpenTRIAGE: Entity Linkage for Detail Webpages. SEBD 2022: 1-12 - 2021
- [j12]Valerio Cetorelli, Paolo Atzeni, Valter Crescenzi, Franco Milicchio:
The Smallest Extraction Problem. Proc. VLDB Endow. 14(11): 2445-2458 (2021) - [c40]Valerio Cetorelli, Valter Crescenzi, Paolo Merialdo, Roger Voyat:
NOAH: Creating Data Integration Pipelines over Continuously Extracted Web Data. EDBT/ICDT Workshops 2021 - [i2]Valter Crescenzi, Andrea De Angelis, Donatella Firmani, Maurizio Mazzei, Paolo Merialdo, Federico Piai, Divesh Srivastava:
Alaska: A Flexible Benchmark for Data Integration Tasks. CoRR abs/2101.11259 (2021) - 2020
- [e2]Federico Piai, Donatella Firmani, Valter Crescenzi, Andrea De Angelis, Xin Luna Dong, Maurizio Mazzei, Paolo Merialdo, Divesh Srivastava:
Proceedings of the 2nd International Workshop on Challenges and Experiences from Data Integration to Knowledge Graphs co-located with 46th International Conference on Very Large Data Bases, DI2KG@VLDB 2020, Tokyo, Japan, August 31, 2020. CEUR Workshop Proceedings 2726, CEUR-WS.org 2020 [contents]
2010 – 2019
- 2019
- [j11]Valter Crescenzi, Paolo Merialdo, Disheng Qiu:
Hybrid Crowd-Machine Wrapper Inference. ACM Trans. Knowl. Discov. Data 13(5): 51:1-51:43 (2019) - [c39]Jinsong Guo, Valter Crescenzi, Tim Furche, Giovanni Grasso, Georg Gottlob:
RED: Redundancy-Driven Data Extraction from Result Pages? WWW 2019: 605-615 - [e1]Donatella Firmani, Valter Crescenzi, Andrea De Angelis, Xin Luna Dong, Maurizio Mazzei, Paolo Merialdo, Divesh Srivastava:
Proceedings of the 1st International Workshop on Challenges and Experiences from Data Integration to Knowledge Graphs co-located with the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD 2019), Anchorage, Alaska, August 5, 2019. CEUR Workshop Proceedings 2512, CEUR-WS.org 2019 [contents] - 2018
- [j10]Luciano Barbosa, Valter Crescenzi, Xin Luna Dong, Paolo Merialdo, Federico Piai, Disheng Qiu, Yanyan Shen, Divesh Srivastava:
Big Data Integration for Product Specifications. IEEE Data Eng. Bull. 41(2): 71-81 (2018) - [c38]Luciano Barbosa, Valter Crescenzi, Xin Luna Dong, Paolo Merialdo, Federico Piai, Disheng Qiu, Yanyan Shen, Divesh Srivastava:
Lessons Learned and Research Agenda for Big Data Integration of Product Specifications. SEBD 2018 - [c37]Letizia Tanca, Paolo Atzeni, Davide Azzalini, Ilaria Bartolini, Luca Cabibbo, Luca Calderoni, Paolo Ciaccia, Valter Crescenzi, Juan Carlos De Martin, Selina Fenoglietto, Donatella Firmani, Sergio Greco, Francesco Isgrò, Dario Maio, Davide Martinenghi, Maristella Matera, Paolo Merialdo, Cristian Molinaro, Marco Patella, Roberto Prevete, Elisa Quintarelli, Antonio Santangelo, Andrea Tagarelli, Guglielmo Tamburrini, Riccardo Torlone:
Ethics-aware Data Governance (Vision Paper). SEBD 2018: 49 - [c36]Disheng Qiu, Luciano Barbosa, Valter Crescenzi, Paolo Merialdo, Divesh Srivastava:
Big Data Linkage for Product Specification Pages. SIGMOD Conference 2018: 67-81 - 2017
- [j9]Valter Crescenzi, Alvaro A. A. Fernandes, Paolo Merialdo, Norman W. Paton:
Crowdsourcing for data management. Knowl. Inf. Syst. 53(1): 1-41 (2017) - 2015
- [j8]Valter Crescenzi, Paolo Merialdo, Disheng Qiu:
Crowdsourcing large scale wrapper inference. Distributed Parallel Databases 33(1): 95-122 (2015) - [j7]Tim Weninger, Rodrigo Palácios, Valter Crescenzi, Thomas Gottron, Paolo Merialdo:
Web Content Extraction: a MetaAnalysis of its Past and Thoughts on its Future. SIGKDD Explor. 17(2): 17-23 (2015) - [i1]Tim Weninger, Rodrigo Palácios, Valter Crescenzi, Thomas Gottron, Paolo Merialdo:
Web Content Extraction - a Meta-Analysis of its Past and Thoughts on its Future. CoRR abs/1508.04066 (2015) - 2014
- [c35]Lorenz Bühmann, Ricardo Usbeck, Axel-Cyrille Ngonga Ngomo, Muhammad Saleem, Andreas Both, Valter Crescenzi, Paolo Merialdo, Disheng Qiu:
Web-Scale Extension of RDF Knowledge Bases from Templated Websites. ISWC (1) 2014: 66-81 - 2013
- [j6]Mirko Bronzi, Valter Crescenzi, Paolo Merialdo, Paolo Papotti:
Extraction and Integration of Partially Overlapping Web Sources. Proc. VLDB Endow. 6(10): 805-816 (2013) - [c34]Valter Crescenzi, Paolo Merialdo, Disheng Qiu:
Wrapper Generation Supervised by a Noisy Crowd. DBCrowd 2013: 8-13 - [c33]Valter Crescenzi, Paolo Merialdo, Disheng Qiu:
A framework for learning web wrappers from the crowd. WWW 2013: 261-272 - [c32]Valter Crescenzi, Paolo Merialdo, Disheng Qiu:
ALFRED: crowd assisted data extraction. WWW (Companion Volume) 2013: 297-300 - 2012
- [c31]Rolando Creo, Valter Crescenzi, Disheng Qiu, Paolo Merialdo:
Minimizing the Costs of the Training Data for Learning Web Wrappers. VLDS 2012: 35-40 - [p1]Lorenzo Blanco, Valter Crescenzi, Paolo Merialdo, Paolo Papotti:
Web Data Reconciliation: Models and Experiences. SeCO Book 2012: 1-15 - 2011
- [c30]Lorenzo Blanco, Valter Crescenzi, Paolo Merialdo, Paolo Papotti:
Contextual Data Extraction and Instance-Based Integration. VLDS 2011: 23-29 - [c29]Mirko Bronzi, Valter Crescenzi, Paolo Merialdo, Paolo Papotti:
Wrapper Generation for Overlapping Web Sources. Web Intelligence 2011: 32-35 - [c28]Lorenzo Blanco, Valter Crescenzi, Paolo Merialdo, Paolo Papotti:
Characterizing the uncertainty of web data: models and experiences. WebQuality@WWW 2011: 1-8 - [c27]Lorenzo Blanco, Mirko Bronzi, Valter Crescenzi, Paolo Merialdo, Paolo Papotti:
Automatically building probabilistic databases from the web. WWW (Companion Volume) 2011: 185-188 - 2010
- [c26]Lorenzo Blanco, Valter Crescenzi, Paolo Merialdo, Paolo Papotti:
Probabilistic Models to Reconcile Complex Data from Inaccurate Data Sources. CAiSE 2010: 83-97 - [c25]Lorenzo Blanco, Valter Crescenzi, Paolo Merialdo, Paolo Papotti:
Probabilistic Reconciliation of Records from Inaccurate Web Sources (Extended Abstract). SEBD 2010: 390-397 - [c24]Paolo Papotti, Valter Crescenzi, Paolo Merialdo, Mirko Bronzi, Lorenzo Blanco:
Redundancy-Driven Web Data Extraction and Integration. WebDB 2010 - [c23]Lorenzo Blanco, Mirko Bronzi, Valter Crescenzi, Paolo Merialdo, Paolo Papotti:
Exploiting information redundancy to wring out structured data from the web. WWW 2010: 1063-1064
2000 – 2009
- 2009
- [c22]Lorenzo Blanco, Mirko Bronzi, Valter Crescenzi, Paolo Merialdo, Paolo Papotti:
Data Extraction and Integration from Imprecise Web Sources. SEBD 2009: 229-236 - 2008
- [j5]Valter Crescenzi, Paolo Merialdo:
Wrapper Inference for Ambiguous Web Pages. Appl. Artif. Intell. 22(1&2): 21-52 (2008) - [j4]Lorenzo Blanco, Valter Crescenzi, Paolo Merialdo:
Structure and Semantics of Data-IntensiveWeb Pages: An Experimental Study on their Relationships. J. Univers. Comput. Sci. 14(11): 1877-1892 (2008) - [c21]Lorenzo Blanco, Valter Crescenzi, Paolo Merialdo, Paolo Papotti:
Flint: Google-basing the Web. EDBT 2008: 720-724 - [c20]Claudio Bertoli, Valter Crescenzi, Paolo Merialdo:
Crawling programs for wrapper-based applications. IRI 2008: 160-165 - [c19]Lorenzo Blanco, Valter Crescenzi, Paolo Merialdo, Paolo Papotti:
Searching Entities on the Web by Sample. SEBD 2008: 406-413 - [c18]Lorenzo Blanco, Valter Crescenzi, Paolo Merialdo, Paolo Papotti:
Supporting the automatic construction of entity aware search engines. WIDM 2008: 149-156 - 2006
- [c17]Valter Crescenzi, Paolo Merialdo:
Efficient Techniques for Effective Wrapper Induction. ICDE Workshops 2006: 47 - 2005
- [j3]Valter Crescenzi, Paolo Merialdo, Paolo Missier:
Clustering Web pages based on their structure. Data Knowl. Eng. 54(3): 279-299 (2005) - [c16]Lorenzo Blanco, Valter Crescenzi, Paolo Merialdo:
Harvesting Structurally Similar Pages. SEBD 2005: 109-116 - [c15]Lorenzo Blanco, Valter Crescenzi, Paolo Merialdo:
Efficiently Locating Collections of Web Pages to Wrap. WEBIST 2005: 247-254 - 2004
- [j2]Valter Crescenzi, Giansalvatore Mecca:
Automatic information extraction from large websites. J. ACM 51(5): 731-779 (2004) - [c14]Valter Crescenzi, Giansalvatore Mecca, Paolo Merialdo:
Improving the expressiveness of ROADRUNNER. SEBD 2004: 62-69 - [c13]Valter Crescenzi, Giansalvatore Mecca, Paolo Merialdo, Paolo Missier:
An Automatic Data Grabber for Large Web Sites. VLDB 2004: 1321-1324 - 2003
- [c12]Luigi Arlotta, Valter Crescenzi, Giansalvatore Mecca, Paolo Merialdo:
Automatic annotation of data extracted from large web sites. SEBD 2003: 359-366 - [c11]Luigi Arlotta, Valter Crescenzi, Giansalvatore Mecca, Paolo Merialdo:
Automatic annotation of data extracted from large Web sites. WebDB 2003: 7-12 - [c10]Valter Crescenzi, Paolo Merialdo, Paolo Missier:
Fine-grain web site structure discovery. WIDM 2003: 15-22 - 2002
- [c9]Valter Crescenzi, Giansalvatore Mecca, Paolo Merialdo:
Wrapping-oriented classification of web pages. SAC 2002: 1108-1112 - [c8]Valter Crescenzi, Giansalvatore Mecca, Paolo Merialdo:
Back to Gold's Age: Bridging the Gap Between Traditional Grammar Inference and Web Information Extraction. SEBD 2002: 87-94 - [c7]Valter Crescenzi, Giansalvatore Mecca, Paolo Merialdo:
RoadRunner: automatic data extraction from data-intensive web sites. SIGMOD Conference 2002: 624 - 2001
- [c6]Valter Crescenzi, Giansalvatore Mecca, Paolo Merialdo:
Automatic Web Information Extraction in the ROADRUNNER System. ER (Workshops) 2001: 264-277 - [c5]Valter Crescenzi, Giansalvatore Mecca, Paolo Merialdo:
The RoadRunner Web Data Extraction System. SEBD 2001: 281-288 - [c4]Valter Crescenzi, Giansalvatore Mecca, Paolo Merialdo:
RoadRunner: Towards Automatic Data Extraction from Large Web Sites. VLDB 2001: 109-118 - 2000
- [c3]Giansalvatore Mecca, Paolo Merialdo, Paolo Atzeni, Valter Crescenzi:
Experiences in XML data management. SEBD 2000: 109-119
1990 – 1999
- 1999
- [c2]Giansalvatore Mecca, Paolo Merialdo, Paolo Atzeni, Valter Crescenzi:
The ARANEUS Guide to Web-Site Development. SEBD 1999: 167-177 - [c1]Giansalvatore Mecca, Paolo Merialdo, Paolo Atzeni, Valter Crescenzi:
The (Short) Araneus Guide to Web-Site Development. WebDB (Informal Proceedings) 1999: 13-18 - 1998
- [j1]Valter Crescenzi, Giansalvatore Mecca:
Grammars Have Exceptions. Inf. Syst. 23(8): 539-565 (1998)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-04-24 22:51 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint