Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleSeptember 2024
Beyond The Page-Break: Towards Better Tools for Remediation of Born-Digital Documents
HT '24: Proceedings of the 35th ACM Conference on Hypertext and Social MediaPages 70–77https://doi.org/10.1145/3648188.3678215A legacy of print is that much of our process and tooling is predicated on using text in paginated form, such as was required for (paper) printed media. Increasingly, digitally-created (‘born-digital’) documents will never be used non-digitally and yet ...
- ArticleJuly 2024
Integrating Mathematical Data and Resources: Advancements in zbMATH Open for Enhanced Mathematical Research Accessibility and Reproducibility
AbstractWe report the ongoing efforts of swMATH, an integral part of zbMATH Open, to collect precise referencing software metadata. zbMATH Open is emerging as a unified platform offering a spectrum of mathematical resources, including mathematical ...
- research-articleJuly 2024
Browsing and Searching Metadata of TREC
SIGIR '24: Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information RetrievalPages 313–323https://doi.org/10.1145/3626772.3657873Information Retrieval (IR) research is deeply rooted in experimentation and evaluation, and the Text REtrieval Conference (TREC) has been playing a central role in making that possible since its inauguration in 1992. TREC's mission centers around ...
- research-articleJune 2024
Acoustic Classification of Guitar Tunings with Deep Learning
DLfM '24: Proceedings of the 11th International Conference on Digital Libraries for MusicologyPages 6–14https://doi.org/10.1145/3660570.3660574A guitar tuning is the allocation of pitches to the open strings of the guitar. A wide variety of guitar tunings are featured in genres such as blues, classical, folk, and rock. Standard tuning provides a convenient placing of intervals and a manageable ...
- research-articleAugust 2024
Smart Emergency Alerting System: A Machine Learning Approach
CCCAI '24: Proceedings of the 2024 2nd International Conference on Communications, Computing and Artificial IntelligencePages 8–15https://doi.org/10.1145/3676581.3676583Abstract: Recently, Saudi Arabia has hosted significant sports and technology events. Saudi Arabia has also successfully secured the bid to host Expo 2030 and declared its intention to host the FIFA World Cup in 2034. These crowds pertain to the elderly ...
-
- research-articleAugust 2024
FAIR enough: a Vision for Research Objects in Empirical Software Engineering Studies
WSESE '24: Proceedings of the 1st IEEE/ACM International Workshop on Methodological Issues with Empirical Studies in Software EngineeringPages 64–67https://doi.org/10.1145/3643664.3648201In recent years, the software engineering research community has been fostering Open Science through several initiatives. Although the transparency fostered in Open Science can address some of the concerns related to appropriate study design and data ...
- research-articleJuly 2024
AndroZoo: A Retrospective with a Glimpse into the Future
MSR '24: Proceedings of the 21st International Conference on Mining Software RepositoriesPages 389–393https://doi.org/10.1145/3643991.3644863In 2016, we released AndroZoo, a continuously expanding dataset of Android applications that aggregates apps from various sources, including the official Google Play app market. As of today, Andro-Zoo contains approximately 24 million APK files, making ...
- research-articleApril 2024
Exif2Vec: A Framework to Ascertain Untrustworthy Crowdsourced Images Using Metadata
ACM Transactions on the Web (TWEB), Volume 18, Issue 3Article No.: 31, Pages 1–27https://doi.org/10.1145/3645094In the context of social media, the integrity of images is often dubious. To tackle this challenge, we introduce Exif2Vec, a novel framework specifically designed to discover modifications in social media images. The proposed framework leverages an image’...
- posterMay 2024
Designing Metadata for the Use of Artificial Intelligence in Academia
SAC '24: Proceedings of the 39th ACM/SIGAPP Symposium on Applied ComputingPages 1662–1664https://doi.org/10.1145/3605098.3636201Academic writing is one of the most important tasks in Academia. The pressure to "publish or perish" drives researchers to use all the tools available to try to improve their papers and their impact. Generative artificial intelligence (AI) that can ...
- research-articleMarch 2024
Discovering Functional Dependencies through Hitting Set Enumeration
Proceedings of the ACM on Management of Data (PACMMOD), Volume 2, Issue 1Article No.: 43, Pages 1–24https://doi.org/10.1145/3639298Functional dependencies (FDs) are among the most important integrity constraints in databases. They serve to normalize datasets and thus resolve redundancies, they contribute to query optimization, and they are frequently used to guide data cleaning ...
- research-articleJune 2024
Towards a More Generic and Elastic Metadata Management Model in a Data Lake Environment
ICMLC '24: Proceedings of the 2024 16th International Conference on Machine Learning and ComputingPages 44–51https://doi.org/10.1145/3651671.3651773The evolution of the vast amount of heterogeneous data sources is leading to the emergence of several new concepts. One of the best-known concepts that is emerging as a new and trending topic in the big data space is the data lake. This is a central ...
- research-articleJanuary 2024
On the Impact of Showing Evidence from Peers in Crowdsourced Truthfulness Assessments
ACM Transactions on Information Systems (TOIS), Volume 42, Issue 3Article No.: 87, Pages 1–26https://doi.org/10.1145/3637872Misinformation has been rapidly spreading online. The common approach to dealing with it is deploying expert fact-checkers who follow forensic processes to identify the veracity of statements. Unfortunately, such an approach does not scale well. To deal ...
- research-articleJanuary 2024
(Re?)Building trust in research integrity
Information Services and Use (INSU), Volume 44, Issue 1Pages 27–30https://doi.org/10.3233/ISU-230200This article is based on a session with the same title from the 2023 APE (Academic Publishing in Europe) conference, in which the authors discussed the challenges and opportunities for the scholarly communications community to improve trust in the ...
- research-articleNovember 2023
Positioning Paradata: A Conceptual Frame for AI Processual Documentation in Archives and Recordkeeping Contexts
Journal on Computing and Cultural Heritage (JOCCH), Volume 16, Issue 4Article No.: 75, Pages 1–19https://doi.org/10.1145/3594728The emergence of sophisticated Artificial Intelligence (AI) and machine learning tools poses a challenge to archives and records professionals, who are accustomed to understanding and documenting the activities of human agents rather than the often-opaque ...
- research-articleNovember 2023
An Ontological Approach for Unlocking the Colonial Archive
- Gustavo Candela,
- Javier Pereda,
- Dolores Sáez,
- Pilar Escobar,
- Alexander Sánchez,
- Andrés Villa Torres,
- Albert A. Palacios,
- Kelly McDonough,
- Patricia Murrieta-Flores
Journal on Computing and Cultural Heritage (JOCCH), Volume 16, Issue 4Article No.: 74, Pages 1–18https://doi.org/10.1145/3594727Cultural Heritage institutions have been exploring new ways of making available their catalogues in digital format. Recently, new approaches have emerged as methods to reuse and make available the contents for computational purposes. This work introduces ...
- short-paperDecember 2023
Towards Open-Source Maps Metadata
SIGSPATIAL '23: Proceedings of the 31st ACM International Conference on Advances in Geographic Information SystemsArticle No.: 31, Pages 1–4https://doi.org/10.1145/3589132.3625576This paper envisions having an open-source web portal for detailed worldwide road network maps with rich metadata. This would be major advancement from current portals that only have road networks without important metadata, including traffic-related ...
- research-articleNovember 2023
Xfast: Extreme File Attribute Stat Acceleration for Lustre
- Yingjin Qian,
- Wen Cheng,
- Lingfang Zeng,
- Xi Li,
- Marc-André Vef,
- Andreas Dilger,
- Siyao Lai,
- Shuichi Ihara,
- Yong Fan,
- André Brinkmann
SC '23: Proceedings of the International Conference for High Performance Computing, Networking, Storage and AnalysisArticle No.: 96, Pages 1–12https://doi.org/10.1145/3581784.3607080Directory tree walks on parallel file systems are costly operations frequently required by many storage management tasks. Even listing the contents of a single directory can take minutes to hours for huge directories, as the tree walk performance of ...
- research-articleOctober 2023
Common Voice and accent choice: data contributors self-describe their spoken accents in diverse ways
EAAMO '23: Proceedings of the 3rd ACM Conference on Equity and Access in Algorithms, Mechanisms, and OptimizationArticle No.: 35, Pages 1–10https://doi.org/10.1145/3617694.3623258The use of machine learning (ML)-powered speech technologies has increased significantly in recent years [40, 56, 72]. The datasets used for training speech models often represent demographic features of the speaker – such as gender, age, and accent. ...
- ArticleOctober 2023
Metadata Improves Segmentation Through Multitasking Elicitation
Domain Adaptation and Representation TransferPages 147–155https://doi.org/10.1007/978-3-031-45857-6_15AbstractMetainformation is a common companion to biomedical images. However, this potentially powerful additional source of signal from image acquisition has had limited use in deep learning methods, for semantic segmentation in particular. Here, we ...
- short-paperOctober 2023
Collection Space Navigator: An Interactive Visualization Interface for Multidimensional Datasets
VINCI '23: Proceedings of the 16th International Symposium on Visual Information Communication and InteractionArticle No.: 24, Pages 1–5https://doi.org/10.1145/3615522.3615546We introduce the Collection Space Navigator (CSN), a browser-based visualization tool to explore, research, and curate large collections of visual digital artifacts that are associated with multidimensional data, such as vector embeddings or tables of ...