Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleFebruary 2025
Reliable Text-to-SQL with Adaptive Abstention
Proceedings of the ACM on Management of Data (PACMMOD), Volume 3, Issue 1Article No.: 69, Pages 1–30https://doi.org/10.1145/3709719Large language models (LLMs) have revolutionized natural language interfaces for databases, particularly in text-to-SQL conversion. However, current approaches often generate unreliable outputs when faced with ambiguity or insufficient context.
We ...
- research-articleFebruary 2025
Two efficient iteration methods for solving the absolute value equations
Applied Numerical Mathematics (APNM), Volume 208, Issue PBPages 148–159https://doi.org/10.1016/j.apnum.2024.10.009AbstractTwo efficient iteration methods are proposed for solving the absolute value equation which are the accelerated generalized SOR-like (AGSOR-like) iteration method and the preconditioned generalized SOR-like (PGSOR-like) iteration method. We prove ...
- research-articleNovember 2024
ODIN: Object Density Aware Index for C<inline-formula><tex-math notation="LaTeX">$k$</tex-math><alternatives><mml:math><mml:mi>k</mml:mi></mml:math><inline-graphic xlink:href="yu-ieq1-3344662.gif"/></alternatives></inline-formula>NN Queries Over Moving Objects on Road Networks
IEEE Transactions on Knowledge and Data Engineering (IEEECS_TKDE), Volume 36, Issue 11Pages 6758–6772https://doi.org/10.1109/TKDE.2023.3344662We study the problem of processing continuous <inline-formula><tex-math notation="LaTeX">$k$</tex-math><alternatives><mml:math><mml:mi>k</mml:mi></mml:math><inline-graphic xlink:href="yu-ieq3-3344662.gif"/></alternatives></inline-formula> nearest neighbor ...
- research-articleOctober 2024
Fuzzy Harsanyi solutions for fuzzy level structure games with multi weight systems
Discrete Applied Mathematics (DAMA), Volume 356, Issue CPages 117–132https://doi.org/10.1016/j.dam.2024.05.020AbstractIn real life, the relationship between social or economic environment and resource constraints may impose restrictions on the formation of coalitions. More and more scholars have recognized the limitations of (crisp) cooperative games and propose ...
- research-articleOctober 2024
DAMOCRO: A Data Migration Framework Using Online Classification and Reordering
CIKM '24: Proceedings of the 33rd ACM International Conference on Information and Knowledge ManagementPages 4546–4553https://doi.org/10.1145/3627673.3680097This paper introduces DAMOCRO, a <u>da</u>ta <u>m</u>igration framework using <u>o</u>nline <u>c</u>lassification and tuple <u>r</u>e<u>o</u>rdering to improve throughput and decrease the costs of data migration. The DAMOCRO workflow consists of four ...
-
- research-articleAugust 2024
Predictive breast cancer diagnosis using ensemble fuzzy model
AbstractBreast cancer continues to be a major global health challenge, necessitating reliable diagnostic methods for early detection and improved patient outcomes. This study introduces a novel ensemble fuzzy model for predictive breast cancer diagnosis, ...
Highlights- A new approach integrates deep-learning classifiers with fuzzy logic for improved decision-making.
- The ensemble includes Inception-V4, Inception-ResNet, and Inception V3/V4 + BN, enhancing accuracy.
- Fuzzy logic allows adaptive ...
- research-articleJuly 2024
A Distributed Solution for Efficient <italic>K</italic> Shortest Paths Computation Over Dynamic Road Networks
IEEE Transactions on Knowledge and Data Engineering (IEEECS_TKDE), Volume 36, Issue 7Pages 2759–2773https://doi.org/10.1109/TKDE.2023.3346377The problem of identifying the <italic>k</italic>-shortest paths (KSPs for short) in a dynamic road network is essential to many location-based services. Road networks are dynamic in the sense that the weights of the edges in the corresponding graph ...
Optimizing Video Queries with Declarative Clues
Proceedings of the VLDB Endowment (PVLDB), Volume 17, Issue 11Pages 3256–3268https://doi.org/10.14778/3681954.3681998Video Database Management Systems (VDBMS) leverage advancements in computer vision and deep learning for efficient video data analysis and retrieval. This paper introduces the concept of user-specified Clues, allowing users to incorporate domain-specific ...
- opinionJuly 2024
Special Issue on Data Economy and Data Marketplaces
IEEE Internet Computing (IEEECS_INTERNET), Volume 28, Issue 4Pages 5–6https://doi.org/10.1109/MIC.2024.3415950This special issue of IEEE Internet Computing explores the transformative area of data marketplaces within the data economy, presenting three rigorously selected articles that address crucial topics, such as data valuation, privacy preservation, and trust ...
- research-articleMay 2024
Data Acquisition for Improving Model Confidence
Proceedings of the ACM on Management of Data (PACMMOD), Volume 2, Issue 3Article No.: 131, Pages 1–25https://doi.org/10.1145/3654934In recent years, there has been a growing recognition that high-quality training data is crucial for the performance of machine learning models. This awareness has catalyzed both research endeavors and industrial initiatives dedicated to data acquisition ...
- research-articleNovember 2023
Fuzzy cooperative game with intersecting priori coalition: A generalized configuration value
AbstractA fuzzy game with a configuration structure (i.e., a fuzzy configuration structure game) is introduced, which not only allows the fuzzy participation of players but also admits the intersection of fuzzy priori coalitions. A kind of ...
- rapid-communicationNovember 2023
Coalition structure value considering the outside alignment option of priori coalition
Operations Research Letters (OPERRL), Volume 51, Issue 6Pages 659–665https://doi.org/10.1016/j.orl.2023.10.014AbstractIn a cooperative game with a coalition structure (i.e., coalition structure game), a coalition structure value (CS-value) is obtained on the assumption that the coalition structure has been or can be formed. The necessity analysis of coalition ...
- research-articleSeptember 2023
ML4DM ‘23: The Third Workshop on the Emerging Applications of Machine Learning in Modern Data Management
CASCON '23: Proceedings of the 33rd Annual International Conference on Computer Science and Software EngineeringPages 251–253Machine Learning (ML) has gained prominence across various fields, including data management. Rule-based components are substituted by ML-driven counterparts that extract rules from experience. The prevalence of statistical methods is waning as ...
- research-articleSeptember 2023
Optimizing Data Migration Using Online Clustering
CASCON '23: Proceedings of the 33rd Annual International Conference on Computer Science and Software EngineeringPages 173–178Data migration refers to the transfer of data from one location to another, for instance, from a local database to a cloud server or from one cloud to another. To minimize business disruption during this process, it is essential to ensure that data ...
- research-articleAugust 2023
A fuzzy value for the fuzzy TU-game based on the relative group marginal contribution of coalition with restricted size
AbstractConsidering the group marginal contribution of coalitions (with size no larger than k), we study the cooperative game (transferable utility game (TU-game)) with a fuzzy payoff (i.e., fuzzy TU-game). In order to reflect the impact of the coalition'...
- research-articleAugust 2023
Data and AI Model Markets: Opportunities for Data and Model Sharing, Discovery, and Integration
Proceedings of the VLDB Endowment (PVLDB), Volume 16, Issue 12Pages 3872–3873https://doi.org/10.14778/3611540.3611573The markets for data and AI models are rapidly emerging and increasingly significant in the realm and the practices of data science and artificial intelligence. These markets are being studied from diverse perspectives, such as e-commerce, economics, ...
- research-articleJune 2023
OM3: An Ordered Multi-level Min-Max Representation for Interactive Progressive Visualization of Time Series
Proceedings of the ACM on Management of Data (PACMMOD), Volume 1, Issue 2Article No.: 145, Pages 1–24https://doi.org/10.1145/3589290We present a novel multi-level representation of time series called OM3 that facilitates efficient interactive progressive visualization of large data stored in a database and supports various interactions such as resizing, panning, zooming, and visual ...
- research-articleMay 2023
dbET: Execution Time Distribution-based Plan Selection
Proceedings of the ACM on Management of Data (PACMMOD), Volume 1, Issue 1Article No.: 31, Pages 1–26https://doi.org/10.1145/3588711While selecting the execution plan for a given query based on a single estimated cost is a generally-adopted strategy, it is usually error-prone and fails to comprehensively profile the plan performance. In this work, we complement existing plan ...
- research-articleApril 2023
TME: Tree-guided Multi-task Embedding Learning towards Semantic Venue Annotation
ACM Transactions on Information Systems (TOIS), Volume 41, Issue 4Article No.: 112, Pages 1–24https://doi.org/10.1145/3582553The prevalence of location-based services has generated a deluge of check-ins, enabling the task of human mobility understanding. Among the various types of information associated with the check-in venues, categories (e.g., Bar and Museum) are vital to ...
- research-articleMarch 2023
Self-Training With Double Selectors for Low-Resource Named Entity Recognition
IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP), Volume 31Pages 1265–1275https://doi.org/10.1109/TASLP.2023.3250828Named Entity Recognition (NER) is fundamental to multiple downstream natural language processing (NLP) tasks, but most advanced NER methods heavily rely on massive labeled data with high cost. In this paper, we explore the effectiveness of self-training ...