IEEECS_TKDE: Vol 25, No 6

Volume 25, Issue 6June 2013

Volume 25, Issue 6

June 2013

Publisher:

IEEE Educational Activities Department
445 Hoes Lane P.O. Box 1331 Piscataway, NJ
United States

ISSN:1041-4347

Tags:

Bibliometrics

Select All

Export Citations Save to Binder

article

A Survival Modeling Approach to Biomedical Search Result Diversification Using Wikipedia

Pages 1201–1212https://doi.org/10.1109/TKDE.2012.24

In this paper, we propose a survival modeling approach to promoting ranking diversity for biomedical information retrieval. The proposed approach concerns with finding relevant documents that can deliver more different aspects of a query. First, two ...

article

Centroid-Based Actionable 3D Subspace Clustering

Pages 1213–1226https://doi.org/10.1109/TKDE.2012.37

Actionable 3D subspace clustering from real-world continuous-valued 3D (i.e., object-attribute-context) data promises tangible benefits such as discovery of biologically significant protein residues and profitable stocks, but existing algorithms are ...

article

Constrained Text Coclustering with Supervised and Unsupervised Constraints

Pages 1227–1239https://doi.org/10.1109/TKDE.2012.45

In this paper, we propose a novel constrained coclustering method to achieve two goals. First, we combine information-theoretic coclustering and constrained clustering to improve clustering performance. Second, we adopt both supervised and unsupervised ...

article

Crowdsourced Trace Similarity with Smartphones

Pages 1240–1253https://doi.org/10.1109/TKDE.2012.55

Smartphones are nowadays equipped with a number of sensors, such as WiFi, GPS, accelerometers, etc. This capability allows smartphone users to easily engage in crowdsourced computing services, which contribute to the solution of complex problems in a ...

article

Customized Policies for Handling Partial Information in Relational Databases

Pages 1254–1271https://doi.org/10.1109/TKDE.2012.91

Most real-world databases have at least some missing data. Today, users of such databases are “on their own” in terms of how they manage this incompleteness. In this paper, we propose the general concept of partial information policy (PIP) operator to ...

article

Decision Trees for Mining Data Streams Based on the McDiarmid's Bound

Pages 1272–1279https://doi.org/10.1109/TKDE.2012.66

In mining data streams the most popular tool is the Hoeffding tree algorithm. It uses the Hoeffding's bound to determine the smallest number of examples needed at a node to select a splitting attribute. In the literature the same Hoeffding's bound was ...

article

Discovering Characterizations of the Behavior of Anomalous Subpopulations

Pages 1280–1292https://doi.org/10.1109/TKDE.2012.58

We consider the problem of discovering attributes, or properties, accounting for the a priori stated abnormality of a group of anomalous individuals (the outliers) with respect to an overall given population (the inliers). To this aim, we introduce the ...

article

FoCUS: Learning to Crawl Web Forums

Pages 1293–1306https://doi.org/10.1109/TKDE.2012.56

In this paper, we present Forum Crawler Under Supervision (FoCUS), a supervised web-scale forum crawler. The goal of FoCUS is to crawl relevant forum content from the web with minimal overhead. Forum threads contain information content that is the ...

article

Improving Word Similarity by Augmenting PMI with Estimates of Word Polysemy

Pages 1307–1322https://doi.org/10.1109/TKDE.2012.30

Pointwise mutual information (PMI) is a widely used word similarity measure, but it lacks a clear explanation of how it works. We explore how PMI differs from distributional similarity, and we introduce a novel metric, $({\rm PMI}_{max})$, that augments ...

article

Incentive Compatible Privacy-Preserving Data Analysis

Pages 1323–1335https://doi.org/10.1109/TKDE.2012.61

In many cases, competing parties who have private data may collaboratively conduct privacy-preserving distributed data analysis (PPDA) tasks to learn beneficial data models or analysis results. Most often, the competing parties have different ...

article

Nonnegative Matrix Factorization: A Comprehensive Review

Pages 1336–1353https://doi.org/10.1109/TKDE.2012.51

Nonnegative Matrix Factorization (NMF), a relatively novel paradigm for dimensionality reduction, has been in the ascendant since its inception. It incorporates the nonnegativity constraint and thus obtains the parts-based representation as well as ...

article

On Identifying Critical Nuggets of Information during Classification Tasks

Pages 1354–1367https://doi.org/10.1109/TKDE.2012.112

In large databases, there may exist critical nuggets—small collections of records or instances that contain domain-specific important information. This information can be used for future decision making such as labeling of critical, unlabeled data ...

article

Radio Database Compression for Accurate Energy-Efficient Localization in Fingerprinting Systems

Pages 1368–1379https://doi.org/10.1109/TKDE.2011.241

Location fingerprinting is a positioning method that exploits the already existing infrastructures such as cellular networks or WLANs. Regarding the recent demand for energy efficient networks and the emergence of issues like green networking, we ...

article

Semi-Supervised Nonlinear Hashing Using Bootstrap Sequential Projection Learning

Pages 1380–1393https://doi.org/10.1109/TKDE.2012.76

In this paper, we study the effective semi-supervised hashing method under the framework of regularized learning-based hashing. A nonlinear hash function is introduced to capture the underlying relationship among data points. Thus, the dimensionality of ...

article

Spatial Approximate String Search

Pages 1394–1409https://doi.org/10.1109/TKDE.2012.48

This work deals with the approximate string search in large spatial databases. Specifically, we investigate range queries augmented with a string similarity search predicate in both euclidean space and road networks. We dub this query the spatial ...

article

SVStream: A Support Vector-Based Algorithm for Clustering Data Streams

Pages 1410–1424https://doi.org/10.1109/TKDE.2011.263

In this paper, we propose a novel data stream clustering algorithm, termed SVStream, which is based on support vector domain description and support vector clustering. In the proposed algorithm, the data elements of a stream are mapped into a kernel ...

article

The Move-Split-Merge Metric for Time Series

Pages 1425–1438https://doi.org/10.1109/TKDE.2012.88

A novel metric for time series, called Move-Split-Merge (MSM), is proposed. This metric uses as building blocks three fundamental operations: Move, Split, and Merge, which can be applied in sequence to transform any time series into any other time ...

article

A User-Friendly Patent Search Paradigm

Pages 1439–1443https://doi.org/10.1109/TKDE.2012.63

As an important operation for finding existing relevant patents and validating a new patent application, patent search has attracted considerable attention recently. However, many users have limited knowledge about the underlying patents, and they have ...

article

IEEE Open Access Publishing

Page 1444https://doi.org/10.1109/TKDE.2013.63

Comments

Please enable JavaScript to view thecomments powered by Disqus.

IEEE Transactions on Knowledge and Data Engineering

Sections

A Survival Modeling Approach to Biomedical Search Result Diversification Using Wikipedia

Centroid-Based Actionable 3D Subspace Clustering

Constrained Text Coclustering with Supervised and Unsupervised Constraints

Crowdsourced Trace Similarity with Smartphones

Customized Policies for Handling Partial Information in Relational Databases

Decision Trees for Mining Data Streams Based on the McDiarmid's Bound

Discovering Characterizations of the Behavior of Anomalous Subpopulations

FoCUS: Learning to Crawl Web Forums

Improving Word Similarity by Augmenting PMI with Estimates of Word Polysemy

Incentive Compatible Privacy-Preserving Data Analysis

Nonnegative Matrix Factorization: A Comprehensive Review

On Identifying Critical Nuggets of Information during Classification Tasks

Radio Database Compression for Accurate Energy-Efficient Localization in Fingerprinting Systems

Semi-Supervised Nonlinear Hashing Using Bootstrap Sequential Projection Learning

Spatial Approximate String Search

SVStream: A Support Vector-Based Algorithm for Clustering Data Streams

The Move-Split-Merge Metric for Time Series

A User-Friendly Patent Search Paradigm

IEEE Open Access Publishing