IEEE Transactions on Knowledge and Data Engineering

IEEE Transactions on Knowledge and Data Engineering (TKDE) is an archival journal published monthly designed to inform researchers, developers, managers, strategic planners, users, and others interested in state-of-the-art and state-of-the-practice activities in the knowledge and data engineering area. Read the full scope of TKDE


Expand your horizons with Colloquium, a monthly survey of abstracts from all CS transactions! Replaces OnlinePlus in January 2017.


From the May 2018 issue

Diagnosing and Minimizing Semantic Drift in Iterative Bootstrapping Extraction

By Zhixu Li, Ying He, Binbin Gu, An Liu, Hongsong Li, Haixun Wang, and Xiaofang Zhou

Featured Article Semantic drift is a common problem in iterative information extraction. Previous approaches for minimizing semantic drift may incur substantial loss in recall. We observe that most semantic drifts are introduced by a small number of questionable extractions in the earlier rounds of iterations. These extractions subsequently introduce a large number of questionable results, which lead to the semantic drift phenomenon. We call these questionable extractions Drifting Points (DPs). If erroneous extractions are the “symptoms” of semantic drift, then DPs are the “causes” of semantic drift. In this paper, we propose a method to minimize semantic drift by identifying the DPs and removing the effect introduced by the DPs. We use isA (concept-instance) extraction as an example to describe our approach in cleaning information extraction errors caused by semantic drift, but we perform experiments on different relation extraction processes on three large real data extraction collections. The experimental results show that our DP cleaning method enables us to clean around 90 percent incorrect instances or patterns with about 90 percent precision, which outperforms the previous approaches we compare with.

download PDF View the PDF of this article      csdl View this issue in the digital library


Editorials and Announcements

Announcements

  • TKDE now offers authors access to Code Ocean. Code Ocean is a cloud-based executable research platform that allows authors to share their algorithms in an effort to make the world’s scientific code more open and reproducible. Learn more or sign up for free.
  • We are pleased to announce that Xuemin Lin, a Scientia Professor in the School of Computer Science and Engineering at the University of New South Wales, Australia, has been named the new Editor-in-Chief of the IEEE Transactions on Knowledge and Data Engineering starting in 2017.

Editorials


Guest Editorials


Reviewers List


Annual Index


Access recently published TKDE articles

RSS Subscribe to the RSS feed of recently published TKDE content

mail icon Sign up for e-mail notifications through IEEE Xplore Content Alerts

preprints icon View TKDE preprints in the Computer Society Digital Library

Computing Now