Narrow your search
Listing 1 - 10 of 28 << page
of 3
>>
Sort by
RIDE-SDMA 2005: 15th International Workshop on Research Issues in Data Engineering (03-07 April 2005/Tokyo, Japan)
Author:
ISBN: 0769523900 1538603349 Year: 2005 Publisher: [Place of publication not identified] IEEE Computer Society Press

Loading...
Export citation

Choose an application

Bookmark

Abstract

Keywords

Data mining : concepts and techniques
Authors: ---
ISBN: 1558609016 9781558609013 Year: 2006 Publisher: Amsterdam Boston : San Francisco : Elsevier Morgan Kaufmann,

Data mining
Authors: ---
ISBN: 1282665863 9786612665868 0080475582 9780080475585 9781558609013 1558609016 Year: 2006 Publisher: Amsterdam Boston San Francisco, CA Elsevier Morgan Kaufmann

Loading...
Export citation

Choose an application

Bookmark

Abstract

Our ability to generate and collect data has been increasing rapidly. Not only are all of our business, scientific, and government transactions now computerized, but the widespread use of digital cameras, publication tools, and bar codes also generate data. On the collection side, scanned text and image platforms, satellite remote sensing systems, and the World Wide Web have flooded us with a tremendous amount of data. This explosive growth has generated an even more urgent need for new techniques and automated tools that can help us transform this data into useful information and knowledge.


Book
Mining Structures of Factual Knowledge from Text : An Effort-Light Approach
Authors: ---
ISBN: 3031019121 3031001079 3031007840 Year: 2018 Publisher: Cham : Springer International Publishing : Imprint: Springer,

Loading...
Export citation

Choose an application

Bookmark

Abstract

The real-world data, though massive, is largely unstructured, in the form of natural-language text. It is challenging but highly desirable to mine structures from massive text data, without extensive human annotation and labeling. In this book, we investigate the principles and methodologies of mining structures of factual knowledge (e.g., entities and their relationships) from massive, unstructured text corpora. Departing from many existing structure extraction methods that have heavy reliance on human annotated data for model training, our effort-light approach leverages human-curated facts stored in external knowledge bases as distant supervision and exploits rich data redundancy in large text corpora for context understanding. This effort-light mining approach leads to a series of new principles and powerful methodologies for structuring text corpora, including (1) entity recognition, typing and synonym discovery, (2) entity relation extraction, and (3) open-domain attribute-value mining and information extraction. This book introduces this new research frontier and points out some promising research directions.


Book
Multidimensional Mining of Massive Text Data
Authors: ---
ISBN: 3031019148 3031001095 3031007867 Year: 2019 Publisher: Cham : Springer International Publishing : Imprint: Springer,

Loading...
Export citation

Choose an application

Bookmark

Abstract

Unstructured text, as one of the most important data forms, plays a crucial role in data-driven decision making in domains ranging from social networking and information retrieval to scientific research and healthcare informatics. In many emerging applications, people's information need from text data is becoming multidimensional—they demand useful insights along multiple aspects from a text corpus. However, acquiring such multidimensional knowledge from massive text data remains a challenging task. This book presents data mining techniques that turn unstructured text data into multidimensional knowledge. We investigate two core questions. (1) How does one identify task-relevant text data with declarative queries in multiple dimensions? (2) How does one distill knowledge from text data in a multidimensional space? To address the above questions, we develop a text cube framework. First, we develop a cube construction module that organizes unstructured data into a cube structure, by discovering latent multidimensional and multi-granular structure from the unstructured text corpus and allocating documents into the structure. Second, we develop a cube exploitation module that models multiple dimensions in the cube space, thereby distilling from user-selected data multidimensional knowledge. Together, these two modules constitute an integrated pipeline: leveraging the cube structure, users can perform multidimensional, multigranular data selection with declarative queries; and with cube exploitation algorithms, users can extract multidimensional patterns from the selected data for decision making. The proposed framework has two distinctive advantages when turning text data into multidimensional knowledge: flexibility and label-efficiency. First, it enables acquiring multidimensional knowledge flexibly, as the cube structure allows users to easily identify task-relevant data along multiple dimensions at varied granularities and further distill multidimensional knowledge. Second, the algorithms for cube construction and exploitation require little supervision; this makes the framework appealing for many applications where labeled data are expensive to obtain.


Book
Automated taxonomy discovery and exploration
Authors: ---
ISBN: 3031114051 3031114043 Year: 2022 Publisher: Cham, Switzerland : Springer,

Loading...
Export citation

Choose an application

Bookmark

Abstract


Book
Mining heterogeneous information networks
Authors: ---
ISBN: 9781608458813 9781608458806 Year: 2012 Publisher: San Rafael, Calif. (1537 Fourth Street, San Rafael, CA 94901 USA) Morgan & Claypool

Loading...
Export citation

Choose an application

Bookmark

Abstract

Keywords


Book
Mining latent entity structures
Authors: ---
ISBN: 9781627056618 9781627056601 Year: 2015 Publisher: San Rafael, California (1537 Fourth Street, San Rafael, CA 94901 USA) Morgan & Claypool

Loading...
Export citation

Choose an application

Bookmark

Abstract

Keywords


Book
Mining structures of factual knowledge from text
Authors: ---
ISBN: 9781681733937 9781681733944 9781681733920 Year: 2018 Publisher: [San Rafael, California] Morgan & Claypool

Loading...
Export citation

Choose an application

Bookmark

Abstract

Keywords

Listing 1 - 10 of 28 << page
of 3
>>
Sort by