000 -LEADER |
fixed length control field |
06362nam a2200769 i 4500 |
001 - CONTROL NUMBER |
control field |
7084069 |
003 - CONTROL NUMBER IDENTIFIER |
control field |
IEEE |
005 - DATE AND TIME OF LATEST TRANSACTION |
control field |
20200413152917.0 |
006 - FIXED-LENGTH DATA ELEMENTS--ADDITIONAL MATERIAL CHARACTERISTICS |
fixed length control field |
m eo d |
007 - PHYSICAL DESCRIPTION FIXED FIELD--GENERAL INFORMATION |
fixed length control field |
cr cn |||m|||a |
008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION |
fixed length control field |
150426s2015 caua foab 000 0 eng d |
020 ## - INTERNATIONAL STANDARD BOOK NUMBER |
International Standard Book Number |
9781627056618 |
Qualifying information |
ebook |
020 ## - INTERNATIONAL STANDARD BOOK NUMBER |
Canceled/invalid ISBN |
9781627056601 |
Qualifying information |
print |
024 7# - OTHER STANDARD IDENTIFIER |
Standard number or code |
10.2200/S00625ED1V01Y201502DMK010 |
Source of number or code |
doi |
035 ## - SYSTEM CONTROL NUMBER |
System control number |
(CaBNVSL)swl00404858 |
035 ## - SYSTEM CONTROL NUMBER |
System control number |
(OCoLC)908031780 |
040 ## - CATALOGING SOURCE |
Original cataloging agency |
CaBNVSL |
Language of cataloging |
eng |
Description conventions |
rda |
Transcribing agency |
CaBNVSL |
Modifying agency |
CaBNVSL |
050 #4 - LIBRARY OF CONGRESS CALL NUMBER |
Classification number |
QA76.9.D343 |
Item number |
W255 2015 |
082 04 - DEWEY DECIMAL CLASSIFICATION NUMBER |
Classification number |
006.312 |
Edition number |
23 |
100 1# - MAIN ENTRY--PERSONAL NAME |
Personal name |
Wang, Chi., |
Relator term |
author. |
245 10 - TITLE STATEMENT |
Title |
Mining latent entity structures / |
Statement of responsibility, etc. |
Chi Wang, Jiawei Han. |
264 #1 - PRODUCTION, PUBLICATION, DISTRIBUTION, MANUFACTURE, AND COPYRIGHT NOTICE |
Place of production, publication, distribution, manufacture |
San Rafael, California (1537 Fourth Street, San Rafael, CA 94901 USA) : |
Name of producer, publisher, distributor, manufacturer |
Morgan & Claypool, |
Date of production, publication, distribution, manufacture, or copyright notice |
2015. |
300 ## - PHYSICAL DESCRIPTION |
Extent |
1 PDF (xi, 147 pages) : |
Other physical details |
illustrations. |
336 ## - CONTENT TYPE |
Content type term |
text |
Source |
rdacontent |
337 ## - MEDIA TYPE |
Media type term |
electronic |
Source |
isbdmedia |
338 ## - CARRIER TYPE |
Carrier type term |
online resource |
Source |
rdacarrier |
490 1# - SERIES STATEMENT |
Series statement |
Synthesis lectures on data mining and knowledge discovery, |
International Standard Serial Number |
2151-0075 ; |
Volume/sequential designation |
# 10 |
538 ## - SYSTEM DETAILS NOTE |
System details note |
Mode of access: World Wide Web. |
538 ## - SYSTEM DETAILS NOTE |
System details note |
System requirements: Adobe Acrobat Reader. |
500 ## - GENERAL NOTE |
General note |
Part of: Synthesis digital library of engineering and computer science. |
504 ## - BIBLIOGRAPHY, ETC. NOTE |
Bibliography, etc. note |
Includes bibliographical references (pages 141-145). |
505 0# - FORMATTED CONTENTS NOTE |
Formatted contents note |
1. Introduction -- 1.1 Motivation -- 1.2 Data model: a text-rich heterogeneous information network modeL -- 1.3 Latent entity structure -- 1.4 The mining framework -- 1.4.1 Hierarchical topic and community discovery -- 1.4.2 Topical phrase mining -- 1.4.3 Entity topical role analysis -- 1.4.4 Entity relationship mining -- |
505 8# - FORMATTED CONTENTS NOTE |
Formatted contents note |
2. Hierarchical topic and community discovery -- 2.1 Generative model for text or homogeneous networks -- 2.2 Generative model for heterogeneous network -- 2.2.1 The basic model -- 2.2.2 Learning link-type weights -- 2.2.3 Shape of hierarchy -- 2.3 Empirical analysis -- 2.3.1 Efficacy of subtopic discovery -- 2.3.2 Topical hierarchy quality -- 2.3.3 Case study -- |
505 8# - FORMATTED CONTENTS NOTE |
Formatted contents note |
3. Topical phrase mining -- 3.1 Criteria of good phrases and topical phrases -- 3.2 KERT: mining phrases in short, content-representative text -- 3.2.1 Phrase quality -- 3.2.2 Topical phrase quality -- 3.3 ToPMine: mining phrases in general text -- 3.3.1 Frequent phrase mining -- 3.3.2 Segmentation and phrase filtering -- 3.3.3 Topical phrase ranking -- 3.4 Empirical analysis -- 3.4.1 The impact of the four criteria -- 3.4.2 Comparison of mining methods -- 3.4.3 Scalability -- |
505 8# - FORMATTED CONTENTS NOTE |
Formatted contents note |
4. Entity topical role analysis -- 4.1 Role of given entities -- 4.1.1 Entity specific phrase ranking -- 4.1.2 Distribution over subtopics -- 4.1.3 Case study -- 4.2 Entities of given roles -- |
505 8# - FORMATTED CONTENTS NOTE |
Formatted contents note |
5. Mining entity relations -- 5.1 Unsupervised hierarchical relation mining -- 5.1.1 Notations -- 5.1.2 Assumptions and framework -- 5.1.3 Stage 1: preprocessing -- 5.1.4 Stage 2: TPFG model -- 5.1.5 Model inference -- 5.1.6 Empirical analysis -- 5.2 Supervised hierarchical relation mining -- 5.2.1 Conditional random field for hierarchical relationship -- 5.2.2 Potential function design -- 5.2.3 Model inference and learning -- 5.2.4 Empirical analysis -- 5.3 Semi-supervised co-profiling -- 5.3.1 Observations -- 5.3.2 Model -- 5.3.3 Inference algorithm -- 5.3.4 Empirical analysis -- |
505 8# - FORMATTED CONTENTS NOTE |
Formatted contents note |
6. Scalable and robust topic discovery -- 6.1 Latent dirichlet allocation with topic tree -- 6.2 The STROD algorithm -- 6.2.1 Moment-based inference -- 6.2.2 Scalability improvement -- 6.2.3 Hyperparameter learning -- 6.3 Empirical analysis -- 6.3.1 Scalability -- 6.3.2 Robustness -- 6.3.3 Interpretability -- |
505 8# - FORMATTED CONTENTS NOTE |
Formatted contents note |
7. Application and research frontier -- 7.1 Application -- 7.1.1 Online analytical processing of information networks -- 7.1.2 Social influence and viral marketing -- 7.1.3 Relevance targeting -- 7.2 Research frontier -- |
505 8# - FORMATTED CONTENTS NOTE |
Formatted contents note |
Bibliography -- Authors' biographies. |
506 1# - RESTRICTIONS ON ACCESS NOTE |
Terms governing access |
Abstract freely available; full-text restricted to subscribers or individual document purchasers. |
510 0# - CITATION/REFERENCES NOTE |
Name of source |
Compendex |
510 0# - CITATION/REFERENCES NOTE |
Name of source |
INSPEC |
510 0# - CITATION/REFERENCES NOTE |
Name of source |
Google scholar |
510 0# - CITATION/REFERENCES NOTE |
Name of source |
Google book search |
520 3# - SUMMARY, ETC. |
Summary, etc. |
The 'big data' era is characterized by an explosion of information in the form of digital data collections, ranging from scientific knowledge, to social media, news, and everyone's daily life. Examples of such collections include scientific publications, enterprise logs, news articles, social media, and general web pages. Valuable knowledge about multi-typed entities is often hidden in the unstructured or loosely structured, interconnected data. Mining latent structures around entities uncovers hidden knowledge such as implicit topics, phrases, entity roles and relationships. In this monograph, we investigate the principles and methodologies of mining latent entity structures from massive unstructured and interconnected data. We propose a text-rich information network model for modeling data in many different domains. This leads to a series of new principles and powerful methodologies for mining latent structures, including (1) latent topical hierarchy, (2) quality topical phrases, (3) entity roles in hierarchical topical communities, and (4) entity relations. This book also introduces applications enabled by the mined structures and points out some promising research directions. |
530 ## - ADDITIONAL PHYSICAL FORM AVAILABLE NOTE |
Additional physical form available note |
Also available in print. |
588 ## - SOURCE OF DESCRIPTION NOTE |
Source of description note |
Title from PDF title page (viewed on April 26, 2015). |
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM |
Topical term or geographic name entry element |
Data mining. |
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM |
Topical term or geographic name entry element |
Latent structure analysis. |
653 ## - INDEX TERM--UNCONTROLLED |
Uncontrolled term |
information networks |
653 ## - INDEX TERM--UNCONTROLLED |
Uncontrolled term |
text mining |
653 ## - INDEX TERM--UNCONTROLLED |
Uncontrolled term |
link analysis |
653 ## - INDEX TERM--UNCONTROLLED |
Uncontrolled term |
topic modeling |
653 ## - INDEX TERM--UNCONTROLLED |
Uncontrolled term |
phrase extraction |
653 ## - INDEX TERM--UNCONTROLLED |
Uncontrolled term |
role discovery |
653 ## - INDEX TERM--UNCONTROLLED |
Uncontrolled term |
clustering |
653 ## - INDEX TERM--UNCONTROLLED |
Uncontrolled term |
ranking |
653 ## - INDEX TERM--UNCONTROLLED |
Uncontrolled term |
relationship mining |
653 ## - INDEX TERM--UNCONTROLLED |
Uncontrolled term |
probabilistic models |
653 ## - INDEX TERM--UNCONTROLLED |
Uncontrolled term |
real-world applications |
653 ## - INDEX TERM--UNCONTROLLED |
Uncontrolled term |
efficient and scalable algorithms |
700 1# - ADDED ENTRY--PERSONAL NAME |
Personal name |
Han, Jiawei., |
Relator term |
author. |
776 08 - ADDITIONAL PHYSICAL FORM ENTRY |
Relationship information |
Print version: |
International Standard Book Number |
9781627056601 |
830 #0 - SERIES ADDED ENTRY--UNIFORM TITLE |
Uniform title |
Synthesis digital library of engineering and computer science. |
830 #0 - SERIES ADDED ENTRY--UNIFORM TITLE |
Uniform title |
Synthesis lectures on data mining and knowledge discovery ; |
Volume/sequential designation |
# 10. |
International Standard Serial Number |
2151-0075 |
856 42 - ELECTRONIC LOCATION AND ACCESS |
Materials specified |
Abstract with links to resource |
Uniform Resource Identifier |
http://ieeexplore.ieee.org/servlet/opac?bknumber=7084069 |