MARC View

000			05714nam a22007091i 4500
001			8444554
003			IEEE
005			20200413152927.0
006			m eo d
007			cr cn \|\|\|m\|\|\|a
008			180829s2018 caua foab 000 0 eng d
020			_a9781681734217 _qebook
020			_z9781681734224 _qhardcover
020			_z9781681734200 _qpaperback
024	7		_a10.2200/S00870ED1V01Y201807DTM050 _2doi
035			_a(CaBNVSL)swl000408652
035			_a(OCoLC)1050334078
040			_aCaBNVSL _beng _erda _cCaBNVSL _dCaBNVSL
050		4	_aQA76.9.D3 _bG266 2018
082	0	4	_a005.7565 _223
100	1		_aGao, Yunjun, _eauthor.
245	1	0	_aQuery processing over incomplete databases / _cYunjun Gao, Xiaoye Miao.
264		1	_a[San Rafael, California] : _bMorgan & Claypool, _c2018.
300			_a1 PDF (xv, 106 pages) : _billustrations.
336			_atext _2rdacontent
337			_aelectronic _2isbdmedia
338			_aonline resource _2rdacarrier
490	1		_aSynthesis lectures on data management, _x2153-5426 ; _v# 50
538			_aMode of access: World Wide Web.
538			_aSystem requirements: Adobe Acrobat Reader.
500			_aPart of: Synthesis digital library of engineering and computer science.
504			_aIncludes bibliographical references (pages 87-103).
505	0		_a1. Introduction -- 1.1 Applications of incomplete data management -- 1.2 Overview of incomplete databases -- 1.2.1 Indexing incomplete databases -- 1.2.2 Querying incomplete databases -- 1.2.3 Incomplete database management systems -- 1.3 Challenges of querying incomplete databases -- 1.4 Organization --
505	8		_a2. Handling incomplete data methods -- 2.1 Method taxonomy -- 2.2 Overview of imputation methods -- 2.2.1 Statistical imputation -- 2.2.2 Machine learning-based imputation -- 2.2.3 Modern imputation methods --
505	8		_a3. Query semantics on incomplete data -- 3.1 K-nearest neighbor search on incomplete data -- 3.1.1 Background -- 3.1.2 Problem definition -- 3.2 Skyline queries on incomplete data -- 3.2.1 Background -- 3.2.2 Problem definition -- 3.3 Top-k dominating queries on incomplete data -- 3.3.1 Background -- 3.3.2 Problem definition --
505	8		_a4. Advanced techniques -- 4.1 Index structures -- 4.1.1 Lab index for k-nearest neighbor search on incomplete data -- 4.1.2 Histogram index for k-nearest neighbor search on incomplete data -- 4.1.3 Bitmap index for top-k dominating queries on incomplete data -- 4.2 Pruning heuristics -- 4.2.1 Alpha value pruning for k-nearest neighbor search on incomplete data -- 4.2.2 Histogram-based pruning for k-nearest neighbor search on incomplete data -- 4.2.3 Local skyband pruning for top-k dominating queries on incomplete data -- 4.2.4 Upper bound score pruning for top-k dominating queries on incomplete data -- 4.2.5 Bitmap pruning for top-k dominating queries on incomplete data -- 4.3 Crowdsourcing techniques -- 4.3.1 Crowdsourcing framework for skyline queries on incomplete data -- 4.3.2 C-table construction -- 4.3.3 Probability computation -- 4.3.4 Crowd task selection --
505	8		_a5. Conclusions -- Bibliography -- Authors' biographies.
506			_aAbstract freely available; full-text restricted to subscribers or individual document purchasers.
510	0		_aCompendex
510	0		_aINSPEC
510	0		_aGoogle scholar
510	0		_aGoogle book search
520	3		_aIncomplete data is part of life and almost all areas of scientific studies. Users tend to skip certain fields when they fill out online forms; participants choose to ignore sensitive questions on surveys; sensors fail, resulting in the loss of certain readings; publicly viewable satellite map services have missing data in many mobile applications; and in privacy-preserving applications, the data is incomplete deliberately in order to preserve the sensitivity of some attribute values. Query processing is a fundamental problem in computer science, and is useful in a variety of applications. In this book, we mostly focus on the query processing over incomplete databases, which involves finding a set of qualified objects from a specified incomplete dataset in order to support a wide spectrum of real-life applications. We first elaborate the three general kinds of methods of handling incomplete data, including (i) discarding the data with missing values, (ii) imputation for the missing values, and (iii) just depending on the observed data values. For the third method type, we introduce the semantics of k-nearest neighbor (kNN) search, skyline query, and top-k dominating query on incomplete data, respectively. In terms of the three representative queries over incomplete data, we investigate some advanced techniques to process incomplete data queries, including indexing, pruning as well as crowdsourcing techniques.
530			_aAlso available in print.
588			_aTitle from PDF title page (viewed on August 29, 2018).
650		0	_aQuerying (Computer science)
650		0	_aDatabase searching.
650		0	_aMissing observations (Statistics)
653			_aquery processing
653			_aincomplete data
653			_amissing data
653			_asimilarity search
653			_ak-nearest neighbor search
653			_askyline query
653			_atop-k dominating query
653			_acrowdsourcing
700	1		_aMiao, Xiaoye, _eauthor.
776	0	8	_iPrint version: _z9781681734200 _z9781681734224
830		0	_aSynthesis digital library of engineering and computer science.
830		0	_aSynthesis lectures on data management ; _v# 50. _x2153-5426
856	4	2	_3Abstract with links to resource _uhttps://ieeexplore.ieee.org/servlet/opac?bknumber=8444554
999			_c562315 _d562315