000 | 05714nam a22007091i 4500 | ||
---|---|---|---|
001 | 8444554 | ||
003 | IEEE | ||
005 | 20200413152927.0 | ||
006 | m eo d | ||
007 | cr cn |||m|||a | ||
008 | 180829s2018 caua foab 000 0 eng d | ||
020 |
_a9781681734217 _qebook |
||
020 |
_z9781681734224 _qhardcover |
||
020 |
_z9781681734200 _qpaperback |
||
024 | 7 |
_a10.2200/S00870ED1V01Y201807DTM050 _2doi |
|
035 | _a(CaBNVSL)swl000408652 | ||
035 | _a(OCoLC)1050334078 | ||
040 |
_aCaBNVSL _beng _erda _cCaBNVSL _dCaBNVSL |
||
050 | 4 |
_aQA76.9.D3 _bG266 2018 |
|
082 | 0 | 4 |
_a005.7565 _223 |
100 | 1 |
_aGao, Yunjun, _eauthor. |
|
245 | 1 | 0 |
_aQuery processing over incomplete databases / _cYunjun Gao, Xiaoye Miao. |
264 | 1 |
_a[San Rafael, California] : _bMorgan & Claypool, _c2018. |
|
300 |
_a1 PDF (xv, 106 pages) : _billustrations. |
||
336 |
_atext _2rdacontent |
||
337 |
_aelectronic _2isbdmedia |
||
338 |
_aonline resource _2rdacarrier |
||
490 | 1 |
_aSynthesis lectures on data management, _x2153-5426 ; _v# 50 |
|
538 | _aMode of access: World Wide Web. | ||
538 | _aSystem requirements: Adobe Acrobat Reader. | ||
500 | _aPart of: Synthesis digital library of engineering and computer science. | ||
504 | _aIncludes bibliographical references (pages 87-103). | ||
505 | 0 | _a1. Introduction -- 1.1 Applications of incomplete data management -- 1.2 Overview of incomplete databases -- 1.2.1 Indexing incomplete databases -- 1.2.2 Querying incomplete databases -- 1.2.3 Incomplete database management systems -- 1.3 Challenges of querying incomplete databases -- 1.4 Organization -- | |
505 | 8 | _a2. Handling incomplete data methods -- 2.1 Method taxonomy -- 2.2 Overview of imputation methods -- 2.2.1 Statistical imputation -- 2.2.2 Machine learning-based imputation -- 2.2.3 Modern imputation methods -- | |
505 | 8 | _a3. Query semantics on incomplete data -- 3.1 K-nearest neighbor search on incomplete data -- 3.1.1 Background -- 3.1.2 Problem definition -- 3.2 Skyline queries on incomplete data -- 3.2.1 Background -- 3.2.2 Problem definition -- 3.3 Top-k dominating queries on incomplete data -- 3.3.1 Background -- 3.3.2 Problem definition -- | |
505 | 8 | _a4. Advanced techniques -- 4.1 Index structures -- 4.1.1 Lab index for k-nearest neighbor search on incomplete data -- 4.1.2 Histogram index for k-nearest neighbor search on incomplete data -- 4.1.3 Bitmap index for top-k dominating queries on incomplete data -- 4.2 Pruning heuristics -- 4.2.1 Alpha value pruning for k-nearest neighbor search on incomplete data -- 4.2.2 Histogram-based pruning for k-nearest neighbor search on incomplete data -- 4.2.3 Local skyband pruning for top-k dominating queries on incomplete data -- 4.2.4 Upper bound score pruning for top-k dominating queries on incomplete data -- 4.2.5 Bitmap pruning for top-k dominating queries on incomplete data -- 4.3 Crowdsourcing techniques -- 4.3.1 Crowdsourcing framework for skyline queries on incomplete data -- 4.3.2 C-table construction -- 4.3.3 Probability computation -- 4.3.4 Crowd task selection -- | |
505 | 8 | _a5. Conclusions -- Bibliography -- Authors' biographies. | |
506 | _aAbstract freely available; full-text restricted to subscribers or individual document purchasers. | ||
510 | 0 | _aCompendex | |
510 | 0 | _aINSPEC | |
510 | 0 | _aGoogle scholar | |
510 | 0 | _aGoogle book search | |
520 | 3 | _aIncomplete data is part of life and almost all areas of scientific studies. Users tend to skip certain fields when they fill out online forms; participants choose to ignore sensitive questions on surveys; sensors fail, resulting in the loss of certain readings; publicly viewable satellite map services have missing data in many mobile applications; and in privacy-preserving applications, the data is incomplete deliberately in order to preserve the sensitivity of some attribute values. Query processing is a fundamental problem in computer science, and is useful in a variety of applications. In this book, we mostly focus on the query processing over incomplete databases, which involves finding a set of qualified objects from a specified incomplete dataset in order to support a wide spectrum of real-life applications. We first elaborate the three general kinds of methods of handling incomplete data, including (i) discarding the data with missing values, (ii) imputation for the missing values, and (iii) just depending on the observed data values. For the third method type, we introduce the semantics of k-nearest neighbor (kNN) search, skyline query, and top-k dominating query on incomplete data, respectively. In terms of the three representative queries over incomplete data, we investigate some advanced techniques to process incomplete data queries, including indexing, pruning as well as crowdsourcing techniques. | |
530 | _aAlso available in print. | ||
588 | _aTitle from PDF title page (viewed on August 29, 2018). | ||
650 | 0 | _aQuerying (Computer science) | |
650 | 0 | _aDatabase searching. | |
650 | 0 | _aMissing observations (Statistics) | |
653 | _aquery processing | ||
653 | _aincomplete data | ||
653 | _amissing data | ||
653 | _asimilarity search | ||
653 | _ak-nearest neighbor search | ||
653 | _askyline query | ||
653 | _atop-k dominating query | ||
653 | _acrowdsourcing | ||
700 | 1 |
_aMiao, Xiaoye, _eauthor. |
|
776 | 0 | 8 |
_iPrint version: _z9781681734200 _z9781681734224 |
830 | 0 | _aSynthesis digital library of engineering and computer science. | |
830 | 0 |
_aSynthesis lectures on data management ; _v# 50. _x2153-5426 |
|
856 | 4 | 2 |
_3Abstract with links to resource _uhttps://ieeexplore.ieee.org/servlet/opac?bknumber=8444554 |
999 |
_c562315 _d562315 |