000 05714nam a22007091i 4500
001 8444554
003 IEEE
005 20200413152927.0
006 m eo d
007 cr cn |||m|||a
008 180829s2018 caua foab 000 0 eng d
020 _a9781681734217
_qebook
020 _z9781681734224
_qhardcover
020 _z9781681734200
_qpaperback
024 7 _a10.2200/S00870ED1V01Y201807DTM050
_2doi
035 _a(CaBNVSL)swl000408652
035 _a(OCoLC)1050334078
040 _aCaBNVSL
_beng
_erda
_cCaBNVSL
_dCaBNVSL
050 4 _aQA76.9.D3
_bG266 2018
082 0 4 _a005.7565
_223
100 1 _aGao, Yunjun,
_eauthor.
245 1 0 _aQuery processing over incomplete databases /
_cYunjun Gao, Xiaoye Miao.
264 1 _a[San Rafael, California] :
_bMorgan & Claypool,
_c2018.
300 _a1 PDF (xv, 106 pages) :
_billustrations.
336 _atext
_2rdacontent
337 _aelectronic
_2isbdmedia
338 _aonline resource
_2rdacarrier
490 1 _aSynthesis lectures on data management,
_x2153-5426 ;
_v# 50
538 _aMode of access: World Wide Web.
538 _aSystem requirements: Adobe Acrobat Reader.
500 _aPart of: Synthesis digital library of engineering and computer science.
504 _aIncludes bibliographical references (pages 87-103).
505 0 _a1. Introduction -- 1.1 Applications of incomplete data management -- 1.2 Overview of incomplete databases -- 1.2.1 Indexing incomplete databases -- 1.2.2 Querying incomplete databases -- 1.2.3 Incomplete database management systems -- 1.3 Challenges of querying incomplete databases -- 1.4 Organization --
505 8 _a2. Handling incomplete data methods -- 2.1 Method taxonomy -- 2.2 Overview of imputation methods -- 2.2.1 Statistical imputation -- 2.2.2 Machine learning-based imputation -- 2.2.3 Modern imputation methods --
505 8 _a3. Query semantics on incomplete data -- 3.1 K-nearest neighbor search on incomplete data -- 3.1.1 Background -- 3.1.2 Problem definition -- 3.2 Skyline queries on incomplete data -- 3.2.1 Background -- 3.2.2 Problem definition -- 3.3 Top-k dominating queries on incomplete data -- 3.3.1 Background -- 3.3.2 Problem definition --
505 8 _a4. Advanced techniques -- 4.1 Index structures -- 4.1.1 Lab index for k-nearest neighbor search on incomplete data -- 4.1.2 Histogram index for k-nearest neighbor search on incomplete data -- 4.1.3 Bitmap index for top-k dominating queries on incomplete data -- 4.2 Pruning heuristics -- 4.2.1 Alpha value pruning for k-nearest neighbor search on incomplete data -- 4.2.2 Histogram-based pruning for k-nearest neighbor search on incomplete data -- 4.2.3 Local skyband pruning for top-k dominating queries on incomplete data -- 4.2.4 Upper bound score pruning for top-k dominating queries on incomplete data -- 4.2.5 Bitmap pruning for top-k dominating queries on incomplete data -- 4.3 Crowdsourcing techniques -- 4.3.1 Crowdsourcing framework for skyline queries on incomplete data -- 4.3.2 C-table construction -- 4.3.3 Probability computation -- 4.3.4 Crowd task selection --
505 8 _a5. Conclusions -- Bibliography -- Authors' biographies.
506 _aAbstract freely available; full-text restricted to subscribers or individual document purchasers.
510 0 _aCompendex
510 0 _aINSPEC
510 0 _aGoogle scholar
510 0 _aGoogle book search
520 3 _aIncomplete data is part of life and almost all areas of scientific studies. Users tend to skip certain fields when they fill out online forms; participants choose to ignore sensitive questions on surveys; sensors fail, resulting in the loss of certain readings; publicly viewable satellite map services have missing data in many mobile applications; and in privacy-preserving applications, the data is incomplete deliberately in order to preserve the sensitivity of some attribute values. Query processing is a fundamental problem in computer science, and is useful in a variety of applications. In this book, we mostly focus on the query processing over incomplete databases, which involves finding a set of qualified objects from a specified incomplete dataset in order to support a wide spectrum of real-life applications. We first elaborate the three general kinds of methods of handling incomplete data, including (i) discarding the data with missing values, (ii) imputation for the missing values, and (iii) just depending on the observed data values. For the third method type, we introduce the semantics of k-nearest neighbor (kNN) search, skyline query, and top-k dominating query on incomplete data, respectively. In terms of the three representative queries over incomplete data, we investigate some advanced techniques to process incomplete data queries, including indexing, pruning as well as crowdsourcing techniques.
530 _aAlso available in print.
588 _aTitle from PDF title page (viewed on August 29, 2018).
650 0 _aQuerying (Computer science)
650 0 _aDatabase searching.
650 0 _aMissing observations (Statistics)
653 _aquery processing
653 _aincomplete data
653 _amissing data
653 _asimilarity search
653 _ak-nearest neighbor search
653 _askyline query
653 _atop-k dominating query
653 _acrowdsourcing
700 1 _aMiao, Xiaoye,
_eauthor.
776 0 8 _iPrint version:
_z9781681734200
_z9781681734224
830 0 _aSynthesis digital library of engineering and computer science.
830 0 _aSynthesis lectures on data management ;
_v# 50.
_x2153-5426
856 4 2 _3Abstract with links to resource
_uhttps://ieeexplore.ieee.org/servlet/opac?bknumber=8444554
999 _c562315
_d562315