Normal view MARC view ISBD view

Multi-armed bandits : (Record no. 562450)

000 -LEADER
fixed length control field	05156nam a2200685 i 4500
001 - CONTROL NUMBER
control field	8910671
003 - CONTROL NUMBER IDENTIFIER
control field	IEEE
005 - DATE AND TIME OF LATEST TRANSACTION
control field	20200413152934.0
006 - FIXED-LENGTH DATA ELEMENTS--ADDITIONAL MATERIAL CHARACTERISTICS
fixed length control field	m eo d
007 - PHYSICAL DESCRIPTION FIXED FIELD--GENERAL INFORMATION
fixed length control field	cr bn \|\|\|m\|\|\|a
008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION
fixed length control field	191127s2020 caua fob 000 0 eng d
020 ## - INTERNATIONAL STANDARD BOOK NUMBER
International Standard Book Number	9781627058711
Qualifying information	electronic
020 ## - INTERNATIONAL STANDARD BOOK NUMBER
Canceled/invalid ISBN	9781681736372
Qualifying information	hardcover
020 ## - INTERNATIONAL STANDARD BOOK NUMBER
Canceled/invalid ISBN	9781627056380
Qualifying information	paperback
024 7# - OTHER STANDARD IDENTIFIER
Standard number or code	10.2200/S00941ED2V01Y201907CNT022
Source of number or code	doi
035 ## - SYSTEM CONTROL NUMBER
System control number	(CaBNVSL)thg00979755
035 ## - SYSTEM CONTROL NUMBER
System control number	(OCoLC)1129092706
040 ## - CATALOGING SOURCE
Original cataloging agency	CaBNVSL
Language of cataloging	eng
Description conventions	rda
Transcribing agency	CaBNVSL
Modifying agency	CaBNVSL
050 #4 - LIBRARY OF CONGRESS CALL NUMBER
Classification number	Q325.5
Item number	.Z536 2020eb
082 04 - DEWEY DECIMAL CLASSIFICATION NUMBER
Classification number	006.3/1
Edition number	23
100 1# - MAIN ENTRY--PERSONAL NAME
Personal name	Zhao, Qing
Titles and words associated with a name	(Ph.D. in electrical engineering),
Relator term	author.
245 10 - TITLE STATEMENT
Title	Multi-armed bandits :
Remainder of title	theory and applications to online learning in networks /
Statement of responsibility, etc.	Qing Zhao.
264 #1 - PRODUCTION, PUBLICATION, DISTRIBUTION, MANUFACTURE, AND COPYRIGHT NOTICE
Place of production, publication, distribution, manufacture	[San Rafael, California] :
Name of producer, publisher, distributor, manufacturer	Morgan & Claypool,
Date of production, publication, distribution, manufacture, or copyright notice	[2020]
300 ## - PHYSICAL DESCRIPTION
Extent	1 PDF (xviii, 147 pages) :
Other physical details	illustrations.
336 ## - CONTENT TYPE
Content type term	text
Source	rdacontent
337 ## - MEDIA TYPE
Media type term	electronic
Source	isbdmedia
338 ## - CARRIER TYPE
Carrier type term	online resource
Source	rdacarrier
490 1# - SERIES STATEMENT
Series statement	Synthesis lectures on communication networks ,
International Standard Serial Number	1935-4193 ;
Volume/sequential designation	#22
538 ## - SYSTEM DETAILS NOTE
System details note	Mode of access: World Wide Web.
538 ## - SYSTEM DETAILS NOTE
System details note	System requirements: Adobe Acrobat Reader.
500 ## - GENERAL NOTE
General note	Part of: Synthesis digital library of engineering and computer science.
504 ## - BIBLIOGRAPHY, ETC. NOTE
Bibliography, etc. note	Includes bibliographical references (pages 127-145).
505 0# - FORMATTED CONTENTS NOTE
Formatted contents note	1. Introduction -- 1.1. Multi-armed bandit problems -- 1.2. An essential conflict : exploration vs. Exploitation -- 1.3. Two formulations : Bayesian and frequentist -- 1.4. Notation
505 8# - FORMATTED CONTENTS NOTE
Formatted contents note	2. Bayesian bandit model and Gittins index -- 2.1. Markov decision processes -- 2.2. The Bayesian bandit model -- 2.3. Gittins index -- 2.4. Optimality of the Gittins index policy -- 2.5. Computing Gittins index -- 2.6. Semi-Markov bandit processes
505 8# - FORMATTED CONTENTS NOTE
Formatted contents note	3. Variants of the Bayesian bandit model -- 3.1. Necessary assumptions for the index theorem -- 3.2. Variations in the action space -- 3.3. Variations in the system dynamics -- 3.4. Variations in the reward structure -- 3.5. Variations in performance measure
505 8# - FORMATTED CONTENTS NOTE
Formatted contents note	4. Frequentist bandit model -- 4.1. Basic formulations and regret measures -- 4.2. Lower bounds on regret -- 4.3. Online learning algorithms -- 4.4. Connections between Bayesian and frequentist bandit models
505 8# - FORMATTED CONTENTS NOTE
Formatted contents note	5. Variants of the frequentist bandit model -- 5.1. Variations in the reward model -- 5.2. Variations in the action space -- 5.3. Variations in the observation model -- 5.4. Variations in the performance measure -- 5.5. Learning in context : bandits with side information -- 5.6. Learning under competition : bandits with multiple players
505 8# - FORMATTED CONTENTS NOTE
Formatted contents note	6. Application examples -- 6.1. Communication and computer networks -- 6.2. Social-economic networks.
506 ## - RESTRICTIONS ON ACCESS NOTE
Terms governing access	Abstract freely available; full-text restricted to subscribers or individual document purchasers.
510 0# - CITATION/REFERENCES NOTE
Name of source	Compendex
510 0# - CITATION/REFERENCES NOTE
Name of source	INSPEC
510 0# - CITATION/REFERENCES NOTE
Name of source	Google scholar
510 0# - CITATION/REFERENCES NOTE
Name of source	Google book search
520 ## - SUMMARY, ETC.
Summary, etc.	Multi-armed bandit problems pertain to optimal sequential decision making and learning in unknown environments. Since the first bandit problem posed by Thompson in 1933 for the application of clinical trials, bandit problems have enjoyed lasting attention from multiple research communities and have found a wide range of applications across diverse domains. This book covers classic results and recent development on both Bayesian and frequentist bandit problems. We start in Chapter 1 with a brief overview on the history of bandit problems, contrasting the two schools--Bayesian and frequentist--of approaches and highlighting foundational results and key applications. Chapters 2 and 4 cover, respectively, the canonical Bayesian and frequentist bandit models. In Chapters 3 and 5, we discuss major variants of the canonical bandit models that lead to new directions, bring in new techniques, and broaden the applications of this classical problem. In Chapter 6, we present several representative application examples in communication networks and social-economic systems, aiming to illuminate the connections between the Bayesian and the frequentist formulations of bandit problems and how structural results pertaining to one may be leveraged to obtain solutions under the other.
530 ## - ADDITIONAL PHYSICAL FORM AVAILABLE NOTE
Additional physical form available note	Also available in print.
588 ## - SOURCE OF DESCRIPTION NOTE
Source of description note	Title from PDF title page (viewed on November 27, 2019).
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name entry element	Machine learning.
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name entry element	Reinforcement learning.
653 ## - INDEX TERM--UNCONTROLLED
Uncontrolled term	multi-armed bandit
653 ## - INDEX TERM--UNCONTROLLED
Uncontrolled term	machine learning
653 ## - INDEX TERM--UNCONTROLLED
Uncontrolled term	online learning
653 ## - INDEX TERM--UNCONTROLLED
Uncontrolled term	reinforcement learning
653 ## - INDEX TERM--UNCONTROLLED
Uncontrolled term	Markov decision processes
655 #0 - INDEX TERM--GENRE/FORM
Genre/form data or focus term	Electronic books.
776 08 - ADDITIONAL PHYSICAL FORM ENTRY
Relationship information	Print version:
International Standard Book Number	9781627056380
--	9781681736372
830 #0 - SERIES ADDED ENTRY--UNIFORM TITLE
Uniform title	Synthesis digital library of engineering and computer science.
830 #0 - SERIES ADDED ENTRY--UNIFORM TITLE
Uniform title	Synthesis lectures on communication networks ;
Volume/sequential designation	#22.
856 40 - ELECTRONIC LOCATION AND ACCESS
Materials specified	Abstract with links to full text
Uniform Resource Identifier	https://doi.org/10.2200/S00941ED2V01Y201907CNT022
856 42 - ELECTRONIC LOCATION AND ACCESS
Materials specified	Abstract with links to resource
Uniform Resource Identifier	https://ieeexplore.ieee.org/servlet/opac?bknumber=8910671

Holdings
Withdrawn status	Lost status	Damaged status	Not for loan	Permanent Location	Current Location	Date acquired	Barcode	Date last seen	Price effective from	Koha item type
				PK Kelkar Library, IIT Kanpur	PK Kelkar Library, IIT Kanpur	2020-04-13	EBKE950	2020-04-13	2020-04-13	E books

PKK Library

Welcome to P K Kelkar Library, Online Public Access Catalogue (OPAC)

Multi-armed bandits : (Record no. 562450)