Welcome to P K Kelkar Library, Online Public Access Catalogue (OPAC)

Speech Separation by Humans and Machines (Record no. 507075)

000 -LEADER
fixed length control field 04962nam a22004815i 4500
001 - CONTROL NUMBER
control field 978-0-387-22794-8
003 - CONTROL NUMBER IDENTIFIER
control field DE-He213
005 - DATE AND TIME OF LATEST TRANSACTION
control field 20161121231011.0
007 - PHYSICAL DESCRIPTION FIXED FIELD--GENERAL INFORMATION
fixed length control field cr nn 008mamaa
008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION
fixed length control field 100301s2005 xxu| s |||| 0|eng d
020 ## - INTERNATIONAL STANDARD BOOK NUMBER
International Standard Book Number 9780387227948
-- 978-0-387-22794-8
024 7# - OTHER STANDARD IDENTIFIER
Standard number or code 10.1007/b99695
Source of number or code doi
050 #4 - LIBRARY OF CONGRESS CALL NUMBER
Classification number TK5102.9
050 #4 - LIBRARY OF CONGRESS CALL NUMBER
Classification number TA1637-1638
050 #4 - LIBRARY OF CONGRESS CALL NUMBER
Classification number TK7882.S65
072 #7 - SUBJECT CATEGORY CODE
Subject category code TTBM
Source bicssc
072 #7 - SUBJECT CATEGORY CODE
Subject category code UYS
Source bicssc
072 #7 - SUBJECT CATEGORY CODE
Subject category code TEC008000
Source bisacsh
072 #7 - SUBJECT CATEGORY CODE
Subject category code COM073000
Source bisacsh
082 04 - DEWEY DECIMAL CLASSIFICATION NUMBER
Classification number 621.382
Edition number 23
245 10 - TITLE STATEMENT
Title Speech Separation by Humans and Machines
Medium [electronic resource] /
Statement of responsibility, etc. edited by Pierre Divenyi.
264 #1 - PRODUCTION, PUBLICATION, DISTRIBUTION, MANUFACTURE, AND COPYRIGHT NOTICE
Place of production, publication, distribution, manufacture Boston, MA :
Name of producer, publisher, distributor, manufacturer Springer US,
Date of production, publication, distribution, manufacture, or copyright notice 2005.
300 ## - PHYSICAL DESCRIPTION
Extent XXIV, 319 p.
Other physical details online resource.
336 ## - CONTENT TYPE
Content type term text
Content type code txt
Source rdacontent
337 ## - MEDIA TYPE
Media type term computer
Media type code c
Source rdamedia
338 ## - CARRIER TYPE
Carrier type term online resource
Carrier type code cr
Source rdacarrier
347 ## - DIGITAL FILE CHARACTERISTICS
File type text file
Encoding format PDF
Source rda
505 0# - FORMATTED CONTENTS NOTE
Formatted contents note Speech Segregation: Problems and Perspectives -- Auditory Scene Analysis -- Speech separation -- Recurrent Timing Nets for F0-based Speaker Separation -- Blind Source Separation Using Graphical Models -- Speech Recognizer Based Maximum Likelihood Beamforming -- Exploiting Redundancy to Construct Listening Systems -- Automatic Speech Processing by Inference in Generative Models -- Signal Separation Motivated by Human Auditory Perception: Applications to Automatic Speech Recognition -- Speech Segregation Using an Event-synchronous Auditory Image and STRAIGHT -- Underlying Principles of a High-quality Speech Manipulation System STRAIGHT and Its Application to Speech Segregation -- On Ideal Binary Mask As the Computational Goal of Auditory Scene Analysis -- The History and Future of CASA -- Techniques for Robust Speech Recognition in Noisy and Reverberant Conditions -- Source Separation, Localization, and Comprehension in Humans, Machines, and Human-machine Systems -- The Cancellation Principle in Acoustic Scene Analysis -- Informational and Energetic Masking Effects in Multitalker Speech Perception -- Masking the Feature Information In Multi-stream Speech-analogue Displays -- Interplay Between Visual and Audio Scene Analysis -- Evaluating Speech Separation Systems -- Making Sense of Everyday Speech: a Glimpsing Account.
520 ## - SUMMARY, ETC.
Summary, etc. The "cocktail-party effect" - the ability to focus on one voice in a sea of noises - is a highly sophisticated skill that is usually effortless to listeners but largely impossible for machines. Investigating and unraveling this capacity spans numerous fields including psychology, physiology, engineering, and computer science. All these perspectives are brought together in this volume which, for the first time, provides a comprehensive and authoritative discussion of our understanding of how humans separate speech, and the state of the art in approaching these abilities with machines. This material is drawn from an October 2003 workshop, sponsored by the National Science Foundation, on speech separation. Leading authorities from around the world were invited to present their perspectives and discuss the points of contact to other perspectives. The result is a clear and uniform overview of this problem, and a primer in what is emerging as an important, active and successful area for the development of new techniques and applications. Chapters include historical and current summaries of relevant research in behavioral science, neuroscience and engineering, along with more in-depth descriptions of several of the most exciting current research projects and techniques, including the latest experimental results illuminating how listeners organize the mixtures of sound they hear, and the most powerful and successful signal processing and machine learning techniques for the separation of real-world recordings of sound mixtures by one or more microphones. There is no comparable collection that seeks to bring together the underlying experimental science and the wide variety of technical approaches to give an integrated picture of the problem and solutions to speech separation. Those specializing in speech science, hearing science, neuroscience, or computer science and engineers working on applications such as automatic speech recognition, cochlear implants, hands-free telephones, sound recording, multimedia indexing and retrieval will find Speech Separation by Humans and Machines a useful and inspiring read.
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name entry element Engineering.
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name entry element User interfaces (Computer systems).
650 14 - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name entry element Engineering.
650 24 - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name entry element Signal, Image and Speech Processing.
650 24 - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name entry element User Interfaces and Human Computer Interaction.
650 24 - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name entry element Engineering, general.
700 1# - ADDED ENTRY--PERSONAL NAME
Personal name Divenyi, Pierre.
Relator term editor.
710 2# - ADDED ENTRY--CORPORATE NAME
Corporate name or jurisdiction name as entry element SpringerLink (Online service)
773 0# - HOST ITEM ENTRY
Title Springer eBooks
776 08 - ADDITIONAL PHYSICAL FORM ENTRY
Relationship information Printed edition:
International Standard Book Number 9781402080012
856 40 - ELECTRONIC LOCATION AND ACCESS
Uniform Resource Identifier http://dx.doi.org/10.1007/b99695
912 ## -
-- ZDB-2-ENG
Holdings
Withdrawn status Lost status Damaged status Not for loan Permanent Location Current Location Date acquired Barcode Date last seen Price effective from Koha item type
        PK Kelkar Library, IIT Kanpur PK Kelkar Library, IIT Kanpur 2016-11-21 EBK7362 2016-11-21 2016-11-21 E books

Powered by Koha