Welcome to P K Kelkar Library, Online Public Access Catalogue (OPAC)

Syntax-based statistical machine translation / (Record no. 562222)

000 -LEADER
fixed length control field 07280nam a2200733 i 4500
001 - CONTROL NUMBER
control field 7555397
003 - CONTROL NUMBER IDENTIFIER
control field IEEE
005 - DATE AND TIME OF LATEST TRANSACTION
control field 20200413152922.0
006 - FIXED-LENGTH DATA ELEMENTS--ADDITIONAL MATERIAL CHARACTERISTICS
fixed length control field m eo d
007 - PHYSICAL DESCRIPTION FIXED FIELD--GENERAL INFORMATION
fixed length control field cr cn |||m|||a
008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION
fixed length control field 160816s2016 caua foab 001 0 eng d
020 ## - INTERNATIONAL STANDARD BOOK NUMBER
International Standard Book Number 9781627055024
Qualifying information ebook
020 ## - INTERNATIONAL STANDARD BOOK NUMBER
Canceled/invalid ISBN 9781627059008
Qualifying information print
024 7# - OTHER STANDARD IDENTIFIER
Standard number or code 10.2200/S00716ED1V04Y201604HLT033
Source of number or code doi
035 ## - SYSTEM CONTROL NUMBER
System control number (CaBNVSL)swl00406781
035 ## - SYSTEM CONTROL NUMBER
System control number (OCoLC)956738395
040 ## - CATALOGING SOURCE
Original cataloging agency CaBNVSL
Language of cataloging eng
Description conventions rda
Transcribing agency CaBNVSL
Modifying agency CaBNVSL
050 #4 - LIBRARY OF CONGRESS CALL NUMBER
Classification number P308
Item number .W557 2016
082 04 - DEWEY DECIMAL CLASSIFICATION NUMBER
Classification number 418.020285
Edition number 23
100 1# - MAIN ENTRY--PERSONAL NAME
Personal name Williams, Philip.,
Relator term author.
245 10 - TITLE STATEMENT
Title Syntax-based statistical machine translation /
Statement of responsibility, etc. Philip Williams, Rico Sennrich, Matt Post, Philipp Koehn.
264 #1 - PRODUCTION, PUBLICATION, DISTRIBUTION, MANUFACTURE, AND COPYRIGHT NOTICE
Place of production, publication, distribution, manufacture [San Rafael, California] :
Name of producer, publisher, distributor, manufacturer Morgan & Claypool,
Date of production, publication, distribution, manufacture, or copyright notice 2016.
300 ## - PHYSICAL DESCRIPTION
Extent 1 PDF (xvii, 190 pages) :
Other physical details illustrations.
336 ## - CONTENT TYPE
Content type term text
Source rdacontent
337 ## - MEDIA TYPE
Media type term electronic
Source isbdmedia
338 ## - CARRIER TYPE
Carrier type term online resource
Source rdacarrier
490 1# - SERIES STATEMENT
Series statement Synthesis lectures on human language technologies,
International Standard Serial Number 1947-4059 ;
Volume/sequential designation # 33
538 ## - SYSTEM DETAILS NOTE
System details note Mode of access: World Wide Web.
538 ## - SYSTEM DETAILS NOTE
System details note System requirements: Adobe Acrobat Reader.
500 ## - GENERAL NOTE
General note Part of: Synthesis digital library of engineering and computer science.
504 ## - BIBLIOGRAPHY, ETC. NOTE
Bibliography, etc. note Includes bibliographical references (pages 159-175) and index.
505 0# - FORMATTED CONTENTS NOTE
Formatted contents note 1. Models -- 1.1 Syntactic translation units -- 1.1.1 Phrases -- 1.1.2 Phrases with gaps -- 1.1.3 Phrases with labels -- 1.1.4 Phrases with internal tree structure -- 1.2 Grammar formalisms -- 1.2.1 Context-free grammar -- 1.2.2 Synchronous context-free grammar -- 1.2.3 Synchronous tree-substitution grammar -- 1.2.4 Probabilistic and weighted grammars -- 1.3 Statistical models -- 1.3.1 Generative models -- 1.3.2 Discriminative models -- 1.4 A classification of syntax-based models -- 1.4.1 String-to-string -- 1.4.2 String-to-tree -- 1.4.3 Tree-to-string -- 1.4.4 Tree-to-tree -- 1.5 A brief history of syntax-based SMT --
505 8# - FORMATTED CONTENTS NOTE
Formatted contents note 2. Learning from parallel text -- 2.1 Preliminaries -- 2.2 Hierarchical phrase-based grammar -- 2.2.1 Rule extraction -- 2.2.2 Features -- 2.3 Syntax-augmented grammar -- 2.3.1 Rule extraction -- 2.3.2 Extraction heuristics -- 2.3.3 Features -- 2.4 GHKM -- 2.4.1 Identifying frontier nodes -- 2.4.2 Extracting minimal rules -- 2.4.3 Unaligned source words -- 2.4.4 Composed rules -- 2.4.5 Features -- 2.5 A comparison -- 2.6 Summary --
505 8# - FORMATTED CONTENTS NOTE
Formatted contents note 3. Decoding I: preliminaries -- 3.1 Hypergraphs, forests, and derivations -- 3.1.1 Basic definitions -- 3.1.2 Parse forests -- 3.1.3 Translation forests -- 3.1.4 Derivations -- 3.1.5 Weighted derivations -- 3.2 Algorithms on hypergraphs -- 3.2.1 The topological sort algorithm -- 3.2.2 The Viterbi max-derivation algorithm -- 3.2.3 The CYK max-derivation algorithm -- 3.2.4 The eager and lazy k-best algorithms -- 3.3 Historical notes and further reading --
505 8# - FORMATTED CONTENTS NOTE
Formatted contents note 4. Decoding II: tree decoding -- 4.1 Decoding with local features -- 4.1.1 A basic decoding algorithm -- 4.1.2 Hyperedge bundling -- 4.2 State splitting -- 4.2.1 Adding a bigram language model feature -- 4.2.2 The state-split hypergraph -- 4.2.3 Complexity -- 4.3 Beam search -- 4.3.1 The beam -- 4.3.2 Rest cost estimation -- 4.3.3 Monotonicity redux -- 4.3.4 Exhaustive beam filling -- 4.3.5 Cube pruning -- 4.3.6 Cube growing -- 4.3.7 State refinement -- 4.4 Efficient tree parsing -- 4.5 Tree-to-tree decoding -- 4.6 Historical notes and further reading --
505 8# - FORMATTED CONTENTS NOTE
Formatted contents note 5. Decoding III: string decoding -- 5.1 Basic beam search -- 5.1.1 Parse forest complexity -- 5.2 Faster beam search -- 5.2.1 Constrained width parsing -- 5.2.2 Per-subspan beam search -- 5.3 Handling non-binary grammars -- 5.3.1 Binarization -- 5.3.2 Alternatives to binarization -- 5.4 Interim summary -- 5.5 Parsing algorithms -- 5.5.1 The CYK+ algorithm -- 5.5.2 Trie-based grammar storage -- 5.5.3 The recursive CYK+ algorithm -- 5.6 STSG and distinct-category SCFG -- 5.6.1 STSG -- 5.6.2 Distinct-category SCFG -- 5.7 Historical notes and further reading --
505 8# - FORMATTED CONTENTS NOTE
Formatted contents note 6. Selected topics -- 6.1 Transformations on trees -- 6.1.1 Tree restructuring -- 6.1.2 Tree re-labeling -- 6.1.3 Fuzzy syntax -- 6.1.4 Forest-based approaches -- 6.1.5 Beyond context-free models -- 6.2 Dependency structure -- 6.2.1 Dependency treelet translation -- 6.2.2 String-to-dependency SMT -- 6.3 Improving grammaticality -- 6.3.1 Agreement -- 6.3.2 Subcategorization -- 6.3.3 Morphological structure in synchronous grammars -- 6.3.4 Syntactic language models -- 6.4 Evaluation metrics --
505 8# - FORMATTED CONTENTS NOTE
Formatted contents note 7. Closing remarks -- 7.1 Which approach is best? -- 7.2 What's next? --
505 8# - FORMATTED CONTENTS NOTE
Formatted contents note A. Open-source tools -- Bibliography -- Authors' biographies -- Author index -- Index.
506 1# - RESTRICTIONS ON ACCESS NOTE
Terms governing access Abstract freely available; full-text restricted to subscribers or individual document purchasers.
510 0# - CITATION/REFERENCES NOTE
Name of source Compendex
510 0# - CITATION/REFERENCES NOTE
Name of source INSPEC
510 0# - CITATION/REFERENCES NOTE
Name of source Google scholar
510 0# - CITATION/REFERENCES NOTE
Name of source Google book search
520 3# - SUMMARY, ETC.
Summary, etc. This unique book provides a comprehensive introduction to the most popular syntax-based statistical machine translation models, filling a gap in the current literature for researchers and developers in human language technologies. While phrase-based models have previously dominated the field, syntax-based approaches have proved a popular alternative, as they elegantly solve many of the shortcomings of phrase-based models. The heart of this book is a detailed introduction to decoding for syntax-based models. The book begins with an overview of synchronous-context free grammar (SCFG) and synchronous tree-substitution grammar (STSG) along with their associated statistical models. It also describes how three popular instantiations (Hiero, SAMT, and GHKM) are learned from parallel corpora. It introduces and details hypergraphs and associated general algorithms, as well as algorithms for decoding with both tree and string input. Special attention is given to efficiency, including search approximations such as beam search and cube pruning, data structures, and parsing algorithms. The book consistently highlights the strengths (and limitations) of syntax-based approaches, including their ability to generalize phrase-based translation units, their modeling of specific linguistic phenomena, and their function of structuring the search space.
530 ## - ADDITIONAL PHYSICAL FORM AVAILABLE NOTE
Additional physical form available note Also available in print.
588 ## - SOURCE OF DESCRIPTION NOTE
Source of description note Title from PDF title page (viewed on August 16, 2016).
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name entry element Machine translating.
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name entry element Translating and interpreting
General subdivision Data processing.
653 ## - INDEX TERM--UNCONTROLLED
Uncontrolled term statistical machine translation
653 ## - INDEX TERM--UNCONTROLLED
Uncontrolled term syntax
653 ## - INDEX TERM--UNCONTROLLED
Uncontrolled term synchronous grammar formalisms
653 ## - INDEX TERM--UNCONTROLLED
Uncontrolled term natural language processing
653 ## - INDEX TERM--UNCONTROLLED
Uncontrolled term computational linguistics
653 ## - INDEX TERM--UNCONTROLLED
Uncontrolled term machine learning
653 ## - INDEX TERM--UNCONTROLLED
Uncontrolled term statistical modeling
700 1# - ADDED ENTRY--PERSONAL NAME
Personal name Sennrich, Rico.,
Relator term author.
700 1# - ADDED ENTRY--PERSONAL NAME
Personal name Post, Matt.,
Relator term author.
700 1# - ADDED ENTRY--PERSONAL NAME
Personal name Koehn, Philipp.,
Relator term author.
776 08 - ADDITIONAL PHYSICAL FORM ENTRY
Relationship information Print version:
International Standard Book Number 9781627059008
830 #0 - SERIES ADDED ENTRY--UNIFORM TITLE
Uniform title Synthesis digital library of engineering and computer science.
830 #0 - SERIES ADDED ENTRY--UNIFORM TITLE
Uniform title Synthesis lectures on human language technologies ;
Volume/sequential designation # 33.
International Standard Serial Number 1947-4059
856 42 - ELECTRONIC LOCATION AND ACCESS
Materials specified Abstract with links to resource
Uniform Resource Identifier http://ieeexplore.ieee.org/servlet/opac?bknumber=7555397
Holdings
Withdrawn status Lost status Damaged status Not for loan Permanent Location Current Location Date acquired Barcode Date last seen Price effective from Koha item type
        PK Kelkar Library, IIT Kanpur PK Kelkar Library, IIT Kanpur 2020-04-13 EBKE722 2020-04-13 2020-04-13 E books

Powered by Koha