000 | 07280nam a2200733 i 4500 | ||
---|---|---|---|
001 | 7555397 | ||
003 | IEEE | ||
005 | 20200413152922.0 | ||
006 | m eo d | ||
007 | cr cn |||m|||a | ||
008 | 160816s2016 caua foab 001 0 eng d | ||
020 |
_a9781627055024 _qebook |
||
020 |
_z9781627059008 _qprint |
||
024 | 7 |
_a10.2200/S00716ED1V04Y201604HLT033 _2doi |
|
035 | _a(CaBNVSL)swl00406781 | ||
035 | _a(OCoLC)956738395 | ||
040 |
_aCaBNVSL _beng _erda _cCaBNVSL _dCaBNVSL |
||
050 | 4 |
_aP308 _b.W557 2016 |
|
082 | 0 | 4 |
_a418.020285 _223 |
100 | 1 |
_aWilliams, Philip., _eauthor. |
|
245 | 1 | 0 |
_aSyntax-based statistical machine translation / _cPhilip Williams, Rico Sennrich, Matt Post, Philipp Koehn. |
264 | 1 |
_a[San Rafael, California] : _bMorgan & Claypool, _c2016. |
|
300 |
_a1 PDF (xvii, 190 pages) : _billustrations. |
||
336 |
_atext _2rdacontent |
||
337 |
_aelectronic _2isbdmedia |
||
338 |
_aonline resource _2rdacarrier |
||
490 | 1 |
_aSynthesis lectures on human language technologies, _x1947-4059 ; _v# 33 |
|
538 | _aMode of access: World Wide Web. | ||
538 | _aSystem requirements: Adobe Acrobat Reader. | ||
500 | _aPart of: Synthesis digital library of engineering and computer science. | ||
504 | _aIncludes bibliographical references (pages 159-175) and index. | ||
505 | 0 | _a1. Models -- 1.1 Syntactic translation units -- 1.1.1 Phrases -- 1.1.2 Phrases with gaps -- 1.1.3 Phrases with labels -- 1.1.4 Phrases with internal tree structure -- 1.2 Grammar formalisms -- 1.2.1 Context-free grammar -- 1.2.2 Synchronous context-free grammar -- 1.2.3 Synchronous tree-substitution grammar -- 1.2.4 Probabilistic and weighted grammars -- 1.3 Statistical models -- 1.3.1 Generative models -- 1.3.2 Discriminative models -- 1.4 A classification of syntax-based models -- 1.4.1 String-to-string -- 1.4.2 String-to-tree -- 1.4.3 Tree-to-string -- 1.4.4 Tree-to-tree -- 1.5 A brief history of syntax-based SMT -- | |
505 | 8 | _a2. Learning from parallel text -- 2.1 Preliminaries -- 2.2 Hierarchical phrase-based grammar -- 2.2.1 Rule extraction -- 2.2.2 Features -- 2.3 Syntax-augmented grammar -- 2.3.1 Rule extraction -- 2.3.2 Extraction heuristics -- 2.3.3 Features -- 2.4 GHKM -- 2.4.1 Identifying frontier nodes -- 2.4.2 Extracting minimal rules -- 2.4.3 Unaligned source words -- 2.4.4 Composed rules -- 2.4.5 Features -- 2.5 A comparison -- 2.6 Summary -- | |
505 | 8 | _a3. Decoding I: preliminaries -- 3.1 Hypergraphs, forests, and derivations -- 3.1.1 Basic definitions -- 3.1.2 Parse forests -- 3.1.3 Translation forests -- 3.1.4 Derivations -- 3.1.5 Weighted derivations -- 3.2 Algorithms on hypergraphs -- 3.2.1 The topological sort algorithm -- 3.2.2 The Viterbi max-derivation algorithm -- 3.2.3 The CYK max-derivation algorithm -- 3.2.4 The eager and lazy k-best algorithms -- 3.3 Historical notes and further reading -- | |
505 | 8 | _a4. Decoding II: tree decoding -- 4.1 Decoding with local features -- 4.1.1 A basic decoding algorithm -- 4.1.2 Hyperedge bundling -- 4.2 State splitting -- 4.2.1 Adding a bigram language model feature -- 4.2.2 The state-split hypergraph -- 4.2.3 Complexity -- 4.3 Beam search -- 4.3.1 The beam -- 4.3.2 Rest cost estimation -- 4.3.3 Monotonicity redux -- 4.3.4 Exhaustive beam filling -- 4.3.5 Cube pruning -- 4.3.6 Cube growing -- 4.3.7 State refinement -- 4.4 Efficient tree parsing -- 4.5 Tree-to-tree decoding -- 4.6 Historical notes and further reading -- | |
505 | 8 | _a5. Decoding III: string decoding -- 5.1 Basic beam search -- 5.1.1 Parse forest complexity -- 5.2 Faster beam search -- 5.2.1 Constrained width parsing -- 5.2.2 Per-subspan beam search -- 5.3 Handling non-binary grammars -- 5.3.1 Binarization -- 5.3.2 Alternatives to binarization -- 5.4 Interim summary -- 5.5 Parsing algorithms -- 5.5.1 The CYK+ algorithm -- 5.5.2 Trie-based grammar storage -- 5.5.3 The recursive CYK+ algorithm -- 5.6 STSG and distinct-category SCFG -- 5.6.1 STSG -- 5.6.2 Distinct-category SCFG -- 5.7 Historical notes and further reading -- | |
505 | 8 | _a6. Selected topics -- 6.1 Transformations on trees -- 6.1.1 Tree restructuring -- 6.1.2 Tree re-labeling -- 6.1.3 Fuzzy syntax -- 6.1.4 Forest-based approaches -- 6.1.5 Beyond context-free models -- 6.2 Dependency structure -- 6.2.1 Dependency treelet translation -- 6.2.2 String-to-dependency SMT -- 6.3 Improving grammaticality -- 6.3.1 Agreement -- 6.3.2 Subcategorization -- 6.3.3 Morphological structure in synchronous grammars -- 6.3.4 Syntactic language models -- 6.4 Evaluation metrics -- | |
505 | 8 | _a7. Closing remarks -- 7.1 Which approach is best? -- 7.2 What's next? -- | |
505 | 8 | _aA. Open-source tools -- Bibliography -- Authors' biographies -- Author index -- Index. | |
506 | 1 | _aAbstract freely available; full-text restricted to subscribers or individual document purchasers. | |
510 | 0 | _aCompendex | |
510 | 0 | _aINSPEC | |
510 | 0 | _aGoogle scholar | |
510 | 0 | _aGoogle book search | |
520 | 3 | _aThis unique book provides a comprehensive introduction to the most popular syntax-based statistical machine translation models, filling a gap in the current literature for researchers and developers in human language technologies. While phrase-based models have previously dominated the field, syntax-based approaches have proved a popular alternative, as they elegantly solve many of the shortcomings of phrase-based models. The heart of this book is a detailed introduction to decoding for syntax-based models. The book begins with an overview of synchronous-context free grammar (SCFG) and synchronous tree-substitution grammar (STSG) along with their associated statistical models. It also describes how three popular instantiations (Hiero, SAMT, and GHKM) are learned from parallel corpora. It introduces and details hypergraphs and associated general algorithms, as well as algorithms for decoding with both tree and string input. Special attention is given to efficiency, including search approximations such as beam search and cube pruning, data structures, and parsing algorithms. The book consistently highlights the strengths (and limitations) of syntax-based approaches, including their ability to generalize phrase-based translation units, their modeling of specific linguistic phenomena, and their function of structuring the search space. | |
530 | _aAlso available in print. | ||
588 | _aTitle from PDF title page (viewed on August 16, 2016). | ||
650 | 0 | _aMachine translating. | |
650 | 0 |
_aTranslating and interpreting _xData processing. |
|
653 | _astatistical machine translation | ||
653 | _asyntax | ||
653 | _asynchronous grammar formalisms | ||
653 | _anatural language processing | ||
653 | _acomputational linguistics | ||
653 | _amachine learning | ||
653 | _astatistical modeling | ||
700 | 1 |
_aSennrich, Rico., _eauthor. |
|
700 | 1 |
_aPost, Matt., _eauthor. |
|
700 | 1 |
_aKoehn, Philipp., _eauthor. |
|
776 | 0 | 8 |
_iPrint version: _z9781627059008 |
830 | 0 | _aSynthesis digital library of engineering and computer science. | |
830 | 0 |
_aSynthesis lectures on human language technologies ; _v# 33. _x1947-4059 |
|
856 | 4 | 2 |
_3Abstract with links to resource _uhttp://ieeexplore.ieee.org/servlet/opac?bknumber=7555397 |
999 |
_c562222 _d562222 |