Cover image for Computational approaches to morphology and syntax
Title:
Computational approaches to morphology and syntax
Personal Author:
Series:
Oxford surveys in syntax and morphology
Publication Information:
New York, NY. : OUP, 2007
Physical Description:
xx, 316 p. : ill. ; 26 cm.
ISBN:
9780199274772

Available:*

Library
Item Barcode
Call Number
Material Type
Item Category 1
Status
Searching...
30000010184048 P98 R63 2007 Open Access Book Book
Searching...

On Order

Summary

Summary

The book will appeal to scholars and advanced students of morphology, syntax, computational linguistics and natural language processing (NLP). It provides a critical and practical guide to computational techniques for handling morphological and syntactic phenomena, showing how these techniques have been used and modified in practice. The authors discuss the nature and uses of syntactic parsers and examine the problems and opportunities of parsing algorithms for finite-state, context-free and various context-sensitive grammars. They relate approaches for describing syntax and morphology to formal mechanisms and algorithms, and present well-motivated approaches for augmenting grammars with weights or probabilities.


Author Notes

Brian Roark is Assistant Professor in the Department of Computer Science Electrical Engineering and the Center for Spoken Language Understanding at Oregon Health Science University. He has published papers in Computer Speech and Language, Speech Communication, Natural Language Engineering, and Computational Linguistics.
Richard Sproat is Professor of Linguistics and Electrical and Computer Engineering at the University of Illinois at Urbana-Champaign and also holds an appointment at the Beckman Institute for Advanced Science and Technology. His most recent book is A Computational Theory of Writing Systems (CUP, 2000).


Table of Contents

General prefacep. ix
Prefacep. x
List of Figuresp. xii
List of Tablesp. xv
Abbreviationsp. xvii
1 Introduction and Preliminariesp. 1
1.1 Introductionp. 1
1.2 Finite-State Automata and Transducersp. 2
1.3 Weights and Probabilitiesp. 8
1.4 Weighted Finite-State Automata and Transducersp. 9
1.5 A Synopsis of Algorithmic Issuesp. 13
1.6 Computational Approaches to Morphology and Syntaxp. 16
Part I Computational Approaches to Morphology
2 The Formal Characterization of Morphological Operationsp. 23
2.1 Introductionp. 24
2.2 Syntagmatic Variationp. 27
2.2.1 Simple Concatenationp. 27
2.2.2 Interlude: Prosodic Circumscriptionp. 29
2.2.3 Prosodically Governed Concatenationp. 31
2.2.4 Phonological Changes Induced by Affixationp. 35
2.2.5 Subsegmental Morphologyp. 36
2.2.6 Subtractive Morphologyp. 37
2.2.7 Extrametrical Infixationp. 39
2.2.8 Positively Circumscribed Infixationp. 40
2.2.9 Root-and-Pattern Morphologyp. 41
2.2.10 Morphomic Componentsp. 46
2.3 Paradigmatic Variationp. 49
2.4 The Remaining Problem: Reduplicationp. 53
2.5 Summaryp. 61
3 The Relevance of Computational Issues for Morphological Theoryp. 62
3.1 Introduction: Realizational versus Incremental Morphologyp. 62
3.2 Stump's Theoryp. 66
3.3 Computational Implementation of Fragmentsp. 67
3.3.1 Stem Alternations in Sanskritp. 68
3.3.2 Position Classes in Swahilip. 73
3.3.3 Double Plurals in Bretonp. 79
3.4 Equivalence of Inferential-Realizational and Lexical-Incremental Approaches: A Formal Analysisp. 83
3.5 Conclusionsp. 85
Appendix 3A Lextoolsp. 86
Appendix 3B XFST Implementation of Sanskritp. 95
4 A Brief History of Computational Morphologyp. 100
4.1 Introductionp. 100
4.2 The KIMMO Two-Level Morphological Analyzerp. 102
4.2.1 KIMMO Basicsp. 103
4.2.2 FST Intersectionp. 105
4.2.3 Koskenniemi's Rule Typesp. 109
4.2.4 Koskenniemi's System as a Historical Accidentp. 110
4.3 Summaryp. 113
5 Machine Learning of Morphologyp. 116
5.1 Introductionp. 116
5.2 Goldsmith, 2001p. 119
5.2.1 Candidate Generationp. 121
5.2.2 Candidate Evaluationp. 122
5.3 Schone and Jurafsky, 2001p. 124
5.4 Yarowsky and Wicentowski, 2001p. 129
5.5 Discussionp. 132
Part II Computational Approaches to Syntax
6 Finite-state Approaches to Syntaxp. 139
6.1 N-gram Modelsp. 139
6.1.1 Backgroundp. 139
6.1.2 Basic Approachp. 141
6.1.3 Smoothingp. 143
6.1.4 Encodingp. 148
6.1.5 Factored Language Modelsp. 150
6.2 Class-based Language Modelsp. 151
6.2.1 Forward Algorithmp. 154
6.3 Part-of-Speech Taggingp. 159
6.3.1 Viterbi Algorithmp. 160
6.3.2 Efficient N-best Viterbi Decodingp. 162
6.3.3 Forward-backward Algorithmp. 164
6.3.4 Forward-backward Decodingp. 168
6.3.5 Log-linear Modelsp. 170
6.4 NP Chunking and Shallow Parsingp. 173
6.5 Summaryp. 174
7 Basic Context-free Approaches to Syntaxp. 176
7.1 Grammars, Derivations and Treesp. 176
7.2 Deterministic Parsing Algorithmsp. 180
7.2.1 Shift-reduce Parsingp. 181
7.2.2 Pushdown Automatap. 182
7.2.3 Top-down and Left-corner Parsingp. 184
7.3 Non-deterministic Parsing Algorithmsp. 189
7.3.1 Re-analysis and Beam-searchp. 191
7.3.2 CYK Parsingp. 193
7.3.3 Earley Parsingp. 201
7.3.4 Inside-outside Algorithmp. 203
7.3.5 Labeled Recall Parsingp. 206
7.4 Summaryp. 208
8 Enriched Context-free Approaches to Syntaxp. 209
8.1 Stochastic CFG-based Parsingp. 209
8.1.1 Treebanks and PCFGsp. 210
8.1.2 Lexicalized Context-free Grammarsp. 221
8.1.3 Collins Parserp. 226
8.1.4 Charniak Parserp. 230
8.2 Dependency Parsingp. 234
8.3 PCFG-based Language Modelsp. 238
8.4 Unsupervised Grammar Inductionp. 240
8.5 Finite-state Approximationsp. 244
8.6 Summaryp. 246
9 Context-sensitive Approaches to Syntaxp. 248
9.1 Unification Grammars and Parsingp. 248
9.2 Lexicalized Grammar Formalisms and Parsingp. 257
9.2.1 Tree-adjoining Grammarsp. 258
9.2.2 Combinatory Categorial Grammarsp. 265
9.2.3 Other Mildly Context-sensitive Approachesp. 270
9.2.4 Finite-state and Context-free Approximationsp. 271
9.3 Parse Selectionp. 273
9.3.1 Stochastic Unification Grammarsp. 273
9.3.2 Data-oriented Parsingp. 275
9.3.3 Context-free Parser Re-rankingp. 277
9.4 Transduction Grammarsp. 279
9.5 Summaryp. 283
Referencesp. 285
Name Indexp. 307
Language Indexp. 312
Indexp. 313