The PARC 700 Dependency Bank

The PARC 700 Dependency Bank consists of 700 sentences which were randomly extracted from section 23 of the UPenn Wall Street Journal treebank, parsed with our LFG grammar, and given gold-standard annotations of grammatical dependency relations by manual correction and extension. Average sentence length: 19.8 words; average number of relation triples: 65.4. The corpus is freely available for research and evaluation purposes. Please contact us personally in case you intend to use the corpus for commercial applications.

We would like to thank Ted Briscoe, Mick Burke, Aoife Cahill, John Carroll, Rebecca Watson, and Tomas By for corrections to the original release.




For more information and references, visit the NLTT page or contact Tracy Holloway King (www)
Last modified: , Tracy Holloway King