▽What’s New! What’s Free Archive ●12/02 14:26 Classic Corpora in LDC’s Catalog: Penn Treebank The〓LDC Catalog〓features classic corpora responsible for critical advances in human language technology that continue to influence researchers. Among them are the Penn Treebank releases, Treebank-2 (LDC96T7)〓and〓Treebank-3 (LDC99T42).〓 The Penn Treebank project (1989-1996) produced seven million words tagged for part-of-speech, three million words of