Thesaurus Linguae Graecae

The Thesaurus Linguae Graecae (TLG) is a research center at the University of California, Irvine. The TLG was founded in 1972 by Marianne McDonald (a graduate student at the time and now a professor of theater and classics at the University of California, San Diego) with the goal to create a comprehensive digital collection of all surviving texts written in Greek from antiquity to the present era. Since 1972, the TLG has collected and digitized most surviving literary texts written in Greek from Homer to the fall of Constantinople in 1453 CE, and beyond. Theodore Brunner (1934–2007) directed the project from 1972 until his retirement from the University of California in 1998. Maria Pantelia, also a classics professor at UC Irvine, succeeded Theodore Brunner in 1998, and has been directing the TLG since. TLG's name is shared with its online database, the full title of which is Thesaurus Linguae Graecae: A Digital Library of Greek Literature (the TLG, in italics, for short).^[1]

The challenge of this huge undertaking was originally met with the help of several classicists and technology experts but primarily thanks to the efforts of David W. Packard and his team who created the Ibycus system, the hardware and software originally used to proofread and search the corpus. Packard also developed Beta code, a character and formatting encoding convention used to encode Polytonic Greek. The collection was originally circulated on CD-ROM. The first CD-ROM was released in 1985, and was the first compact disc that did not contain music. Subsequent versions were released in 1988 and in 1992, thanks to technical support provided by Packard.

By the late 1990s, it became obvious that the old Ibycus technology was outdated. Under the direction of Professor Maria Pantelia, a number of new projects were undertaken, including the massive migration out of Ibycus, the development of a new system to digitize, proofread, and manage the textual collection, a new CD-ROM (TLG E), released in 1999, and eventually the move of the corpus to the web environment in 2001. At the same time, the TLG started working with the Unicode Technical Committee to include all characters needed to encode and display Greek in the Unicode standard. The corpus continues to be expanded significantly to include Byzantine, medieval, and eventually modern Greek texts. More recent projects include the lemmatization of the Greek corpus (2006) – a substantial undertaking, given the highly inflected nature of Greek and the complexity of the corpus, covering more than two millennia of literary development – and the Online Liddell–Scott–Jones Greek–English Lexicon (commonly referred to as the LSJ), released in February 2011.

Since 2001, the TLG corpus has been searchable online by members of subscribing institutions, which number close to 1500 worldwide. All bibliographical information and a subset of the texts are available to the general public.

The number of Greek words in the corpus amounts to 110 million,^[2] while the number of unique wordforms amount to 1.6 million and the number of unique lemmata to 250,000.^[3]

References

^ "TLG – Home". Thesaurus Linguae Graecae: A Digital Library of Greek Literature. University of California, Irvine: Thesaurus Linguae Graecae. 2017. Retrieved December 10, 2017.
^ "About". Thesaurus Linguae Graecae: A Digital Library of Greek Literature. University of California, Irvine: Thesaurus Linguae Graecae. 2017. Retrieved December 10, 2017.
^ "Statistics" (PDF). Thesaurus Linguae Graecae: A Digital Library of Greek Literature. University of California, Irvine: Thesaurus Linguae Graecae. 2017. Retrieved December 10, 2017.

External links

[1] "TLG – Home". Thesaurus Linguae Graecae: A Digital Library of Greek Literature. University of California, Irvine: Thesaurus Linguae Graecae. 2017. Retrieved December 10, 2017.

[2] "About". Thesaurus Linguae Graecae: A Digital Library of Greek Literature. University of California, Irvine: Thesaurus Linguae Graecae. 2017. Retrieved December 10, 2017.

[3] "Statistics" (PDF). Thesaurus Linguae Graecae: A Digital Library of Greek Literature. University of California, Irvine: Thesaurus Linguae Graecae. 2017. Retrieved December 10, 2017.

[1]

[2]

[3]

v t e Corpus linguistics
Text corpora, English	American National Corpus Bank of English Bergen Corpus of London Teenage Language British National Corpus Brown Corpus Buckeye Corpus Cambridge English Corpus Corpus of Contemporary American English Enron Corpus EnTenTen International Corpus of English Lancaster-Oslo-Bergen Corpus Oxford English Corpus PropBank Spoken English Corpus Switchboard Telephone Speech Corpus TIMIT VerbNet Wellington Corpus of Spoken New Zealand English
Text corpora, non-English	Bijankhan Corpus CHILDES CorCenCC National Corpus of Contemporary Welsh Croatian Language Corpus Croatian National Corpus Czech National Corpus Europarl Corpus German Reference Corpus Hamshahri Corpus National Corpus of Polish Neo-Assyrian Text Corpus Project Persian Speech Corpus Quranic Arabic Corpus Russian National Corpus Somali Corpus Scottish Corpus of Texts and Speech Slovenian National Corpus TalkBank Tatoeba Tehran Monolingual Corpus Tekstaro de Esperanto TenTen Corpus Family Thesaurus Linguae Graecae
Organizations	BNC consortium COBUILD Sketch Engine

v t e University of California, Irvine
Overview	Academics Athletics Campus People
Schools	Arts Biological Sciences Business Education Engineering Humanities Information and Computer Sciences Law Medicine Physical Sciences Social Ecology Social Sciences
Research	Beckman Laser Institute Burns Piñon Ridge Reserve Calit2 Center for the Neurobiology of Learning and Memory Institute of Transportation Studies Steele Burnand Anza-Borrego Desert Research Center Thesaurus Linguae Graecae UC Center for Hydrologic Modeling UC Humanities Research Institute
Student life	Activities and traditions KUCI Muslim Student Union
History	Timeline Irvine 11 controversy pro-Palestinian campus occupation
Athletics	Baseball Men's basketball Women's basketball Men's soccer Men's volleyball
Campus	Anteater Ballpark Anteater Recreation Center Anteater Stadium Arboretum Barclay Theatre Bren Events Center Crawford Hall UCI Medical Center Irvine–Newport under construction UCI Medical Center off campus New Swan Theater seasonal Student housing Middle Earth University Hills

Authority control databases
International	ISNI VIAF 2
National	Germany United States Australia Norway Vatican Israel
Academics	CiNii
People	Trove

See also

References

External links