Anda belum login :: 24 Apr 2025 06:02 WIB
Home
|
Logon
Hidden
»
Administration
»
Collection Detail
Detail
TEI Analytics: Converting Documents Into A TEI Format For Cross-Collection Text Analysis
Oleh:
Zillig, Brian L. Pytlik
Jenis:
Article from Journal - e-Journal
Dalam koleksi:
Literary and Linguistic Computing vol. 24 no. 2 (Jun. 2009)
,
page 187-192.
Fulltext:
Vol 24, 2, p 187-192.pdf
(64.39KB)
Isi artikel
For the purposes of large-scale analysis of XML/SGML files, converting humanities texts into a common form of markup represents a technical challenge. The MONK (Metadata Offer New Knowledge) Project has developed both a common format, TEI Analytics (a TEI subset designed to facilitate interoperability of text archives) and a command-line tool, Abbot, that performs the conversion. Abbot relies upon a new technique, schema harvesting, developed by the author to convert text documents into TEI-A. This article has two aims: first, to describe the TEI-A format itself and, second, to outline the methods used to convert files. More generally, it is hoped that the techniques described will lead to greater interoperability of text documents for text analysis in a wider context.
Opini Anda
Klik untuk menuliskan opini Anda tentang koleksi ini!
Kembali
Process time: 0.03125 second(s)