Academia.eduAcademia.edu

Machine Learning and the Future of Philology: A Case Study

2023, TAPA

Abstract

This paper argues that machine learning (ML) has a role to play in the future of philology, understood here as a discipline concerned with preserving and elucidating the global archive of premodern texts. We offer one initialcase study in order to outline broader possibilities for the field. The argument is in four parts. First, we offer a brief introduction to the history of classical philology, focusing on the development of three technologies: writing, printing, and digitizing. We evaluate their impact and emphasize some elements of continuity in philological practice. Second, we describe Logion, an ML model we are currently developing to support various philological tasks, such as making conjectures to fill lacunae, identifying scribal errors, and proposing emendations.In part three, we present some of the results achieved to date in editing the work of the Byzantine author Michael Psellos. Finally, we build on the specific study presented (part three), as well as our more general considerations on philology (part one) and ML (part two), in order to shed light on current challenges and future opportunities for the global archive of premodern texts.

Key takeaways

  • keywords: Philology, Machine Learning, Artificial Intelligence, Logion, Michael Psellos, Digital Humanities, Diagnostic Conjecture, Emendation, Machine-Human Collaboration premodern texts have been preserved in manuscripts which feature gaps caused by material deterioration and errors, omissions, and additions introduced by scribes.
  • Partly as a result of this culture of competitive scholarship, printed critical editions started to combine the publication of ancient texts with the resources needed to evaluate the philological work carried out on them.
  • Meanwhile, we emphasize here that the goal is not to supplement Psellos with machine-generated conjectures, but rather to have Logion support philologists in making their own supplements by offering suggestions.
  • Our point is that, in terms of method, using Logion as a matter of course on all lacunae, rather than relying on human thought processes alone to determine what query to run in the TLG, seems an improvement on current philological practice.
  • The way we have trained Logion so far, by insisting on philological respect for the original, is something that we learned from a tradition that extends back to the ancient scholars working in the Museum.