This research was funded in part by the UK AHRC, grant number AH/W010682/1. For
the purpose of Open Access, the authors have applied a CC BY public copyright
licence to any Author Accepted Manuscript (AAM) version arising from this
submission.
Open scholarship: epigraphic corpora in the digital
age
John Bodel (Brown University), Jonathan Prag (University of Oxford), Charlotte
Roueché (King’s College London). 1
This paper began life as two sessions entitled ‘the future of corpora’ and ‘digital
epigraphy’. For practical reasons, this became two linked papers co-authored by
three speakers. In composing those papers it became increasingly clear to us that
the two topics are intricately linked: not because we believe that any future corpus
must necessarily be digital, but rather because the possibilities created by the rapid
advances in digital technologies expose an increasing lack of clarity about the nature
and purpose of both the ‘corpus’ and the individual ‘edition’. At the same time, the
possibilities and the challenges of digital methods bring to the fore a wide set of
issues concerning not only research methods but also research culture (it is no
coincidence that open scholarship has emerged as a contemporary issue alongside
digital scholarship).
The difficulties encountered by the earliest digital epigraphic projects have been well
rehearsed: we do not attempt a history of the development of digital epigraphy,
which is well documented. 2 The technical incapacities and incompatibilities that
initially hindered full cooperation and sharing of information among even willing
partners have been almost entirely overcome, although the scars from that
experimental period of rapid development may well persist. Rather, it is our shared
view that the largest obstacle still facing us as a discipline today is not technical but
cultural: our academic culture has not caught up to advancements in technology that
make possible the dissemination and sharing of information with unprecedented
ease. Ever-advancing technological capabilities have opened up uncharted territory
in the field of epigraphic research. If we are to embrace these developments because they enable progress in our research, which is our raison d’être - we will
need to adapt. Retaining to some extent the structure of our original presentations,
the first part of this paper considers the interrelated questions of what purpose a
corpus serves, what type of publication is best suited to achieving it, and what,
increasingly problematised in the rapidly evolving world of digital publication, we
1
Authors are listed in alphabetical order and share equal responsibility for the text. This work was
supported by the UK AHRC grant number AH/W010682/1 (the FAIR Epigraphy project).
2 See e.g. Rossi 2021, Elliott 2014, Bodel 2012, the papers in Feraudi-Gruénais 2010, Cayless et al.
2009; and for recent work and challenges, the collections of De Santis and Rossi 2018 or Velásquez
and Espinosa 2021.
Published as: Bodel, J., Prag, J.R.W. and Roueché, C. (2024). In Pierre Fröhlich & Milagros Navarro
Cabellero (eds.), L’épigraphie au XXIe siècle. Actes du XVIe Congrès International d’Épigraphie
Grecque et Latine, Bordeaux, 29 août-02 septembre 2022, Bordeaux, Ausonius, 91-117.
mean by an edition. We attempt to ground our ideas and concerns in a discussion of
the evolution of the Inscriptions of Roman Tripolitania: throughout our writing of this
paper we have had the remarkable figure of the late Joyce Reynolds in our thoughts:
she was deeply committed to collaboration in the epigraphic community, hosting her
own International Congress in 1967 before welcoming the development of AIEGL to
take on the task. It was this commitment that led her to embrace the possibilities
brought to epigraphic publication by digital developments (with a healthy scepticism
of technology for its own sake). In the second part of the paper we develop some of
these ideas, with a particular focus on some key challenges that are highlighted by
the application of digital methods: sustainability, the fundamental importance of
agreed standards, and the need for open and accessible data.
Epigraphic corpora
When August Boeckh in 1815 presented to the board of the Berlin Academy the
proposal he had drafted on behalf of the Philological-Historical Class of that year for
the publication of a Thesaurus Inscriptionum of all the inscriptions of antiquity “up to
the beginning of the medieval period”, beginning with Greek and Latin (a practical
constraint we should surely now move beyond), the core elements of how a
geographically arranged corpus might be constituted and on what principles editions
should be based had been largely worked out. 3 Three hundred years earlier Cyriac
of Ancona compiled the first large-scale epigraphic corpus by accurately copying and
drawing the monumental inscriptions he encountered in his travels in Italy, Sicily,
Dalmatia and throughout the eastern Mediterranean. By the end of the fifteenth
century the principle that editions should be based on autopsy and, where possible,
published with drawings or at least diplomatic transcriptions had been well
established, and other basic subsidiary tools, such as indices of names, became
common with the advancement of printed editions over the following century. 4 But an
enterprise on the scale that Boeckh and his colleagues proposed required something
more: “The principal aim of a Royal Academy of Sciences,” he wrote, “must be to
undertake projects and to complete work which no one person could achieve, in part
because his abilities alone would not be sufficient, in part because it would require
an expense which a private individual would not dare to incur”. 5 The plan was to
begin with Greek inscriptions, since Latin texts were more fully covered in existing
3
Boeckh’s handwritten proposal is reproduced and transcribed in Hallof 2012, 2-7.
Buonocore 2014 provides a masterful, if cursory, overview of the rich manuscript tradition. The
three editions of Greek and Latin inscriptions published by Fra Giocondo of Verona in the last quarter
of the 15th century were influential and set a high bar: praeter quae vidi quaeque accurate exscripsi in
hoc volumen nihil congessi (“I have gathered into this volume nothing except what I have seen and
what I have carefully transcribed”, quoted at Buonocore 2014, 30).
5 Hallof 2012, 2-3, “Der Hauptzweck einer königl. Akademie der Wissenschaften muß dieser seyn,
Unternehmungen zu machen und Arbeiten zu liefern, welche kein Einzelner leisten kann, theils weil
seine Kräfte denselben nicht gewachsen sind, theils weil ein Aufwand dazu erfordert wird, welchen
kein Privatmann zu machen wagen wird.” Translation from Hallof 2009, 29, adapted. Boeckh went on
to point out that only ‘a great prejudice’ (ein großes Vorurtheil) would deny to the fields of philology
and history the same funding already bestowed upon the fields of mathematics and physics.
4
publications, a task that could be completed, it was thought, in four volumes to be
produced over four years. When the fourth and final volume of the Corpus
Inscriptionum Graecarum (CIG) did finally appear forty-four years later, a year before
Boeckh’s death in 1860 (the indices would not appear until 1877), it was clear
already that a second edition and supplement were needed.
Epigraphists today, whether working in traditional ways or toiling in the digital
trenches, can take comfort in three lessons that Boeckh’s experience with CIG
teaches us: in order to achieve success in any large-scale epigraphic project, longterm funding and collaboration are essential; such projects, if undertaken
conscientiously, take more time than anticipated; and, no matter how professionally
executed, they will inevitably require revision and updating. The successor project to
CIG, the current Inscriptiones Graecae (IG), like the Corpus Inscriptionum Latinarum
(CIL) initiated by Mommsen in 1853, was conceived through the recognised need in
the nineteenth century to establish and maintain “a comprehensive and critical
source-collection, presented geographically and by inscription-type, and one which is
continually extended and updated”. 6 Two core points might be highlighted: one is the
ambition to be comprehensive, combined with the assumption that there neither was
nor would be any other comparably foolhardy project (whether conceived of as rival
or fellow traveller); the other is the form and content of such a corpus.
Naturally, these corpora were constructed according to the best methods of the
nineteenth century. The challenge of staying up to date has been constantly
recognised, but hardly solved, at least outside of the internal archives of individual
projects, the ephemeral Ephemeris epigraphica: Corporis inscriptionum Latinarum
supplementum, or the early creation of L’Année Épigraphique (AE) and the
Supplementum Epigraphicum Graecum (SEG). Other than the increased inclusion of
photographs or drawings, more standardised basic metadata, and ever richer
indices, the corpus volumes have not significantly changed over time. 7 As Bodel put
it when addressing the Oxford congress in 2007: “Innovations introduced into the
entries in CIL … have taken the medium of the codex … nearly as far as it can go.” 8
A concordance is needed to navigate between the volumes of CIL, 9 while IG, always
intended to cover only the Greek inscriptions of Europe, now by design is destined to
be incomplete. The basic question then, must be, is this still the best way to produce
6
Quoting from the ambitions of CIL: “It is the only systematic and critical edition which unites all the
epigraphic sources from the former Roman empire (though only a selection of Christian inscriptions
are included). As a comprehensive and critical source-collection, presented geographically and by
inscription-type, and one which is continually extended and updated, it is an indispensable tool in the
study of the ancient world.” (Schmidt 2007, 19).
7 Certain editorial practices have changed: e.g., IG and CIL no longer attempt to reproduce the
appearance of the carved letters, as they originally did; some practices, such as (in IG) the recording
of ‘fasti’ of literary sources related to particular communities to accompany the epigraphic texts, have
come and gone; the use of Latin persists; see Schmidt 2007, 25-27 with 16 and Hallof 2009, 32-45.
8 Bodel 2012, 280; the critique of IG by Merkelbach 1998 is noteworthy.
9 Fassbender 2003.
corpora, according to the methods of the nineteenth century, and not the methods
now available to us in the twenty-first?
Günther Klaffenbach, director of IG from 1953 to 1972, recognized this limitation but
saw it as inevitable. In response to the question “what are the users of a corpus
looking for?”, he listed criteria we would all recognize as essential for a critical
edition: information on the inscribed object (for IG, stone, itself a limitation that
should perhaps be more open to question) and script, findspot and current location,
a reliable text, a date, and “the minimum information necessary to understand the
inscription”. None of this has changed; users of a corpus are looking for the same
things today. But Klaffenbach went on: “For more than that, however, there is really
no space, unless the acceptable length is to be exceeded, which would reduce the
utility of the Corpus”. 10 By acceptable length, of course, he meant for a folio volume,
and ‘utility’ is here defined with reference to practicality. There was little else to
imagine in Klaffenbach’s day. In 1967, he predicted “with confidence” that “the more
the field diversifies over time, the greater the need will be … for a few, large,
comprehensive editions, to which every scholar of the ancient world can turn without
lengthy searches and inquiries”. 11 In 1967 that was a reasonable expectation. Two
years later the western world looked very different culturally, and digital technologies
had begun to make their way into the world of epigraphy. 12
Nowadays there is little reason to limit our idea of what a corpus should do by the
technological capabilities of the nineteenth century, and no one today would argue
that the best way to make inscriptions accessible to “every scholar of the ancient
world” is through the medium of large folio volumes housed in major research
libraries (and priced beyond the reach of individuals). The structure of an entry in the
standard corpora is a beautiful thing—orderly, meticulous, concise, clean (in the
sense that it is laid out visually in a way that facilitates comprehension)—and there is
no reason to lose any of this utility or appeal; indeed all these features could be
reproduced. 13 But militant epigraphists have always pushed the technological
capabilities of their day in order to acquire more accurate readings and to make
inscriptions more legible to those not able to view them directly: drawings, squeezes,
10
Hallof 2009, 42-43, “Für mehr ist aber auch wirklich kein Platz da, wenn nicht der zulässige
Umfang überstiegen und damit die Übersichtlichkeit des Corpus beeinträchtigt werden soll - und wenn
der Bearbeiter auch wirklich fertig werden soll.” English transl. adapted.
11 Hallof 2009, 24, “Je größer aber im Laufe der Zeit die Zersplitterung werden wird, desto größer
wird (das glaube ich getrost behaupten zu können) wieder das Bedürfnis nach wenigen großen,
zusammenfassenden Editionen werden, an die sich jeder Vertreter der Altertumswissenschaft ohne
langes Suchen und Fragen wenden kann.” English translation on p. 27.
12 Leaving aside the social and political upheavals of the day, Packard 1968 offered the first
computer-based concordance of Livy, and work had begun on Jory and Moore 1974–1975: the IT
revolution had begun (cf. Bodel 2012)..
13 Although it is worth bearing in mind the degree to which the CIL and IG entries, written still in Latin,
are increasingly inaccessible, such that handbooks to epigraphy provide guidance on accessing them,
e.g. Cooley 2012, 346-50 (cf. Merkelbach 1998, 295). For corpora entries, see Hallof 2009, 22-23;
Schmidt 2007, 16, 18-23.
print photographs, text databases, digital imaging, photogrammetry and 3-D
modeling, physical and chemical analysis have each, in their day, advanced the
science of epigraphy and the accessibility of the material. None has fallen out of use
with the arrival of each new technology, and the gains to knowledge have been
cumulative. Is there any reason to anticipate lesser gains from transferring our
accumulated knowledge from one medium of communication to another, one that
enables additional capabilities and provides added value? Digital methods, notably
but not only the possibilities of exploiting Linked Open Data (LOD), today enable us,
and should require us, to do more.
To take one example: the so-called Leiden conventions were developed by
papyrologists to meet their particular editorial needs. Subsequently adapted by
epigraphists to the editing of texts incised in stone, the system enabled the
consistent (i.e. widely comprehensible) and efficient presentation in the medium of
print of core features of an inscribed text. 14 The EpiDoc TEI standard was developed
to encode the same features in machine-readable fashion, but such encoding is
capable of capturing far more than Leiden or even an enhanced Leiden+ system
ever could, and there is no reason artificially to limit the application of such encoding
in line with the earlier limitations imposed by print. 15 To what extent should the
medium dictate the way we study and present our material?
These ever increasing possibilities beg the question, what is the purpose of the
corpus? Any answer to that question will reveal an emerging tension, between the
possibilities and desire for richer editions and the desire for comprehensive corpora
to enable research. In the world of the codex, the two were seen to be mutually
exclusive; and, in the world of the codex, the type of research that could be
conducted over a collection of many thousands of texts was limited. We return to this
theme below.
First, we examine a set of practices in which we have all probably engaged at some
point, but which none of us consciously condones. These practices have emerged
out of the existing model, based as it is upon the technology of the book,
notwithstanding its many virtues. We refer to the habit of citing inscriptions by their
entry numbers in standard corpora, or indeed in any of the supplementary series,
anthologies or thematic and specialist corpora, as if the number were a mere
descriptive identifier and did not in fact represent a specific edition made by a
particular scholar on a particular basis at a particular time. This is not simply a
question of potentially depriving an editor of credit for authorship (although this is
itself an important point). Rather it is a matter of obscuring or even confusing
14 Panciera 1984 (repr. in Panciera 2006) provides a brief assessment of why the Leiden conventions
never fully met the needs of epigraphists.
15 For EpiDoc, see Bodard 2010, Cayless et al. 2009, and https://epidoc.stoa.org/ (= Elliott et al. 20062022).
authorship, and consequently the specific reading of the text adopted, when multiple
editions of an inscription are cited as if equivalent. For example, when we cite a wellknown epitaph of the late Republican period found outside Rome as CIL I2 1221 = VI
9499 = ILS 7472 = CLE 959 = ILLRP 793, we leave entirely unclear whose edition
we have consulted and on what basis the text that we have used in our research is
constituted. 16 The reasons for listing so many references are several (including the
authorial claim to knowledge and authority), but above all the practice reflects the
fundamental challenge of a world in which the original corpus edition of an inscription
in CIL or IG, while still a reliable authoritative edition of record, is rarely the definitive
edition (CIL I2 was published in 1918), given the multiplication of subsidiary corpora,
further studies and new discoveries, with which the original corpus cannot hope to
keep up to date; hence the value of incorporating at least one citation, where
possible, of a reference in AE or SEG, particularly in the case of a publication that
provides further material, contextual information or a photograph. 17 Nonetheless, this
citation habit combining abbreviation with feigned equivalence is deeply flawed: at
best it is lazy, and fails to give credit where due; at worst, it makes it fundamentally
difficult to engage critically with the research being conducted on the text in question.
This practice was regularly criticised by Louis Robert, with his profound
understanding of the importance of identifying and assessing every editor: “Quand
l’éditeur n’est pas l’auteur de la copie, n’a pas vu la pierre ou l’estampage - c’est
régulièrement le cas dans le CIG de Boeckh et de Franz -, il faut indiquer avec soin
de qui est la copie; c’est capital pour la critique.” 18 Furthermore, such practice
increasingly conceals a crucial piece of information, namely which of these
references point to genuine ‘editions’, contributing new or differing information and
interpretation, and which ones merely reproduce a previously published text.
This situation has only deteriorated over time, firstly with the constant increase in
paper publications; and secondly, with the rise of digital collections. 19 The previous
reference could, for example, be extended to include “= EDCS-19200211 =
EDR167214 = TM574526”, and increasingly (and quite rightly), such digital identifiers
are being cited in publications alongside paper ones. In the first place, we should
note the current widespread failure to cite digital resources fully and formally in a
16
The text was transcribed and edited in CIL by Mommsen on the basis of autopsy (“descripsi”) in the
British Museum; the other editions listed here all leave implicit the source of their text, although all
note that Mommsen transcribed it from autopsy.
17 The EDR entry for this particular inscription is especially instructive (http://www.edredr.it/edr_programmi/res_complex_comune.php?do=book&id_nr=EDR167214 = Butini 2022): the
primary corpora references noted by us in the main text are supplemented by nine print and two
digital editions / resources (some of the former are reported generally as works in AE, but this
inscription is essentially not picked up for re-editions by AE, so finding more up to date information on
this particular inscription would be a challenge without Butini in EDR!).
18 Robert & Robert 1954, 13-14 in a statement of their own principles for an edition; more trenchantly
in critique of others, Robert 1959, 10-12 and 1965, 17-19.
19 We use this neutral term as a ‘catch-all’, precisely because the classification of digital resources, as
part of a revised typology of editions, remains a desideratum.
comparable fashion to traditional printed resources. 20 Each of these identifiers, if you
know where to look, will resolve as a webpage with further information about the
inscription. Most of those citing these numbers do not bother to reference either the
hosting website or the specific URL, and digital resources rarely appear in
bibliographies with full details of publication date, authorship, etc. 21 It is worth
recognising, at the same time, that most of those citing CIL or IG leave many
elements implicit, not least the date of the edition in question, as well as the identity
of the actual editor responsible for that edition. 22 However, at this point, comparison
of these three specific digital examples highlights why current epigraphic citation
practice ever more urgently needs revision.
The Epigraphik-Datenbank Clauss / Slaby (EDCS) boasts by far the largest number
of texts of Greco-Roman inscriptions of any site on the internet (well over half a
million at this point), and is an invaluable resource that is widely used. Texts are
presented as search results via html, in a way similar to that of the Packard
Humanities Institute (PHI) resource for Greek inscriptions, an aspect to which we
return below. If we consider this specific case (EDCS-19200211), certain features
can be observed: 23 the entry provides a transcription with supplements and editorial
decisions such as corrected letters marked with the editorial conventions (an
adapted Leiden system) peculiar to EDCS; a sequence of references to previous
publications; an approximate date; and basic metadata about the original location
and material of the support, the type of inscription, and persons mentioned in the
text. There is no indication as to the source of the text and metadata (as there is in
PHI) other than a listing of seven editions (or notices of editions, since AE 2006, 125
merely notes a volume which contains a new edition), nor is it immediately clear who
prepared, published, or last revised the text and when. The EDCS ‘advice’ page
notes that “The texts are based on the decision of the EDCS editors”, but this leaves
many questions unanswered. 24 Currently there is no way to check the reliability of a
text in EDCS or even to understand how it was created and where it might diverge
from previous editions without going back to an earlier edition, and there is
20
We attempt to model good practice in this paper, but doubtless with limited success.
Conventions for citing digital resources are well established: see e.g. https://apastyle.apa.org/stylegrammar-guidelines/references/examples/webpage-website-references (= American Psychological
Association 2021). The three sites in question are: http://www.manfredclauss.de/ (= Clauss et al.
n.d.); http://www.edr-edr.it/default/index.php (= Panciera et al. 1999-2023); and
https://www.trismegistos.org/ (= Depauw n.d.). In fairness to the user, many sites do not provide clear
guidance for their own citation.
22 We note that there is a clear divergence of opinion in the community at present, expressed during
the conference, as to whether an edition cited from a corpus such as CIL should in fact be attributed
to the responsible individual, or whether the individual scholar should be subordinated to the collective
enterprise (and so anonymised). It will be apparent that we subscribe to the former view.
23 https://db.edcs.eu/epigr/epi_url.php?s_sprache=en&p_edcs_id=EDCS-19200211 (= Clauss et al.
n.d.).
24 This information can be found at: https://db.edcs.eu/epigr/hinweise/hinweis-en.html. Note that due
to the use of frames on the site, following the link ‘Advice / Hinweise’ from the homepage to this
particular page will not actually expose this URL, which has to be extracted by other means, making it
even more difficult to provide accurate links to such information.
21
seemingly no way of knowing which edition was used to produce the text presented,
in the many cases such as this where more than one edition is listed. Consequently,
the increasing numbers of researchers who are citing inscriptions in their published
research simply via an EDCS number are making arguments on the basis of a text of
uncertain provenance and form - and which without a date of publication or revision
may change not only without warning, but without ability to trace that fact - and thus
falling short of the standards that we might expect of the scholarly community.
To be clear, this is not a problem with ‘digital’, or with the unique and invaluable
EDCS database alone. Quite the opposite: the problem is widespread. It is a failure
of academic practice, firstly on the part of those compiling the databases that do not
provide users with critical information about how they are constituted, and secondly
on the part of all those uncritical users (a majority, no doubt) who do not confirm the
texts they report on the basis of more reliable, or at least, creditable editions. The
fault lies on both sides, with both the producers and the users. On the side of the
producers, clear and transparent documentation of how the data is produced and
how it should be cited is essential; on the side of the users, it is imperative that we
become more critical and responsible in our use of digital tools (but also, it emerges,
in our use of published resources in general). Within the field of epigraphy, as
already suggested, we need to become more critical in our appreciation of what
constitutes an ‘edition’. If the status of a text in EDCS is unclear (is it a copy of an
existing edition? Or is it modified, and if so on what basis?), the same is not the case
when we turn to the same inscription in the Epigraphic Database Roma (EDR). 25 In
this case, a sequence of editions is reported, and a text is presented that is explicitly
attributed to certain of those editions, with the further information that it has been
checked against an image by the editor and with a limited apparatus; the author of
this digital edition and the date of publication (and revision) is noted. The project’s
‘presentazione’ page makes explicit the status of these texts as critical editions in
their own right and provides a suggested form of citation for each record, which
includes recognition of the authorship and date. 26
Lastly, we turn to the Trismegistos reference, the TM number cited above. 27 The TM
number goes to the heart of the multiple editions ‘problem’, since the TM reference
does not itself resolve as an edition of the text. Rather, the very purpose of the TM
25
http://www.edr-edr.it/edr_programmi/res_complex_comune.php?do=book&id_nr=EDR167214 (=
Butini 2022).
26 http://www.edr-edr.it/it/present_it.php. It remains the case that the vast majority of citations of EDR
in publications that we have observed do no more than quote the number, highlighting the fact that it
is we, the users, who are above all at fault.
27 www.trismegistos.org/text/574526. As a stable identifier, this is a different sort of resource when it
comes to citation (see main text). The Trismegistos project simply suggests that in discussion of the
resource one cites Depauw & Gheldof 2014 (see
https://www.trismegistos.org/about_how_to_cite.php), on the grounds that reference to web
addresses for bibliometric purposes “are often problematic” (!). For a more recent presentation and
discussion, Depauw 2018.
text number is to provide a stable identifier to identify the inscription under
discussion: consequently, what is provided is enough information, in the form of
some basic metadata and core bibliographic references (including other digital
resources), to refer unambiguously to this inscription and no other. In other words,
the TM text number constitutes a reference to the inscription in the abstract, not a
reference to a specific edition. Theoretically, therefore, in those situations where one
is citing an inscription and the specific text/edition is not important, but one wishes to
enable the reader to identify the inscription and locate editions of it, the TM number
could serve in place of the entire list of references previously cited (or, one might
imagine citing only the text edition actually employed together with the TM number,
in order to enable the user to find a wider range of other editions if they wish to do
so). 28 The TM reference thus makes particularly clear the need to be self-conscious
and explicit as to what in fact one is citing, when referencing a text. But even this is
not the whole story. Almost none of these editions make reference to the textcarrying stone’s inventory number in the British Museum. 29 While it may not be the
intention of an editor that the edition reference equates to an identifier for the textcarrier (i.e. physical object), this conflation of text with object is commonplace, at
least in the practice of users. This confusion may even have been increased by the
increased attention to a text’s material context in recent decades (a repeated theme
at recent AIEGL congresses). Although the TM number functions as an abstract
reference to a text, the exact relationship between text and object in this case too
remains opaque, because the TM number originated within the world of
papyrological conventions, where commonly all texts on a single papyrus are treated
as a single document. 30 This does not make the TM number less useful (although it
does mean that there will be occasions where a single TM number may relate to
multiple texts on a single object), but only reinforces the point that clarity of reference
is essential, and there remains a further level of distinction, between text and text-
28
Currently access to the concordance of identifiers provided by Trismegistos can only be accessed
either by subscription or through direct use of the TexRelations Matcher API, which sets a high
technical bar; consequently it remains inaccessible to most users and not fully ‘FAIR’ (on which see
below at nn.72 and 79 especially).
29 GR 1867.508.55 (the inventory number is recorded in EDR alone of the publications we have
referenced). In fact there is a full online presentation of this record by the museum (including images,
text transcription and translation), which is currently not cross-referenced by other resources:
https://www.britishmuseum.org/collection/object/G_1867-0508-55 (= The Trustees of the British
Museum 2023). The categorisation (and so title) of the inscription in the BM records as “tomb relief;
block” is typically unhelpful to anyone searching for the inscribed text. Museum databases and
catalogues seldom mention ancient inscriptions in their short titles and regularly fail to record them
altogether.
30 https://www.trismegistos.org/about_how_to_cite.php, and in particular “To determine what
constitutes a document or book or inscription (and thus should become a separate record), we have
given priority to material aspects: in principle all texts written on what was in antiquity a single writing
surface belong together and form one document receiving a single Trismegistos number, unless there
are good reasons to believe that the only (and unintended) relation between the two texts is the
writing surface itself.”
carrier, that has yet to be fully considered and integrated into many epigraphers’
conceptual models. 31
It is clear, therefore, that there needs to be an ongoing discussion about how best to
express to a reader the precise source of the words they are seeing on the page, or
upon which the argument that they are reading is based (i.e. an edition typology and
best practice in the citation of editions). An example of such a categorisation can
already be seen in the terminology employed in CIL, which includes:
● descripsi: original edition based upon autopsy
● contuli: revised edition (based upon autopsy?)
● contuli quae supersunt: revised edition based upon autopsy where only part of
the original now survives compared to at the original moment of edition.
● recognovi: minor revisions of a previous edition (without autopsy)
● recognovi et emendavi: major revisions of a previous edition (without autopsy)
● ex ectypo: revision or edition based upon a squeeze
These could obviously be extended to include, for example, revision or edition based
upon photograph (or other category of reproduction), but should also be extended to
include categories such as the simple reproduction without emendation of an existing
edition. The proposed guidance for recording observations of an inscription in the
EpiDoc guidelines offers a basis for further development. 32 Such an exercise in
categorisation applies to editions of any kind, and is essential to the sort of scholarly
rigour discussed above; but its necessity becomes ever more apparent as we start to
encode texts in digital form.
At the heart of any such discussion is the identification of the necessary categories.
How such categories should be expressed (in Latin, or any other language) is a
secondary issue. The same applies to the presentation of text (it is not the shape of
the brackets employed in Leiden or Krummrey-Panciera that matters, but the
varieties of text survival, restoration, and emendation that need to be represented),
or to bibliography (it is not the abbreviation itself that matters, but the bibliographic
edition that it represents). Digital encoding, in fact, enables such debates to be
bypassed, since the encoding can be represented in any way one chooses. As the
example of the Leiden conventions reminds us, it is in this crucial area of deciding on
the appropriate categories that the community, we, the members of AIEGL, can play
a valuable role by sharing our experience and pooling our expertise. We return
below to the question of standards, which sits at the heart of Linked Open Data
approaches.
31
This sort of conflation is accounted for within the CIDOC Conceptual Reference Model (CRM)
developed by the cultural heritage sector to enable information integration particularly within the
semantic web, which now has a proposed extension to incorporate ancient texts, called CRMtex,
developed by Francesca Murano and Achille Felicetti: see https://cidoc-crm.org/ and https://cidoccrm.org/crmtex/ModelVersion/version-1.0-0, discussed in Felicetti & Murano 2017.
32 https://epidoc.stoa.org/gl/latest/supp-histlocations.html (= Bodard et al. 2022).
At this point in our discussion, it will be useful to consider the example of one
regional corpus project that has successfully crossed the bridge from analog (that is,
print) to digital publication: this may illustrate how the move to online dissemination
can result naturally, even inevitably, from a consistent pursuit of the same primary
responsibility of any epigraphist who is confronted with editing an ancient monument,
which is to record it as fully as possible, as if she or he may be the only (or the last)
witness who will ever do so. This recognition of our task as witnesses may be dulled
in times of peace and easy travel, but it becomes clearer in times of war. When John
Ward-Perkins, an archaeologist who had served with British troops in North Africa
and Italy, was appointed Director of the British School at Rome in 1946, he
understood the critical importance of recording archaeological evidence carefully and
fully; from 1947 he visited Libya regularly, accompanied by an energetic graduate
student, Joyce Reynolds. Ward-Perkins photographed, and Reynolds transcribed as
many inscriptions as possible. Our choice of case-study is of course deliberate, not
least as a tribute to Joyce. As Roueché has already observed elsewhere, “When the
possibility of online publication arose, she embraced it – not because she had any
affection for computers, but because she grasped that this could make her work
accessible to so many more people all over the world.” 33
Reynolds and Ward-Perkins realised that their records would be imperfect, and that
a full account would depend on collaboration with other scholars: it is particularly
striking that, even in the sour aftermath of war, they worked closely with Italian
scholars who had worked in Libya before them. They also saw the need to make
their materials available as fully as possible, as soon as possible: The Inscriptions of
Roman Tripolitania was published in 1952, within the constraints of post-war
publication. 34 In 1953 Louis Robert observed the usefulness of the volume, while
also regretting its limitations. 35 This was not the only time that Robert was to criticise
corpora for being too limited: but the costs of printing and publishing would only
increase. This is a situation where the best could so easily be the enemy of the
good: the demand for ever richer commentaries, encouraged by Robert, has made
publishing inscriptions ever more expensive. In particular, he commented that there
were only 38 photographs for some 1000 texts: Robert had come to set great store
by photography. These could be consulted at the British School at Rome: how much
more secure they would be, he pointed out, if they were printed in a volume which
could then exist in multiple copies. It was the need to disseminate the photographs
which led the British School, fifty years later, to investigate the possibility of issuing
an online edition of IRT with full illustration. Who can doubt that Robert would have
welcomed this?
33
Roueché & Thompson 2023.
Reynolds & Ward-Perkins 1952.
35 Bulletin Épigraphique 1953, no.25. “Il nous parait cependant trop peu dévéloppé; on a fait un effort
intense pour ne pas dépasser les limites de grosseur du volume et de prix de revient; il y a cependant
un point à ne pas franchir.”
34
In 2009 a collaboration between the British School at Rome, the Institute for the
Study of the Ancient World (New York) and King’s College London led to the
publication of IRT2009 online. 36 This reproduced the 1952 volume in an enhanced,
digitally encoded edition, with a few items that had been published immediately
afterwards. The 2009 edition retained the IRT numbering: this reflects a reality, that
people want simple ways of referring to a text in discussion. CIL VIII was published
in 1881, so scholars have been dependent upon regional studies such as IRT to
define the corpus in the subsequent 140 years. 37 Traditionally, one major
contribution that a corpus can make, especially when providing the first edition of a
text, is to give an inscribed text an identifying number, a name that can be used to
refer to it, and that almost inevitably will be so used. One major challenge for the
community (discussed above) is to revisit how this process works, since the fact that
the number represents an actual edition tends to get lost over time. In the truly
integrated multilingual environment to which the collective corpus inscriptionum of
the ancient Mediterranean should aspire, a system of unique universal identifiers will
be essential: at the same time it remains the case that abstract identifiers such as
those generated by the Trismegistos project are more legible to machines than
humans, and there will be a balance to be struck between interoperability (among
computers, mostly taking place out of sight) and practicality (for researchers
accustomed to deducing basic contextual information, such as language or location
of an inscription, from the abbreviated title of the corpus in which it is published).
The 2009 edition did not attempt to provide new or improved commentary, except in
one important regard: Joyce Reynolds did provide English translations for almost all
the texts. In the digital format there were no constraints on space to prevent this; and
translation is an essential form of commentary, particularly in a publication which is
suddenly accessible to anyone anywhere in the world, not just to a few well-prepared
scholars. IG, under Klaus Hallof’s direction, has made freely available online
translations (mostly in German) of all the inscriptions published in the corpus
volumes since 1945, and the Attic Inscriptions Online Project (AIO) is providing
enhanced English language translations and annotations of all inscriptions
concerning ancient Athens and Attica; meanwhile, the hugely ambitious EAGLE
MediaWiki project unites translations from multiple projects and individual
contributors. 38 The other aim of the 2009 publication was to provide better location
data. This had for many years been difficult in Libya, where geographic data were
controlled for security reasons; but the arrival of freely available satellite maps, such
as Google Earth, changed the situation. The editors therefore collected the
36
Reynolds et al. 2009, available at: https://inslib.kcl.ac.uk/irt2009/.
https://irt2021.inslib.kcl.ac.uk/en/concordance/bibliography/cil.html (accessed 2023-02-09) for the
handful of inscriptions already in CIL.
38 IG translations: .http://telota.bbaw.de/ig/ (= Berlin-Brandenburgischen Akademie der
Wissenschaften n.d.). AIO: https://www.atticinscriptions.com/ (= Lambert et al. n.d.). The EAGLE
MediaWiki online at https://wiki.eagle-network.eu/wiki/Main_Page (accessed 2023-02-09), with Liuzzo
et al. 2014 and Almas et al. 2017.
37
geographic data, with links to Google Earth. Over time the accessibility of Google
Earth changed, and maps became less usable; the geodata were then used to build
a Gazetteer of heritage locations in Libya. 39
The 2009 edition was widely welcomed and used, not least in Libya; colleagues
appreciated the instant accessibility, and the many search tools. It was, however,
clear that it could be improved, and linked to the Gazetteer. The core information had
all been encoded according to agreed standards: it was therefore simple, and
inexpensive to reuse it in a new edition. The aim of IRT2021 was to add all known
Greek or Latin inscriptions that had been published since 1952. 40 This was made
possible by the diligent work of collection enterprises such as AE, SEG, and the
Epigraphic Database Heidelberg (EDH). 41 But even more important, in the spirit of
the 1952 publication, was collaboration with expert colleagues, something made
much easier in a digital environment. A first draft was made available to selected
scholars online; the editors could then add comments, improvements, translations
and above all much fuller illustration. Furthermore, it was now possible to use new
standards for connecting information: the indices provide links for standard terms to
Wikidata, or other resources, whether fully or partially digital, such as the Lexicon of
Greek Personal Names (LGPN) or the Prosopographia Imperii Romani (PIR). 42
Consequently, the process of indexing not only connects data within the project, but
links it out to other data sets, potentially without limits.
Publishing IRT2021, funded by the British Institute for Libyan and North African
Studies (formerly the Society for Libyan Studies), was relatively inexpensive, and
took less than twelve months. An immediate consequence is that it will lend itself
easily to further editions, with new readings, new images, new information, multilingual translations, and more: an international team is already discussing how to
develop the online collections of inscriptions from Libya, forming, as an initial step,
the Libyan Epigraphy Research network. 43
Publication online, therefore, has been driven by very practical considerations. A
modern witness to an inscription is required to record with care, and to publish a
considerable body of information, particularly photographs. Even without the
provision of a rich commentary to the standards of Louis Robert, the resultant
publication presents demands that a printed book cannot meet. Modern
archaeological activity in particular means that such collections need to be regularly
updated, creating further expense. But the story of IRT demonstrates that the
publication of an epigraphic corpus in digital form is an absolutely natural and
39
https://slsgazetteer.org/ (= British Institute for Libyan and North African Studies 2015)
Reynolds et al. 2021, available at: https://irt2021.inslib.kcl.ac.uk/en/..
41 EDH online at https://edh.ub.uni-heidelberg.de/ (= Heidelberg Academy of Sciences 1993-2021).
42 LGPN is mostly online and in evolution towards being a truly Linked Data resource
(https://www.lgpn.ox.ac.uk/ accessed 2023-02-09); PIR is only very partially online
(https://pir.bbaw.de/#/overview (accessed 2023-02-09).
43 https://libyanepigraphy.org/ (accessed 2023-02-09).
40
perhaps inevitable evolution. Moreover, because digital publication offers new
possibilities and has the potential to be transformative, it is challenging us to rethink
our basic methods and approaches, our communal disciplinary culture, whether that
is the simple nature of an edition, or the function of the corpus. IRT, like many other
local and regional corpora, has also been strongly driven by the hope that this will
make epigraphy accessible and relevant to more people than ever before,
particularly in the communities where many of our inscriptions have been recorded
and which bear the burden of preserving and curating them.
Open scholarship
It will be useful at this point to recall the words of Boeckh and Klaffenbach quoted at
the outset: that corpora attempt to achieve what no one person could do alone
intellectually or materially; but also that the greater the diversification of the field, the
greater the need to enable the scholar to access core and reasonably
comprehensive data without lengthy searches and inquiries. Digital work in
epigraphy, it would be fair to suggest, has for some time pushed in two seemingly
opposing directions, which themselves are reflected in the story of IRT: the
possibilities and desire for richer editions; and the desire for comprehensive corpora
to enable research.
A project such as EDCS, PHI or EAGLE exists, we would suggest, for exactly the
same reasons that the original corpora were compiled: to make as many texts as
possible accessible - and searchable - by researchers; and digital capacities make
that possible in ways that were not conceivable in the nineteenth century. The digital
projects of recent decades, however, were (and are) very much projects of their time,
not only technically but intellectually, reflecting a world in which digital corpora have
been conceived of as purely derivative of published editions, ancillary tools, rather
than curated data and original editions in their own right. It is worth noting the
secondary definition of ‘corpus’ that emerged in the twentieth century: ‘a body of
spoken or written material on which a linguistic analysis is based’, which is to say, a
text corpus for analysis. 44 Researchers increasingly want to do more with such
collections of texts (and associated information) than simply use them as tools for
searching text strings - a desire that goes to the heart of both how and why these
texts and monuments and the information about them (collectively, our ‘data’) are
published to begin with, while going beyond the original conception of these
projects. In the sciences the need for transparency and reproducibility in data-based
research is manifest, and a variety of international protocols has been developed to
support it; there is, however, nothing peculiar to the domain of the natural or life
sciences that makes these principles discipline specific. Accurate and careful
44
The definition is taken from The New Shorter Oxford English Dictionary (1993), s.v. ‘corpus’, noted
as mid-twentieth-century, and presented as a specific sub-meaning of the general concept of a corpus
as “a body or collection of writing, knowledge, etc; the whole body of a particular category of literature,
etc.” (itself going back to early eighteenth-century (i.e. enlightenment) usage).
citation and referencing of data is as important to us as to the physicists: this was
why classical scholars designed and developed the apparatus criticus in the first
place. We can and should make these protocols our own. A text database can be
recognized as a product of its time, a pioneering effort of the late 1980s that
performs an invaluable service but has not kept up with technological developments.
At the same time, as the case of IRT illustrates (and it is far from being alone), digital
corpora are now being published that, rather than being derivations of published
corpora, in fact sit parallel to them, or even supercede them.
Aggregator projects, as we might call them, such as EDCS or more recently, and
complexly, The Europeana network of Ancient Greek and Latin Epigraphy
(EAGLE), 45 reflect one response to the essential problem of the future of the corpus,
itself reflecting the concerns of Boeckh and Klaffenbach already noted: the desire of
the community to have a readily accessible, comprehensive set of texts on the one
hand; and the need to navigate a proliferation of ever richer editions on the other.
This proliferation of editions was a feature of the analog print world long before the
digital turn, whence the increasingly confused modes of reference discussed above.
It has been exacerbated as a problem for a variety of reasons, some of which are
digital, but some of which are more general, not least the very question of the
viability of the grand corpora themselves, as well as the changing national and
institutional pressures on scholars to publish in certain ways. Several of the major
attempts of the 1990s to compose large-scale corpora reflect in their evolving
division of the Latin epigraphic world among EDH, Hispania Epigraphica Online
(HEpOnl), Epigraphic Database Bari (EDB), and EDR the familiar challenge of any
grand corpus project: large scale coverage was only possible through collaboration
and division of labour, resulting in a number of separate but collaborating corpora
projects. 46 Subsequently, with the development of standards such as EpiDoc XML,
the development of increasingly rich, but smaller, more localised or thematic corpora,
exemplified on our own parts by IRT, USEP and I.Sicily, has further complicated the
digital picture, with ever greater overlap. 47 Such projects in turn collaborate to a
greater or lesser degree with the larger scale projects, but all such collaboration is
essentially ad hoc; the EAGLE project revealed, as Panciera saw, the challenges of
45
See https://www.eagle-network.eu/ (= Orlandi et al. 2013-2023) and the concluding conference,
Orlandi et al. 2017.
46 See Bodel 2012, 285-7 for a summary of this move to federation. We crudely distinguish here
between ‘aggregator’ projects, which draw texts from existing corpora or publications, and primary
‘corpora’ which ‘edit’ the texts to a greater or lesser degree, generating new ‘editions’. HEpOnl is at
http://www.eda-bea.es/ (Gómez-Pantoja n.d.), EDB is at https://www.edb.uniba.it/ (Carletti et al. 19882023).
47 Reynolds et al. 2021, online at http://irt2021.inslib.kcl.ac.uk/; Bodel 2003-2023, online at
https://usepigraphy.brown.edu/projects/usep/collections/; Prag 2017-2023, online at
http://sicily.classics.ox.ac.uk/. Any selection of projects is invidious but, acknowledging our hosts at
the conference, we highlight the long-lived PETRAE project (https://petrae.huma-num.fr/fr/projet =
Devillers et al. 2012-2018) as a valuable and evolving model at this regional level.
agreeing standards sufficient to enable interoperability. 48 In the rest of this paper, we
focus on two key areas which are fundamental to a digital future, and which have
been highlighted by constructive sceptics: questions of sustainability and
preservation on the one hand, and of the increasing demand for common standards
to enable interoperability on the other.
Sustainability is not a uniquely digital problem - every project has a limited life and
limited funds - but digital outputs do present distinctive problems of sustainability.
When Boeckh or Mommsen published a fascicle of CIG or CIL, they did not share
our current perspective that such volumes might be thought increasingly inaccessible
because existing in only a few expensive copies. More than this, however, they did
so because they had confidence those copies would still exist, in multiple copies and
be curated in libraries in fifty or one hundred years, preserving knowledge that in
some cases only existed otherwise in a single manuscript. Digital epigraphists today
can reach anyone with internet access but cannot be confident their work will be
accessible at all in fifty or even twenty-five years: and this dilemma is doubtless one
reason why digital publications, whatever their quality, continue to be treated with
caution. Digital projects need both durable storage and ongoing infrastructure
support - in other words, the equivalent of a library. Even running a static site (i.e.
one which is no longer being updated, and which does not rely on software to
generate content) on a server requires upkeep and expense, since both hardware
and basic operating software still require regular maintenance and updates (just as a
library needs maintenance, security, and staffing). 49 This is a real problem, not yet
solved; but some key points should be noted.
Firstly, there is a difference between a static publication, i.e. a fixed edition (e.g. IRT)
on the model of paper publication, and a dynamic publication, i.e. one subject to
continuously published revision and updating (e.g. I.Sicily or USEP). This dynamic
publication (i.e. one which is intended to allow for continuous revision and additions
to individual text editions within a corpus, with no fixed end-point) clearly requires the
ability to document, archive and reliably cite each distinct version (i.e. edition,
however classified). Moreover, it almost certainly requires the ability to assign
intellectual responsibility to very granular elements of a publication, since a new
version is potentially constructed with a single correction to a text or its metadata,
perhaps by a different individual (the sorts of changes traditionally united,
retrospectively, at a single point of time in the future, in the apparatus criticus of a
second edition). 50 This model, of course, reflects a very real digital solution to the
48
The observations of Panciera 2012 remain extremely pertinent; Liuzzo 2015 is a sobering overview
of the multiple challenges.
49 IRT2009 is maintained in this basic form, at https://inslib.kcl.ac.uk/irt2009/ (“IRT2009 has been
archived, and thus search and map functionality is no longer available.”)
50 The principles of version control, which extend far beyond the world of digitised inscriptions, are
useful for designating minor changes that might not rise to the level of constituting a new ‘edition’: see
the rich and wide-ranging discussion of Broyles 2020. A partial solution, whereby substantial revisions
are archived to Zenodo.org with a distinct DOI, is currently in use for I.Sicily (see Prag 2021, 185-6
fundamental challenge presented to any corpus by the need to revise and update.
However, it also highlights a further area for future community discussion - what
constitutes a major or a minor revision? The Sicilian corpus of Greek and Latin
inscriptions in IG and CIL is now well over one hundred years out of date; a new
edition once every hundred years or more should not be something we have to live
with any longer; and withholding individual revisions until a complete corpus is ready
for publication is also no longer necessary. 51
Secondly, there is a difference between data and software (this is hardly a new point
in the digital world, but it deserves emphasising): the I.Sicily and USEP websites are
never going to be sustainable indefinitely; each will require periodic upgrades in
order to remain functional (indeed the current I.Sicily site, developed in 2015/16, is
already beyond life expectancy and will need to be be rebuilt shortly). It is the
website and supporting software that generate the interfaces that make visualisation,
searching, and analysis easier; but the actual data (notably the XML files containing
the editions, and the images) can be readily stored and made available much more
indefinitely and for very little expense. Existing examples include the model data
repository of the currently static Heidelberg database (EDH), or the data deposits
from multiple projects collected by the International Digital Epigraphy Association
(IDEA) on Zenodo, or the data deposit of the constituent files of IRT2021 in the
King’s College London digital repository. 52 Any digital project should have a clear
strategy for this sort of basic preservation of its data, but this is something that
requires planning at the start, and until relatively recently was not something that
was widely recognised. 53 It is true that this sort of publication is not so easily human
readable, but to assume that the only value of publication is a traditionally human
readable publication, even in digital form, is to miss a very large part of the value of
digital publication: data that is available digitally can not only be searched and cited
by humans, it can, potentially, be searched by machines, and reused to undertake
further research.
At this point our different threads begin to come together. Multi-authored corpora
came into being in order to make as many texts as possible accessible in reliable
editions (ever more important in the face of potential loss); but also in recognition of
the limits to what individuals could realistically achieve working alone. But corpora
continue to multiply and expand in both quantity and richness of data (as the
and for an example, compare http://sicily.classics.ox.ac.uk/inscription/ISic000537 with
https://doi.org/10.5281/zenodo.4384888 and https://doi.org/10.5281/zenodo.6421667); more granular
versions could also be recovered from https://github.com/ISicily/ISicily.
51
The PETRAE project illustrates another approach, which is the parallel publication in paper and
digital format (see https://petrae.huma-num.fr/fr/publications accessed 2023-02-09) with volumes
produced from the digital corpus.
52 https://edh.ub.uni-heidelberg.de/data (= Heidelberg Academy of Sciences 1993-2021);
https://zenodo.org/communities/eagle-idea/ (accessed 2023-02-09);
https://doi.org/10.18742/c.6002275.v1 = Reynolds et al. 2022.
53 See e.g. Rios 2018 discussing the general focus of research data management plans on data but
not (until relatively recently) software.
inscriptions themselves continue to multiply). Digital tools first emerged as a means
to analyse the already unmanageable mass of data in the printed corpora, 54
highlighting an essential purpose and value of corpora, the ability to study and
analyse a large body of texts in carrying out our research. On the one hand
researchers want to ask more questions of such data; and on the other, as the data
itself becomes richer, those questions and research possibilities only increase; and
digital recording and publication make possible ever richer data and analysis.
Furthermore, if the purpose of a corpus is to enable research, it is not for the
compiler of that corpus to second-guess future research questions, only to make the
material available. Lastly, we are increasingly required by funders to make our data
openly accessible, which is an admirable ambition (if one which funders currently do
not fully resource). To return to one of our earlier ‘aggregator’ examples, EDCS
enables an enhanced level of textual searching compared to paper indexes, but the
data is not available in any readily reusable form (only as HTML in response to a
query), 55 its legal status (whether one is allowed to re-use it in any way, and, if so,
who should be credited) is unclear, 56 and the quality of the data is unknowable (for
the reasons discussed above).
It is worth considering, therefore, what becomes possible when we make the data,
even just the basic texts, available for use in digital form by our fellow researchers.
Early, highly labour-intensive examples of this, such as Richard Saller’s and Brent
Shaw’s 1984 study of “Tombstones and Roman Family Relations in the Principate”, 57
remain rare, because the data remains extremely inaccessible. More recent
examples of such papers, whether focused on linguistics or epigraphic culture,
consistently describe the basic challenges (and considerable effort) of trying to build
a suitable dataset, notwithstanding existing digital resources, and the resulting
limitations on the results. 58 It is particularly striking that linguistic analysis of
epigraphic texts lags behind parallel work on literary or papyrological texts, a
regrettable situation undoubtedly due in part to the limited availability of usable and
reliable editions. 59 A couple of contemporary examples of digital projects will help to
54
The principal example is Jory & Moore 1974-1975, who manually entered the texts of CIL VI in
order to create a simple Key Word In Context index.
55 Consequently, export or re-use of the data is only possible through the use of web-scraping (the
extraction of data from a website, often using software to extract and standardise the data for further
use: see e.g. Black 2016 for a discussion of some of the associated issues).
56 The only licencing information is the generic internet disclaimer at
https://www.disclaimer.de/disclaimer.htm?farbe=FFFFFF/000000/000000/000000 (accessed 2023-0902) and the disclaimer statement at https://db.edcs.eu/epigr/hinweise/disclaimer-en.html (accessed
2023-09-02). The PHI site presents similar problems for re-use, even if the data sources are clear
(https://inscriptions.packhum.org/ = Packard Humanities Institute n.d.).
57 Saller and Shaw 1984.
58 Adamik 2021 (linguistics); Heřmánková et al. 2021 (epigraphic culture).
59 See CLaSSES (http://classes-latin-linguistics.fileli.unipi.it/en = Marotta 2023) for a smallscale
example, or more recently and openly, CEIPoM (https://reubenjpitts.github.io/Corpus-of-theEpigraphy-of-the-Italian-Peninsula-in-the-1st-Millennium-BCE/ = Pitts 2022); contrast PapyGreek
(https://papygreek.hum.helsinki.fi/ = Henriksson and Vierros 2022) on papyrological data.
illustrate the challenges – and the potential – of both richer and more accessible data
encoding.
The LatinNow project, directed by Alex Mullen, set out to employ sociolinguistics,
epigraphy, and archaeology to write the social history of the Latinization of the northwestern provinces of the Roman Empire. In the end, a huge part of the five-year
project was devoted simply to preparing a dataset for analysis. The work would have
been impossible without the massive works of data creation and aggregation already
undertaken by a number of previous and ongoing projects, and the initial work of
unification by the EAGLE project; but the limitations still present in that data and the
work required to make it actually usable revealed the challenges that researchers
still face. The remarkable GIS dataset that has resulted was released in beta form to
the community at the time of the Bordeaux congress (August 2022). 60 The Ithaca
project, led by Yannis Assael and Thea Sommerschield, has developed a machinelearning platform to assist epigraphers with the reconstruction and geographical and
chronological attribution of Greek epigraphic texts. 61 However, this project required
several months of simple cleaning of the texts available in PHI in order to have a
basic dataset necessary to make machine-learning possible; similar work is now
underway to establish a comparable Latin text corpus from existing resources. It is
sobering to realise, three conferences and fifteen years after the optimistic hope was
expressed that we might be on the cusp of realising the possibilities of digital
analysis based upon methods such as semantic mark-up, that we still sit on that
cusp. 62 There are two sides to this continuing problem: how digital datasets are
made available (or not), and, looking back to considerations of academic practice
discussed above, how they are both created and consumed. Projects such as these
provide the merest hint of what any corpus project could, indeed should, now be
enabling, far beyond the basic searching of metadata or texts, and far beyond the
original questions or intentions of those who first created the data.
Re-use of data for further research brings to the fore the very problem with
responsibility and citation that underlies some of the concerns discussed above (as
well as the necessity of ‘clean’ or standardised data, to which we return below). An
aggregator may bring together data in a way that makes it possible for others to reuse it; but if the aggregator does not record the source(s), not only is the validity of
this data no longer (easily) verifiable, but the basic principles of intellectual property
are violated. This is not, however, a problem with aggregators alone: plenty of
existing projects make data available without providing sufficient documentation of
intellectual responsibility or methods of data creation; and in turn, any re-user,
60
LatinNow at https://latinnow.eu/ (= Mullen 2017-2023); for the GIS data, see https://gis.latinnow.eu/
and https://latinnow.eu/online-resources-2/ (accessed 2023-02-09).
61 Assael et al. 2022, with the tool available online at https://ithaca.deepmind.com/ (accessed 202302-09). The project essentially applies machine-learning to the traditional tasks that epigraphers
regularly undertake on a daily basis, using paper corpora and now often also text-string searches in
EDCS and PHI, in searching out parallels for reconstructing and contextualizing new inscriptions.
62 Bodel 2012, 282.
aggregator or direct research exercise, has a responsibility to document the source
of their data, not always observed. As we noted earlier, this is not a specifically
digital issue (and it is one that the apparatus criticus in part was intended to address)
- but, it is made more urgent by the very nature and potential value of digital data.
Established mechanisms exist for attributing and recognising authorship (whether
original authorship, or other forms of data curatorship or data encoding), starting with
the existence of digital identifiers for scholars that can be embedded in the data,
such as ORCIDs, but it will take an effort of the community to embrace them and to
do so in standardised ways. 63
These issues do not apply only to data publication for reuse, however (and let us be
clear, when we talk about ‘data publication’ we are talking primarily about the
publication of digitally encoded editions of inscriptions). They are the same issues as
make collaboration a much more realistic possibility. In recent years, not least
through the COVID-19 pandemic, we have all learned how much more interaction is
possible digitally. Digital tools of varying levels of complexity, from Dropbox to
Github, make highly collaborative corpus-building a reality. But such methods make
it even more important, especially considering the need for early career researchers
to be able to demonstrate their actual outputs and activity, that individual
contributions in a collaborative environment be properly recorded and credited.
Whether one believes that a particular edition of an inscription should be cited by its
corpus number alone, or with reference to the one or more authors of that edition, it
is essential that the question of who prepared the text, who recorded the metadata,
who compiled the bibliography, who took the photograph, should be documented:
this information has often been obscured to some extent by the nature of print
editions; but it is easier to incorporate such information in a digital environment. This
is one of several areas where it becomes apparent that using digital methods in fact
requires that we hold ourselves to higher standards than those to which we have
become accustomed; it is also notable that multiple authorship (common in the
sciences) is becoming more commonplace. The purpose of documenting the
construction of an edition in appropriate digital form is to ensure not only that the
information can be retrieved but also that the work of individuals can be identified
and catalogued; 64 and, as already noted, that information is important when it comes
not just to citation, but to data re-use.
63
ORCID stands for Open Researcher and Contributor ID, a community-built, not-for-profit, global
organisation providing a unique, persistent identifier free of charge to researchers (with now in excess
of 10 million registered users): https://orcid.org/ (accessed 2023-02-09).
64 See https://github.com/ISicily/ISicily/graphs/contributors for a basic example, from which it is
possible to drill down to each change each individual has ever made. For a different sort of example,
see https://en.wikipedia.org/wiki/Praeneste_fibula, together with
https://en.wikipedia.org/w/index.php?title=Praeneste_fibula&action=history (accessed 2023-02-09). It
is interesting to note the absence of epigraphers from this document; and the rather telling comment
from a contributor at: https://en.wikipedia.org/wiki/Talk:Praeneste_fibula#Expert_still_needed.
These themes of collaboration, sustainability, proliferation and aggregation are not
new, but digital methods make them ever more complex and important, and
challenge some of our long-standing conventions of practice, which have evolved
over the decades and centuries. The EAGLE project (2013-2016) confronted some
of these challenges by attempting openly to aggregate the data from a large number
of existing digital projects, exemplifying the principles of collaborative corpus-building
and open data, in order to make real the ambition of federation that had earlier been
embraced by a number of projects, with the encouragement of AIEGL (see above,
nn. 45-46). The fundamental challenge that the EAGLE project faced (from our
perspective) is that of trying to unify data from diverse projects retrospectively. 65
Even if data is openly available, if it is not prepared to the same standards (whether
in inscription typologies or the treatment of past editions), it can only be combined
with immense difficulty, as illustrated by the efforts required by the projects noted
above, such as LatinNow and Ithaca. The EAGLE project valiantly attempted to align
the ‘metadata’ labels (i.e. descriptive classificatory information, such as type of
inscription, type of object, type of material) used across multiple projects, not only
those involved in the EAGLE federation. 66 It thus instantiated one of the fundamental
principles of Linked Open Data, the ability to align data through standardised
terminology (and so to achieve any sort of integration or useful aggregation). It is
worth recalling that the Greek and Latin epigraphic community has a strong tradition
of evolving community standards, as being essential to communication. The primary
example, already discussed, is the general adoption and development of the Leiden
Conventions, albeit with some diversification between the Greek and Latin
epigraphic communities. Those divergences, however, highlight a fundamental point,
which is that the categories that the conventions capture are widely recognised, but
there can be autonomy in their representation (and not all are needed by all, since
some only apply in specific domains). 67 In this specific case, encoding the universally
agreed categories in EpiDoc leaves entirely open the question of how the categories
are subsequently displayed, so there is no loss of independence.
A second recent example is offered by the AIEGL proposal for standardisation of the
abbreviations for Greek epigraphic publications, to which other projects can in turn
65
The projects that joined the federation were born at different stages in the development of digital
tools and methods from the 1980s to the 2000s; consequently, the variations in format and content
among these projects are the result of circumstance, not design. We have not discussed here the
further challenge of changing formats over time, positively exemplified by the PETRAE project’s
evolution between 1986 and the present day: outlined at https://petrae.humanum.fr/fr/projet/historique (accessed 2023-02-09).
66 https://www.eagle-network.eu/resources/vocabularies/ (accessed 2023-02-09), with an overview of
the issues in Liuzzo 2015, and profound reflections on the project in Liuzzo 2018.
67 Some of the principle divergences can easily be observed on the comparative table in the EpiDoc
quick reference guide (Bodard 2021-02-01), available at:
https://github.com/EpiDoc/Source/blob/6041fccd12a309b689ec4223d52f33df7a1a8278/guidelines/ms
word/quickref.pdf.
offer alignment, as recently demonstrated by SEG. 68 The next step in that particular
case is to offer stable digital identifiers for each bibliography item, so that projects
can maintain autonomy in their presentation of the item, while still ensuring the
alignment that is the ambition behind the standardised abbreviation. 69 Both the
AIEGL abbreviation list and the community development of the EpiDoc standard
exemplify how well this can work. But at present the community lacks the detailed
standards that would enable the easy comparison of data from one project to
another: the EAGLE vocabularies are explicitly only a first draft, and it has so far
been left to small voluntary groups such as the epigraphy.info community to make
some first efforts in this direction. 70 This need for community agreed standards
extends very widely, far beyond agreement on, e.g, how to classify a funerary
inscription, and includes, as discussed above, the need to clarify categories of
edition and principles of citation. None of these issues is new or unique to the digital
world; but the possibilities opened up by digital methods make precision in these
areas ever more important if we are to advance in our research.
EAGLE also came up against one of the other fundamental challenges noted above,
sustainability: by aggregating the data from individual projects into a new database,
EAGLE immediately faced the same sustainability challenge as any other project. In
this case the problems are twofold. On the one hand, there is the need to maintain a
database and its interface. 71 On the other, there is the need for regular future
intervention in order to keep such a dataset (‘corpus’) up-to-date, a problem noted
repeatedly throughout this paper. The ability to revise and update remains one of the
essential challenges of any corpus project, and one which digital capabilities in
principle make easier, not harder. A serious challenge for any model of collaboration
or aggregation, however, is how to capture and process the many revisions
undertaken across many projects. Many mechanisms are needed to achieve this,
most of which have been touched on in the preceding discussion, from version
control, to authorial documentation, to agreed data standards, as well as open data
and interoperability: from a technical perspective all of these things are possible.
Some of the most fundamental elements required are already firmly established, with
the EpiDoc standard for text-encoding, or the existence of unique identifiers for every
inscription in the abstract (rather than denoting any particular edition), via the
Trismegistos project, which creates the possibility for machines to connect every
68
AIEGL Greek abbreviations list, Chaniotis et al. 2022, online at https://aiegl.org/grepiabbr.html;
SEG concordance, Tsolakis 2021, online at https://scholarlyeditions.brill.com/sego/abbreviations/.
69 A pilot is under development at https://github.com/FAIR-epigraphy (Asif et al. 2022-2023), with the
support of AIEGL.
70 Specifically the working groups of epigraphy.info: https://epigraphy.info/working_groups/ (accessed
2023-02-09).
71 The EAGLE database can be searched at https://www.eagle-network.eu/resources/searchinscriptions/ (accessed 2023-02-09). In the case of EAGLE, an initial solution to fund future work was
the subscription model of the IDEA association (https://www.eagle-network.eu/about/who-we-are/);
Trismegistos has adopted a similar approach (https://www.trismegistos.org/registration.php, accessed
2023-02-09), although in this case access to tools and resources is in turn limited for those who do
not subscribe.
edition of a particular text unambiguously. Arguably, our greatest challenges are
intellectual, academic, and cultural: firstly, the necessary cross-community
collaboration on standards, which enables, for example, two epigraphists to agree,
however loosely, on what might constitute a funerary inscription, or a meaningful
change (i.e. new version) to an epigraphic text; and secondly the recognition that
‘data’ should be openly accessible, ideally in line with the now widely advocated
FAIR principles: data should be Findable, Accessible, Interoperable, and Reusable. 72
We conclude with what we hope is an optimistic, rather than a pessimistic vision for
the future of the corpus. Given the ever increasing challenges of scale, proliferation,
funding and even sustainable travel, together with the fundamental opportunities for
collaboration, more detailed analysis and more rapid revision that digital methods
present, we would argue for an inversion of the existing corpus model, one that
moves away from the single monumental corpus, and instead builds directly upon
the proliferation of local and thematic corpora. 73 Fundamental to such an approach,
however, are two basic sets of principles and approaches, alluded to and implicit in
much of what has been said so far: the FAIR principles, for publication of data in
ways that are Findable, Accessible, Interoperable and Reuseable; and the
overlapping but distinct principles of Linked Open Data. 74 Essential for the success
of any of this, however, is the ability of the community to agree on standards,
whether for data categories, bibliographic and citation conventions, levels of
versioning (editorial revision), or authorial responsibility. This challenge is hardly
unique to the field of epigraphy: the frameworks for much of this are common to
many disciplines, and do not need to be redeveloped for our own; 75 but some are
discipline-specific to epigraphy, and need to be at least broadly agreed among us.
To repeat the earlier point about standards and representation, it deserves
emphasising that this does not require that any particular project actually calls a
funerary inscription a titulus sepulcralis, but it does imply that all projects need to
state somewhere in their data what categories they are using, and how those might
align with an agreed list of categories - remembering also that in a digital
72
https://www.go-fair.org/fair-principles/ (accessed 2023-02-09, and see Wilkinson et al. 2016). For a
recent experiment with archiving a TEI-EpiDoc project employing FAIR principles, see Creamer et al.
2022.
73 It is perhaps not inappropriate to quote Silvio Panciera: “One might claim that irresolvable technical
divergences no longer exist these days. More important, in my view, is the need to join together in
identifying a collective goal, which takes account of the necessity for different threads of research and
of the best route to follow in order to reach it: this with overt protocols, which will subsequently allow
the goal once reached to become a new point of departure. But on the one hand that calls for a spirit
of unity and collaboration which so far frankly has been absent, and on the other for attaining a new
point of balance between ambitions and possibilities, between excessive and over-modest
expectations, between short-term projects and future opportunities.” (Panciera 2012, 273.)
74 https://www.w3.org/DesignIssues/LinkedData.html (= Berners-Lee 2009) .
75 See in particular the examples of the Getty Art & Architecture Thesaurus
(https://www.getty.edu/research/tools/vocabularies/aat/), the Ariadne portal for archaeology
(https://portal.ariadne-infrastructure.eu/), or the Pactols thesaurus for archaeology
(https://pactols.frantiq.fr/opentheso/) (all accessed 2023-02-09).
environment texts can be presented in multiple categories simultaneously, which is
both a blessing and a challenge. That might just seem like good practice, and is no
different from stating at the outset that one is, for example, using the Leiden
conventions, or the AIEGL Greek epigraphic abbreviations. And indeed, what we
propose is essentially that, a community of best practice, equipped with the
community documentation and tools to make it (relatively) easy to do.
Adopting such an approach in fact already makes it possible, using existing and very
standard technology, to aggregate live data from multiple projects in the simplest
possible database. The FAIR Epigraphy project, directed by Marietta Horster and
Jonathan Prag, has already created a simple pilot to demonstrate this, which pulls
together a sample set of material from the available XML files of the
Roman Inscriptions of Britain (RIB), the Inscriptions of Greek Cyrenaica (IGCyr) and
I.Sicily, transforms them (a process described as ‘serialisation’) into a universal
format called Resource Description Framework (RDF) (a standard format for the
description and exchange of data on the web), and then makes this publicly available
through a simple web interface at https://inscriptiones.org/. 76 The framework (known
as an ‘ontology’) for doing this was already devised by a collaborative group of
epigraphers several years ago, building in part upon the work of the EAGLE project
and almost entirely using existing resources. 77 This took a software engineer (Imran
Asif) with no epigraphic background a very few hours to create, and could rapidly
and easily be expanded to cover any project that can align its data to existing
standards, such as the EAGLE or Getty vocabularies. 78 As the use of the
Trismegistos number illustrates, such a method also directly opens the door to
linking multiple corpora simultaneously for the same inscription and so would enable
aggregating complementary data from multiple sources for individual inscriptions. 79
The same principle, inserting e.g. Pleiades geographic identifiers, or LGPN person
identifiers, as already demonstrated by IRT (see above), opens the door to
connecting inscriptions to larger datasets about the ancient world, and searches
such as “show me all data from x place or about y person”. 80 This is the potential of
76
The FAIR Epigraphy project, funded by the AHRC and the DFG, is intended to support the
development and use of standards and best practices to realise the possibilities of ‘FAIR’ epigraphy.
The RDF pilot and more information can be found at https://github.com/FAIR-epigraphy (Asif et al.
2022-2023). On RDF see https://www.w3.org/RDF/ and, e.g.
https://www.w3schools.com/xml/xml_rdf.asp (accessed 2023-02-09).
77 See Bodard et al. 2021, also online at: https://epigraphy.info/ontologies_wg/ (the current draft in fact
created only 3 new terms, otherwise using entirely existing resources: see
https://w3id.org/epont/ontology (accessed 2023-02-09).
78 We would note that this approach has already been adopted, with enormous success, by the
numismatic community: see https://nomisma.org/ (accessed 2023-02-09).
79 Such linkage is already enabled by the Trismegistos TexRelations Matcher API (TexRelations
Matcher API Documentation, accessed 2023-02-09), for which a simple user interface developed by
the FAIR Epigraphy project is now available in beta form at https://id-resolver.inscriptiones.org/ (= Asif
& Prag 2023).
80 See https://peripleo.pelagios.org/about, employing the Pleiades geographic references
https://pleiades.stoa.org (= Bagnall et al. 2023), for an example of compilation of data from multiple
resources with reference to ancient places.
Linked Open Data. An RDF data store of this sort can on the one hand act as a longterm and static (stable) data repository of core data from multiple projects; but on the
other it can also operate dynamically, incorporating new and revised data from those
projects, either through automatic or manual updates, through a very standardised
process. Such an approach of course brings its own overheads, but they are much
lower than those of a single comprehensive corpus or a traditional database, more
responsive to change, and in the true spirit of linked data not only make core data
readily accessible and searchable, but provide access for researchers to the
individual projects that have studied specific inscriptions in much greater depth than
the aggregator can hope to capture.
Any such approach, whether that of the EAGLE project, or that of the FAIR
Epigraphy project, is, however, dependent upon community collaboration and
respect, and therein lies the challenge - not a technical challenge, but a cultural one.
Our concerns parallel those of the first epigraphic congress of 1938:
Le Congrès, rendant hommage aux fondateurs et aux directeurs actuels des
Inscriptiones Graecae et du Corpus Inscriptionum Latinarum; déplorant qu'il n'y a à
présent uniformité ni dans le format ni dans les caractères ni dans la présentation
générale des grands recueils d'inscriptions grecques et latines; émet le voeu de voir
déterminer pour l'avenir par un statut international les conditions dans lesquelles
seront publiés les futurs volumes. 81
This request illustrates how collaboration and interoperability have been core
concerns of AIEGL and a focus for congresses, from the beginning; the Association
facilitates communication between larger and smaller enterprises, allowing the wider
community to benefit from the work of both the subcommunities (e.g. epigraphy.info)
and projects (e.g. FAIR Epigraphy) currently dedicated to this challenge.The future of
the corpus, and our discipline is surely bright.
81
This printed statement was circulated (on what basis we do not know) at the 1938 congress in
Amsterdam; a copy (our source) is preserved in the papers of W.H. Buckler who attended the
conference.
Bibliography
Adamik, B. (2021) : “Romanization and Latinization of the Roman Empire in the light
of data in the Computerized Historical Linguistic Database of Latin Inscriptions of the
Imperial Age”, Journal of Latin Linguistics, 20, 1, 1‑19.
Almas, B., R. Baumann, M.-C. Beaulieu, H. Cayless, J. Cowey, P.M. Liuzzo, F.
McCourt and J. Sosin (2017) : “TIGLIO. Translations and Images of Greek and Latin
Inscriptions Online”, in : Orlandi et al., ed. 2017, 221-232.
American Psychological Association (2021) : Webpage on a website references
(https://apastyle.apa.org/style-grammar-guidelines/references/examples/webpagewebsite-references, accessed 09.02.2023).
Asif, I., P. Hermankova, M. Horster and J.R.W. Prag (2022) : The FAIR Epigraphy
Project (https://github.com/FAIR-epigraphy, accessed 09.02.2023).
Asif, I. and J.R.W. Prag (2023) : Inscriptiones Identifier Resolver (IDR) (https://idresolver.inscriptiones.org/, accessed 09.02.2023).
Assael, Y., T. Sommerschield, B. Shillingford, M. Bordbar, J. Pavlopoulos, M.
Chatzipanagiotou, I. Androutsopoulos, J. Prag and N. de Freitas (2022) : “Restoring
and attributing ancient texts using deep neural networks”, Nature, 603, 7900,
280‑283.
Bagnall, R., R. Talbert and T. Elliott, ed. (2023) : Pleiades (https://pleiades.stoa.org/,
accessed 09.02.2023).
Berlin-Brandenburgischen Akademie der Wissenschaften (n.d.) : Inscriptiones
Graecae (http://telota.bbaw.de/ig/, accessed 09.02.2023).
Berners-Lee, T. (2009) : Linked Data - Design Issues
(https://www.w3.org/DesignIssues/LinkedData.html, accessed 09.02.2023).
Black, M.L. (2016) : “The World Wide Web as Complex Data Set: Expanding the
Digital Humanities into the Twentieth Century and Beyond through Internet
Research”, IJHAC, 10, 1, 95‑109.
Bodard, G. (2010) : “EpiDoc: Epigraphic documents in XML for publication and
interchange”, in : Feraudi-Gruénais, éd. 2010, 101‑118.
Bodard, G. (2021) : Leiden and EpiDoc Quick Reference (shorter)
(https://github.com/EpiDoc/Source/blob/6041fccd12a309b689ec4223d52f33df7a1a8
278/guidelines/msword/quickref.pdf, accessed 09.02.2023).
Bodard, G., H. Cayless, C. Cenati, A. Cooley, T. Elliott, S. Evangelisti, A. Felicetti, P.
Granados, F. Grieshaber, E. Gruber, A. Hershkowitz, T. Hill, H. Kiiskinen, T. Kollatz,
A. Levivier, P. Liuzzo, F. Luciani, A. Mannocci, E. Mataix, F. Murano, O. Murphy, E.
Mylonas, J. Prag, V. Razanajao, S. Stoyanova, G. Tsolakis, C. Tupman, I.
Vagionakis, V. Vitale and F. Weise (2021) : Modeling Epigraphy with an Ontology
(https://zenodo.org/record/4639508, accessed 09.02.2023).
Bodard, G., S. Stoyanova, T. Elliott, J. Cowey and S. Vanderbilt (2022) : Dates and
locations of modern observations (https://epidoc.stoa.org/gl/latest/supphistlocations.html, accessed 09.02.2023).
Bodel, J., ed. (2003-2023) : U.S. Epigraphy Project (USEP)
(https://usepigraphy.brown.edu/projects/usep/collections/, accessed 09.02.2023).
— (2012) : “Latin Epigraphy and the IT Revolution”, in : Davies & Wilkes, ed. 2012,
275‑296.
Bolikowski, Ł., V. Casarosa, P. Goodale, N. Houssos, P. Manghi and J. Schirrwagen,
ed. (2014) : Theory and Practice of Digital Libraries -- TPDL 2013 Selected
Workshops: LCPD 2013, SUEDL 2013, DataCur 2013, Held in Valletta, Malta,
September 22-26, 2013. Revised Selected Papers, Communications in Computer
and Information Science, Cham.
British Institute for Libyan and North African Studies (2015) : Heritage Gazetteer of
Libya (https://slsgazetteer.org/, accessed 09.02.2023).
Broyles, P.A. (2020) : “Digital Editions and Version Numbering”, Digital Humanities
Quarterly, 14, 2 (http://www.digitalhumanities.org/dhq/vol/14/2/000455/000455.html).
Bruun, C. and J. Edmondson, ed. (2014) : The Oxford Handbook of Roman
Epigraphy, Oxford Handbooks, Oxford.
Buonocore, M. (2014) : “Epigraphic Research from its Inception: The Contribution of
Manuscripts”, in : Bruun & Edmondson, ed. 2014, 21‑41.
Butini, M. (2022) : EDR167214 (http://www.edredr.it/edr_programmi/res_complex_comune.php?do=book&id_nr=EDR167214,
accessed 09.02.2023).
Carletti, C., A.E. Felle, V. Ambriola, F. Pallocca and A. Vilella (1988) : Epigraphic
Database Bari. Inscriptions by Christians in Rome (3rd-8th cent. CE)
(https://www.edb.uniba.it/, accessed 09.02.2023).
Cayless, H., C. Roueché, T. Elliott and G. Bodard (2009) : “Epigraphy in 2017”,
Digital Humanities Quarterly, 3, 1
(http://www.digitalhumanities.org/dhq/vol/3/1/000030/000030.html).
Chaniotis, A., T. Corsten, D. Feissel, P.-L. Gatier, K. Hallof, M. Hatzopoulos, S.
Orlandi, R. Parker, D. Rousset and C. Schuler (2022) : Liste des abréviations des
éditions et ouvrages de référence pour l’épigraphie grecque alphabétique
(https://aiegl.org/grepiabbr.html, accessed 09.02.2023).
Clauss, M., A. Klob, W.A. Slaby and B. Woitas (n.d.) : Epigraphische Datenbank
Clauss - Slaby (EDCS) (http://www.manfredclauss.de/, accessed 09.02.2023).
Clauss, M., A. Kolb, W.A. Slaby and B. Woitas (n.d.) : EDCS-19200211
(https://db.edcs.eu/epigr/epi_url.php?s_sprache=en&p_edcs_id=EDCS-19200211,
accessed 09.02.2023).
Cooley, A.E. (2012) : The Cambridge Manual of Latin Epigraphy, Cambridge.
Corrado, E.M. (2019) : “Software Preservation: An Introduction to Issues and
Challenges”, Technical Services Quarterly, 36, 2, 177‑189.
Creamer, A., G. Lembi, E. Mylonas and M. Satlow (2022) : “Archiving a TEI Project
FAIRly”, Journal of the Text Encoding Initiative, Issue 14 (2021). Online since
08.12.2022 at: https://doi.org/10.4000/jtei.4324.
Davies, J.K. and J.J. Wilkes, ed. (2012) : Epigraphy and the historical sciences,
Proceedings of the British Academy 177, Oxford.
De Santis, A. and I. Rossi, ed. (2018) : Crossing experiences in digital epigraphy:
From practice to discipline, Warsaw, Berlin.
Depauw, M. (2018) : “Trismegistos: Optimizing Interoperability for Texts from the
Ancient World”, in : De Santis & Rossi, ed. 2018, 193-201.
Depauw, M., ed. (n.d.) : Trismegistos (www.trismegistos.org, accessed 09.02.2023).
Depauw, M. and T. Gheldof (2014) : “Trismegistos. An interdisciplinary Platform for
Ancient World Texts and Related Information”, in : Bolikowski et al., ed. 2014, 40‑52.
Devillers, O., M. Navarro Caballero, N. Prévôt and C. Bassion (2012-2018) :
PETRAE. Programme d’Enregistrement, Traitement et Reconnaissance
Automatique en Épigraphie (https://petrae.huma-num.fr/fr/projet, accessed
09.02.2023).
Elliott, T. (2014) : “Epigraphy and Digital Resources”, in : Bruun & Edmondson, ed.
2014, 78‑86.
Elliott, T., G. Bodard and H. Cayless (2006) : EpiDoc: Epigraphic Documents in TEI
XML (https://epidoc.stoa.org/, accessed 09.02.2023).
Etienne, R., ed. (1984) : Epigraphie hispanique: problèmes de méthode et d’édition.,
Publications du Centre Pierre Paris 10, Talence, France : Paris.
Fassbender, A. (2003) : Index numerorum: ein Findbuch zum Corpus inscriptionum
Latinarum, Corpus inscriptionum Latinarum. Auctarium ; series nova, vol. primum,
Berlin.
Felicetti, A. and F. Murano (2017) : “Scripta manent: a CIDOC CRM semiotic reading
of ancient texts”, International Journal on Digital Libraries, 18, 4, 263‑270.
Feraudi-Gruénais, F., ed. (2010) : Latin on stone: epigraphic research and electronic
archives, Roman studies, Lanham.
Gómez-Pantoja, J. ed. (n.d.) : Hispania Epigraphica Online (HEpOnl) (http://edabea.es/, accessed 30.08.22).
Hallof, K. (2009) : Inscriptiones Graecae, editio altera, Berlin.
— (2012) : Inscriptiones graecae: imagines epigraphicorum = Epigraphikerbildnisse,
Berlin.
Heidelberg Academy of Sciences (1993-2021) : Epigraphic Database Heidelberg
(EDH) (https://edh.ub.uni-heidelberg.de/, accessed 09.02.2023).
Henriksson, E. and M. Vierros (2022) : PapyGreek
(https://papygreek.hum.helsinki.fi/, accessed 09.02.2023).
Heřmánková, P., V. Kaše and A. Sobotková (2021) : “Inscriptions as data: digital
epigraphy in macro-historical perspective”, Journal of Digital Humanities, 1
(http://doi.org/10.1515/JDH-2021-1004?locatt=label:IDHFULL).
Jory, E.J. and D.W. Moore (1974) : Corpus Inscriptionum Latinarum. Vol. VII, pars
vii. Indices vocabulorum, fascs. 1-6, Berlin.
Lambert, S., P. Low, P. Liddel and C. De Lisle, ed. (n.d.) : Attic Inscriptions Online
(AIO) (https://www.atticinscriptions.com/, accessed 09.02.2023).
Liuzzo, P.M. (2015) : “EAGLE and EUROPEANA: Architecture Problems for
Aggregation and Harmonization”, Proceedings of the Symposium on Cultural
Heritage Markup. Balisage Series on Markup Technologies, 16
(http://doi.org/10.4242/BalisageVol16.Liuzzo01).
Liuzzo, P.M. (2018) : “EAGLE Continued: IDEA. The International Digital Epigraphy
Association”, in : De Santis & Rossi, ed. 2018, 216-230.
Liuzzo, P.M., A. Zanni, L. Martinelli, L. Losa et P. De Nicolao (2014) : “The EAGLE
Mediawiki. A fully collaborative database for academics, data engineers and the
general public”, in : Orlandi et al., ed. 2014, 187-200.
Lynch, C. (2014) : “The Next Generation of Challenges in the Curation of Scholarly
Data”, in : J.M. Ray, ed. 2014, 395‑408.
Marotta, G. (2023) : CLaSSES. Corpus for Latin Sociolinguistic Studies on
Epigraphic textS (http://classes-latin-linguistics.fileli.unipi.it/en, accessed
09.02.2023).
Merkelbach, R. (1998) : “Nochmals Inscriptiones Graecae”, Zeitschrift für
Papyrologie und Epigraphik, 122, 293‑299.
Mullen, A., ed. (2017-2023) : LatinNow. Latinization of the north-western Roman
provinces: sociolinguistics, epigraphy and archaeology (https://latinnow.eu/,
accessed 09.02.2023).
Orlandi, S., ed. (2013-2023) : The Europeana network of Ancient Greek and Latin
Epigraphy (EAGLE) (https://www.eagle-network.eu/, accessed 09.02.2023).
Orlandi, S., V. Casarosa, R. Santucci and P.M. Liuzzo, ed. (2014) : Information
Technologies for Epigraphy and Cultural Heritage, Rome. (http://www.eaglenetwork.eu/wp-content/uploads/2015/01/Paris-Conference-Proceedings.pdf).
Orlandi, S., R. Santucci, F. Mambrini and P.M. Liuzzo, ed. (2017) : Digital and
Traditional Epigraphy in Context. Proceedings of the EAGLE 2016 International
Conference, Rome. (https://doi.org/10.13133/978-88-9377-021-7).
Packard, D.W. (1968) : A concordance to Livy, Cambridge, Mass.
Packard Humanities Institute (n.d.) : Searchable Greek Inscriptions
(https://inscriptions.packhum.org/, accessed 09.02.2023).
Panciera, S. (1984) : “I Segni diacritici: riflessioni e proposte”, in : Etienne, ed. 1984,
372‑379.
— (2006) : Epigrafi, epigrafia, epigrafisti: scritti vari editi e inediti (1956-2005) con
note complementari e indici, Vetera 16, Roma.
— (2012) : “Epigraphy and Informatics”, in : Davies & Wilkes, ed. 2012, 271‑273.
Panciera, S., G. Camodeca, G. Cecconi, S. Orlandi, L. Fabriani and S. Evangelisti,
ed. (1999-2023) : Epigraphic Database Roma (EDR) (http://www.edr-edr.it/,
accessed 09-02-2023).
Pitts, R. (2022) : Corpus of the Epigraphy of the Italian Peninsula in the 1st
Millennium BCE (CEIPoM) (https://reubenjpitts.github.io/Corpus-of-the-Epigraphy-ofthe-Italian-Peninsula-in-the-1st-Millennium-BCE/, accessed 09-02-2023).
Prag, J.R.W., ed. (2017-2023) : Inscriptions of Sicily (I.Sicily)
(http://sicily.classics.ox.ac.uk/, accessed 09.02.2023).
Prag, J.R.W. (2021) : “I.Sicily and Crossreads: a digital epigraphic corpus for ancient
Sicily”, in : Prescott et al., ed. 2021, 181‑192.
Prescott, C., A. Karivieri, P.B. Campbell, K. Göransson et S. Tusa, ed. (2021) :
Trinacria, “an island outside time”: international archaeology in Sicily, Oxford.
Ray, J.M. ed. (2014) : Research data management: practical strategies for
information professionals, Charleston insights in library, archival, and information
sciences, West Lafayette, Indiana.
Reynolds, J.M. and J.B. Ward-Perkins (1952) : The Inscriptions of Roman
Tripolitania, Roma ; London.
Reynolds, J.M., J.B. Ward-Perkins, G. Bodard and C. Roueché (2009) : Inscriptions
of Roman Tripolitania (IRT2009) (https://inslib.kcl.ac.uk/irt2009/, accessed
09.02.2023).
Reynolds, J.M., C. Roueché, G. Bodard and C. Barron (2021) : Inscriptions of
Roman Tripolitania (IRT2021) (https://irt2021.inslib.kcl.ac.uk/en/, accessed
09.02.2023).
Reynolds, J.M., C. Roueche, G. Bodard, I. Vagionakis, C. Barron, F. Bigi, C. DobiasLalou, P. Kenrick, R.M. Kerr, M. Mackensen, and I. Tantillo (2022) : Inscriptions of
Roman Tripolitania 2021. (https://doi.org/10.18742/C.6002275.V1).
Rios, F. (2018) : “Incorporating Software Curation into Research Data Management
Services: Lessons Learned”, International Journal of Digital Curation, 13, 1, 235‑247.
Robert, L. (1959) : “Griechische Versinschriften hrsg. von Peek. 1. Grabepigramme”,
Gnomon, 31, 1‑30.
Robert, L. (1965) : “Compte rendu du volume VIII des Monumenta Asiae Minoris
Antiqua”, Hellenica: recueil d’épigraphie de numismatique et d’antiquités grecques,
13, 9‑23.
Robert, L. and J. Robert (1954) : La Carie: histoire et géographie historique avec le
recueil des inscriptions antiques, Paris.
Rossi, I. (2020) : “Qualitative and quantitative approaches in digital epigraphy”,
Archeologia e Calcolatori, 31, 2, 145-156.
Roueché, C. and D. Thompson (2023) : “Joyce Reynolds (1918-2022)”, CUCD
Bulletin, 52.
Saller, R.P. and B.D. Shaw (1984) : “Tombstones and Roman Family Relations in
the Principate: Civilians, Soldiers and Slaves”, Journal of Roman Studies, 74,
124‑156.
Schmidt, M.G. (2007) : Corpus inscriptionum Latinarum, 2. durchgesehene und
aktualisierte Auflage. Berlin.
Sverkos, E. and Tsolakis, G. (2021) : Supplementum Epigraphicum Graecum:
consolidated concordances for volumes XLVI-LX (1996-2010), Leiden.
The Trustees of the British Museum (2023) : Tomb relief; block
(https://www.britishmuseum.org/collection/object/G_1867-0508-55, accessed 09-022023).
Tsolakis, G. (2021) : “Epigraphic abbreviations”, in : Sverkos & Tsolakis, ed. 2021,
XV‑CXXVI.
Velázquez, I. and D. Espinosa, ed. (2021) : Epigraphy in the digital age:
opportunities and challenges in the recording, analysis and dissemination of
inscriptions, Archaeopress archaeology, Oxford.
Wilkinson, M.D., M. Dumontier, I.J. Aalbersberg, G. Appleton, B. Mons, et al. (2016) :
“The FAIR Guiding Principles for scientific data management and stewardship”,
Scientific Data, 3, 1, 1‑9.