Neven Jovanović (firstname.lastname@example.org) Florilegia: Big Textual Data Workshop, July 10-11, 2017
This page: croala.ffzg.unizg.hr/croala-florilegia/
How? (At random, precisely, topically, comparatively)
Languages of culture used by Croatians in the past:
Latin, Italian, German
Bibliographic research for Latin: 1263 authors, 4871 works in the period 976-1984
Currently in CroALa: 5.7 million words, 467 documents, bibliographic data.
First edition 2009
Our experience of the text is constrained
by our knowledge and
by print (book),
by software (PhiloLogic),
by a markup model system (TEI XML).
(Marko Marulić, Repertorium, Problemata Aristotelis)
A list of persons in Croatian Latin school drama
Counting Latin authors in Croatia and in Tyrol:
Croatica et Tyrolensia
Looking for Lucretius in CroALa (forthcoming)
In Corpus corporum, because of XML
In Neulateinische Wortliste as a source
In the struggles with material agency [our] plans and goals too are at stake and liable to revision. And thus the intentional character of human agency has a further aspect of temporal emergence, being reconfigured itself in the real time of practice, as well as a further aspect of intertwining with material agency, being reciprocally redefined with the contours of material agency in tuning.
Andrew Pickering, The Mangle of Practice: Time, Agency, and Science (1995)
What had started as an ad hoc collection of TEI-encoded texts with time grew into a forest; the forest needed a map, and there was a thing that could be developed into a map; we needed specific parts of the forest, and there were things that could be used to extract parts of it; we realized that similar plants are growing in other forests, and there were things that could be used to compare plants.
What had started as a collection of texts turned into something that has to be "manipulated" — accessed, retrieved — at the atomic and molecular level
(read: words, phrases, sentences, passages).
(Read: meanings, grammar, things.)