Text as Graph (TAG)
What is it?
The Text As Graph (TAG) data model is a directed property hypergraph for modeling text, markup (roughly comparable to XML elements), and annotations (roughly comparable to what XML attributes would be like if they could contain markup, including attributes on attributes).
News
New release: Alexandria 2.4 (October 2020)
New features:
- The
export-xml
now has an option to choose a ‘leading’ layer. When a TAGML document has multiple layers,export-xml
uses Trojan Horse Milestones on the layers. Previously it used these on all layers. In this version you can indicate a layer to define the XML hierarchy: markup in this leading layer will be output as regular XML open/close tags, for all other layers the Trojan Horse Milestones will be used. - When a view is active, changes to its definition file cannot be committed.
init
now also creates a sparql directory.
New/Changed commands for the command-line app
- export-xml (new option:
--leadinginglayer
/-l
)
New release: LSP-tagml 1.0.1 (May 2020)
LSP-tagml is a plugin for Sublime Text 3 that installs the tagml-language-server.
New release: tagml-language-server 0.1 (May 2020)
The tagml-language-server is an implementation of the Language Server Protocol for TAGML. When linked with a compatible editor, it can be used to provide syntax help in the editor.
New release: Alexandria 2.3 (September 2019)
Bugfixes:
- It is now possible, when a view other than the default view is active, to commit new and changed view definitions, and new tagml source files. Committing changes to tagml source files of existing documents is still only possible when the default view is active.
- Running
alexandria init
in your home directory is not allowed and will fail with an error message. alexandria status
will now only search one level deep in watched directories for files/directories that may be added.- When committing a view definition which is valid json, but does not contain at least one of the required fields
includeLayers
,excludeLayers
,includeMarkup
,excludeMarkup
, alexandria would silently accept this, producing an invalid view. This has been fixed: committing this view definition will not fail with an error message.
New release: Alexandria 2.2 (July 2019)
New/Changed commands for the command-line app
- query
Bugfixes:
- After a revert, the reverted file is now no longer shown as modified.
- It is now possible to run alexandria commands from any directory, provided one of its parent directories has been initialized.
New release: Alexandria 2.1 (December 2018)
New features:
New/Changed commands for the command-line app
- about
- add
- commit
- export-dot
- export-svg
- export-png
- export-xml
Bugfixes:
- The first markup is now always the root markup for the default layer, even if new layers are defined on that markup.
- This means that this first markup tag must correspond with the last closing markup tag, and suspending/resuming of this markup is not allowed.
Upcoming presentation
Elli Bleeker, Bram Buitendijk, and Ronald Haentjens Dekker. “Between Flexibility and Universality: Combining TAGML with XML to Improve the Modeling of Cultural Heritage Texts”. Short paper at the online workshop Computational Humanities Research, November 18-20, 2020.
Implementations
Alexandria
Alexandria is a prototype implementation of TAG.
TAGML
TAGML (Text-As-Graph Markup Language) is a syntax for the TAG model.
HyperCollate
HyperCollate is a prototype collation engine that is able to handle intradocumentary variation (i.e. variation within one document), in addition to finding the differences between witnesses. This advanced form of collation is possible because HyperCollate looks not only at the text of a document, but also at its markup.
Presentations and Publications
-
Haentjens Dekker, Ronald, Bram Buitendijk, and Elli Bleeker. 2020. “Parsing a markup language that supports overlap and discontinuity”. Short paper presented online at DocEng: the 20th ACM Symposium on Document Engineering, September 29-October 1, 2020.
- Bleeker, Elli. “What Has Been Left Out: the Perks and Pitfalls of XML-aware Collation”. Keynote lecture at the online workshop Introduction to automatic collation at the University of Lausanne (UNIL), September 24-25, 2020.
- Bleeker, Elli, Bram Buitendijk, and Ronald Haentjens Dekker. 2020. “Marking Up Microrevisions with Major Implications: Nonlinear Text in TAG”. Paper presented at Balisage: The Markup Conference 2020, online, July 27 - 31, 2020. In Proceedings of Balisage: The Markup Conference 2020. Balisage Series on Markup Technologies, vol. 25 (2020). doi:
https://doi.org/10.4242/BalisageVol25.Bleeker01
. - Bleeker, Elli, Bram Buitendijk, and Ronald Haentjens Dekker. 2019. “Uniting Scholarly Knowledge: The (recent) Past and (near) Future of Textual Scholarship”. Paper presented at the European Society of Textual Scholarship (ESTS) conference, November 28, 2019.
-
Bleeker, Elli. 2019. “Different Ways of Looking: Text Models and Markup for Digital Scholarly Editing”. Invited talk at the Colloquium for Digital Cultural Heritage at the Cologne Centre for eHumanities, November 13, 2019. Slides at request.
- Bleeker, Elli, Bram Buitendijk, and Ronald Haentjens Dekker. 2019. “Between Freedom and Formalisation: A Hypergraph Model for Representing the Nature of Text”. Paper presented at the TEI conference 2019, September 16-21, 2019, University of Graz (Austria).
-
Bleeker, Elli, Bram Buitendijk, Ronald Haentjens Dekker, and Astrid Kulsdom. 2019. Workshop TAG and Alexandria at the DH Benelux conference in Liège, Belgium on September 11, 2019. Slides available.
-
Haentjens Dekker, Ronald. 2019. Invited talk at the Workshop on Scholarly Digital Editions, Graph Data-Models and Semantic Web Technologies, Université de Lausanne, 3 June 2019.
-
Bleeker, Elli, Bram Buitendijk, Ronald Haentjens Dekker. 2019. “From Graveyard to Graph: Visualisation of Collation in a Digital Paradigm”. In: International Journal of Digital Humanities, vol. 1, pp. 141-163. doi:
https://doi.org/10.1007/s42803-019-00012-w
. -
Bleeker, Elli. 2018. “Adressing Ancient Promises: Text Modeling and Alexandria”. Invited talk at the DH-Kolloquium of the Berlin Brandenburgische Akademie der Wissenschaften, 2 November 2018. Slides here.
- Haentjens Dekker, Ronald, Elli Bleeker, Bram Buitendijk, Astrid Kulsdom and David J. Birnbaum. 2018. “TAGML: A markup language of many dimensions.” Presented at Balisage: The Markup Conference 2018, Washington, DC, July 31 - August 3, 2018. In Proceedings of Balisage: The Markup Conference 2018. Balisage Series on Markup Technologies, vol. 21 (2018).
doi:https://doi.org/10.4242 BalisageVol21.HaentjensDekker01.
-
Bleeker, Elli, Bram Buitendijk, Ronald Haentjens Dekker, and Astrid Kulsdom. 2018. “Perspectives on Text. Synthesising Textual Knowledge”. Presented at the international Computational Methods for Literary-Historical Textual Scholarship conference at De Montfort University, July 4th 2018.
- Bleeker, Elli, Bram Buitendijk, Ronald Haentjens Dekker, and Astrid Kulsdom. 2018. “Including XML markup in the automated collation of literary text.” Presented at XML Prague 2018, Prague, February 8–10, 2018. In XML Prague 2018. Conference proceedings. Prague: University of Economics, 77–95.
- Haentjens Dekker, Ronald, and David J. Birnbaum. 2017. “It’s more than just overlap: Text As Graph.” Presented at Balisage: The Markup Conference 2017, Washington, DC, August 1-4, 2017. In Proceedings of Balisage: The Markup Conference 2017. Balisage Series on Markup Technologies, vol. 19 (2017). doi:
10.4242/BalisageVol19.Dekker01
.- Paper: https://www.balisage.net/Proceedings/vol19/html/Dekker01/BalisageVol19-Dekker01.html
- Visualization: “Hunting of the Snark” (mp4, 20G)