Thursday, April 2, 2026

T'OMIM: Tanakh Observable Matches of Intertextual Mimesis

David Smiley has archived his T'OMIM: Tanakh Observable Matches of Intertextual Mimesis database on Zenodo. He provided the following description via Agade:

_______________________________________

A new dataset is now available that may be of interest to anyone working on inner-biblical allusion, synoptic parallels, or intertextuality more broadly. T'OMIM (תאומים, "twins" in Hebrew) is an open-access collection of labeled parallel passages in the Hebrew Bible, hosted on Zenodo.

Scholars have been cataloging parallelism and intertextual relationships since antiquity. But none of that accumulated work has existed until now in a structured, machine-readable format. T'OMIM was built to fill that gap.

The dataset pairs two corpora of known parallels. The first contains 554 narrative verse pairs drawn from the Chronicles synoptic tradition. The second contains 256 poetic half-verse pairs identified in the biblical parallelism literature. Both corpora are available at two levels of granularity: verse-level paired texts with source citations, and word-level tokens that carry the full ETCBC morphological annotation (part of speech, verbal stem, gender, number, person, lexeme, English gloss, and syntactic structure). Every word in every parallel passage is fully parsed.

For those working computationally, the word-level data can feed directly into natural language processing workflows. For scholars approaching these texts without a programming background, the verse-level files are structured as simple tabular data and can be opened in Excel or any spreadsheet application. Each row is a pair of passages, with columns for the source reference, the text, and the scholarly citation from which the parallel was drawn.

No comments:

Post a Comment