OpenGNT/fileDescription.md

10 KiB
Raw Blame History

Two Major Files:

Two files are described on this page:

OpenGNT_BASE_TEXT.zip:

Usage:

  • unzip the zip file to unpack "OpenGNT_version3.csv"
  • open "OpenGNT_version3.csv" with a text editor
  • locate columns of data, separated from one another with a [TAB] character.

1st Column - OGNT Sort Number

This column contains sort numbers of all words.
These sort numbers are also important bridges for mapping key features in file OpenGNT_keyedFeatures.csv.zip.

2nd Column - BGBsort

These are original sort numbers of Berean Greek Bible, BGB, available in Berean translation table.
These numbers are important bridges to map associated data of Berean Greek Bible.

3rd Column - BookChapterVerse

Book = Book number, ranging from 40 to 66, representing books from Matthew to the book of Revelation.
Chpter = Chapter number
Verse = Verse number

4th Column - OGNTuOGNTalexemesnrmac

OGNTu = Greek word of OGNT in unaccented form
OGNTa = Greek word of OGNT in accented form
lexeme = Greek word of OGNT in lexical form
sn = Extended Strong's number, according to conventions of TBESG - Tyndale Brief lexicon of Extended Strongs for Greek
rmac = Robinson's Morphological Analysis Codes, morphological analysis combining James Tauber's work in TANTT and data in Berean translation table

4th Column - transSBLmodernGreek

transSBL = transliteration according to SBL's conventions
modernGreek = modern Greek pronunciation

5th Column - TBESGBIBBLBBSB

TBESG = Tyndale House's glosses, taken from TBESG (context-insensitive)
BIB = translation from Berean Interlinear Bible (context-sensitive)
BLB = translation from Berean Literal Bible (context-sensitive)
BSB = translation from Berean Study Bible (context-sensitive)

6th Column - PMpWordPMfWord

PMpWord = punctuation mark(s) preceding the main word
PMfWord = punctuation mark(s) following the main word

7th Column - NoteMvarMlexemeMsnMrmac

Note = Notes on a specific word
(3 Types:
'' means Greek word, which are not in original Berean Greek data, 3 words adapted from Byzantine text, 2 words adapted from BHP;
'' means the main word is different from NA28;
'' means the main word is identical to the corresponding word in NA28, with minor orthographical difference)
Mvar = Greek variant in accented form, taken from TANTT database, applied only where '' or '' appear in 'Note' on the same row.
Mlexeme = lexical form of the Greek variant, Mvar, applied only where '' or '' appear in 'Note' on the same row.
Msn = Extended Strong's number of the Greek variant, Mvar, applied only where '' or '' appear in 'Note' on the same row.
Mrmac = Robinson's Morphological Analysis Codes of the Greek variant, Mvar,, applied only where '' or '' appear in 'Note' on the same row.

OpenGNT_keyedFeatures.csv.zip:

Usage:

  • unzip the zip file to unpack "OpenGNT_keyedFeatures.csv"
  • open "OpenGNT_keyedFeatures.csv" with a text editor
  • locate columns of data, separated from one another with a [TAB] character.

1st Column - Features Sort Number This sort number is used to sort word order mapped in GNT features.

2nd Column - mapIDV2 This is a set of mapping ID, used to map resources like Levinsohn GNT discourse features.

3rd Column - mapIDV1 This is a old set of mapping ID, used to map an early version of TANTT's data.

4th Column - bookchapterverse

  1. Book number
  2. Chapter number
  3. Verse number

  • 4th Column - OGNT_KEYOpenTextWord_KEY
  1. OGNT_KEY - It is same as the main sort number in file OpenGNT_BASE_TEXT.zip; this number is used as a mapping id in this file, to map the base text of OGNT to various GNT features.
  2. OpenTextWordID - Base Word IDs for for mapping OpenText.org Linguisitc Annotation of the Greek New Testament's data
    (Remarks: OpenText's GNT annotations places shorter ending of Mark 16 at the end of Mark 16:8 whereas OpenGNT places it at the end of Mark 16:20)

  • 5th Column - Mapping to Levinsohn GNTDF's Data:
    LevinsohnWordIDnoteMarkernoteMarkerNoClauseclauseotQuotationreportedSpeechembeddedReportedSpeech
  1. LevinsohnWordID - Word IDs for mapping Levinsohn's GNT Discourse Features
    Full mapping is available in the file OGNT_FullMapping_Levinsohn.csv.zip.
    (Remarks: Levinsohn's GNT Discourse Features places shorter ending of Mark 16 at the end of Mark 16:8 whereas OpenGNT places it at the end of Mark 16:20)
  2. noteMarker - Note marker, mapped to notes of Levinsohn's GNT Discourse Features
  3. noteMarkerNoClause - Note marker, mapped to notes of Levinsohn's GNT Discourse Features [without clauses]
  4. clause - Clause markers, according to Levinsohn's GNT Discourse Features
  5. otQuotation - Old Testament Quotations, according to Levinsohn's GNT Discourse Features [ means "beginning of an OT quotation"; * means a word within an OT quotation; means "end of an OT quotation"; the slot is empty where it is not applicable.
  6. reportedSpeech - Reported speech, according to Levinsohn's GNT Discourse Features [ means "beginning of a reported speech"; * means a word within a reported speech; means "end of a reported speech"; the slot is empty where it is not applicable.
  7. embeddedReportedSpeech - Embedded reported speech, according to Levinsohn's GNT Discourse Features [ means "beginning of an embedded reported speech"; * means a word within an embedded reported speech; means "end of an embedded reported speech"; the slot is empty where it is not applicable.

  • 6th Column - Lexical Entries & Morphology:
    lexemeBDAGentryEDNTentryMounceEntrymorphologyCodemorphologyDescriptionextendedStrongNumberGoodrickKohlenbergerNumbersLN-LouwNidaNumbers
  1. lexeme - lexeme
  2. BDAGentry - BDAG catchwords
  3. EDNTentry - EDNT catchwords
  4. MounceEntry - Entry words of Mounce's Concise Greek-English dictionary
  5. morphologyCode - Robinson's Morphological Analysis Codes [RMAC]
  6. morphologyDescription - description on morphology
  7. extendedStrongNumber - Tyndale House's extended Strong's number
  8. GoodrickKohlenbergerNumbers - Goodrick-Kohlenberger numbers; compatible with Mounce's Concise Greek-English dictionary
  9. LouwNidaNumbers - Louw-Nida numbers

  • 7th Column - Gloss & Translation:
    MounceGlossTyndaleHouseGlossOpenGNTGlossNET2Words
  1. MounceGloss - English glosses (Context-insensitive) -
    English glosses selected from Mounce's Concise Greek-English dictionary
  2. TyndaleHouseGloss - English glosses (Context-insensitive) -
    Generated from glosses of TBESG, produced by Tyndale House, Cambridge UK
  3. OpenGNTGloss - English glosses (Context-sensitive) -
    A full set of context-sensitive glosses for OpenGNT, worked out by Eliran Wong [initial data are drawn from "TyndaleHouseGloss" mentioned above; every gloss will be checked against its context; on-going updates are gradually integrated HERE; please check regularly]
  4. NET2Words - Words of The NET Bible® verse text (no Notes; 2nd Edition), mapped to OGNT [1st draft uploaded; subject to on-going revision]

    Enhanced features are gradually integrated in this file.

  • 8th Column - Textual Variants:
    editionMarker1editionMarker2editionsvariants
  1. editionMarker1 - a type of marker for details of editions, used in applications, e.g. BibleBento Plus
  2. editionMarker2 - a type of marker for details of editions, used in applications, e.g. e-Sword
  3. editions - GNT editions having the same spelling as the main word of OpenGNT. There may be variation in accentuation or capitalisation, though. [B=Byzantine, I=NIV Greek, N=NA27, M=NA28 where words are different from NA27, R=Textus Receptus, S=SBLGNT, T=Tregelles's GNT, W=Westcott-Hort, H=Tydale House GNT]
  4. variants - variant(s), if any

  • The last column - WordInHTML:
    This last column provide words of OGNT in html format, with taggings on extended Strong's numbers, morphology, ot quotation [ot.../ot], reported speech [rs.../rs], embedded reported speech [ers.../ers], textual variant marker, Levinsohn's clause division & note marker, if applicable.

    Remarks:
  • Lines / Entries starting with the following numbers are created for mapping purpose only (mapping resouces based on NA27, e.g. Levinsohn Discource Features):
    122580, 122586, 122796, 123928, 123948, 124712, 125108, 125238, 127544, 127800, 128058, 128061.