OpenGNT/fileDescription.md

11 KiB
Raw Blame History

Two Major Files:

Two files are described on this page:

OpenGNT_BASE_TEXT.zip:

Usage:

  • unzip the zip file to unpack "OpenGNT_version3.csv"
  • open "OpenGNT_version3.csv" with a text editor
  • locate columns of data, separated from one another with a [TAB] character.

1st Column - OGNT Sort Number

This column contains sort numbers of all words.
These sort numbers are also important bridges for mapping key features in file OpenGNT_keyedFeatures.csv.zip.

2nd Column - BGBsort

These are original sort numbers of Berean Greek Bible, BGB, available in Berean translation table.
These numbers are important bridges to map associated data of Berean Greek Bible.

3rd Column - BookChapterVerse

  1. Book = Book number, ranging from 40 to 66, representing books from Matthew to the book of Revelation.
  2. Chpter = Chapter number
  3. Verse = Verse number

4th Column - OGNTuOGNTalexemesnrmac

  1. OGNTu = Greek word of OGNT in unaccented form
  2. OGNTa = Greek word of OGNT in accented form
  3. lexeme = Greek word of OGNT in lexical form
  4. sn = Extended Strong's number, according to conventions of TBESG - Tyndale Brief lexicon of Extended Strongs for Greek
  5. rmac = Robinson's Morphological Analysis Codes, morphological analysis combining James Tauber's work in TANTT and data in Berean translation table

4th Column - transSBLmodernGreek

  1. transSBL = transliteration according to SBL's conventions
  2. modernGreek = modern Greek pronunciation

5th Column - TBESGBIBBLBBSB

  1. TBESG = Tyndale House's glosses, taken from TBESG (context-insensitive)
  2. BIB = translation from Berean Interlinear Bible (context-sensitive)
  3. BLB = translation from Berean Literal Bible (context-sensitive)
  4. BSB = translation from Berean Study Bible (context-sensitive)

6th Column - PMpWordPMfWord

  1. PMpWord = punctuation mark(s) preceding the main word
  2. PMfWord = punctuation mark(s) following the main word

7th Column - NoteMvarMlexemeMsnMrmac

  1. Note = Notes on a specific word
    (3 Types:
    '' means Greek word, which are not in original Berean Greek data, 3 words adapted from Byzantine text, 2 words adapted from BHP;
    '' means the main word is different from NA28;
    '' means the main word is identical to the corresponding word in NA28, with minor orthographical difference)
  2. Mvar = Greek variant in accented form, taken from TANTT database, applied only where '' or '' appear in 'Note' on the same row.
  3. Mlexeme = lexical form of the Greek variant, Mvar, applied only where '' or '' appear in 'Note' on the same row.
  4. Msn = Extended Strong's number of the Greek variant, Mvar, applied only where '' or '' appear in 'Note' on the same row.
  5. Mrmac = Robinson's Morphological Analysis Codes of the Greek variant, Mvar,, applied only where '' or '' appear in 'Note' on the same row.

OpenGNT_keyedFeatures.csv.zip:

Usage:

  • unzip the zip file to unpack "OpenGNT_keyedFeatures.csv"
  • open "OpenGNT_keyedFeatures.csv" with a text editor
  • locate columns of data, separated from one another with a [TAB] character.

1st Column - Features Sort Number
This sort number is used to sort word order mapped in GNT features.

2nd Column - mapIDV2
This is a set of mapping ID, used to map resources like Levinsohn GNT discourse features.

3rd Column - mapIDV1
This is a old set of mapping ID, used to map an early version of TANTT's data.

4th Column - bookchapterverse

  1. Book number
  2. Chapter number
  3. Verse number

    5th Column - OGNT_KEYOpenTextWord_KEY
  4. OGNT_KEY - It is same as the main sort number in file OpenGNT_BASE_TEXT.zip; this number is used as a mapping id in this file, to map the base text of OGNT to various GNT features.
  5. OpenTextWordID - Base Word IDs for for mapping OpenText.org Linguisitc Annotation of the Greek New Testament's data
    (Remarks: OpenText's GNT annotations places shorter ending of Mark 16 at the end of Mark 16:8 whereas OpenGNT places it at the end of Mark 16:20)

    6th Column - Mapping to Levinsohn GNTDF's Data:
    LevinsohnWordIDnoteMarkernoteMarkerNoClauseclauseotQuotationreportedSpeechembeddedReportedSpeech
  6. LevinsohnWordID - Word IDs for mapping Levinsohn's GNT Discourse Features
    Full mapping is available in the file OGNT_FullMapping_Levinsohn.csv.zip.
    (Remarks: Levinsohn's GNT Discourse Features places shorter ending of Mark 16 at the end of Mark 16:8 whereas OpenGNT places it at the end of Mark 16:20)
  7. noteMarker - Note marker, mapped to notes of Levinsohn's GNT Discourse Features
  8. noteMarkerNoClause - Note marker, mapped to notes of Levinsohn's GNT Discourse Features [without clauses]
  9. clause - Clause markers, according to Levinsohn's GNT Discourse Features
  10. otQuotation - Old Testament Quotations, according to Levinsohn's GNT Discourse Features [ means "beginning of an OT quotation"; * means a word within an OT quotation; means "end of an OT quotation"; the slot is empty where it is not applicable.
  11. reportedSpeech - Reported speech, according to Levinsohn's GNT Discourse Features [ means "beginning of a reported speech"; * means a word within a reported speech; means "end of a reported speech"; the slot is empty where it is not applicable.
  12. embeddedReportedSpeech - Embedded reported speech, according to Levinsohn's GNT Discourse Features [ means "beginning of an embedded reported speech"; * means a word within an embedded reported speech; means "end of an embedded reported speech"; the slot is empty where it is not applicable.

    7th Column - Lexical Entries & Morphology:
    lexemeBDAGentryEDNTentryMounceEntrymorphologyCodemorphologyDescriptionextendedStrongNumberGoodrickKohlenbergerNumbersLN-LouwNidaNumbers
  13. lexeme - lexeme
  14. BDAGentry - BDAG catchwords
  15. EDNTentry - EDNT catchwords
  16. MounceEntry - Entry words of Mounce's Concise Greek-English dictionary
  17. morphologyCode - Robinson's Morphological Analysis Codes [RMAC]
  18. morphologyDescription - description on morphology
  19. extendedStrongNumber - Tyndale House's extended Strong's number
  20. GoodrickKohlenbergerNumbers - Goodrick-Kohlenberger numbers; compatible with Mounce's Concise Greek-English dictionary
  21. LouwNidaNumbers - Louw-Nida numbers

  • 8th Column - Gloss & Translation:
    MounceGlossTyndaleHouseGlossOpenGNTGlossNET2Words
  1. MounceGloss - English glosses (Context-insensitive) -
    English glosses selected from Mounce's Concise Greek-English dictionary
  2. TyndaleHouseGloss - English glosses (Context-insensitive) -
    Generated from glosses of TBESG, produced by Tyndale House, Cambridge UK
  3. OpenGNTGloss - English glosses (Context-sensitive) -
    A full set of context-sensitive glosses for OpenGNT, worked out by Eliran Wong [initial data are drawn from "TyndaleHouseGloss" mentioned above; every gloss will be checked against its context; on-going updates are gradually integrated HERE; please check regularly]
  4. NET2Words - Words of The NET Bible® verse text (no Notes; 2nd Edition), mapped to OGNT [1st draft uploaded; subject to on-going revision]

    Enhanced features are gradually integrated in this file.

    9th Column - Textual Variants:
    editionMarker1editionMarker2editionsvariants
  5. editionMarker1 - a type of marker for details of editions, used in applications, e.g. BibleBento Plus
  6. editionMarker2 - a type of marker for details of editions, used in applications, e.g. e-Sword
  7. editions - GNT editions having the same spelling as the main word of OpenGNT. There may be variation in accentuation or capitalisation, though. [B=Byzantine, I=NIV Greek, N=NA27, M=NA28 where words are different from NA27, R=Textus Receptus, S=SBLGNT, T=Tregelles's GNT, W=Westcott-Hort, H=Tydale House GNT]
  8. variants - variant(s), if any

  • The last column - WordInHTML:
    This last column provide words of OGNT in html format, with taggings on extended Strong's numbers, morphology, ot quotation [ot.../ot], reported speech [rs.../rs], embedded reported speech [ers.../ers], textual variant marker, Levinsohn's clause division & note marker, if applicable.

    Remarks:
  • Lines / Entries starting with the following numbers are created for mapping purpose only (mapping resouces based on NA27, e.g. Levinsohn Discource Features):
    122580, 122586, 122796, 123928, 123948, 124712, 125108, 125238, 127544, 127800, 128058, 128061.