OpenGNT/fileDescription.md

12 KiB
Raw Blame History

Two Major Files:

Two files are described on this page:

OpenGNT_BASE_TEXT.zip:

Usage:

  • unzip the zip file to unpack "OpenGNT_version3.csv"
  • open "OpenGNT_version3.csv" with a text editor
  • locate columns of data, separated from one another with a [TAB] character.

1st Column - OGNTsort

This column contains sort numbers of all words of the base text of OGNT.

2nd Column - TANTTsort

This column of sort numbers represent the order of "M" variants in TANTT.
These are important bridges for mapping key features in file OpenGNT_keyedFeatures.csv.zip.

3rd Column - FEATURESsort1

Sort numbers as in the first column of the file "OpenGNT_keyedFeatures.csv"

4th Column - BGBsortIBLBsortIBSBsortI

  1. BGBsortI = Sort number of BGB (inclusive) as in original "berean_tables5.xlsx".
  2. BLBsortI = Sort number of BLB (inclusive) as in original "berean_tables5.xlsx".
  3. BSBsortI = Sort number of BSB (inclusive) as in original "berean_tables5.xlsx".

5th Column - BookChapterVerse

  1. Book = Book number, ranging from 40 to 66, representing books from Matthew to the book of Revelation.
  2. Chpter = Chapter number
  3. Verse = Verse number

6th Column - OGNTuOGNTalexemesnrmac

  1. OGNTu = Greek word of OGNT in unaccented form
  2. OGNTa = Greek word of OGNT in accented form
  3. lexeme = Greek word of OGNT in lexical form
  4. sn = Extended Strong's number, according to conventions of TBESG - Tyndale Brief lexicon of Extended Strongs for Greek
  5. rmac = Robinson's Morphological Analysis Codes, morphological analysis combining James Tauber's work in TANTT and data in Berean translation table

7th Column - transSBLmodernGreek

  1. transSBL = transliteration according to SBL's conventions
  2. modernGreek = modern Greek pronunciation

8th Column - TBESGBIBBLBBSB

  1. TBESG = Tyndale House's glosses, taken from TBESG (context-insensitive)
  2. BIB = translation from Berean Interlinear Bible (context-sensitive)
  3. BLB = translation from Berean Literal Bible (context-sensitive)
  4. BSB = translation from Berean Study Bible (context-sensitive)

9th Column - PMpWordPMfWord

  1. PMpWord = punctuation mark(s) preceding the main word
  2. PMfWord = punctuation mark(s) following the main word
    Remarks: Punctuation marks were adapted from data available in TANTT - Tyndale Amalgamated NT Tagged texts.

10th Column - NoteMvarMlexemeMsnMrmac

  1. Note = Notes on a specific word
    (3 Types:
    '' means Greek word, which are not in original Berean Greek data, 3 words adapted from Byzantine text, 2 words adapted from BHP;
    '' means the main word is different from NA28;
    '' means the main word is identical to the corresponding word in NA28, with minor orthographical difference)
  2. Mvar = Greek variant in accented form, taken from TANTT database, applied only where '' or '' appear in 'Note' on the same row.
  3. Mlexeme = lexical form of the Greek variant, Mvar, applied only where '' or '' appear in 'Note' on the same row.
  4. Msn = Extended Strong's number of the Greek variant, Mvar, applied only where '' or '' appear in 'Note' on the same row.
  5. Mrmac = Robinson's Morphological Analysis Codes of the Greek variant, Mvar,, applied only where '' or '' appear in 'Note' on the same row.

OpenGNT_keyedFeatures.csv.zip:

Usage:

  • unzip the zip file to unpack "OpenGNT_keyedFeatures.csv"
  • open "OpenGNT_keyedFeatures.csv" with a text editor
  • locate columns of data, separated from one another with a [TAB] character.

1st Column - FEATURESsort1
This sort number is used to sort word order (TANTT) mapped in GNT features.

2nd Column - FEATURESsort2
This sort number is used to sort word order (OGNT) mapped in GNT features.

3rd Column - mapIDV2
This is a set of mapping ID, used to map resources, e.g. check the 1st column in file, mapping Levinsohn GNT discourse features to OGNT.

4th Column - mapIDV1
This is a old set of mapping ID, used to map an early version of TANTT's data.

5th Column - bookchapterverse

  1. Book number
  2. Chapter number
  3. Verse number

    6th Column - TANTTsortOpenTextWord_KEY
  4. OGNTsort - It is same as the "OGNTsort" in file OpenGNT_BASE_TEXT.zip; this number is used as a mapping id in this file, to map the base text of OGNT to various GNT features.
  5. TANTTsort - It is same as the "TANTTsort" in file OpenGNT_BASE_TEXT.zip; this number is used as a mapping id in this file, to map the base text of OGNT to various GNT features.
  6. OpenTextWordID - Base Word IDs for for mapping OpenText.org Linguisitc Annotation of the Greek New Testament's data
    (Remarks: OpenText's GNT annotations places shorter ending of Mark 16 at the end of Mark 16:8 whereas OpenGNT places it at the end of Mark 16:20)

    7th Column - Mapping to Levinsohn GNTDF's Data:
    LevinsohnWordIDnoteMarkernoteMarkerNoClauseclauseotQuotationreportedSpeechembeddedReportedSpeech
  7. LevinsohnWordID - Word IDs for mapping Levinsohn's GNT Discourse Features
    Full mapping is available in the file OGNT_FullMapping_Levinsohn.csv.zip.
    (Remarks: Levinsohn's GNT Discourse Features places shorter ending of Mark 16 at the end of Mark 16:8 whereas OpenGNT places it at the end of Mark 16:20)
  8. noteMarker - Note marker, mapped to notes of Levinsohn's GNT Discourse Features
  9. noteMarkerNoClause - Note marker, mapped to notes of Levinsohn's GNT Discourse Features [without clauses]
  10. clause - Clause markers, according to Levinsohn's GNT Discourse Features
  11. otQuotation - Old Testament Quotations, according to Levinsohn's GNT Discourse Features [ means "beginning of an OT quotation"; * means a word within an OT quotation; means "end of an OT quotation"; the slot is empty where it is not applicable.
  12. reportedSpeech - Reported speech, according to Levinsohn's GNT Discourse Features [ means "beginning of a reported speech"; * means a word within a reported speech; means "end of a reported speech"; the slot is empty where it is not applicable.
  13. embeddedReportedSpeech - Embedded reported speech, according to Levinsohn's GNT Discourse Features [ means "beginning of an embedded reported speech"; * means a word within an embedded reported speech; means "end of an embedded reported speech"; the slot is empty where it is not applicable.

    8th Column - Lexical Entries & Morphology:
    lexemeBDAGentryEDNTentryMounceEntrymorphologyCodemorphologyDescriptionextendedStrongNumberGoodrickKohlenbergerNumbersLN-LouwNidaNumbers
  14. lexeme - lexeme
  15. BDAGentry - BDAG catchwords
  16. EDNTentry - EDNT catchwords
  17. MounceEntry - Entry words of Mounce's Concise Greek-English dictionary
  18. morphologyCode - Robinson's Morphological Analysis Codes [RMAC]
  19. morphologyDescription - description on morphology
  20. extendedStrongNumber - Tyndale House's extended Strong's number
  21. GoodrickKohlenbergerNumbers - Goodrick-Kohlenberger numbers; compatible with Mounce's Concise Greek-English dictionary
  22. LouwNidaNumbers - Louw-Nida numbers

  • 9th Column - Gloss & Translation:
    MounceGlossTyndaleHouseGlossOpenGNTGlossNET2Words
  1. MounceGloss - English glosses (Context-insensitive) -
    English glosses selected from Mounce's Concise Greek-English dictionary
  2. TyndaleHouseGloss - English glosses (Context-insensitive) -
    Generated from glosses of TBESG, produced by Tyndale House, Cambridge UK
  3. OpenGNTGloss - English glosses (Context-sensitive) -
    A full set of context-sensitive glosses for OpenGNT, worked out by Eliran Wong [initial data are drawn from "TyndaleHouseGloss" mentioned above; every gloss will be checked against its context; on-going updates are gradually integrated HERE; please check regularly]
  4. NET2Words - Words of The NET Bible® verse text (no Notes; 2nd Edition), mapped to OGNT [1st draft uploaded; subject to on-going revision]

    Enhanced features are gradually integrated in this file.

    10th Column - Textual Variants:
    editionMarker1editionMarker2editionsvariants
  5. editionMarker1 - a type of marker for details of editions, used in applications, e.g. BibleBento Plus
  6. editionMarker2 - a type of marker for details of editions, used in applications, e.g. e-Sword
  7. editions - GNT editions having the same spelling as the main word of OpenGNT. There may be variation in accentuation or capitalisation, though. [B=Byzantine, I=NIV Greek, N=NA27, M=NA28 where words are different from NA27, R=Textus Receptus, S=SBLGNT, T=Tregelles's GNT, W=Westcott-Hort, H=Tydale House GNT]
  8. variants - variant(s), if any

    Remarks:
  • Lines / Entries starting with the following numbers are created for mapping purpose only (mapping resouces based on NA27, e.g. Levinsohn Discource Features):
    122580, 122586, 122796, 123928, 123948, 124712, 125108, 125238, 127544, 127800, 128058, 128061.