Minimal elements for lexical/conceptual resources

This page describes the minimal metadata elements specific to lexical/conceptual resources.


LexicalConceptualResource

Path MetadataRecord.DescribedEntity.LanguageResource.LRSubclass.LexicalConceptualResource

Data type component

Optionality Mandatory

Explanation & Instructions

Wraps together elements for lexical/conceptual resources

Example

<ms:LRSubclass>
        <ms:LexicalConceptualResource>
                <ms:lrType>LexicalConceptualResource</ms:lrType>
                ...
        </ms:LexicalConceptualResource>
</ms:LRSubclass>

lcrSubclass

Path MetadataRecord.DescribedEntity.LanguageResource.LRSubclass.LexicalConceptualResource.lcrSubclass

Data type CV (lcrSubclass)

Optionality Recommended

Explanation & Instructions

Introduces a classification of lexical/conceptual resources into types (used for descriptive reasons)

Example

<lcrSubclass>http://w3id.org/meta-share/meta-share/computationalLexicon</lcrSubclass>

<lcrSubclass>http://w3id.org/meta-share/meta-share/ontology</lcrSubclass>

encodingLevel

Path MetadataRecord.DescribedEntity.LanguageResource.LRSubclass.LexicalConceptualResource.encodingLevel

Data type CV (encodingLevel)

Optionality Mandatory

Explanation & Instructions

Classifies the contents of a lexical/conceptual resource or language description as regards the linguistic level of analysis it caters for

You can repeat the element for multiple encoding levels.

Example

<ms:encodingLevel>http://w3id.org/meta-share/meta-share/phonology</ms:encodingLevel>

<ms:encodingLevel>http://w3id.org/meta-share/meta-share/semantics</ms:encodingLevel>

ContentType

Path MetadataRecord.DescribedEntity.LanguageResource.LRSubclass.LexicalConceptualResource.ContentType

Data type CV (ContentType)

Optionality Mandatory

Explanation & Instructions

A more detailed account of the linguistic information contained in the lexical/conceptual resource

You can repeat the element for multiple encoding levels.

Example

<ms:ContentType>http://w3id.org/meta-share/meta-share/collocation</ms:ContentType>

<ms:ContentType>http://w3id.org/meta-share/meta-share/definition</ms:ContentType>

compliesWith

Path MetadataRecord.DescribedEntity.LanguageResource.LRSubclass.LexicalConceptualResource.ContentType

Data type CV (compliesWith)

Optionality Mandatory

Explanation & Instructions

Specifies the vocabulary/standard/best practice to which a resource is compliant with

You can repeat the element for multiple encoding levels.

Example

<ms:compliesWith>http://w3id.org/meta-share/meta-share/LMF</ms:compliesWith>

LexicalConceptualResourceTextPart

Path MetadataRecord.DescribedEntity.LanguageResource.LRSubclass.Corpus.CorpusMediaPart.LexicalConceptualResourceTextPart

Data type component

Optionality Mandatory if applicable

Explanation & Instructions

A part (or whole set) of a lexical/conceptual resource that consists of textual elements

You can repeat the group of elements for multiple textual parts.

The mandatory or recommended elements for the text part of lexical/conceptual resources are:

  • mediaType (Mandatory): Specifies the media type of a language resource (the physical medium of the contents representation). For text parts, always use the value ‘text’.
  • lingualityType (Mandatory ): Indicates whether the resource includes one, two or more languages.
  • multilingualityType (Mandatory if applicable): Indicates whether the resource (part) is parallel, comparable or mixed. If lingualityType = bilingual or multilingual, it is required; select one of the values for parallel (e.g., original text and its translations), comparable (e.g. corpus of the same domain in multiple languages) and multilingualSingleText (for corpora that consist of segments including text in two or more languages (e.g., the transcription of a European Parliament session with MPs speaking in their native language).
  • language (Mandatory): Specifies the language that is used in the resource part , expressed according to the BCP47 recommendation. See language.
  • languageVariety (Mandatory if applicable): Relates a language resource that contains segments in a language variety (e.g., dialect, jargon) to it. Please use for dialect corpora.
  • metalanguage (Mandatory): Specifies the language that is used in the resource part , expressed according to the BCP47 recommendation. See language.
  • modalityType (Recommended if applicable): Specifies the type of the modality represented in the resource. For instance, you can use ‘spoken language’ to describe transcribed speech corpora.

Example

<ms:LexicalConceptualResourceMediaPart>
        <ms:LexicalConceptualResourceTextPart>
                <ms:lcrMediaType>LexicalConceptualResourceTextPart</ms:lcrMediaType>
                <ms:mediaType>http://w3id.org/meta-share/meta-share/text</ms:mediaType>
                <ms:lingualityType>http://w3id.org/meta-share/meta-share/bilingual</ms:lingualityType>
                <ms:multilingualityType>http://w3id.org/meta-share/meta-share/parallel</ms:multilingualityType>
                <ms:language>
                        <ms:languageTag>en-US</ms:languageTag>
                        <ms:languageId>en</ms:languageId>
                        <ms:regionId>US</ms:regionId>
                </ms:language>
                <ms:language>
                        <ms:languageTag>es</ms:languageTag>
                        <ms:languageId>es</ms:languageId>
                </ms:language>
                <ms:metalanguage>
                        <ms:languageTag>es</ms:languageTag>
                        <ms:languageId>es</ms:languageId>
                </metalanguage>
                </ms:language>
        </ms:LexicalConceptualResourceTextPart>
</ms:LexicalConceptualResourceMediaPart>