Release information

Current release

  • Release date: 1/6/2022

  • ELG-SHARE schema version: v3.0.2

ELG platform feature updates

  • assignment of DOIs for ELG-hosted data and ELG-compatible services

  • export for MS-OWL/RDF, and, for metdata records with DOIs, to DataCite/XML and DataCite/JSON

  • support for upload of .tar, and .gz files

  • new stylesheet to match the new website

  • improvements on consumer’s grid (aggregation of data on “my downloads” & “my usage”

  • stylistic improvements on grid

  • stylistic improvements on view page

  • subfaceting of the facet “service functions”

  • removal of the facet “intended application”

  • minor bug fixes

Schema changes

Increase of the size of creationDetails

Changes required for XML metadata records

None

Release v2.1.0

  • Release date: 5/4/2022

  • ELG-SHARE schema version: v3.0.1

ELG platform feature updates

  • addition of tombstone page for deleted and unpublished metadata records

  • bulk service registration & technical validation for ELG-compatible services

  • action for making metadata records out of generic records through the django admin UI

  • move/copy technical metadata (parameters & samples) to code samples tab

  • adding links to previous version for metadata records added without explicit “replaces” relation

  • added display of version at lookup stage

  • minor bug fixes

Schema changes

None

Changes required for XML metadata records

None

Release v2.0.7

Overview

  • Release date: 14/3/2022

  • ELG-SHARE schema version: v3.0.1

ELG platform feature updates

  • Consumer’s grid with basic functionalities

  • Stylistic changes on grid layout

  • Updates on landing page: addition of “share”, “cite” and “counts” (of views, downloads, times used)

  • Addition of the glottolog code for languages and language varieties without an ISO code

  • Update of ISO vocabularies and removal of deprecated values

  • Change of the “language” facet to two subfacets

  • Change of the “condition of use” facet values to tick option

  • Additional validation rules in the import of XML files and editor (e.g., for duplicate values)

  • Export of inverse relations in the XML files

  • Enrichment of the index for free text search with synonyms for language and synonyms for service functions

Schema changes

  • Addition of inverse relations as materialized ones (for import and export purposes)

  • Addition of relations: qualifiedAttribution, isSupplementTo, isSupplementedBy

  • Removal of the distributionLocation element

Changes required for XML metadata records

  • Replace distributionLocation with accessLocation

Release v2.0.6

Overview

  • Release date: 18/12/2021

  • ELG-SHARE schema version: v3.0.0

ELG platform feature updates

  • Updates at the backend, editor and landing pages required for schema 3.0.0

  • Updates in validation rules required to import records marked as for information

  • Calculation of lingualityType based on the number of languages (taking into account values of collective languages)

  • Addition of flag for computed elements

  • Validation rule for blocking duplicate language values

  • Suppressed script on languageTag

  • Integration of the XML validation step in the upload procedure for metadata files

  • Different view pages for curators and administrators (e.g., allowing them to view hidden elements in accordance with the access rights on the metadata)

Schema changes

  • Catering for an enhanced description of models:

    • merged all types of models (e.g. n-gram models, ML models) into Model

    • added new elements (e.g. to describe the training process)

    • introduced the unspecifiedPart which replaces the media-type specific parts for models

  • Catering for metadata records imported from other catalogues with poorer information or more general schemas; these changes are allowed mainly for metadata records marked as for information:

    • changed the optionality status for specific elements

    • introduced the unspecifiedPart which may be used as an alternative when the media type value of a resource is not encoded in the original metadata

    • added elements with free text values as an alternative to elements with controlled value vocabularies or combined elements that cannot be distinguished from the source metadata record (e.g. when size is encoded as a free text combining amount and size unit together)

Detailed list of changes

Changes specific to language descriptions

  • introduced the element ldSubclass to distinguish between grammars, models and other (uncategorized) language descriptions

  • made the LanguageDescriptionSubclass recommended upon conditions (depending on the value of ldSubclass)

  • replaced the MLModel node with Model; it has the same elements and the additional elements: modelFunction (mandatory), modelType, requiresLR, hasOriginalSource, trainingProcessDetails, biasDetails

  • NGramModel has been moved under the Model node

  • for models, unspecifiedPart replaces media-type specific parts (i.e. LanguageDescriptionTextPart, LanguageDescriptionImagePart and LanguageDescriptionVideoPart)

  • for models, distributionUnspecifiedFeature replaces all media-type distribution features

  • the elements perplexity, typesystem and method have been made optional.

Changes for all resource types

  • moved compliesWith under LanguageResource, with optionality status ‘recommended’

  • made personalDataIncluded and sensitiveDataIncluded optional for language descriptions

  • changed data type for three elements: personalDataIncluded, sensitiveDataIncluded and anonymized take values from a controlled vocabulary

  • added elements (all optional or recommended):

    • packageFormat: on DatasetDistribution & SoftwareDistribution

    • mimetype: on DistributionTextFeature, DistributionAudioFeature, DistributionVideoFeature, DistributionImageFeature, DistributionUnspecifiedFeature

    • additionalHWrequirements: on languageDescription

    • spatial: on Corpus, ``LanguageDescription, LexicalConceptualResource

    • temporal: on Corpus, LanguageDescription, LexicalConceptualResource

    • organizationShortName: on GenericOrganization

    • projectShortName: on GenericProject

  • added values for the following elements: fundingType, sourceChannelType, originOfParticipants, colourSpace, accessRightsStatementScheme and conditionOfUse

  • for the elements function and intendedApplication removed the value http://w3id.org/meta-share/omtd-share/bPosTagging; instead, use the value http://w3id.org/meta-share/omtd-share/PosTagging

Changes specific to records marked as “for information”

  • unspecifiedPart and distributionUnspecifiedFeature can be used as an alternative to media type specific parts and distribution features (respectively) when the media type is not known; this applies to corpora, lexical/conceptual resources and language descriptions

  • sizeText (free text) can be used on LanguageResource

  • name can be used for persons related to resources

Changes required for XML metadata records

All metadata record examples and templates at https://gitlab.com/european-language-grid/platform/ELG-SHARE-schema have been updated to the most recent schema version. If you have uploaded XML files before with previous schema versions, please note that the following changes are required for any new XML files:

  • personalDataIncluded, sensitiveDataIncluded and anonymized:

    • change value true to http://w3id.org/meta-share/meta-share/yesP, http://w3id.org/meta-share/meta-share/yesS and http://w3id.org/meta-share/meta-share/yesA respectively

    • change value false to http://w3id.org/meta-share/meta-share/noP, http://w3id.org/meta-share/meta-share/noS and http://w3id.org/meta-share/meta-share/noA respectively

  • compliesWith: move to LanguageResource before the element LRSubclass

  • replace value http://w3id.org/meta-share/omtd-share/bPosTagging with http://w3id.org/meta-share/omtd-share/PosTagging

  • for models:

  1. replace the following structure

<ms:LanguageDescription>
        <ms:lrType>LanguageDescription</ms:lrType>
        <ms:LanguageDescriptionSubclass>
                <ms:MLModel>
                        <ms:ldSubclassType>MlModel</ms:ldSubclassType>
                        ...
    </ms:MLModel>
        </ms:LanguageDescriptionSubclass>

with the structure presented below; i.e. add the element ldSubclass, replace MlModel with Model, and add the elements modelType and modelFunction; if the source metadata record provides no information for these two elements, you can use the value http://w3id.org/meta-share/meta-share/unspecified.

<ms:LanguageDescription>
        <ms:lrType>LanguageDescription</ms:lrType>
        <ms:ldSubclass>http://w3id.org/meta-share/meta-share/model</ms:ldSubclass>
        <ms:LanguageDescriptionSubclass>
                <ms:Model>
                        <ms:ldSubclassType>Model</ms:ldSubclassType>
                        <ms:modelType>...</ms:modelType>
                        <ms:modelFunction>...</ms:modelFunction>
                        ...
                </ms:Model>
        </ms:LanguageDescriptionSubclass>
  1. replace any of the following structures

<ms:LanguageDescriptionMediaPart>
                <ms:LanguageDescriptionTextPart>...</ms:LanguageDescriptionTextPart>
</ms:LanguageDescriptionMediaPart>

<ms:LanguageDescriptionMediaPart>
                <ms:LanguageDescriptionImagePart>...</ms:LanguageDescriptionImagePart>
</ms:LanguageDescriptionMediaPart>

<ms:LanguageDescriptionMediaPart>
                <ms:LanguageDescriptionVideoPart>...</ms:LanguageDescriptionVideoPart>
</ms:LanguageDescriptionMediaPart>

with the following

<ms:unspecifiedPart>...</ms:unspecifiedPart>
  1. replace any of the following elements

<ms:distributionTextFeature>...</ms:distributionTextFeature>

<ms:distributionImageFeature>...</ms:distributionImageFeature>

<ms:distributionVideoFeature>...</ms:distributionVideoFeature>

with the element

<ms:distributionUnspecifiedFeature>...</ms:distributionUnspecifiedFeature>
  • for n-gram models:

  1. replace the following structure

<ms:LanguageDescription>
        <ms:lrType>LanguageDescription</ms:lrType>
        <ms:LanguageDescriptionSubclass>
                <ms:NGramModel>
                        <ms:ldSubclassType>NGramModel</ms:ldSubclassType>
                        ...
                </ms:NGramModel>
        </ms:LanguageDescriptionSubclass>

with the structure presented below; i.e. add the element ldSubclass, replace NGramModel with Model, move NGramModel under Model and add the elements modelType and modelFunction; if the source metadata record provides no information for these two elements, you can use the value http://w3id.org/meta-share/meta-share/unspecified.

<ms:LanguageDescription>
        <ms:lrType>LanguageDescription</ms:lrType>
        <ms:ldSubclass>http://w3id.org/meta-share/meta-share/model</ms:ldSubclass>
        <ms:LanguageDescriptionSubclass>
                <ms:Model>
                        <ms:ldSubclassType>Model</ms:ldSubclassType>
                        <ms:modelType>...</ms:modelType>
                        <ms:modelFunction>...</ms:modelFunction>
                        ...
                        <ms:NGramModel>
                        ...
                        </ms:NGramModel>
                </ms:Model>
        </ms:LanguageDescriptionSubclass>
  1. replace any of the following structures

<ms:LanguageDescriptionMediaPart>
                <ms:LanguageDescriptionTextPart>...</ms:LanguageDescriptionTextPart>
</ms:LanguageDescriptionMediaPart>

<ms:LanguageDescriptionMediaPart>
                <ms:LanguageDescriptionImagePart>...</ms:LanguageDescriptionImagePart>
</ms:LanguageDescriptionMediaPart>

<ms:LanguageDescriptionMediaPart>
                <ms:LanguageDescriptionVideoPart>...</ms:LanguageDescriptionVideoPart>
</ms:LanguageDescriptionMediaPart>

with the following

<ms:unspecifiedPart>...</ms:unspecifiedPart>
  1. replace any of the following elements

<ms:distributionTextFeature>...</ms:distributionTextFeature>

<ms:distributionImageFeature>...</ms:distributionImageFeature>

<ms:distributionVideoFeature>...</ms:distributionVideoFeature>

with the element

<ms:distributionUnspecifiedFeature>...</ms:distributionUnspecifiedFeature>
  • for grammars

<ms:LanguageDescription>
        <ms:lrType>LanguageDescription</ms:lrType>
        <ms:LanguageDescriptionSubclass>
                <ms:Grammar>
                        <ms:ldSubclassType>Grammar</ms:ldSubclassType>
                        ...
                </ms:Grammar>
        </ms:LanguageDescriptionSubclass>
<ms:LanguageDescription>
        <ms:lrType>LanguageDescription</ms:lrType>
        <ms:ldSubclass>http://w3id.org/meta-share/meta-share/grammar</ms:ldSubclass>
        <ms:LanguageDescriptionSubclass>
                <ms:Grammar>
                        <ms:ldSubclassType>Grammar</ms:ldSubclassType>
                        ...
                </ms:Grammar>
        </ms:LanguageDescriptionSubclass>

Release v2.0.5

Overview

  • Release date: 4/10/2021

  • ELG-SHARE schema version: v2.0.5

Schema changes

  • added option for free text values for the elements: annotationType, dataFormat and sizeUnit

  • added element bibliographicRecord for adding bibtex record for documents

  • changed cardinality of isDivisionOf for organizations (i.e. allowing multiple parent organizations)

Changes required for XML metadata records

If you have uploaded XML files before with previous schema versions, please note that the following changes are required for any new XML files:

  • annotationType: replace

<ms:annotationType>...</ms:annotationType>

with the following if you use a value from the recommended vocabulary: https://european-language-grid.readthedocs.io/en/stable/Documentation/ELG-SHARE_xsd.html#annotationTypeRecommended

<ms:annotationType>
        <ms:annotationTypeRecommended>...</ms:annotationTypeRecommended>
</ms:annotationType>

or use the following for free text

<ms:annotationType>
        <ms:annotationTypeOther>free text</ms:annotationTypeOther>
</ms:annotationType>
  • dataFormat: replace

<ms:dataFormat>...</ms:dataFormat>

with the following if you use a value from the recommended vocabulary: https://european-language-grid.readthedocs.io/en/stable/Documentation/ELG-SHARE_xsd.html#dataFormatRecommended

<ms:dataFormat>
        <ms:dataFormatRecommended>...</ms:dataFormatRecommended>
</ms:dataFormat>

or use the following for free text

<ms:dataFormat>
        <ms:dataFormatOther>free text</ms:dataFormatOther>
</ms:dataFormat>
  • sizeUnit: replace

<ms:sizeUnit>...</ms:sizeUnit>

with the following if you use a value from the recommended vocabulary: https://european-language-grid.readthedocs.io/en/stable/Documentation/ELG-SHARE_xsd.html#sizeUnitRecommended

<ms:sizeUnit>
        <ms:sizeUnitRecommended>...</ms:sizeUnitRecommended>
</ms:sizeUnit>

or use the following for free text

<ms:sizeUnit>
        <ms:sizeUnitOther>free text</ms:sizeUnitOther>
</ms:sizeUnit>