Validate an LRT hosted in ELG (at technical/metadata level)

See here how to access the “Validation Tasks”, which is the list of items assigned to you for validation. You can, then, apply the filters on the left to help you reduce the number of items presented or search for a specific item using the search box.

Technical Validations Tasks

By selecting one of the metadata records you will be directed to the its view page. Click on the actions box to access the validation form.

Technical/Metadata validation form

In the form that opens you must say whether you Approve or Reject the item after the technical and metadata validation. Please, check that the LRT is as expected, i.e.:

  • no malicious files are contained

  • the data format is as set in the metadata record

You can download the content files from the respective tab in the view page.

Download content files for validation

You are also asked to check whether the values of the following elements are included in the metadata record and whether their values match the description and contents of the dataset:

  • language(s), linguality & multilinguality type: important for findability purposes

  • resource creator(s) and publication date: although not mandatory, they are useful for citation purposes;

  • domain(s): recommended for findability purposes; if possible, recommend the use of an existing value

  • data format(s): the values “unspecified” or “other” must be avoided; if needed, you can use a broader term from the ontology

  • media type(s): check that they correspond to the contents; please use “text” for transcribed speech corpora; “audio” is to be used only for data resources in audio formats

  • corpus and lexical/conceptual resource subclass: important for findability purposes

  • encoding level(s) (for Lexical/conceptual resources): “unspecified” or “other” must be avoided; if needed, a broader term can be used

  • content type(s): recommended for findability purposes

  • documentation: user and installation manuals for tools are recommended; publications describing the use of the resource are also welcome

  • distribution(s): if a resource is available in multiple formats, it’s recommended to describe them as different distributions

  • size: a meaningful size unit depending on the resource type can be recommended (e.g. translation units for TMX files)

  • dataset distribution form: check the values at https://european-language-grid.readthedocs.io/en/stable/Documentation/ELG-SHAREschema.html#DatasetDistributionForm; depending on the form, a different element (access, download or distribution location) is recommended.

If you are satisfied, approve both types of validation and click on “Submit”.

If not, set the value of the Technical validation and/or the Metadata validation (depending on the source of the issue) to Reject. This will generate a new field where you can write the recommendations you would like to share with the curator. You can also add comments in the Validator notes field which will be visible only to other validators. When you have finished, click on submit.

Technical validation form with review comments

The provider will be notified by email (containing the review comments) in order to update the record. Once finished, the provider will re-submit the record for publication and you will be notified to perform the validation again.

Please, keep in mind that an item is published only when it has been approved at all validation levels (technical, metadata and legal).