GBIF Data Processing and Validation

التفاصيل البيبلوغرافية
العنوان: GBIF Data Processing and Validation
المؤلفون: Waller,John, Volik,Nikolay, Mendez,Federico, Hahn,Andrea
المصدر: Biodiversity Information Science and Standards 5: e75686
بيانات النشر: Pensoft Publishers
سنة النشر: 2021
المجموعة: Pensoft Publishers
مصطلحات موضوعية: tool, API, data quality, issue flag, workflow, reporting, hosted portals, data publication, occurrence data, publisher support, Darwin Core
الوصف: GBIF (Global Biodiversity Information Facility) is the largest data aggregator of biological occurrences in the world. GBIF was officially established in 2001 and has since aggregated 1.8 billion occurrence records from almost 2000 publishers. GBIF relies heavily on Darwin Core (DwC) for organising the data it receives. GBIF Data Processing PipelinesEvery single occurrence record that gets published to GBIF goes through a series of three processing steps until it becomes available on GBIF.org.source downloadingparsing into verbatim occurrences interpreting verbatim valuesOnce all records are available in the standard verbatim form, they go through a set of interpretations.In 2018, GBIF processing underwent a significant rewrite in order to improve speed and maintainablility. One of the main goals of this rewrite was to improve the consistency between GBIF's processing and that of the Living Atlases. In connection with this, GBIF's current data validator fell out of sync with GBIF pipelines processing.New GBIF Data ValidatorThe current GBIF data validator is a service that allows anyone with a GBIF-relevant dataset to receive a report on the syntactical correctness and the validity of the content contained within the dataset. By submitting a dataset to the validator, users can go through the validation and interpretation procedures usually associated with publishing in GBIF and quickly determine potential issues in data, without having to publish it. GBIF is planning to rework the current validator because the current validator does not exactly match current GBIF pipelines processing.Planned Changes The new validator will match the processing of the GBIF pipelines project.Validations will be saved and show up on user pages similar to the way downloads and derived datasets appear now (no more bookmarking validations!)A downloadable report of issues found will be produced.Suggested Changes/Ideas One of the main guiding philosophies for the new validator user interface will be avoiding information overload. The ...
نوع الوثيقة: conference object
وصف الملف: text/html
اللغة: English
العلاقة: info:eu-repo/semantics/altIdentifier/eissn/2535-0897
DOI: 10.3897/biss.5.75686
الإتاحة: https://doi.org/10.3897/biss.5.75686Test
https://biss.pensoft.net/article/75686Test/
حقوق: info:eu-repo/semantics/openAccess
رقم الانضمام: edsbas.B613C17B
قاعدة البيانات: BASE