Skip to content

raw_sample_sheets

Info

Data contract for the sample sheets data from the landing zone into raw. - name: Sample Sheets - version: 0.0.4 - status: active

Terms of Use

Purpose

Data contract for the sample sheets data from the landing zone into raw.

Servers

Name Type Attributes
production databricks No description.
environment: production
roles: [{'role': 'Admins', 'description': 'Access to all the data and settings'}]
catalog: raw_production
host: dbc-2030845a-6c3b.cloud.databricks.com
schema_: dienekes
development databricks No description.
environment: development
roles: [{'role': 'Admins', 'description': 'Access to all the data and settings'}]
catalog: raw_development
host: dbc-03ed8bbb-c0ec.cloud.databricks.com
schema_: dienekes

Schema

sample_sheets

Table storing the sample sheets in the raw layer of the Biocloud.

Field Type Attributes
id string The unique identifier for a row in the table
primaryKey
primaryKeyPosition: 1
plate_serial_number string The serial number of a plate, printed on the plate by the vendor
plate_be_number string The BE number of the plate, should be the same number per sheet
project_number string The number assigned to the project
retry boolean A flag indicating whether the sample was retried
plate_position string The position of a sample on a plate. Represented from a letter from A:H and a number from 01-12
registration_number string The registration number assigned to a specific sample throughout the identification process
field_number string An identifier given to the dwc:Event in the field. Often serves as a link between field notes and the dwc:Event.
full_scientific_name string A string representing the taxonomic identification as it appeared in the original record.
data_entered_by string The person who entered the data in the Sample Sheet.
date_entered date The date on which the row was entered in the sheet.
extraction_method string The method of DNA-extraction from a specimen. Can be either extraction of subsamples, non-destructive extraction or destructive extraction.
specimen_id string The Specimen identifier
sample_type string The type of sample.
buffer string The buffer used for the sample.
contents string The contents of the sample.
priority string The priority of the sample.
sheet_name string The name of the Sample Sheet.
inserted_ts_utc timestamp Timestamp in UTC showing when a record was inserted in the Delta table in Biocloud
updated_ts_utc timestamp Timestamp in UTC showing when a record was last updated in the Delta table in Biocloud
last_updated_ts string Timstamp in utc showing when a record was last updated in the sheet ingestion tool

SLA Properties