Skip to content

enriched_amplicon

Info

Data contract for the Amplicon table of the dna domain of the Biocloud. - name: Amplicon - version: 0.0.1 - status: active

Terms of Use

Purpose

Data contract for the Amplicon table of the dna domain of the Biocloud.

Servers

Name Type Attributes
production databricks No description.
environment: production
roles: [{'role': 'Admins', 'description': 'Access to all the data and settings'}]
catalog: dna_production
host: dbc-2030845a-6c3b.cloud.databricks.com
schema_: enriched
development databricks No description.
environment: development
roles: [{'role': 'Admins', 'description': 'Access to all the data and settings'}]
catalog: dna_development
host: dbc-03ed8bbb-c0ec.cloud.databricks.com
schema_: enriched

Schema

amplicon

The amplicon table contains information about the amplicons. These are plastic tubes used in the lab to store DNA samples.

Field Type Attributes
amplicon_id bigint Biocloud generated identifier for the amplicon.
primaryKey
primaryKeyPosition: 1
required
dna_extract_golden_id bigint Foreign key to the DNA extract table. A Golden ID is a unique system generated identifier assigned to each record to ensure that each record is uniquely identifiable within our Data Lake.
sequencing_run_id bigint Foreign key to the Sequencing run table.
dna_extract_id string The string ID of the DNA extract used in the source system to identify the sample.
examples: ['e1100113138', 'e1100026669']
project_id string The string ID of the associated project from the source system.
examples: ['24026-5004000', '22033-375157']
pcr_id string Identifier of the PCR reaction in the source system.
required
primer_name_forward string Name of the forward primer used in the sequencing process.
required
examples: ['ONT_LepF1+ONT_LCO1490', 'ONT_FLop_Fw']
primer_name_reverse string Name of the reverse primer used in the sequencing process.
required
examples: ['ONT_LepR1+ONT_HCO2198', 'ONT_FLop_Rv']
marker string Genetic marker used for sequencing.
required
is_control boolean Indicates if a sample is used as a a control sample to compare against in the DNA sequencing.
source_id string The ID of this record in the source system. This can be used to track the origin of the data.
source string The name of the source system used to create this record in the table.
inserted_ts_utc timestamp UTC timestamp when record was inserted.
updated_ts_utc timestamp UTC timestamp when record was last updated.

SLA Properties