Skip to content

Biocloud

enriched_amplicon

enriched_amplicon¶

Info¶

Data contract for the Amplicon table of the dna domain of the Biocloud. - name: Amplicon - version: 0.0.1 - status: active

Terms of Use¶

Purpose¶

Data contract for the Amplicon table of the dna domain of the Biocloud.

Servers¶

Name	Type	Attributes
production	databricks	No description. • environment: production • roles: [{'role': 'Admins', 'description': 'Access to all the data and settings'}] • catalog: dna_production • host: dbc-2030845a-6c3b.cloud.databricks.com • schema_: enriched
development	databricks	No description. • environment: development • roles: [{'role': 'Admins', 'description': 'Access to all the data and settings'}] • catalog: dna_development • host: dbc-03ed8bbb-c0ec.cloud.databricks.com • schema_: enriched

Schema¶

amplicon¶

The amplicon table contains information about the amplicons. These are plastic tubes used in the lab to store DNA samples.

Field	Type	Attributes
amplicon_id	bigint	Biocloud generated identifier for the amplicon. • `primaryKey` • primaryKeyPosition: 1 • `required`
dna_extract_golden_id	bigint	Foreign key to the DNA extract table. A Golden ID is a unique system generated identifier assigned to each record to ensure that each record is uniquely identifiable within our Data Lake.
sequencing_run_id	bigint	Foreign key to the Sequencing run table.
dna_extract_id	string	The string ID of the DNA extract used in the source system to identify the sample. • examples: ['e1100113138', 'e1100026669']
project_id	string	The string ID of the associated project from the source system. • examples: ['24026-5004000', '22033-375157']
pcr_id	string	Identifier of the PCR reaction in the source system. • `required`
primer_name_forward	string	Name of the forward primer used in the sequencing process. • `required` • examples: ['ONT_LepF1+ONT_LCO1490', 'ONT_FLop_Fw']
primer_name_reverse	string	Name of the reverse primer used in the sequencing process. • `required` • examples: ['ONT_LepR1+ONT_HCO2198', 'ONT_FLop_Rv']
marker	string	Genetic marker used for sequencing. • `required`
is_control	boolean	Indicates if a sample is used as a a control sample to compare against in the DNA sequencing.
source_id	string	The ID of this record in the source system. This can be used to track the origin of the data.
source	string	The name of the source system used to create this record in the table.
inserted_ts_utc	timestamp	UTC timestamp when record was inserted.
updated_ts_utc	timestamp	UTC timestamp when record was last updated.

SLA Properties¶