raw_metadata_sheets¶
Info¶
Data contract for the metadata sheets data from the landing zone into raw. - name: Metadata Sheets - version: 0.0.4 - status: active
Terms of Use¶
Purpose¶
Data contract for the metadata sheets data from the landing zone into raw.
Servers¶
| Name | Type | Attributes |
|---|---|---|
| production | databricks | No description. • environment: production • roles: [{'role': 'Admins', 'description': 'Access to all the data and settings'}] • catalog: raw_production • host: dbc-2030845a-6c3b.cloud.databricks.com • schema_: dienekes |
| development | databricks | No description. • environment: development • roles: [{'role': 'Admins', 'description': 'Access to all the data and settings'}] • catalog: raw_development • host: dbc-03ed8bbb-c0ec.cloud.databricks.com • schema_: dienekes |
Schema¶
metadata_sheets¶
Table storing the metadata sheets in the raw layer of the Biocloud.
| Field | Type | Attributes |
|---|---|---|
| id | string | Unique identifier of the specimen record • primaryKey• primaryKeyPosition: 1 |
| registration_number | string | Official registration number of the specimen |
| recorded_by_id | string | Identifier of the person who recorded the specimen |
| suffix | string | Additional suffix for registration or catalog number |
| mount | string | Type of specimen mount or preparation |
| preservation | string | General preservation method used |
| basis_of_record | string | Type of record (e.g., specimen, observation) |
| preserved_part | string | Specific part of the organism preserved |
| present_in_collection | string | Indicates if specimen is currently in collection |
| data_in_crs | boolean | Indicates if record is in collection management system |
| registration_type | string | Type of registration assigned to the specimen |
| bait_pheromones | string | Bait pheromones used during collection |
| remarks | string | General comments or remarks about the specimen |
| family_name | string | Taxonomic family name |
| subfamily_name | string | Taxonomic subfamily name |
| qualifier_entire_name | string | Qualifier for entire scientific name |
| genus | string | Taxonomic genus name |
| subgenus | string | Taxonomic subgenus name |
| qualifier_species | string | Qualifier for species name |
| species | string | Taxonomic species name |
| qualifier_subspecies | string | Qualifier for subspecies name |
| subspecies | string | Taxonomic subspecies name |
| determination_author_and_year | string | Author and year of taxonomic determination |
| full_name_no_authority | string | Scientific name without author citation |
| full_scientific_name | string | Full scientific name with author citation |
| type_status | string | Type status of the specimen |
| determination_name_comments | string | Comments on the determination name |
| determination_identifier | string | Identifier of person who made determination |
| identification_start_date | string | Date when identification process began |
| identification_2_genus | string | Second identification genus name |
| identification_2_species | string | Second identification species name |
| identification_2_subspecies | string | Second identification subspecies name |
| identification_2_author_and_year | string | Author and year of second identification |
| identification_2_qualifier_entire_name | string | Qualifier for entire name in second identification |
| identification_2_qualifier_species | string | Qualifier for species in second identification |
| identification_2_qualifier_subspecies | string | Qualifier for subspecies in second identification |
| identification_2_name_comments | string | Comments on second determination name |
| identification_2_identifier | string | Identifier of person for second determination |
| identification_2_start_date | string | Start date of second identification |
| count | bigint | Total count of specimens |
| count_approximate | bigint | Indicates if specimen count is approximate |
| sex | string | Sex of specimen(s) |
| phase_or_stage | string | Developmental stage or phase of specimen |
| detailed_specification_1_number_of_specimen | string | Number of specimens in first detailed group |
| detailed_specification_1_part | string | Body part in first detailed group |
| detailed_specification_1_approximate | string | Indicates if first group count is approximate |
| detailed_specification_1_sex | string | Sex of specimens in first group |
| detailed_specification_1_phase_or_stage | string | Stage of specimens in first group |
| detailed_specification_2_number_of_specimen | string | Number of specimens in second detailed group |
| detailed_specification_2_part | string | Body part in second detailed group |
| detailed_specification_2_approximate | string | Indicates if second group count is approximate |
| detailed_specification_2_sex | string | Sex of specimens in second group |
| detailed_specification_2_phase_or_stage | string | Stage of specimens in second group |
| detailed_specification_3_number_of_specimen | string | Number of specimens in third detailed group |
| detailed_specification_3_part | string | Body part in third detailed group |
| detailed_specification_3_approximate | string | Indicates if third group count is approximate |
| detailed_specification_3_sex | string | Sex of specimens in third group |
| detailed_specification_3_phase_or_stage | string | Stage of specimens in third group |
| detailed_specification_4_number_of_specimen | string | Number of specimens in fourth detailed group |
| detailed_specification_4_part | string | Body part in fourth detailed group |
| detailed_specification_4_approximate | string | Indicates if fourth group count is approximate |
| detailed_specification_4_sex | string | Sex of specimens in fourth group |
| detailed_specification_4_phase_or_stage | string | Stage of specimens in fourth group |
| collector_name | string | Name of person who collected specimen |
| collector_residence | string | Residence or affiliation of collector |
| collecting_method | string | Method used for collecting specimen |
| collecting_date_start | string | Start date of specimen collection |
| collecting_date_end | string | End date of specimen collection |
| collecting_date_string | string | Verbatim collection date text |
| collecting_time_start | string | Start time of collection |
| collecting_time_end | string | End time of collection |
| gatheringsites_country | string | Country where specimen was collected |
| gatheringsites_state_province | string | State or province of collection site |
| gatheringsites_island | string | Island where specimen was collected |
| gatheringsites_locality | string | Locality of collection site |
| gatheringsites_full_locality_text | string | Full locality description as provided |
| place_type | string | Type of place where collected |
| place_name | string | Name of place of collection |
| station_number | string | Station or site number |
| sample_id | string | Identifier of related sample |
| survey_name | string | Name of survey or expedition |
| synecology_scientific_or_informal_name | string | Associated species name or ecological partner |
| synecology_sex | string | Sex of associated organism |
| synecology_determination_type | string | Type of identification for associated organism |
| synecology_phase_or_stage | string | Developmental stage of associated organism |
| synecology_comment | string | Comments about associated species or ecology |
| biotope_text | string | Description of habitat or environment |
| verbatim_coordinates | string | Coordinates as originally recorded |
| decimal_latitude_wgs84 | double | Latitude in decimal degrees (WGS84) |
| decimal_longitude_wgs84 | double | Longitude in decimal degrees (WGS84) |
| gatheringcoordinates_uncertainty_m | string | Coordinate uncertainty in meters |
| minimum_altitude_in_meters | double | Minimum altitude at collection site |
| maximum_altitude_in_meters | double | Maximum altitude at collection site |
| minimum_height_in_meters | double | Minimum height where collected |
| maximum_height_in_meters | double | Maximum height where collected |
| minimum_depth_in_meters | double | Minimum depth where collected |
| maximum_depth_in_meters | double | Maximum depth where collected |
| preservation_technique | string | Technique used for preservation |
| applied_preservative | string | Chemical or method applied for preservation |
| acquisition_date_string | string | Date of specimen acquisition |
| acquired_from_agent | string | Source or agent from whom acquired |
| acquisition_residence | string | Residence or affiliation of acquisition source |
| additional_numbers_1_type | string | Type of first additional number |
| additional_numbers_1_value | string | Value of first additional number |
| additional_numbers_1_comment | string | Comment for first additional number |
| additional_numbers_2_type | string | Type of second additional number |
| additional_numbers_2_value | string | Value of second additional number |
| additional_numbers_2_comment | string | Comment for second additional number |
| additional_numbers_3_type | string | Type of third additional number |
| additional_numbers_3_value | string | Value of third additional number |
| additional_numbers_3_comment | string | Comment for third additional number |
| datagroup_description | string | Description of the data group or set |
| higher_name_1_name | string | Name of higher taxonomic rank 1 |
| higher_name_1_rank | string | Rank of higher taxonomic name 1 |
| higher_name_2_name | string | Name of higher taxonomic rank 2 |
| higher_name_2_rank | string | Rank of higher taxonomic name 2 |
| sheet_name | string | Name or code of specimen sheet |
| inserted_ts_utc | timestamp | UTC timestamp when record was inserted |
| updated_ts_utc | timestamp | UTC timestamp when record was last updated |
| located_within_trap | string | Which trap identifier was located in |
| data_in_brahms | boolean | Is it part of Brahms collection |
| location_remarks | string | General comments or remarks about the location |
| delete | boolean | Flag indicating a record was deleted from the batch |
| deleted_at | timestamp | Timestamp indicating if a record was deleted. Is used in enriched to perform soft deletes |
| last_updated_ts | timestamp | Timestamp indicating when a record was last updated. Used for the retrieval of deltas |