enriched_material_entity¶
Info¶
Data contract for the Material Entity table of the dna domain of the Biocloud. Contains information about specimen and trap registrations from collection management systems. - name: Material Entity - version: 0.0.1 - status: active
Terms of Use¶
Purpose¶
Data contract for the Material Entity table of the dna domain of the Biocloud. Contains information about specimen and trap registrations from collection management systems.
Servers¶
| Name | Type | Attributes |
|---|---|---|
| production | databricks | No description. • environment: production • roles: [{'role': 'Admins', 'description': 'Access to all the data and settings'}] • catalog: dna_production • host: dbc-2030845a-6c3b.cloud.databricks.com • schema_: enriched |
| development | databricks | No description. • environment: development • roles: [{'role': 'Admins', 'description': 'Access to all the data and settings'}] • catalog: dna_development • host: dbc-03ed8bbb-c0ec.cloud.databricks.com • schema_: enriched |
Schema¶
material_entity¶
The Material Entity table contains all information about specimen and trap registrations. This includes physical specimens, environmental samples, and trap deployments used in biodiversity research.
| Field | Type | Attributes |
|---|---|---|
| material_entity_id | bigint | Biocloud generated identifier for the material entity. • primaryKey• primaryKeyPosition: 1 • required |
| bait | string | Type of bait used if this is a trap-based collection. • examples: ['sugar water', 'UV light', 'pheromone'] |
| basis_of_record | string | Darwin Core term indicating the nature of the record (e.g., PreservedSpecimen, MaterialSample). • examples: ['PreservedSpecimen', 'MaterialSample'] |
| catalog_number | string | Unique identifier for the specimen in the collection. • required• examples: ['RMNH.5143366', 'RMNH.INS.123456'] |
| collection_code | string | Code identifying the collection to which the specimen belongs. • examples: ['RMNH', 'WAG', 'L'] |
| dataset_name | string | Name of the dataset or collection type. |
| data_in_brahms | boolean | Whether the catalog number occurs in the BRAHMS botanical database. |
| data_in_crs | boolean | Whether the catalog number occurs in the CRS (Collection Registration System). |
| located_within_trap | string | Identifier of trap if specimen was collected from within a trap. |
| decimal_latitude | double | Latitude of collection location in decimal degrees, using WGS84 datum. • examples: [52.0907, -3.7533] |
| decimal_longitude | double | Longitude of collection location in decimal degrees, using WGS84 datum. • examples: [5.1214, -3.7533] |
| event_date_end | string | End date of the collecting event in ISO format. |
| event_date_start | string | Start date of the collecting event in ISO format. |
| geodetic_datum | string | Geodetic datum used for coordinates (typically WGS84). • examples: ['WGS84'] |
| habitat | string | Description of the habitat where the specimen was collected. • examples: ['deciduous forest', 'coastal dune', 'agricultural field'] |
| lifestage | string | Life stage of the organism at time of collection. • examples: ['adult', 'larva', 'juvenile', 'egg'] |
| location_remarks | string | Additional remarks about the collection location. |
| location | string | Concatenation of country, state, island, locality, and place providing full location context. |
| maximum_elevation_meters | double | Maximum elevation of collection site in meters above sea level. |
| minimum_elevation_meters | double | Minimum elevation of collection site in meters above sea level. |
| minimum_depth_meters | double | Minimum depth below surface in meters (for aquatic or soil samples). |
| maximum_depth_meters | double | Maximum depth below surface in meters (for aquatic or soil samples). |
| minimum_distance_above_surface_meters | double | Minimum distance above ground/water surface in meters (for aerial samples). |
| maximum_distance_above_surface_meters | double | Maximum distance above ground/water surface in meters (for aerial samples). |
| occurrence_remarks | string | Comments or notes about the occurrence. |
| organism_remarks | string | Comments or notes about the individual organism. |
| parent_id | bigint | Foreign key to parent material entity if this is a sub-sample. |
| preservation | string | Method used to preserve the specimen. • examples: ['ethanol', 'pinned', 'dried', 'frozen'] |
| preserved_part | string | Part of organism that was preserved. • examples: ['whole organism', 'tissue sample', 'DNA extract'] |
| record_number | string | Identifier given at time of recording (e.g., collector number). |
| recorded_by_id | string | Identifier of the person who recorded/collected the specimen. |
| recorded_by | string | Name of the person who recorded/collected the specimen. |
| registration_status | string | Current registration status of the specimen. |
| rights_holder_id | string | Identifier of the institution or person holding rights to the specimen. |
| sample_kit_number | string | Identifier of the sample kit used for collection. |
| sample_plate_id | string | Identifier of the storage plate containing the sample. |
| sample_plate_position | string | Position on the storage plate (well position). • examples: ['A1', 'H12'] |
| sampling_protocol | string | Method or protocol used for sampling. • examples: ['malaise trap', 'hand collection', 'net sweep'] |
| sex | string | Sex of the organism. • examples: ['male', 'female', 'hermaphrodite'] |
| specimen_mount | string | How the specimen is mounted or stored. • examples: ['pinned', 'slide mounted', 'in vial'] |
| subject | string | Subject tags or keywords associated with the specimen. |
| coordinate_uncertainty_in_meters | string | Uncertainty radius for coordinates in meters. |
| country | string | Country where specimen was collected. |
| crs_datagroup_description | string | Description from CRS datagroup. |
| disposition | string | Current disposition of the specimen. • examples: ['in collection', 'not in collection', 'missing'] |
| event_date_verbatim | string | Verbatim collecting date as originally recorded. |
| event_name | string | Name of the collecting event or expedition. |
| event_time_end | string | End time of collecting event. |
| event_time_start | string | Start time of collecting event. |
| individual_count | bigint | Number of individuals in the sample. |
| individual_count_approximately | bigint | Approximate count when exact count unavailable. |
| island | string | Name of island where specimen was collected. |
| locality | string | Specific locality description. |
| locality_verbatim | string | Verbatim locality text as originally recorded. |
| material_sample_id | string | Identifier for material sample. |
| place_name | string | Name of the place. |
| place_type | string | Type of place. |
| preservative | string | Preservative used for storage. |
| state_province | string | State or province where collected. |
| station_number | string | Station number for the collection site. |
| synecology_associated_specimen_lifestage | string | Life stage of associated specimens in synecology study. |
| synecology_associated_specimen_sex | string | Sex of associated specimens in synecology study. |
| synecology_associated_taxon_name | string | Taxonomic name of associated taxa in synecology study. |
| synecology_associated_taxon_type | string | Type of taxonomic association in synecology study. |
| synecology_comment | string | Comments about synecological relationships. |
| verbatim_coordinates | string | Coordinates as originally recorded. |
| source | string | The name of the source system used to create this record in the table. • required• examples: ['ada', 'sampling-sheet'] |
| source_id | string | The ID of this record in the source system. • required |
| inserted_ts_utc | timestamp | UTC timestamp when record was inserted. |
| updated_ts_utc | timestamp | UTC timestamp when record was last updated. |