Skip to content

Metrics structuring for contribution

Prerequisites

This article requires that you understand how data is stored on SynDB, we recommend reading through the overview article if you are uncertain.

This article is a guide for contributors who wish to upload their data to SynDB. Please don't hesitate to ask for help on the Discord channel if you have any questions; this part can be challenging.

Data structuring

Schema

Each SynDB table has its own schema, which can be found on our GitHub repository. The schema defines the supported column names and their data types. The data must be structured in a way that is compatible with the schema of the table you are contributing to.

The column names and the data stored under them must be in a format that is compatible with the type. You can find the supported column names for each SynDB table in the . You may use the glossary at the end of this article for reference.

Nano

We use nanometers as the unit for all measurements; includes volume, radius, and distance.

Sourcing raw data

You may upload the sourcing raw data files including meshes or SWL to SynDB. Place the absolute path to the file in your table file. The following are supported:

  • Meshes in .glb format, column name: mesh_path
  • SWC files, .swc, column name: swc_path

This list is the main tracker for the supported formats. You may request additional formats on the Discord channel. The SynDB team will review the request and consider adding the new format to the platform.

Columns

Most column types are self-explanatory, but some require additional explanation.

Identifiers and relations

The CID column defined in your table can have any unique hashable value, it will be replaced by a UUID when uploaded to SynDB. When uploading a relational dataset, the cid column in the parent will be used to correlate the relations to the children by their parent_id; meaning the hashable value in the parent cid column must match the parent_id in the child. parent_enum can be omitted as the compartments are defined at the tabular level, and will, therefore, be added automatically.

Example

Notice the parent_id column in the child table, this is the cid of the parent table. The parent_enum column is not present in the child table, as it is defined at the tabular file name.

vesicle.csv, child

cid neurotransmitter voxel_radius distance_to_active_zone minimum_normal_length parent_id centroid_z centroid_x centroid_y
0 glutamate 26.9129 705.2450 23 1 4505.232 1996.224 4953.6
1 glutamate 25.5388 615.0213 23 1 4505.232 1996.224 4953.6
2 glutamate 29.5260 513.0701 23 1 4505.232 1996.224 4953.6
3 glutamate 30.5131 479.9224 23 1 4505.232 1996.224 4953.6
4 glutamate 28.3977 454.8248 23 1 4505.232 1996.224 4953.6
5 glutamate 30.2033 459.7557 23 2 4505.232 1996.224 4953.6
6 glutamate 33.4548 374.8131 23 2 4505.232 1996.224 4953.6
7 glutamate 32.0890 455.9293 23 4 4505.232 1996.224 4953.6

axon.csv, parent

voxel_volume mitochondria_count total_mitochondria_volume cid
385668034.56 1 93208043.52 1
1492089016.32 4 412054179.84 2
327740497.92 0 0 4

Glossary

Key Description
dataset_id The unique identifier for the dataset, of type uuid.
cid The unique identifier for a SynDB unit within the dataset, of type uuid.
parent_id The CID of the parent component, of type uuid.
parent_enum An integer representing the type or category of the parent component, of type int.
polarity The polarity of the neuron, of type ascii.
voxel_volume The volume of the voxel, of type double.
voxel_radius The radius of the voxel, of type double.
s3_mesh_location The location of the mesh in S3 storage, of type smallint.
mesh_volume The volume of the mesh, of type double.
mesh_surface_area The surface area of the mesh, of type double.
mesh_area_volume_ratio The ratio of the surface area to the volume of the mesh, of type double.
mesh_sphericity The sphericity of the mesh, of type double.
centroid_z The z-coordinate of the centroid, of type double.
centroid_x The x-coordinate of the centroid, of type double.
centroid_y The y-coordinate of the centroid, of type double.
s3_swb_location The location of the SWB in S3 storage, of type smallint.
terminal_count The count of terminals, of type int.
mitochondria_count The count of mitochondria, of type int.
total_mitochondria_volume The total volume of mitochondria, of type double.
neuron_id The unique identifier for the associated neuron, of type uuid.
vesicle_count The count of vesicles, of type int.
total_vesicle_volume The total volume of vesicles, of type double.
forms_synapse_with The unique identifier of the synapse that the component forms with, of type uuid.
connection_score The score representing the strength or quality of the connection, of type double.
cleft_score The score for the synaptic cleft, of type int.
GABA The concentration or presence of GABA neurotransmitter, of type double.
acetylcholine The concentration or presence of acetylcholine neurotransmitter, of type double.
glutamate The concentration or presence of glutamate neurotransmitter, of type double.
octopamine The concentration or presence of octopamine neurotransmitter, of type double.
serine The concentration or presence of serine neurotransmitter, of type double.
dopamine The concentration or presence of dopamine neurotransmitter, of type double.
cave_id The identifier of the cave structure, of type int.
pre_id The unique identifier of the pre-synaptic component, of type uuid.
post_id The unique identifier of the post-synaptic component, of type uuid.
dendritic_spine_count The count of dendritic spines, of type int.
neurotransmitter The type of neurotransmitter present in a vesicle, of type ascii.
distance_to_active_zone The distance from the vesicle to the active zone, of type double.
minimum_normal_length The minimum normal length, of type int.
ribosome_count The count of ribosomes within the endoplasmic reticulum, of type int.