Data Organization
The Databank includes three types of objects: Simulation, Experiment, and Molecule. Each is described by it’s own metadata file. Metadata is stored in YAML format. Simulation object stores precomputed properties in JSON format. Experiment object stores experimental data in JSON format.
Databank objects are stored in the separated repository; by default, it’s a BilayerData but any of its forks or even completely new repo could be used instead.
It is connected into the FAIRMD Lipids repository as a submodule into ./Data folder.
Data/
├── Simulations/
│ └── rw5/
│ └── g8v/
│ └── au3/
│ └── b8c/
│ ├── data-piece.json
│ └── README.yaml
├── experiments/
│ ├── FormFactors/
│ │ └── 10.1001/
│ │ └── j.some.thing.3242.11/
│ │ └── 1/
│ │ ├── form-factor-data.json
│ │ └── README.yaml
│ └── OrderParameter/
│ │ └── 10.1001/
│ │ └── j.some.thing.3242.11/
│ │ └── 1/
│ │ ├── lipid1-data.json
│ │ ├── lipid2-data.json
│ │ └── README.yaml
├── info_files/
│ └── some-folder/
│ ├── info1.yaml
│ └── info2.yaml
├── lipid_json_buildH/
│ ├── ua-dictionary-1.json
│ └── ua-dictionary-2.json
├── Ranking/
│ ├── lipid1-headgroup-ranking.json
│ └── lipid2-tail1-ranking.json
└── Molecules/
├── membrane/
│ └── lipid1/
│ ├── lipid1-forcefield2-mapping.yaml
│ └── metadata.yaml
└── solution/
└── ion1/
├── ion1-forcefield3-mapping.yaml
└── metadata.yaml
Simulation record
Simulation metadata README.yaml
A file containing all the relevant information of each simulation entry. These files are
stored at fairmd.lipids.FMDL_SIMU_PATH.
For more information see Simulation metadata (README.yaml).
Simulation computed properties
apl.jsoncontains area per lipid (Å2) as a function of time (ps) for a simulation.
Seefairmd.lipids.analyze.computeAPL()for computing details.XXXXOrderParameters.jsoncontains C-H bond order parameters and their uncertainties for each lipid XXXX. Key in json format is the merged universal C and H atom names. Three values are given: order parameter value for the C-H bond, standard deviation, and the standard error of the mean.
Seefairmd.lipids.analyze.computeOP()for computing details.FormFactor.jsoncontains simulated X-ray scattering form factor. X-axis value unit are Å-1 and y-axis value units are e/nm2.
Seefairmd.lipids.analyze.computeFF()for computing details.XXXXDensity.jsoncontains electron densities calculated from trajectory. We store separately lipid, water, and total densities. X-axis units are nm and y-axis units e/nm3.
Seefairmd.lipids.analyze.computeDensity()for computing details.thickness.jsoncontains thickness (nm) calculated from the trajectory.
Seefairmd.lipids.analyze.computeThickness()for computing details.eq_times.jsoncontains a dictionary with relative equilibration time of molecule principal components. It demonstrates how long time is required in this particular simulation to sample the ensemble for each lipid molecule.
Seefairmd.lipids.analyze.computeNMRPCA()for computing details.
Simulation quality files
POPC_OrderParameters_quality.jsoncontains a dictionary with the quality of each C-H bond against experiments if available. First key is the DOI for the source of experimental data. Second key gives C-H pair univeral atom names. Third key gives values for simulation order parameter, its stardard deviation, standard error of the mean, experimental order parameter, its error, and finally the quality of the order parameter. Quality is the probability for the agreement between simulated and experimental values taking into account the error bars.For more details, see the FAIRMD Lipids manuscript.
POPC_FragmentQuality.jsoncontains fragment qualities determined separately for each lipid in the simulation with experimental data available using Eq. (4) in the FAIRMD Lipids manuscript.SYSTEM_quality.jsoncontains total qualities averaged over different lipids for different membrane parts calculated from Eq. (5) in the FAIRMD Lipids manuscript.FormFactorQuality.jsoncontains quality of form factor against experimental data determined as described in the FAIRMD Lipids manuscript. Second term is the scaling coefficient for experimental intensities (Eq. (6) in the FAIRMD Lipids manuscript).
Experiment record
The data is located in subfolders named after the DOI of the original source publication
of the data. Because a publication can contain more than one experimental data set, each
data set is stored in a subfolder with integer name, for example,
[experiments/OrderParameters/10.1039/c2cp42738a/2](https://github.com/NMRLipids/BilayerData/tree/main/experiments/OrderParameters/10.1039/c2cp42738a/2).
Each of such folders then contain the experimental data and README.yaml files
describing the experimental conditions.
Order parameter NMR experiment
TODO: write the block
README.yamlcontains metadata of the experiment, such as DOI, lamellar phase preparation protocol, temperature, B0 field, ions concentration and other relevant information.POPC_OrderParameters.jsoncontains C-H bond order parameters and their experimental uncertainties defined for universal atom names.
Form factor X-ray experiment metadata
TODO: write the block
README.yamlcontains metadata of the experiment, such as DOI, properties of the source, type of liposome suspension, and other relevant information.FormFactor.jsoncontains X-ray scattering form factor data.
Molecule record
Molecule metadata
TODO: write the block
Mapping files
Describe connection between universal atom names and simulation specific atom names. These enable automatic analyses over simulations with different naming conventions. For more information, see Universal molecule and atom names
Other files
info.yaml
Contains information given by contributor when adding data into the FAIRMD Lipids, i.e.
all non-recomputable fields of simulation README.yaml. This file is given as an
input to _add_simulation_py to create README.yaml files:
fmdl_add_simulation -f info.yaml. These files are not required but currently are
stored historically in info_files subfolder of
fairmd.lipids.FMDL_DATA_PATH.
For more information see Simulation metadata (README.yaml) and Adding simulations.
Lipid united-atom dictionary
TODO: write the block