You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Runs the MetadataGenerator using the configuration provided in a TOML file.
8
+
9
+
The TOML configuration file must include the following fields:
10
+
11
+
- metadata_file: str
12
+
The path to the .csv, .xlsx, or .xls file where the biosample metadata is stored (include file extension).
13
+
Example: "metadata_file.csv"
14
+
- data_dir: str
15
+
The directory path where the data lives or will be stored.
16
+
Example: "/path/to/data/"
17
+
- ref_calibration_path: str
18
+
The path to the .ref file where the reference calibration data is stored (include .ref).
19
+
Example: "/path/to/calibration_file.ref"
20
+
- raw_data_object_type: str
21
+
The raw data object type. Must match one of the following options:
22
+
- 'Direct Infusion FT ICR-MS Raw Data'
23
+
- 'LC-DDA-MS/MS Raw Data'
24
+
- processed_data_object_type: str
25
+
The processed data object type. Must match one of the following options:
26
+
- 'FT ICR-MS Analysis Results'
27
+
- 'GC-MS Metabolomics Results'
28
+
- registration_file: str
29
+
The desired name of the output file where the data will be dumped (include .json).
30
+
Example: "output_file.json"
31
+
- execution_resource: str
32
+
The execution resource used for the analysis. Must match one of the following options:
33
+
- 'EMSL-RZR'
34
+
- 'EMSL'
35
+
- field_strength: int
36
+
The field strength for the NOM analysis. Must match one of the following values:
37
+
- 7
38
+
- 12
39
+
- 15
40
+
- 21
41
+
- minting_client_config_path: str
42
+
The path to the NMDC minting client configuration file. Defaults to 'enviroMS/nmdc_metadata_generation/config.yaml'.
43
+
44
+
Notes
45
+
-----
46
+
This function assumes:
47
+
- If a biosample_id does not exist for the sample in the metadata_file, a biosample will be generated. On the other hand,
48
+
if a biosample_id does exist in the spreadsheet, no biosample will be generated. And the biosample_id present will be used to
49
+
generate all other metadata.
50
+
- The metadata_file conforms to a predefined structure. See example spreadsheet: https://docs.google.com/spreadsheets/d/1-xHGkkG5Gpw5Pen1iM_JUP2XmphuF19ps2ZYONvtDUs/edit?gid=1112301083#gid=1112301083
51
+
- Necessary configuration and calibrations are already added to MongoDB. If new configurations are needed, they must be added beforehand.
0 commit comments