Prepare for Analysis
Last updated
Last updated
Prepare for Analysis is used to export participant-level data corresponding to your filters and variable selections. There are several steps to export the data, which are shown using this process.
The first step of the process is to review your cohort details. This provides a tabular summary of the variables that have been filtered and added.
To complete the export, the user will need to decide what format they would like their participant-level data to be in. There are two options: Export as Data Frame or CSV or Export as PFB.
In some instances, multiple values may relate to a single variable per participant. For example, some participants may have several timepoints of information, resulting in many values for a single participant. If there are multiple values for a given variable, these values will be separated by a tab or \t
character.
The data is now ready for export. Based on your export format selection, there will be options displayed for export.
If you chose Export as Data Frame or CSV, the code to complete this export into a data frame in Python or R is provided. Additionally, the file can be downloaded as a CSV file.
If you chose Export as PFB, you have the option to download the PFB avro file.
The Export as Data Frame or CSV option should be selected if you are interested in exporting your selected data as a Comma-Separated Values file or if you intend to complete your export using the PIC-SURE API via R or Python. This includes using Juptyer Notebooks or RStudio to export your data to an analysis workspace. For more information about using the PIC-SURE API for export, please refer to the .
The Export as PFB option should be selected if you are interested in exporting your selected data as a Portable Format for Biomedical Data file. For more information about the PFB format, please refer to the .
The next step is to save the dataset ID. The dataset ID is the unique identifier that is created for the specific cohort and data that you have selected for export. Type a name for the dataset ID into the field in order to save the dataset ID for future reference. For more information about accessing and managing previously saved dataset IDs, please refer to the .