PIC-SURE
  • Getting Started
    • Introduction to PIC-SURE
    • Requesting Access
    • Navigating PIC-SURE
  • Explore Data
    • Explore Page Overview
    • Genomic Filtering
    • Prepare for Analysis
    • Variant Explorer
      • What variant metadata is exposed?
    • Manage Datasets
  • Prepare Data for Analysis Using the PIC-SURE API
    • What is the PIC-SURE API?
    • PIC-SURE Personal Access Token
    • More information about the PIC-SURE API
  • Privacy Policy
  • Administrative Privileges
    • Introduction to Admin Privileges
    • Adding, Editing, and Removing Users
  • Release Notes
    • 12 May 2025
Powered by GitBook
On this page
  • Step 1: Review Cohort Details
  • Step 2: Select Export Type
  • Step 3: Save Dataset ID
  • Step 4: Export Data
  1. Explore Data

Prepare for Analysis

PreviousGenomic FilteringNextVariant Explorer

Last updated 12 days ago

Prepare for Analysis is used to export participant-level data corresponding to your filters and variable selections. There are several steps to export the data, which are shown using this process.

Step 1: Review Cohort Details

The first step of the process is to review your cohort details. This provides a tabular summary of the variables that have been filtered and added.

Queries with more than 1,000,000 data points will not be exportable.

Step 2: Select Export Type

To complete the export, the user will need to decide what format they would like their participant-level data to be in. There are two options: Export as Data Frame or CSV or Export as PFB.

Export as Data Frame or CSV

In some instances, multiple values may relate to a single variable per participant. For example, some participants may have several timepoints of information, resulting in many values for a single participant. If there are multiple values for a given variable, these values will be separated by a tab or \t character.

Export as PFB

Step 3: Save Dataset ID

Step 4: Export Data

The data is now ready for export. Based on your export format selection, there will be options displayed for export.

If you chose Export as Data Frame or CSV, the code to complete this export into a data frame in Python or R is provided. Additionally, the file can be downloaded as a CSV file.

If you chose Export as PFB, you have the option to download the PFB avro file.

The Export as Data Frame or CSV option should be selected if you are interested in exporting your selected data as a Comma-Separated Values file or if you intend to complete your export using the PIC-SURE API via R or Python. This includes using Juptyer Notebooks or RStudio to export your data to an analysis workspace. For more information about using the PIC-SURE API for export, please refer to the .

The Export as PFB option should be selected if you are interested in exporting your selected data as a Portable Format for Biomedical Data file. For more information about the PFB format, please refer to the .

The next step is to save the dataset ID. The dataset ID is the unique identifier that is created for the specific cohort and data that you have selected for export. Type a name for the dataset ID into the field in order to save the dataset ID for future reference. For more information about accessing and managing previously saved dataset IDs, please refer to the .

Data Analysis Using the PIC-SURE API section
PFB documentation
Manage Datasets section
Step 1: Review cohort details with the option to include sample identifiers
Step 2: Select export format, either Dataframe/CSV or PFB
Step 3: Save dataset ID step