NHLBI BioData Catalyst® Powered by PIC-SURE
  • NHLBI BioData Catalyst® Powered by PIC-SURE User Guide
    • Frequently Asked Questions
  • Introduction to PIC-SURE
    • General Layout
    • Browse vs. Explore
  • Browse
    • Browse All Data
    • Features of Browse
  • Explore
    • Log in to Explore
    • Features of Explore
      • Prepare for Analysis
      • PFB Handoff to BioData Catalyst Powered by Terra
    • Manage Datasets
  • Data in PIC-SURE
    • Data Organization in BDC-PIC-SURE
      • BDC-PIC-SURE Data Format
    • Available Data & Managing Data Access
      • Publicly Available Datasets
      • TOPMed and TOPMed Related Datasets
        • Harmonized Data (TOPMed DCC Harmonized Clinical Variables)
      • BioLINCC Datasets
      • CONNECTS Datasets
  • Prepare for Data Analysis Using the PIC-SURE API
    • What is the PIC-SURE API?
    • PIC-SURE Personal Access Token
    • Analysis in the BioData Catalyst Ecosystem
      • BDC Powered by Seven Bridges
      • BDC Powered by Terra
    • Data Dictionaries via PIC-SURE API
    • More information about the PIC-SURE API
  • Citation and Acknowledgement of BioData Catalyst
  • Release Notes
    • Release Notes
      • 2025 May 8 Release
      • 2025 April 3 Release
      • 2025 March 5 Release
      • 2025 February 10 Release
      • 2024 Release Notes
        • 2024 December 19 Release
        • 2024 November 21 Release
        • 2024 November 4 Release
        • 2024 October 3 Release
        • 2024 September 5 Release
        • 2024 August 20 Release
        • 2024 August 1 Release
        • 2024 June 18 Release
        • 2024 May 29/30 Release
        • 2024 May 10/14 Release
        • 2024 March 26/28 Release
        • 2024 February 20/22 Release
        • 2024 January 30/31
        • 2024 January 16 Release
        • 2024 June 27 Release
      • 2023 Release Notes
        • 2023 December 12/14 Release
        • 2023 November 17 Release
        • 2023 October 23/31 Releases
        • 2023 October 13 Release
        • 2023 October 6 Release
        • 2023 September 28 Release
        • 2023 August 29 Release
        • 2023 July 27 Release
        • 2023 May 25 Release
        • 2023 March 30 Release
        • 2023 January 26 Release
  • Video Tutorials
    • Introduction to BioData Catalyst Powered by PIC-SURE
    • Basics: Finding Variables
    • Basics: Applying a Filter on a Variable
    • Basics: Editing a Variable Filter
    • PIC-SURE Open Access: Interpreting the Results
    • PIC-SURE Authorized Access: Add Variables to Export
    • PIC-SURE Authorized Access: Applying a Genomic Filter
    • PIC-SURE Authorized Access: Variable Distributions Tool
    • PIC-SURE Open Application Programming Interface (API)
  • Appendix
    • Glossary
    • Appendix 1: BDC Identifiers - dbGaP, TOPMed, and PIC-SURE
    • Appendix 2: Table of TOPMed DCC Harmonized Variables in PIC-SURE
Powered by GitBook
On this page
  • Step 1: Review Cohort Details
  • Step 2: Select Export Type
  • Export as Data Frame or CSV
  • Export as PFB
  • Step 3: Save Dataset ID
  • Step 4: Export Data
  1. Explore
  2. Features of Explore

Prepare for Analysis

PreviousFeatures of ExploreNextPFB Handoff to BioData Catalyst Powered by Terra

Last updated 5 months ago

Prepare for Analysis is used to export participant-level data corresponding to your filters and variable selections. There are several steps to export the data, which are shown using this process.

Step 1: Review Cohort Details

The first step of the process is to review your cohort details. This provides a tabular summary of the variables that have been filtered and added.

Below the summary is an option to include sample identifiers in the export. This will allow you to connect the phenotypic data you have selected to the sample data associated with the participant. By checking the box, the sample identifier information will be added to your export if the selected participants have sample information available.

Note: Queries with more than 1,000,000 data points will not be exportable.

Step 2: Select Export Type

To complete the export, the user will need to decide what format they would like their participant-level data to be in. There are two options: Export as Data Frame or CSV or Export as PFB.

Export as Data Frame or CSV

In some instances, multiple values may relate to a single variable per participant. For example, some participants may have had several samples sequenced, resulting in many sample identifiers for a single participant. If there are multiple values for a given variable, these values will be separated by a tab or \t character.

Export as PFB

Step 3: Save Dataset ID

Step 4: Export Data

The data is now ready for export. Based on your export format selection, there will be options displayed for export.

If you chose Export as Data Frame or CSV, the code to complete this export into a data frame in Python or R is provided. Additionally, the file can be downloaded as a CSV file.

The Export as Data Frame or CSV option should be selected if you are interested in exporting your selected data as a Comma-Separated Values file or if you intend to complete your export using the PIC-SURE API via R or Python. This includes using Juptyer Notebooks or RStudio to export your data to BioData Catalyst Powered by Seven Bridges or BioData Catalyst Powered by Terra. For more information about using the PIC-SURE API for export, please refer to the .

The Export as PFB option should be selected if you are interested in exporting your selected data as a Portable Format for Biomedical Data file or if you intend to send your data to BioData Catalyst Powered by Terra. For more information about this, please refer to .

The next step is to save the dataset ID. The dataset ID is the unique identifier that is created for the specific cohort and data that you have selected for export. Type a name for the dataset ID into the field in order to save the dataset ID for future reference. For more information about accessing and managing previously saved dataset IDs, please refer to the .

If you chose Export as PFB, you have the option to export the file into a Terra workspace. Clicking either of these options will automatically put the file into the location of your choosing. For more information, please refer to .

Data Analysis Using the PIC-SURE API section
PFB Handoff to BioData Catalyst Powered by Terra
Manage Datasets section
PFB Handoff to BioData Catalyst Powered by Terra
Step 1: Review cohort details with the option to include sample identifiers
Step 2: Select export format
Step 3: Save dataset ID step, showing a dataset ID named as "test_dataset_id"