NHLBI BioData Catalyst® Powered by PIC-SURE
  • NHLBI BioData Catalyst® Powered by PIC-SURE User Guide
    • Frequently Asked Questions
  • Introduction to PIC-SURE
    • General Layout
    • Browse vs. Explore
  • Browse
    • Browse All Data
    • Features of Browse
  • Explore
    • Log in to Explore
    • Features of Explore
      • Prepare for Analysis
      • PFB Handoff to BioData Catalyst Powered by Terra
      • PFB Handoff to BioData Catalyst Powered by Seven Bridges
    • Manage Datasets
  • Data in PIC-SURE
    • Data Organization in BDC-PIC-SURE
      • BDC-PIC-SURE Data Format
    • Available Data & Managing Data Access
      • Publicly Available Datasets
      • TOPMed and TOPMed Related Datasets
        • Harmonized Data (TOPMed DCC Harmonized Clinical Variables)
      • BioLINCC Datasets
      • CONNECTS Datasets
  • Prepare for Data Analysis Using the PIC-SURE API
    • What is the PIC-SURE API?
    • PIC-SURE Personal Access Token
    • Analysis in the BioData Catalyst Ecosystem
      • BDC Powered by Seven Bridges
      • BDC Powered by Terra
    • Data Dictionaries via PIC-SURE API
    • More information about the PIC-SURE API
  • Citation and Acknowledgement of BioData Catalyst
  • Release Notes
    • Release Notes
      • 2025 June 4 Release
      • 2025 May 22 Release
      • 2025 May 8 Release
      • 2025 April 3 Release
      • 2025 March 5 Release
      • 2025 February 10 Release
      • 2024 Release Notes
        • 2024 December 19 Release
        • 2024 November 21 Release
        • 2024 November 4 Release
        • 2024 October 3 Release
        • 2024 September 5 Release
        • 2024 August 20 Release
        • 2024 August 1 Release
        • 2024 June 18 Release
        • 2024 May 29/30 Release
        • 2024 May 10/14 Release
        • 2024 March 26/28 Release
        • 2024 February 20/22 Release
        • 2024 January 30/31
        • 2024 January 16 Release
        • 2024 June 27 Release
      • 2023 Release Notes
        • 2023 December 12/14 Release
        • 2023 November 17 Release
        • 2023 October 23/31 Releases
        • 2023 October 13 Release
        • 2023 October 6 Release
        • 2023 September 28 Release
        • 2023 August 29 Release
        • 2023 July 27 Release
        • 2023 May 25 Release
        • 2023 March 30 Release
        • 2023 January 26 Release
  • Video Tutorials
    • Introduction to BioData Catalyst Powered by PIC-SURE
    • Basics: Finding Variables
    • Basics: Applying a Filter on a Variable
    • Basics: Editing a Variable Filter
    • PIC-SURE Open Access: Interpreting the Results
    • PIC-SURE Authorized Access: Add Variables to Export
    • PIC-SURE Authorized Access: Applying a Genomic Filter
    • PIC-SURE Authorized Access: Variable Distributions Tool
    • PIC-SURE Open Application Programming Interface (API)
  • Appendix
    • Glossary
    • Appendix 1: BDC Identifiers - dbGaP, TOPMed, and PIC-SURE
    • Appendix 2: Table of TOPMed DCC Harmonized Variables in PIC-SURE
Powered by GitBook
On this page
  1. Appendix

Appendix 2: Table of TOPMed DCC Harmonized Variables in PIC-SURE

Variable Name
Variable Description
TYPE
UNITS
VALUES

angina_incident_1

An indicator of whether a subject had an angina event (that was verified by adjudication or by medical professionals) during the follow-up period.

encoded

0=Angina event did not occur during follow-up || 1=Angina event occurred during follow-up

angina_prior_1

An indicator of whether a subject had an angina event prior to the baseline visit.

encoded

0=Did not have a history of angina before baseline || 1=Had a history of angina before baseline

annotated_sex_1

Subject sex, as recorded by the study.

encoded

female=Female || male=Male

antihypertensive_meds_1

Indicator for use of antihypertensive medication at the time of blood pressure measurement.

encoded

0=Not taking antihypertensive medication || 1=Taking antihypertensive medication

basophil_ncnc_bld_1

Count by volume, or number concentration (ncnc), of basophils in the blood (bld).

decimal

thousands / microliter

bmi_baseline_1

Body mass index calculated at baseline.

decimal

kg/m^2

bp_diastolic_1

Resting diastolic blood pressure from the upper arm in a clinical setting.

decimal

mmHg

bp_systolic_1

Resting systolic blood pressure from the upper arm in a clinical setting.

decimal

mmHg

cabg_incident_1

An indicator of whether a subject had a coronary artery bypass graft (CABG) procedure (that was verified by adjudication or by medical professionals) during the follow-up period.

encoded

0=CABG procedure did not occur during follow-up || 1=CABG procedure occurred during follow-up

cabg_prior_1

An indicator of whether a subject had a coronary artery bypass graft (CABG) procedure prior to the start of the baseline visit.

encoded

0=Did not have a CABG procedure before baseline || 1=Had a CABG procedure before baseline

cac_score_1

Coronary artery calcification (CAC) score using Agatston scoring of CT scan(s) of coronary arteries

decimal

cac_volume_1

Coronary artery calcium volume using CT scan(s) of coronary arteries

decimal

cubic millimeters

cad_followup_start_age_1

Age of subject at the start of the follow-up period during which atherosclerosis events were reviewed and adjudicated.

decimal

cad_followup_start_age

carotid_plaque_1

Presence or absence of carotid plaque.

encoded

0=Plaque not present || 1=Plaque present

carotid_stenosis_1

Extent of narrowing of the carotid artery.

encoded

0=None || 1=1%-24% || 2=25%-49% || 3=50%-74% || 4=75%-99% || 5=100%

cd40_1

Cluster of differentiation 40 ligand (CD40) concentration in blood.

decimal

ng/mL

chd_death_definite_1

An indicator of whether the cause of death was determined by medical professionals or technicians to be "definite" coronary heart disease for subjects who died during the follow-up period.

encoded

0=Death did not occur during follow-up, or cause of CHD death was not determined as definite CHD || 1=CHD death occurred during follow-up and was determined as definite

chd_death_probable_1

An indicator of whether the cause of death was determined by medical professionals or technicians to be "probable" or "definite" coronary heart disease for subjects who died during the follow-up period.

encoded

0=Death did not occur during follow-up, or cause of CHD death was not determined as definite or probable CHD || 1=CHD death occurred during follow-up and was determined as probable or definite

cimt_1

Common carotid intima-media thickness, calculated as the mean of two values: mean of multiple thickness estimates from the left far wall and from the right far wall.

decimal

mm

cimt_2

Common carotid intima-media thickness, calculated as the mean of four values: maximum of multiple thickness estimates from the left far wall, left near wall, right far wall, and right near wall.

decimal

mm

coronary_angioplasty_incident_1

An indicator of whether a subject had a coronary angioplasty procedure (that was verified by adjudication or by medical professionals) during the follow-up period.

encoded

0=Coronary angioplasty procedure did not occur during follow-up || 1=Coronary angioplasty procedure occurred

during follow-up

coronary_angioplasty_prior_1

An indicator of whether a subject had a coronary angioplasty procedure prior to the start of the baseline visit.

encoded

0=Did not have a coronary angioplasty procedure before baseline || 1=Had a coronary angioplasty procedure before baseline

coronary_revacularization_prior_1

An indicator of whether a subject had a coronary revascularization procedure prior to the start of the baseline visit. This includes angioplasty, CABG, and other coronary revascularization procedures.

encoded

0=Did not have a coronary revascularization procedure before baseline || 1=Had a coronary revascularization procedure before baseline

current_smoker_baseline_1

Indicates whether subject currently smokes cigarettes.

encoded

0=Does not currently smoke cigarettes || 1=Currently smokes cigarettes

crp_1

C-reactive protein (CRP) concentration in blood.

decimal

mg/L

eosinophil_ncnc_bld_1

Count by volume, or number concentration (ncnc), of eosinophils in the blood (bld).

decimal

thousands / microliter

eselectin_1

E-selectin concentration in blood.

decimal

ng/mL

ever_smoker_baseline_1

Indicates whether subject ever regularly smoked cigarettes.

encoded

0=Never a cigarette smoker || 1=Current or former cigarette smoker

fasting_lipids_1

Indicates whether participant fasted for at least eight hours prior to blood draw to measure lipids phenotypes.

encoded

0=Participant did not fast_or fasted for fewer than eight hours prior to measurement of lipids phenotypes. || 1=Participant fasted for at least eight hours prior to measurement of lipids phenotypes

geographic_site_1

Recruitment/field center, baseline clinic, or geographic region.

encoded

hdl_1

Blood mass concentration of high-density lipoprotein cholesterol

decimal

mg/dL

height_baseline_1

Body height at baseline.

decimal

cm

hematocrit_vfr_bld_1

Measurement of hematocrit, the fraction of volume (vfr) of blood (bld) that is composed of red blood cells.

decimal

% = percentage

hemoglobin_mcnc_bld_1

Measurement of mass per volume, or mass concentration (mcnc), of hemoglobin in the blood (bld).

decimal

g / dL = grams per deciliter

hispanic_or_latino_1

Indicator of reported Hispanic or Latino ethnicity.

encoded

ethnicity component dbGaP variable values for a subject were inconsistent/contradictory (e.g. over multiple visits) || Hispanic or Latino || not Hispanic or Latino

hispanic_subgroup_1

classification of Hispanic/Latino background for Hispanic/Latino subjects where country or region of origin information is available

encoded

CentralAmerican=Central American || CostaRican=from Costa Rica || Cuban=Cuban || Dominican=Dominican || Mexican=Mexican || PuertoRican=Puerto Rican || SouthAmerican=South American

icam1_1

Intercellular adhesion molecule 1 (ICAM1) concentration in blood.

decimal

ng/mL

il1_beta_1

Interleukin 1 beta (IL1b) concentration in blood.

decimal

pg/mL

il10_1

Interleukin 10 (IL10) concentration in blood.

decimal

pg/mL

il18_1

Interleukin 18 (IL18) concentration in blood.

decimal

pg/mL

il6_1

Interleukin 6 (IL6) concentration in blood.

decimal

pg/mL

isoprostane_8_epi_pgf2a_1

Isoprostane 8-epi-prostaglandin F2 alpha (8-epi-PGF2a) concentration in urine.

decimal

pg/mL

ldl_1

Blood mass concentration of low-density lipoprotein cholesterol

decimal

mg/dL

lipid_lowering_medication_1

Indicates whether participant was taking any lipid-lowering medication at blood draw to measure lipids phenotypes

encoded

0=Participant was not taking lipid-lowering medication || 1=Participant was taking lipid-lowering medication

lppla2_act_1

Activity of lipoprotein-associated phospholipase A2 (LP-PLA2), also known as platelet-activating factor acetylhydrolase, measured in blood.

decimal

nmol/min/mL

lppla2_mass_1

Mass of lipoprotein-associated phospholipase A2 (LP-PLA2), also known as platelet-activating factor acetylhydrolase, measured in blood.

decimal

ng/mL

lymphocyte_ncnc_bld_1

Count by volume, or number concentration (ncnc), of lymphocytes in the blood (bld).

decimal

thousands / microliter

mi_incident_1

An indicator of whether a subject had a myocardial infarction (MI) event (that was verified by adjudication or by medical professionals) during the follow-up period.

encoded

0=MI event did not occur during follow-up || 1=MI event occurred during follow-up

mi_prior_1

An indicator of whether a subject had a myocardial infarction (MI) prior to the start of the baseline visit.

encoded

0=Did not have a history of MI before baseline || 1=Had a history of MI before baseline

mch_entmass_rbc_1

Measurement of the average mass (entmass) of hemoglobin per red blood cell(rbc), known as mean corpuscular hemoglobin (MCH).

decimal

pg = picogram

mchc_mcnc_rbc_1

Measurement of the mass concentration (mcnc) of hemoglobin in a given volume of packed red blood cells (rbc), known as mean corpuscular hemoglobin concentration (MCHC).

decimal

g /dL = grams per deciliter

mcp1_1

Monocyte chemoattractant protein-1 (MCP1), also known as C-C motif chemokine ligand 2, concentration in blood.

decimal

pg/mL

mmp9_1

Matrix metalloproteinase 9 (MMP9) concentration in blood.

decimal

ng/mL

mcv_entvol_rbc_1

Measurement of the average volume (entvol) of red blood cells (rbc), known as mean corpuscular volume (MCV).

decimal

fL = femtoliter

monocyte_ncnc_bld_1

Count by volume, or number concentration (ncnc), of monocytes in the blood (bld).

decimal

thousands / microliter

mpo_1

Myeloperoxidase (MPO) concentration in blood.

decimal

ng/mL

neutrophil_ncnc_bld_1

Count by volume, or number concentration (ncnc), of neutrophils in the blood (bld).

decimal

thousands / microliter

opg_1

Osteoprotegerin (OPG) concentration in blood.

decimal

pmol/L

pad_incident_1

An indicator of whether a subject had peripheral arterial disease (that was verified by adjudication or by medical professionals) during the follow-up period.

encoded

0=No diagnosis of PAD during follow-up || 1=PAD was diagnosed during follow-up

pad_prior_1

An indicator of whether a subject had peripheral arterial disease prior to the baseline visit.

encoded

0=Did not have a history of PAD before baseline || 1=Had a history of PAD before baseline

platelet_ncnc_bld_1

Count by volume, or number concentration (ncnc), of platelets in the blood (bld).

integer

thousands / microliter

pmv_entvol_bld_1

Measurement of the mean volume (entvol) of platelets in the blood (bld), known as mean platelet volume (MPV or PMV).

decimal

fL = femtoliter

pselectin_1

P-selectin concentration in blood.

decimal

ng/mL

race_us_1

Harmonized race category of participant.

encoded

AI_AN=American Indian_Alaskan Native or Native American || Asian=Asian || Black=Black or African American || HI_PI=Native Hawaiian or other Pacific Islander || Multiple=More than one race || Other=Other race || White=White or Caucasian

rbc_ncnc_bld_1

Count by volume, or number concentration (ncnc), of red blood cells in the blood (bld).

decimal

millions / microliter

rdw_ratio_rbc_1

Measurement of the ratio of variation in width to the mean width of the red blood cell (rbc) volume distribution curve taken at +/- 1 CV, known as red cell distribution width (RDW).

decimal

% = percentage

sleep_duration_1

Usual amount of time slept per day.

decimal

hours/day

subcohort_1

A distinct subgroup within a study, generally indicating subjects who share similar characteristics due to study design. Subjects may belong to only one subcohort.

encoded

tnfa_1

Tumor necrosis factor alpha (TNFa) concentration in blood.

decimal

pg/mL

tnfa_r1_1

Tumor necrosis factor alpha receptor 1 (TNFa-R1) concentration in blood.

decimal

pg/mL

tnfr2_1

Tumor necrosis factor receptor 2 (TNFR2) concentration in blood.

decimal

pg/mL

total_cholesterol_1

Blood mass concentration of total cholesterol

decimal

mg/dL

triglycerides_1

Blood mass concentration of triglycerides

decimal

mg/dL

vte_case_status_1

An indicator of whether a subject experienced a venous thromboembolism event (VTE) that was verified by adjudication or by medical professionals.

encoded

0=Not known to ever have a VTE event_either self-reported or from medical records || 1=Experienced a VTE event as verified by adjudication or by medical professionals

vte_followup_start_age_1

Age of subject at the start of the follow up period during which venous thromboembolism (VTE) events were reviewed and adjudicated.

decimal

years

vte_prior_history_1

An indicator of whether a subject had a venous thromboembolism (VTE) event prior to the start of the medical review process (including self-reported events).

encoded

0=did not have prior VTE event || 1=had prior VTE event

wbc_ncnc_bld_1

Count by volume, or number concentration (ncnc), of white blood cells in the blood (bld).

decimal

thousands / microliter

weight_baseline_1

Body weight at baseline.

decimal

kg

age_at_*

For each phenotypic value for a given subject, an associated age at measurement is provided.

decimal

years

unit_*

For each harmonized variable, a paired “unit_variable” is provided, whose value indicates where in the documentation to look to find the set of component variables and the algorithm used to harmonize those variables.

encoded

PreviousAppendix 1: BDC Identifiers - dbGaP, TOPMed, and PIC-SURE

Last updated 24 days ago

See for more information.

See for more information.

TOPMed Harmonization Strategies
TOPMed Harmonization Strategies