Talk to the Veterans Crisis Line now
U.S. flag
An official website of the United States government

Office of Research & Development

print icon sign up for VA Research updates

The Big Data Scientist Training Enhancement Program (BD-STEP): Developing the Next Generation of Healthcare Data Scientists

Header - Big Data Scientist Training Enhancement Program (BD-STEP)


The Big Data Scientist Training Enhancement Program (BD-STEP) is a two-year fellowship program that uses data science to advance research and patient care. The Veterans Health Administration advanced fellowship launched in 2015 in collaboration with the National Cancer Institute (NCI), and the program provides well-rounded training and unparalleled access to VA data resources and NCI cancer research expertise. Abstract image - Integrating Big Data and healthcare The Veterans Health Administration is America's largest integrated healthcare system, providing care at 1,250 health care facilities and serving 9 million enrolled Veterans each year. The long-term care Veterans receive within this centralized healthcare system provides a rich source of longitudinal patient data- covering patients through period of health and illness. This is unique to the VHA, as the care patients receive in other US healthcare organization is often fragmented among different clinical sites, making it difficult to obtain a complete patient profile through the aggregation of medical records. Within the integrated healthcare system, there are many untapped opportunities to gain insights from patient data to advance cancer research and care. BD-STEP provides an avenue to access the rich, diverse data available in the VA Electronic Health Record (EHR), including longitudinal clinical patient data and diagnosis and treatment information from the VA Central Cancer Registry. BD-STEP utilizes the expertise of early-career data scientists to analyze these data and facilitate the execution of large-scale system changes in clinical care.

Abstract image - research fellowship lifecycle

Fellows are placed in VA medical centers across the country to work with clinicians and interdisciplinary researchers to address important patient-centered health challenges. The fellowship is centrally managed by a Coordinating Center which hosts a National Curriculum and works closely with sites to monitor fellow progress. The Coordinating Center is guided by a steering committee with VA and NCI membership, including the NCI's Center for Strategic Scientific Initiatives, Center for Cancer Training, and Center for Biomedical Informatics and Information Technology.

Over the course of their research, fellows network with healthcare and data science experts across government, industry, and academic institutions. They receive research mentorship from VA healthcare providers and academic researchers and curriculum oversight by VHA and NCI program leadership. This equips BD-STEP graduates with the skills and connection they need to pursue careers in healthcare data science after graduation.

Research Projects

Since the launch of the program, BD-STEP fellows have initiated diverse studies using VA healthcare data resources. These including predicting hepatocellular carcinoma in hepatitis VA patients using a cohort of more than 180,000 Veterans, comparing frailty assessment via clinical teams and machine learning to predict mortality in patients with congestive heart failure, and characterizing dynamic biological changes associated with prostate cancer progression in obese patients.

Service Projects

In addition to the fellow's individual research project, funded fellows spend 20% of their time on a Service Project that is of direct value and impact to the VA healthcare system and/or the fellowship. Service projects will be selected by the BD-STEP Coordinating Center based on their significance and impact on identified program and healthcare system needs. Examples of past service projects include: working with the National Artificial Intelligence Institute (NAII), supporting operations of the National Precision Oncology Program, and working with the Innovation Ecosystem to examine the social determinants of health related to COVID.


Developing and establishing fruitful collaborations with scientific peers is a vital component of a successful research career. BD-STEP provides various opportunities for fellows to form partnerships with researchers and organizations within the VA as well as academic institutions, commercial enterprises, and other government agencies. While the fellowship maintains formal collaborations with key VA organizations working within the data science space as well as the National Cancer Institute (listed below), the fellows' individual research projects are also typically multi-institutional efforts.

Fellow Eligibility

Applicants must meet all three criteria:

    Abstract image - areas of fellowship expertise Applicants must have obtained a PhD in engineering, computer science, physical science, or other related discipline including:
    • Engineering disciplines
    • Bioinformatics
    • Computational Biology
    • Economics
    • Epidemiology
    • Statistics
    • Chemistry
    In addition to a history of collaboration and teamwork with strong communication skills, applicants must have:
    • Experience in bioinformatics, modeling, or management of large data sets
    • Strong background in advanced mathematics and statistics
    • Proficiency in at least on programming language
    Applicants must be US citizens to be hired within a VA facility. Non-citizens are also encouraged to apply but are not eligible for VA stipend support.

For more information please see the FAQs.

How to Apply

  1. Contact your site of interest and discuss research projects.
  2. Have an academic mentor complete a letter of support for participation in the program.
  3. Compose a brief statement (no more than one page) describing interest and fit within the program, including proposed research areas and topics.
  4. Complete application and submit the following materials:
    • Letter of support from academic mentor
    • Statement of interest
    • Curriculum Vitae (CV)

Want to Learn More?

Questions about the R&D website? Email the Web Team

Any health information on this website is strictly for informational purposes and is not intended as medical advice. It should not be used to diagnose or treat any condition.