Type: Package
Title: A Comprehensive Collection of Cardiovascular and Heart Disease Datasets
Version: 0.1.0
Maintainer: Renzo Caceres Rossi <arenzocaceresrossi@gmail.com>
Description: Offers a diverse collection of datasets focused on cardiovascular and heart disease research, including heart failure, myocardial infarction, aortic dissection, transplant outcomes, cardiovascular risk factors, drug efficacy, and mortality trends. Designed for researchers, clinicians, epidemiologists, and data scientists, the package features clinical, epidemiological, and simulated datasets covering a wide range of conditions and treatments such as statins, anticoagulants, and beta blockers. It supports analyses related to disease progression, treatment effects, rehospitalization, and public health outcomes across various cardiovascular patient populations.
License: GPL-3
URL: https://github.com/lightbluetitan/cardiodatasets, https://lightbluetitan.github.io/cardiodatasets/
BugReports: https://github.com/lightbluetitan/cardiodatasets/issues
Encoding: UTF-8
LazyData: true
Suggests: ggplot2, testthat (≥ 3.0.0), dplyr, knitr, rmarkdown
Depends: R (≥ 4.2.0)
Imports: utils
RoxygenNote: 7.3.2
Config/testthat/edition: 3
VignetteBuilder: knitr
NeedsCompilation: no
Packaged: 2025-05-10 04:15:44 UTC; renzocrossi
Author: Renzo Caceres Rossi [aut, cre]
Repository: CRAN
Date/Publication: 2025-05-13 08:20:06 UTC

CardioDataSets: A Comprehensive Collection of Cardiovascular and Heart Disease Datasets

Description

This package provides a wide variety of datasets focused on heart and cardiovascular research, covering heart disease, myocardial infarction, heart failure, stroke, ischemic heart disease, risk factors, clinical trials, and treatment outcomes.

Details

CardioDataSets: A Comprehensive Collection of Cardiovascular and Heart Disease Datasets

logo

A Comprehensive Collection of Cardiovascular and Heart Disease Datasets.

Author(s)

Maintainer: Renzo Caceres Rossi arenzocaceresrossi@gmail.com

See Also

Useful links:


Acute Coronary Syndrome (ACS) Patient Data

Description

This dataset, acs_patients_df, is a data frame containing demographic and clinical data from 857 patients with Acute Coronary Syndrome (ACS). It includes 17 variables covering patient characteristics, vital signs, laboratory results, and risk factors.

Usage

data(acs_patients_df)

Format

A data frame with 857 observations and 17 variables:

age

Patient age in years (integer)

sex

Patient sex (character)

cardiogenicShock

Presence of cardiogenic shock (character)

entry

Method of hospital entry (character)

Dx

Diagnosis (character)

EF

Ejection fraction percentage (numeric)

height

Height in cm (numeric)

weight

Weight in kg (numeric)

BMI

Body Mass Index in kg/m² (numeric)

obesity

Obesity status (character)

TC

Total cholesterol in mg/dL (numeric)

LDLC

LDL cholesterol in mg/dL (integer)

HDLC

HDL cholesterol in mg/dL (integer)

TG

Triglycerides in mg/dL (integer)

DM

Diabetes mellitus status (character)

HBP

High blood pressure status (character)

smoking

Smoking status (character)

Details

The dataset name has been kept as 'acs_patients_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.

Source

Data taken from the moonBook package version 0.3.1


Age vs. Maximum Heart Rate

Description

This dataset, age_heartrate_df, is a data frame containing simulated data representing the relationship between age and maximum heart rate. It includes 15 observations based on established physiological models.

Usage

data(age_heartrate_df)

Format

A data frame with 15 observations and 2 variables:

age

Age in years (numeric)

maxrate

Maximum predicted heart rate in beats per minute (numeric)

Details

The dataset name has been kept as 'age_heartrate_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.

Source

Data taken from the UsingR package version 2.0-7. Original research: Tanaka H, Monahan KD, Seals DR (2001). "Age-predicted maximal heart rate revisited." Journal of the American College of Cardiology, 37(1):153-156.


Acute Myocardial Infarction (Heart Attack) Events

Description

This dataset, ami_occurrences_tbl_df, is a tibble containing simulated but realistic daily counts of Acute Myocardial Infarction (AMI) occurrences in New York City over one year (365 days). The data represents the number of heart attack events recorded each day.

Usage

data(ami_occurrences_tbl_df)

Format

A tibble with 365 observations and 1 variable:

ami

Number of Acute Myocardial Infarction events recorded each day (integer vector)

Details

The dataset name has been kept as 'ami_occurrences_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

Data taken from the openintro package version 2.5.0


Aortic dissection patients

Description

This dataset, aortaDiss_tbl_df, is a tibble containing clinical information from 226 patients with aortic dissection. It includes demographic variables, symptom presentation, and risk factor data.

Usage

data(aortaDiss_tbl_df)

Format

A tibble with 226 observations and 10 variables:

Gender

Patient gender (numeric)

Age

Patient age in years (numeric)

Age_C

Categorized age (numeric)

Aortadis

Aortic dissection status (numeric)

Acute

Acute presentation indicator (numeric)

Acute3

Three-level acute presentation classification (numeric)

Stomach_Ache

Presence of stomach ache (numeric)

Hyper

Hypertension status (numeric)

Smoking

Smoking status (numeric)

Radiation

Radiation exposure (numeric)

Details

The dataset name has been kept as 'aortaDiss_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

Data taken from the psfmi package version 1.4.0


FDA Beta Blockers Adverse Events

Description

This dataset, betablockers_matrix, is a matrix containing adverse event reports from the FDA Adverse Event Reporting System (FAERS) for 9 beta blockers from Q1 2021 to Q4 2023. The matrix includes 501 adverse events (rows) across 9 medications (columns).

Usage

data(betablockers_matrix)

Format

A matrix with 501 rows (adverse events) and 9 columns (beta blockers):

Acebutolol

Adverse event counts for Acebutolol (integer)

Atenolol

Adverse event counts for Atenolol (integer)

Bisoprolol

Adverse event counts for Bisoprolol (integer)

Carvedilol

Adverse event counts for Carvedilol (integer)

Metoprolol

Adverse event counts for Metoprolol (integer)

Nadolol

Adverse event counts for Nadolol (integer)

Propranolol

Adverse event counts for Propranolol (integer)

Timolol

Adverse event counts for Timolol (integer)

Other

Adverse event counts for other beta blockers (integer)

Details

The dataset name has been kept as 'betablockers_matrix' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'matrix' indicates that the dataset is a matrix object. The original content has not been modified in any way.

Source

Data taken from the MDDC package version 1.1.0. Original data: FDA Adverse Event Reporting System (FAERS) database, Q1 2021 to Q4 2023.


Anticoagulants for CAD Patients

Description

This dataset, cad_anticoagulants_df, is a data frame containing information from 34 clinical trials examining the effectiveness of oral anticoagulants in patients with coronary artery disease. It includes data on treatment outcomes comparing anticoagulant therapy with control groups.

Usage

data(cad_anticoagulants_df)

Format

A data frame with 34 observations and 9 variables:

study

Study identifier (character vector)

year

Year of publication (integer vector)

intensity

Intensity of anticoagulation treatment (character vector)

asp.t

Aspirin use in treatment group (integer vector)

asp.c

Aspirin use in control group (integer vector)

ai

Number of events in treatment group (integer vector)

n1i

Total number of participants in treatment group (integer vector)

ci

Number of events in control group (integer vector)

n2i

Total number of participants in control group (integer vector)

Details

The dataset name has been kept as 'cad_anticoagulants_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.

Source

Data taken from the metadat package version 1.2-0


Heart Failure Clinical Dataset

Description

This dataset, cardiac_failure_df, is a data frame containing clinical data from 299 patients with heart failure. It includes 13 variables covering demographic information, medical history, laboratory results, and mortality outcomes.

Usage

data(cardiac_failure_df)

Format

A data frame with 299 observations and 13 variables:

age

Patient age in years (numeric)

anaemia

Presence of anaemia (integer: 0=no, 1=yes)

creatinine_phosphokinase

Level of CPK enzyme in mcg/L (integer)

diabetes

Presence of diabetes (integer: 0=no, 1=yes)

ejection_fraction

Percentage of blood leaving heart (integer)

high_blood_pressure

Presence of hypertension (integer: 0=no, 1=yes)

platelets

Platelet count in kiloplatelets/mL (numeric)

serum_creatinine

Level of serum creatinine in mg/dL (numeric)

serum_sodium

Level of serum sodium in mEq/L (integer)

sex

Patient sex (integer: 0=female, 1=male)

smoking

Smoking status (integer: 0=no, 1=yes)

time

Follow-up period in days (integer)

DEATH_EVENT

Death during follow-up (integer: 0=no, 1=yes)

Details

The dataset name has been kept as 'cardiac_failure_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.

Source

Data taken from the SOPC package version 0.1.0


Coronary Artery Disease GWAS Meta-Analysis

Description

This dataset, cardiac_gwas_df, is a data frame containing genome-wide association study (GWAS) results from a multi-ethnic meta-analysis of coronary artery disease (CAD). It includes 9,919 genetic variants with their effect sizes and study characteristics.

Usage

data(cardiac_gwas_df)

Format

A data frame with 9,919 observations and 7 variables:

beta_flipped

Effect size estimates (numeric)

gcse

Genomic control standard error (numeric)

variants

Genetic variant identifiers (character)

studies

Participating studies (character)

cases

Number of cases (integer)

controls

Number of controls (integer)

fdr214_gwas46

False discovery rate adjusted p-values (numeric)

Details

The dataset name has been kept as 'cardiac_gwas_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.

Source

Data taken from the getmstatistic package version 0.2.2


Cardiovascular Risk Factors

Description

This dataset, cardioRiskFactors_df, is a data frame containing information from a study investigating the association between uric acid and cardiovascular risk factors in developing countries. It includes data from 998 participants (474 men and 524 women) aged 25-64 years.

Usage

data(cardioRiskFactors_df)

Format

A data frame with 998 observations and 14 variables:

age

Age in years (integer)

bmi

Body Mass Index in kg/m² (numeric)

waisthip

Waist-to-hip ratio (numeric)

smok

Smoking status (integer)

choles

Total cholesterol in mg/dL (numeric)

trig

Triglycerides in mg/dL (numeric)

hdl

HDL cholesterol in mg/dL (numeric)

ldl

LDL cholesterol in mg/dL (numeric)

sys

Systolic blood pressure in mmHg (integer)

dia

Diastolic blood pressure in mmHg (numeric)

Uric

Uric acid level in mg/dL (integer)

sex

Sex (integer)

alco

Alcohol consumption (numeric)

apoa

Apolipoprotein A in mg/dL (numeric)

Details

The dataset name has been kept as 'cardioRiskFactors_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.

Source

Data taken from the Rfit package version 0.27.0. Original study: Heritier S, Cantoni E, Copt S, Victoria-Feser M (2009). Robust Methods in Biostatistics. New York: John Wiley and Sons.


Cardiovascular risks of diabetes drugs

Description

This dataset, cardio_diabetes_tbl_df, is a tibble containing information comparing cardiovascular problems between two diabetes medications (Rosiglitazone and Pioglitazone) in elderly Medicare patients. It includes data from 227,571 patients.

Usage

data(cardio_diabetes_tbl_df)

Format

A tibble with 227,571 observations and 2 variables:

treatment

Type of diabetes medication (factor with 2 levels: Rosiglitazone or Pioglitazone)

cardiovascular_problems

Presence of cardiovascular problems (factor with 2 levels)

Details

The dataset name has been kept as 'cardio_diabetes_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

Data taken from the openintro package version 2.5.0. Original study: Graham DJ, et al. (2010). "Risk of acute myocardial infarction, stroke, heart failure, and death in elderly Medicare patients treated with rosiglitazone or pioglitazone." JAMA, 304(4):411.


Statin Dose Comparison Trials for CVD

Description

This dataset, cardiovascular_list, is a list containing data from 34 clinical trials comparing low dose (1), high dose (2), and placebo (3) statins for cardiovascular disease prevention. The dataset includes study identifiers, treatment assignments, and outcome counts.

Usage

data(cardiovascular_list)

Format

A list with 4 components:

Study

Study identifiers (integer vector of length 34)

Treat

Treatment assignments (numeric vector: 1=low dose, 2=high dose, 3=placebo)

Outcomes

Outcome matrix with 34 rows and 3 columns:

Alive

Number of patients alive (numeric)

FnCVD

Number with non-fatal CVD events (numeric)

FCVD

Number with fatal CVD events (numeric)

N

Sample sizes (numeric vector of length 34)

Details

The dataset name has been kept as 'cardiovascular_list' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The original content has not been modified in any way.

Source

Data taken from the bnma package version 1.6.0


High vs Moderate Statins for MI Prevention

Description

This dataset, coronary_death_df, is a data frame containing information from 4 clinical trials comparing intensive (high dose) versus moderate (standard dose) statin therapy for preventing coronary death or myocardial infarction. It includes data on treatment outcomes across multiple endpoints.

Usage

data(coronary_death_df)

Format

A data frame with 4 observations and 16 variables:

trial

Trial identifier (character vector)

pop

Patient population description (character vector)

nt

Number of patients in treatment group (integer vector)

nc

Number of patients in control group (integer vector)

ep1t

Endpoint 1 events in treatment group (integer vector)

ep1c

Endpoint 1 events in control group (integer vector)

ep2t

Endpoint 2 events in treatment group (integer vector)

ep2c

Endpoint 2 events in control group (integer vector)

ep3t

Endpoint 3 events in treatment group (integer vector)

ep3c

Endpoint 3 events in control group (integer vector)

ep4t

Endpoint 4 events in treatment group (integer vector)

ep4c

Endpoint 4 events in control group (integer vector)

ep5t

Endpoint 5 events in treatment group (integer vector)

ep5c

Endpoint 5 events in control group (integer vector)

ep6t

Endpoint 6 events in treatment group (integer vector)

ep6c

Endpoint 6 events in control group (integer vector)

Details

The dataset name has been kept as 'coronary_death_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.

Source

Data taken from the metadat package version 1.2-0


Blood thinners in CPR survival

Description

This dataset, cpr_survival_tbl_df, is a tibble containing information from a study examining the effect of blood thinners on survival rates in CPR patients. The study randomly assigned 90 patients to either receive a blood thinner (treatment group) or not receive one (control group), with the outcome being survival for at least 24 hours.

Usage

data(cpr_survival_tbl_df)

Format

A tibble with 90 observations and 2 variables:

group

Treatment assignment (factor with 2 levels: "control" and "treatment")

outcome

Survival status (factor with 2 levels: "died" and "survived")

Details

The dataset name has been kept as 'cpr_survival_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

Data taken from the openintro package version 2.5.0


LA pollution and cardiovascular mortality

Description

This dataset, cv_mortality_ts, is a time series containing weekly cardiovascular mortality data from Los Angeles County. It consists of 508 six-day smoothed averages obtained by filtering daily values over the 10-year period from 1970 to 1979.

Usage

data(cv_mortality_ts)

Format

A time series object (ts) with 508 observations:

cv_mortality

Weekly cardiovascular mortality counts (numeric vector)

Details

The dataset name has been kept as 'cv_mortality_ts' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'ts' indicates that the dataset is a time series object. The original content has not been modified in any way.

Time series characteristics: - Start: 1970, Week 1 - End: 1979, Week 40 - Frequency: 52 (weekly data)

Source

Data taken from the astsa package version 2.2


Anger recall effect on heart rate (Lakens, 2013)

Description

This dataset, emotion_heartrate_df, is a data frame containing heart rate measurements from a study investigating how recalling anger affects heart rate. It includes baseline and anger-induced heart rate measurements from 68 participants.

Usage

data(emotion_heartrate_df)

Format

A data frame with 68 observations and 3 variables:

ID

Participant identification number (integer vector)

HR_baseline

Baseline heart rate in beats per minute (numeric vector)

HR_anger

Heart rate during anger recall in beats per minute (numeric vector)

Details

The dataset name has been kept as 'emotion_heartrate_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.

Source

Data taken from the esci package version 1.0-7. Original study: Lakens D (2013). Conceptual replication of Ekman et al. (1983) emotion study.


Artificial Heart Transplant Durations

Description

This dataset, heartTransplantTime_tbl_df, is a tibble containing the durations (in hours) of 15 artificial heart transplant operations.

Usage

data(heartTransplantTime_tbl_df)

Format

A tibble with 15 observations and 1 variable:

duration

Operation duration in hours (numeric)

Details

The dataset name has been kept as 'heartTransplantTime_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

Data taken from the BSDA package version 1.2.3. Original source: Kitchens LJ (2003). "Basic Statistics and Data Analysis." Pacific Grove, CA: Brooks/Cole, a division of Thomson Learning.


Stanford Heart Transplant Data

Description

This dataset, heart_transplant_df, is a data frame containing survival data from the Stanford heart transplant program. It includes information on 172 patients with follow-up times, transplant status, and clinical covariates.

Usage

data(heart_transplant_df)

Format

A data frame with 172 observations and 8 variables:

start

Start time of interval (numeric)

stop

End time of interval (numeric)

event

Survival status (numeric: 1=event, 0=censored)

age

Patient age at enrollment (numeric)

year

Year of enrollment (numeric)

surgery

Prior bypass surgery (numeric)

transplant

Transplant status (factor: 0=no, 1=yes)

id

Patient identification number (numeric)

Details

The dataset name has been kept as 'heart_transplant_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.

Source

Data taken from the lrstat package version 0.2.13. Original source: Stanford Heart Transplant Study data from the survival package.


Heart Disease Patients Clinical Data

Description

This dataset, heartdisease_tbl_df, is a tibble containing information on individuals evaluated for heart disease. It is a cleaned version of the original "Heart Disease" dataset from the UCI Machine Learning Repository, and includes 303 observations on 9 variables.

Usage

data(heartdisease_tbl_df)

Format

A tibble with 303 observations and 9 variables:

Age

Age of the individual (numeric).

Sex

Sex of the individual (factor with 2 levels: typically "Male" and "Female").

ChestPain

Type of chest pain experienced (factor with 4 levels).

BP

Resting blood pressure (numeric).

Cholesterol

Serum cholesterol in mg/dl (numeric).

BloodSugar

Indicates if fasting blood sugar > 120 mg/dl (logical).

MaximumHR

Maximum heart rate achieved (numeric).

ExerciseInducedAngina

Exercise-induced angina (factor with 2 levels).

HeartDisease

Presence or absence of heart disease (factor with 2 levels).

Details

The dataset name has been kept as 'heartdisease_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

Data taken from the cheese package version 0.1.2. Original source: UCI Machine Learning Repository. Heart Disease Data Set. https://archive.ics.uci.edu/ml/datasets/Heart+Disease


Heart Disease Risk Factors

Description

This dataset, heartdiseaserisk_tbl_df, is a tibble containing cardiovascular risk factor data from 498 individuals. It includes measures of physical activity (biking), smoking habits, and heart disease prevalence.

Usage

data(heartdiseaserisk_tbl_df)

Format

A tibble with 498 observations and 3 variables:

Biking

Frequency of biking activity (numeric)

Heart.disease

Prevalence of heart disease (numeric)

Smoking

Smoking frequency or intensity (numeric)

Details

The dataset name has been kept as 'heartdiseaserisk_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

Data taken from the Path.Analysis package version 0.1


Heart Failure rehospitalization risk

Description

This dataset, heartfailure_df, is a data frame containing simulated data from 800 patients with heart failure who are at risk of recurrent hospitalization. The dataset includes 3,068 observations (2,268 events) tracking patient outcomes over time.

Usage

data(heartfailure_df)

Format

A data frame with 3,068 observations and 6 variables:

id

Patient identification number (integer vector)

treatment

Treatment assignment (factor with 2 levels)

t0

Start time of observation period (numeric vector)

t1

End time of observation period (numeric vector)

enum

Event number (numeric vector)

event

Event indicator (numeric vector)

Details

The dataset name has been kept as 'heartfailure_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.

Source

Data taken from the survPen package version 2.0-2. Based on hfaction_cpx12 dataset from package WA.


Statins for Heart Failure Prevention

Description

This dataset, hfPrevention_mtc_network, contains network meta-analysis data from 19 trials comparing statins versus placebo or usual care for cholesterol lowering in heart failure. The main outcome measured is the number of deaths. Trials are categorized as either primary prevention (no previous heart disease) or secondary prevention (previous heart disease).

Usage

data(hfPrevention_mtc_network)

Format

An 'mtc.network' object (list) with 4 components:

description

Character string describing the analysis: "Cholesterol lowering in HF (outcome: death)"

treatments

Data frame with 2 treatments:

id

Treatment ID (factor with 2 levels)

description

Treatment description (character vector)

data.ab

Data frame with 38 rows (arm-level data):

study

Study ID (factor with 19 levels)

treatment

Treatment assignment (factor with 2 levels)

responders

Number of deaths (integer vector)

sampleSize

Total sample size per arm (integer vector)

studies

Data frame with 19 rows (study-level data):

study

Study ID (factor with 19 levels)

secondary

Prevention type: 0 = primary, 1 = secondary (integer vector)

Details

The dataset name has been kept as 'hfPrevention_mtc_network' to maintain consistency with its original source and to avoid confusion with other datasets. This naming convention helps identify this specific network meta-analysis dataset from the CardioDataSets package. The dataset is structured as an 'mtc.network' object, which is the standard format for network meta-analysis in the gemtc package. The original content has not been modified.

Source

Data taken from the gemtc package version 1.0-2. Original publication: Dias S, Sutton AJ, Welton NJ, Ades AE (2013). "Heterogeneity - Subgroups, Meta-Regression, Bias, and Bias-Adjustment." Medical Decision Making, 33(5):618-640.


Elderly CV/MRI and Biomarkers

Description

This dataset, mriCardioVars_tbl_df, is a tibble containing MRI and clinical data from 735 elderly participants in a U.S. observational study of cardiovascular and cerebrovascular disease incidence. It includes 30 variables covering demographic, clinical, and imaging measures.

Usage

data(mriCardioVars_tbl_df)

Format

A tibble with 735 observations and 30 variables:

ptid

Patient identification number (numeric)

mridate

MRI date (Date)

age

Age in years (numeric)

sex

Sex (character)

race

Race (character)

weight

Weight in kg (numeric)

height

Height in cm (numeric)

packyrs

Smoking pack-years (numeric)

yrsquit

Years since quitting smoking (numeric)

alcoh

Alcohol consumption (numeric)

physact

Physical activity level (numeric)

chf

Congestive heart failure status (numeric)

chd

Coronary heart disease status (numeric)

stroke

Stroke history (numeric)

diabetes

Diabetes status (numeric)

genhlth

General health status (numeric)

ldl

LDL cholesterol in mg/dL (numeric)

alb

Albumin level (numeric)

crt

Creatinine level (numeric)

plt

Platelet count (numeric)

sbp

Systolic blood pressure in mmHg (numeric)

aai

Ankle-arm index (numeric)

fev

Forced expiratory volume (numeric)

dsst

Digit Symbol Substitution Test score (numeric)

atrophy

Brain atrophy measure (numeric)

whgrd

White matter hyperintensity grade (numeric)

numinf

Number of brain infarcts (numeric)

volinf

Volume of brain infarcts (numeric)

obstime

Observation time (numeric)

death

Mortality status (numeric)

Details

The dataset name has been kept as 'mriCardioVars_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

Data taken from the rigr package version 1.0.7


Muscatine pediatric CRF

Description

This dataset, muscatine_coronary_risk_df, is a data frame containing longitudinal observations from the Muscatine Coronary Risk Factor (MCRF) study, which examined the development of coronary disease risk factors in children. It includes 14,568 observations of 4,856 children tracked from 1977 to 1981.

Usage

data(muscatine_coronary_risk_df)

Format

A data frame with 14,568 observations and 7 variables:

id

Child identification number (integer)

gender

Gender of child (factor with 2 levels)

base_age

Age at first observation in years (integer)

age

Current age in years (integer)

occasion

Measurement occasion (integer)

obese

Obesity status (factor with 2 levels)

numobese

Numeric obesity indicator (numeric)

Details

The dataset name has been kept as 'muscatine_coronary_risk_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.

Source

Data taken from the geepack package version 1.3.12. Original study: The Muscatine Coronary Risk Factor Study, University of Iowa, 1977-1981.


Streptokinase Therapy in AMI

Description

This dataset, myocardialinfarction_df, is a data frame containing information from 33 clinical trials comparing intravenous streptokinase versus placebo or no therapy in patients hospitalized for acute myocardial infarction. It includes data on treatment outcomes between intervention and control groups.

Usage

data(myocardialinfarction_df)

Format

A data frame with 33 observations and 6 variables:

trial

Trial identifier (character vector)

year

Year of publication (integer vector)

ai

Number of events in treatment group (integer vector)

n1i

Total number of participants in treatment group (integer vector)

ci

Number of events in control group (integer vector)

n2i

Total number of participants in control group (integer vector)

Details

The dataset name has been kept as 'myocardialinfarction_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.

Source

Data taken from the metadat package version 1.2-0. Original publication: Lau J, Antman EM, Jimenez-Silva J, Kupelnick B, Mosteller F, Chalmers TC (1992). "Cumulative meta-analysis of therapeutic trials for myocardial infarction." New England Journal of Medicine, 327(4):248-254.


CAV in Heart Transplant Patients

Description

This dataset, patient_CAV_df, is a data frame containing longitudinal follow-up data from heart transplant recipients at Papworth Hospital, UK. It tracks 2,803 angiographic examinations for the onset of cardiac allograft vasculopathy and mortality.

Usage

data(patient_CAV_df)

Format

A data frame with 2,803 observations and 5 variables:

PTNUM

Patient identification number (integer)

years

Time since transplant in years (numeric)

state

Disease state (numeric)

dage

Donor age in years (integer)

pdiag

Primary diagnosis code (numeric)

Details

The dataset name has been kept as 'patient_CAV_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.

Source

Data taken from the flexmsm package version 0.1.2. Original data: Papworth Hospital, UK. Subset of cav data from msm package.


Radial Artery IVUS Patient Data

Description

This dataset, radial_ivus_df, is a data frame containing demographic and clinical data from 115 patients who underwent intravascular ultrasound (IVUS) examination of the radial artery following transradial coronary angiography. It includes 15 variables covering patient characteristics, laboratory results, and IVUS measurements.

Usage

data(radial_ivus_df)

Format

A data frame with 115 observations and 15 variables:

male

Male sex indicator (integer: 0/1)

age

Age in years (integer)

height

Height in cm (numeric)

weight

Weight in kg (numeric)

HBP

High blood pressure status (integer: 0/1)

DM

Diabetes mellitus status (integer: 0/1)

smoking

Smoking status (factor with 3 levels)

TC

Total cholesterol in mg/dL (integer)

TG

Triglycerides in mg/dL (integer)

HDL

HDL cholesterol in mg/dL (integer)

LDL

LDL cholesterol in mg/dL (integer)

hsCRP

High-sensitivity C-reactive protein in mg/L (numeric)

NTAV

Normalized total atheroma volume (numeric)

PAV

Percent atheroma volume (numeric)

sex

Sex (factor with 2 levels)

Details

The dataset name has been kept as 'radial_ivus_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.

Source

Data taken from the moonBook package version 0.3.1


Scottish Health Survey CVD

Description

This dataset, scottish_CVD_df, is a data frame containing cardiovascular health data from the 1998 Scottish Health Survey. It includes information from 8,804 respondents aged 18-64, with variables covering demographics, health behaviors, and cardiovascular disease status.

Usage

data(scottish_CVD_df)

Format

A data frame with 8,804 observations and 8 variables:

age

Respondent age in years (integer)

sex

Respondent sex (factor with 2 levels)

sc

Social class (factor with 3 levels)

cvddef

Doctor-diagnosed CVD status (integer: 0=no, 1=yes)

carstair

Carstairs deprivation score (numeric)

smoke

Smoking status (factor with 5 levels)

id

Respondent identification number (integer)

area

Geographic area code (integer)

Details

The dataset name has been kept as 'scottish_CVD_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.

Source

Data taken from the R2MLwiN package version 0.8-9. Original survey: 1998 Scottish Health Survey. Methodology reference: Charlton C, Rasbash J, Browne WJ, Healy M, Cameron B (2024). MLwiN Version 3.09. Centre for Multilevel Modelling, University of Bristol.


Statin intensity and MI risk

Description

This dataset, statinMIrisk_df, is a data frame containing results from 4 clinical trials investigating the effect of statin therapy intensity on the risk of myocardial infarction or coronary death. The data compares intensive versus standard statin regimens.

Usage

data(statinMIrisk_df)

Format

A data frame with 4 observations and 5 variables:

study

Study identifier (character)

eI

Number of events in intensive treatment group (numeric)

nI

Total patients in intensive treatment group (numeric)

eC

Number of events in control/standard group (numeric)

nC

Total patients in control/standard group (numeric)

Details

The dataset name has been kept as 'statinMIrisk_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.

Source

Data taken from the RTSA package version 0.2.2


Sulphinpyrazone for post-MI death prevention

Description

This dataset, sulphinpyrazone_tbl_df, is a tibble containing information from a clinical trial studying the efficacy of sulphinpyrazone in preventing sudden death after myocardial infarction. The data includes 1,475 patients randomly assigned to either the treatment or control group.

Usage

data(sulphinpyrazone_tbl_df)

Format

A tibble with 1,475 observations and 2 variables:

group

Treatment assignment (factor with 2 levels: "control" and "treatment")

outcome

Patient outcome (factor with 2 levels)

Details

The dataset name has been kept as 'sulphinpyrazone_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

Data taken from the openintro package version 2.5.0. Original study: Anturane Reinfarction Trial Research Group (1980). "Sulfinpyrazone in the prevention of sudden death after myocardial infarction." New England Journal of Medicine, 302(5):250-256.


US Mortality Rates by Cause and Gender

Description

This dataset, usMortality_df, is a data frame containing mortality rates across all ages in the USA from 2011-2013, stratified by cause of death, sex, and rural/urban status. It includes national aggregate rates for 10 causes of death, including Heart disease.

Usage

data(usMortality_df)

Format

A data frame with 40 observations and 5 variables:

Status

Residential status (factor: Rural/Urban)

Sex

Gender (factor: Male/Female)

Cause

Cause of death (factor with 10 levels)

Rate

Mortality rate per 100,000 population (numeric)

SE

Standard error of mortality rate (numeric)

Details

The dataset name has been kept as 'usMortality_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.

Source

Data taken from the lattice package version 0.22-6. Original source: Rural Health Reform Policy Research Center (2015). "Exploring Rural and Urban Mortality Differences." Bethesda, MD: August 2015.


View Available Datasets in CardioDataSets

Description

This function lists all datasets available in the 'CardioDataSets' package. If the 'CardioDataSets' package is not loaded, it stops and shows an error message. If no datasets are available, it returns a message and an empty vector.

Usage

view_datasets()

Value

A character vector with the names of the available datasets. If no datasets are found, it returns an empty character vector.

Examples

if (requireNamespace("CardioDataSets", quietly = TRUE)) {
  library(CardioDataSets)
  view_datasets()
}