Help for package CardioDataSets

Type:

Package

Title:

A Comprehensive Collection of Cardiovascular and Heart Disease Datasets

Version:

0.1.0

Maintainer:

Renzo Caceres Rossi <arenzocaceresrossi@gmail.com>

Description:

Offers a diverse collection of datasets focused on cardiovascular and heart disease research, including heart failure, myocardial infarction, aortic dissection, transplant outcomes, cardiovascular risk factors, drug efficacy, and mortality trends. Designed for researchers, clinicians, epidemiologists, and data scientists, the package features clinical, epidemiological, and simulated datasets covering a wide range of conditions and treatments such as statins, anticoagulants, and beta blockers. It supports analyses related to disease progression, treatment effects, rehospitalization, and public health outcomes across various cardiovascular patient populations.

License:

GPL-3

URL:

https://github.com/lightbluetitan/cardiodatasets, https://lightbluetitan.github.io/cardiodatasets/

BugReports:

https://github.com/lightbluetitan/cardiodatasets/issues

Encoding:

UTF-8

LazyData:

true

Suggests:

ggplot2, testthat (≥ 3.0.0), dplyr, knitr, rmarkdown

Depends:

R (≥ 4.2.0)

Imports:

utils

RoxygenNote:

7.3.2

Config/testthat/edition:

VignetteBuilder:

knitr

NeedsCompilation:

Packaged:

2025-05-10 04:15:44 UTC; renzocrossi

Author:

Renzo Caceres Rossi [aut, cre]

Repository:

CRAN

Date/Publication:

2025-05-13 08:20:06 UTC

CardioDataSets: A Comprehensive Collection of Cardiovascular and Heart Disease Datasets

Description

This package provides a wide variety of datasets focused on heart and cardiovascular research, covering heart disease, myocardial infarction, heart failure, stroke, ischemic heart disease, risk factors, clinical trials, and treatment outcomes.

Details

CardioDataSets: A Comprehensive Collection of Cardiovascular and Heart Disease Datasets

A Comprehensive Collection of Cardiovascular and Heart Disease Datasets.

Author(s)

Maintainer: Renzo Caceres Rossi arenzocaceresrossi@gmail.com

Acute Coronary Syndrome (ACS) Patient Data

Description

This dataset, acs_patients_df, is a data frame containing demographic and clinical data from 857 patients with Acute Coronary Syndrome (ACS). It includes 17 variables covering patient characteristics, vital signs, laboratory results, and risk factors.

Usage

data(acs_patients_df)

Format

A data frame with 857 observations and 17 variables:

age: Patient age in years (integer)
sex: Patient sex (character)
cardiogenicShock: Presence of cardiogenic shock (character)
entry: Method of hospital entry (character)
Dx: Diagnosis (character)
EF: Ejection fraction percentage (numeric)
height: Height in cm (numeric)
weight: Weight in kg (numeric)
BMI: Body Mass Index in kg/m² (numeric)
obesity: Obesity status (character)
TC: Total cholesterol in mg/dL (numeric)
LDLC: LDL cholesterol in mg/dL (integer)
HDLC: HDL cholesterol in mg/dL (integer)
TG: Triglycerides in mg/dL (integer)
DM: Diabetes mellitus status (character)
HBP: High blood pressure status (character)
smoking: Smoking status (character)

Details

The dataset name has been kept as 'acs_patients_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.

Source

Data taken from the moonBook package version 0.3.1

Age vs. Maximum Heart Rate

Description

This dataset, age_heartrate_df, is a data frame containing simulated data representing the relationship between age and maximum heart rate. It includes 15 observations based on established physiological models.

Usage

data(age_heartrate_df)

Format

A data frame with 15 observations and 2 variables:

age: Age in years (numeric)
maxrate: Maximum predicted heart rate in beats per minute (numeric)

Details

The dataset name has been kept as 'age_heartrate_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.

Source

Data taken from the UsingR package version 2.0-7. Original research: Tanaka H, Monahan KD, Seals DR (2001). "Age-predicted maximal heart rate revisited." Journal of the American College of Cardiology, 37(1):153-156.

Acute Myocardial Infarction (Heart Attack) Events

Description

This dataset, ami_occurrences_tbl_df, is a tibble containing simulated but realistic daily counts of Acute Myocardial Infarction (AMI) occurrences in New York City over one year (365 days). The data represents the number of heart attack events recorded each day.

Usage

data(ami_occurrences_tbl_df)

Format

A tibble with 365 observations and 1 variable:

ami: Number of Acute Myocardial Infarction events recorded each day (integer vector)

Details

The dataset name has been kept as 'ami_occurrences_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

Data taken from the openintro package version 2.5.0

Aortic dissection patients

Description

This dataset, aortaDiss_tbl_df, is a tibble containing clinical information from 226 patients with aortic dissection. It includes demographic variables, symptom presentation, and risk factor data.

Usage

data(aortaDiss_tbl_df)

Format

A tibble with 226 observations and 10 variables:

Gender: Patient gender (numeric)
Age: Patient age in years (numeric)
Age_C: Categorized age (numeric)
Aortadis: Aortic dissection status (numeric)
Acute: Acute presentation indicator (numeric)
Acute3: Three-level acute presentation classification (numeric)
Stomach_Ache: Presence of stomach ache (numeric)
Hyper: Hypertension status (numeric)
Smoking: Smoking status (numeric)
Radiation: Radiation exposure (numeric)

Details

The dataset name has been kept as 'aortaDiss_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

Data taken from the psfmi package version 1.4.0

FDA Beta Blockers Adverse Events

Description

This dataset, betablockers_matrix, is a matrix containing adverse event reports from the FDA Adverse Event Reporting System (FAERS) for 9 beta blockers from Q1 2021 to Q4 2023. The matrix includes 501 adverse events (rows) across 9 medications (columns).

Usage

data(betablockers_matrix)

Format

A matrix with 501 rows (adverse events) and 9 columns (beta blockers):

Acebutolol: Adverse event counts for Acebutolol (integer)
Atenolol: Adverse event counts for Atenolol (integer)
Bisoprolol: Adverse event counts for Bisoprolol (integer)
Carvedilol: Adverse event counts for Carvedilol (integer)
Metoprolol: Adverse event counts for Metoprolol (integer)
Nadolol: Adverse event counts for Nadolol (integer)
Propranolol: Adverse event counts for Propranolol (integer)
Timolol: Adverse event counts for Timolol (integer)
Other: Adverse event counts for other beta blockers (integer)

Details

The dataset name has been kept as 'betablockers_matrix' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'matrix' indicates that the dataset is a matrix object. The original content has not been modified in any way.

Source

Data taken from the MDDC package version 1.1.0. Original data: FDA Adverse Event Reporting System (FAERS) database, Q1 2021 to Q4 2023.

Anticoagulants for CAD Patients

Description

This dataset, cad_anticoagulants_df, is a data frame containing information from 34 clinical trials examining the effectiveness of oral anticoagulants in patients with coronary artery disease. It includes data on treatment outcomes comparing anticoagulant therapy with control groups.

Usage

data(cad_anticoagulants_df)

Format

A data frame with 34 observations and 9 variables:

study: Study identifier (character vector)
year: Year of publication (integer vector)
intensity: Intensity of anticoagulation treatment (character vector)
asp.t: Aspirin use in treatment group (integer vector)
asp.c: Aspirin use in control group (integer vector)
ai: Number of events in treatment group (integer vector)
n1i: Total number of participants in treatment group (integer vector)
ci: Number of events in control group (integer vector)
n2i: Total number of participants in control group (integer vector)

Details

The dataset name has been kept as 'cad_anticoagulants_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.

Source

Data taken from the metadat package version 1.2-0

Heart Failure Clinical Dataset

Description

This dataset, cardiac_failure_df, is a data frame containing clinical data from 299 patients with heart failure. It includes 13 variables covering demographic information, medical history, laboratory results, and mortality outcomes.

Usage

data(cardiac_failure_df)

Format

A data frame with 299 observations and 13 variables:

age: Patient age in years (numeric)
anaemia: Presence of anaemia (integer: 0=no, 1=yes)
creatinine_phosphokinase: Level of CPK enzyme in mcg/L (integer)
diabetes: Presence of diabetes (integer: 0=no, 1=yes)
ejection_fraction: Percentage of blood leaving heart (integer)
high_blood_pressure: Presence of hypertension (integer: 0=no, 1=yes)
platelets: Platelet count in kiloplatelets/mL (numeric)
serum_creatinine: Level of serum creatinine in mg/dL (numeric)
serum_sodium: Level of serum sodium in mEq/L (integer)
sex: Patient sex (integer: 0=female, 1=male)
smoking: Smoking status (integer: 0=no, 1=yes)
time: Follow-up period in days (integer)
DEATH_EVENT: Death during follow-up (integer: 0=no, 1=yes)

Details

The dataset name has been kept as 'cardiac_failure_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.

Source

Data taken from the SOPC package version 0.1.0

Coronary Artery Disease GWAS Meta-Analysis

Description

This dataset, cardiac_gwas_df, is a data frame containing genome-wide association study (GWAS) results from a multi-ethnic meta-analysis of coronary artery disease (CAD). It includes 9,919 genetic variants with their effect sizes and study characteristics.

Usage

data(cardiac_gwas_df)

Format

A data frame with 9,919 observations and 7 variables:

beta_flipped: Effect size estimates (numeric)
gcse: Genomic control standard error (numeric)
variants: Genetic variant identifiers (character)
studies: Participating studies (character)
cases: Number of cases (integer)
controls: Number of controls (integer)
fdr214_gwas46: False discovery rate adjusted p-values (numeric)

Details

The dataset name has been kept as 'cardiac_gwas_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.

Source

Data taken from the getmstatistic package version 0.2.2

Cardiovascular Risk Factors

Description

This dataset, cardioRiskFactors_df, is a data frame containing information from a study investigating the association between uric acid and cardiovascular risk factors in developing countries. It includes data from 998 participants (474 men and 524 women) aged 25-64 years.

Usage

data(cardioRiskFactors_df)

Format

A data frame with 998 observations and 14 variables:

age: Age in years (integer)
bmi: Body Mass Index in kg/m² (numeric)
waisthip: Waist-to-hip ratio (numeric)
smok: Smoking status (integer)
choles: Total cholesterol in mg/dL (numeric)
trig: Triglycerides in mg/dL (numeric)
hdl: HDL cholesterol in mg/dL (numeric)
ldl: LDL cholesterol in mg/dL (numeric)
sys: Systolic blood pressure in mmHg (integer)
dia: Diastolic blood pressure in mmHg (numeric)
Uric: Uric acid level in mg/dL (integer)
sex: Sex (integer)
alco: Alcohol consumption (numeric)
apoa: Apolipoprotein A in mg/dL (numeric)

Details

The dataset name has been kept as 'cardioRiskFactors_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.

Source

Data taken from the Rfit package version 0.27.0. Original study: Heritier S, Cantoni E, Copt S, Victoria-Feser M (2009). Robust Methods in Biostatistics. New York: John Wiley and Sons.

Cardiovascular risks of diabetes drugs

Description

This dataset, cardio_diabetes_tbl_df, is a tibble containing information comparing cardiovascular problems between two diabetes medications (Rosiglitazone and Pioglitazone) in elderly Medicare patients. It includes data from 227,571 patients.

Usage

data(cardio_diabetes_tbl_df)

Format

A tibble with 227,571 observations and 2 variables:

treatment: Type of diabetes medication (factor with 2 levels: Rosiglitazone or Pioglitazone)
cardiovascular_problems: Presence of cardiovascular problems (factor with 2 levels)

Details

The dataset name has been kept as 'cardio_diabetes_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

Data taken from the openintro package version 2.5.0. Original study: Graham DJ, et al. (2010). "Risk of acute myocardial infarction, stroke, heart failure, and death in elderly Medicare patients treated with rosiglitazone or pioglitazone." JAMA, 304(4):411.

Statin Dose Comparison Trials for CVD

Description

This dataset, cardiovascular_list, is a list containing data from 34 clinical trials comparing low dose (1), high dose (2), and placebo (3) statins for cardiovascular disease prevention. The dataset includes study identifiers, treatment assignments, and outcome counts.

Usage

data(cardiovascular_list)

Format

A list with 4 components:

Study

Study identifiers (integer vector of length 34)

Treat

Treatment assignments (numeric vector: 1=low dose, 2=high dose, 3=placebo)

Outcomes

Outcome matrix with 34 rows and 3 columns:

Alive: Number of patients alive (numeric)
FnCVD: Number with non-fatal CVD events (numeric)
FCVD: Number with fatal CVD events (numeric)

N

Sample sizes (numeric vector of length 34)

Details

The dataset name has been kept as 'cardiovascular_list' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The original content has not been modified in any way.

Source

Data taken from the bnma package version 1.6.0

High vs Moderate Statins for MI Prevention

Description

This dataset, coronary_death_df, is a data frame containing information from 4 clinical trials comparing intensive (high dose) versus moderate (standard dose) statin therapy for preventing coronary death or myocardial infarction. It includes data on treatment outcomes across multiple endpoints.

Usage

data(coronary_death_df)

Format

A data frame with 4 observations and 16 variables:

trial: Trial identifier (character vector)
pop: Patient population description (character vector)
nt: Number of patients in treatment group (integer vector)
nc: Number of patients in control group (integer vector)
ep1t: Endpoint 1 events in treatment group (integer vector)
ep1c: Endpoint 1 events in control group (integer vector)
ep2t: Endpoint 2 events in treatment group (integer vector)
ep2c: Endpoint 2 events in control group (integer vector)
ep3t: Endpoint 3 events in treatment group (integer vector)
ep3c: Endpoint 3 events in control group (integer vector)
ep4t: Endpoint 4 events in treatment group (integer vector)
ep4c: Endpoint 4 events in control group (integer vector)
ep5t: Endpoint 5 events in treatment group (integer vector)
ep5c: Endpoint 5 events in control group (integer vector)
ep6t: Endpoint 6 events in treatment group (integer vector)
ep6c: Endpoint 6 events in control group (integer vector)

Details

The dataset name has been kept as 'coronary_death_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.

Source

Data taken from the metadat package version 1.2-0

Blood thinners in CPR survival

Description

This dataset, cpr_survival_tbl_df, is a tibble containing information from a study examining the effect of blood thinners on survival rates in CPR patients. The study randomly assigned 90 patients to either receive a blood thinner (treatment group) or not receive one (control group), with the outcome being survival for at least 24 hours.

Usage

data(cpr_survival_tbl_df)

Format

A tibble with 90 observations and 2 variables:

group: Treatment assignment (factor with 2 levels: "control" and "treatment")
outcome: Survival status (factor with 2 levels: "died" and "survived")

Details

The dataset name has been kept as 'cpr_survival_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

Data taken from the openintro package version 2.5.0

LA pollution and cardiovascular mortality

Description

This dataset, cv_mortality_ts, is a time series containing weekly cardiovascular mortality data from Los Angeles County. It consists of 508 six-day smoothed averages obtained by filtering daily values over the 10-year period from 1970 to 1979.

Usage

data(cv_mortality_ts)

Format

A time series object (ts) with 508 observations:

cv_mortality: Weekly cardiovascular mortality counts (numeric vector)

Details

The dataset name has been kept as 'cv_mortality_ts' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'ts' indicates that the dataset is a time series object. The original content has not been modified in any way.

Time series characteristics: - Start: 1970, Week 1 - End: 1979, Week 40 - Frequency: 52 (weekly data)

Source

Data taken from the astsa package version 2.2

Anger recall effect on heart rate (Lakens, 2013)

Description

This dataset, emotion_heartrate_df, is a data frame containing heart rate measurements from a study investigating how recalling anger affects heart rate. It includes baseline and anger-induced heart rate measurements from 68 participants.

Usage

data(emotion_heartrate_df)

Format

A data frame with 68 observations and 3 variables:

ID: Participant identification number (integer vector)
HR_baseline: Baseline heart rate in beats per minute (numeric vector)
HR_anger: Heart rate during anger recall in beats per minute (numeric vector)

Details

The dataset name has been kept as 'emotion_heartrate_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.

Source

Data taken from the esci package version 1.0-7. Original study: Lakens D (2013). Conceptual replication of Ekman et al. (1983) emotion study.

Artificial Heart Transplant Durations

Description

This dataset, heartTransplantTime_tbl_df, is a tibble containing the durations (in hours) of 15 artificial heart transplant operations.

Usage

data(heartTransplantTime_tbl_df)

Format

A tibble with 15 observations and 1 variable:

duration: Operation duration in hours (numeric)

Details

The dataset name has been kept as 'heartTransplantTime_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

Data taken from the BSDA package version 1.2.3. Original source: Kitchens LJ (2003). "Basic Statistics and Data Analysis." Pacific Grove, CA: Brooks/Cole, a division of Thomson Learning.

Stanford Heart Transplant Data

Description

This dataset, heart_transplant_df, is a data frame containing survival data from the Stanford heart transplant program. It includes information on 172 patients with follow-up times, transplant status, and clinical covariates.

Usage

data(heart_transplant_df)

Format

A data frame with 172 observations and 8 variables:

start: Start time of interval (numeric)
stop: End time of interval (numeric)
event: Survival status (numeric: 1=event, 0=censored)
age: Patient age at enrollment (numeric)
year: Year of enrollment (numeric)
surgery: Prior bypass surgery (numeric)
transplant: Transplant status (factor: 0=no, 1=yes)
id: Patient identification number (numeric)

Details

The dataset name has been kept as 'heart_transplant_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.

Source

Data taken from the lrstat package version 0.2.13. Original source: Stanford Heart Transplant Study data from the survival package.

Heart Disease Patients Clinical Data

Description

This dataset, heartdisease_tbl_df, is a tibble containing information on individuals evaluated for heart disease. It is a cleaned version of the original "Heart Disease" dataset from the UCI Machine Learning Repository, and includes 303 observations on 9 variables.

Usage

data(heartdisease_tbl_df)

Format

A tibble with 303 observations and 9 variables:

Age: Age of the individual (numeric).
Sex: Sex of the individual (factor with 2 levels: typically "Male" and "Female").
ChestPain: Type of chest pain experienced (factor with 4 levels).
BP: Resting blood pressure (numeric).
Cholesterol: Serum cholesterol in mg/dl (numeric).
BloodSugar: Indicates if fasting blood sugar > 120 mg/dl (logical).
MaximumHR: Maximum heart rate achieved (numeric).
ExerciseInducedAngina: Exercise-induced angina (factor with 2 levels).
HeartDisease: Presence or absence of heart disease (factor with 2 levels).

Details

The dataset name has been kept as 'heartdisease_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

Data taken from the cheese package version 0.1.2. Original source: UCI Machine Learning Repository. Heart Disease Data Set. https://archive.ics.uci.edu/ml/datasets/Heart+Disease

Heart Disease Risk Factors

Description

This dataset, heartdiseaserisk_tbl_df, is a tibble containing cardiovascular risk factor data from 498 individuals. It includes measures of physical activity (biking), smoking habits, and heart disease prevalence.

Usage

data(heartdiseaserisk_tbl_df)

Format

A tibble with 498 observations and 3 variables:

Biking: Frequency of biking activity (numeric)
Heart.disease: Prevalence of heart disease (numeric)
Smoking: Smoking frequency or intensity (numeric)

Details

The dataset name has been kept as 'heartdiseaserisk_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

Data taken from the Path.Analysis package version 0.1

Heart Failure rehospitalization risk

Description

This dataset, heartfailure_df, is a data frame containing simulated data from 800 patients with heart failure who are at risk of recurrent hospitalization. The dataset includes 3,068 observations (2,268 events) tracking patient outcomes over time.

Usage

data(heartfailure_df)

Format

A data frame with 3,068 observations and 6 variables:

id: Patient identification number (integer vector)
treatment: Treatment assignment (factor with 2 levels)
t0: Start time of observation period (numeric vector)
t1: End time of observation period (numeric vector)
enum: Event number (numeric vector)
event: Event indicator (numeric vector)

Details

The dataset name has been kept as 'heartfailure_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.

Source

Data taken from the survPen package version 2.0-2. Based on hfaction_cpx12 dataset from package WA.

Statins for Heart Failure Prevention

Description

This dataset, hfPrevention_mtc_network, contains network meta-analysis data from 19 trials comparing statins versus placebo or usual care for cholesterol lowering in heart failure. The main outcome measured is the number of deaths. Trials are categorized as either primary prevention (no previous heart disease) or secondary prevention (previous heart disease).

Usage

data(hfPrevention_mtc_network)

Format

An 'mtc.network' object (list) with 4 components:

description

Character string describing the analysis: "Cholesterol lowering in HF (outcome: death)"

treatments

Data frame with 2 treatments:

id: Treatment ID (factor with 2 levels)
description: Treatment description (character vector)

data.ab

Data frame with 38 rows (arm-level data):

study: Study ID (factor with 19 levels)
treatment: Treatment assignment (factor with 2 levels)
responders: Number of deaths (integer vector)
sampleSize: Total sample size per arm (integer vector)

studies

Data frame with 19 rows (study-level data):

study: Study ID (factor with 19 levels)
secondary: Prevention type: 0 = primary, 1 = secondary (integer vector)

Details

The dataset name has been kept as 'hfPrevention_mtc_network' to maintain consistency with its original source and to avoid confusion with other datasets. This naming convention helps identify this specific network meta-analysis dataset from the CardioDataSets package. The dataset is structured as an 'mtc.network' object, which is the standard format for network meta-analysis in the gemtc package. The original content has not been modified.

Source

Data taken from the gemtc package version 1.0-2. Original publication: Dias S, Sutton AJ, Welton NJ, Ades AE (2013). "Heterogeneity - Subgroups, Meta-Regression, Bias, and Bias-Adjustment." Medical Decision Making, 33(5):618-640.

Elderly CV/MRI and Biomarkers

Description

This dataset, mriCardioVars_tbl_df, is a tibble containing MRI and clinical data from 735 elderly participants in a U.S. observational study of cardiovascular and cerebrovascular disease incidence. It includes 30 variables covering demographic, clinical, and imaging measures.

Usage

data(mriCardioVars_tbl_df)

Format

A tibble with 735 observations and 30 variables:

ptid: Patient identification number (numeric)
mridate: MRI date (Date)
age: Age in years (numeric)
sex: Sex (character)
race: Race (character)
weight: Weight in kg (numeric)
height: Height in cm (numeric)
packyrs: Smoking pack-years (numeric)
yrsquit: Years since quitting smoking (numeric)
alcoh: Alcohol consumption (numeric)
physact: Physical activity level (numeric)
chf: Congestive heart failure status (numeric)
chd: Coronary heart disease status (numeric)
stroke: Stroke history (numeric)
diabetes: Diabetes status (numeric)
genhlth: General health status (numeric)
ldl: LDL cholesterol in mg/dL (numeric)
alb: Albumin level (numeric)
crt: Creatinine level (numeric)
plt: Platelet count (numeric)
sbp: Systolic blood pressure in mmHg (numeric)
aai: Ankle-arm index (numeric)
fev: Forced expiratory volume (numeric)
dsst: Digit Symbol Substitution Test score (numeric)
atrophy: Brain atrophy measure (numeric)
whgrd: White matter hyperintensity grade (numeric)
numinf: Number of brain infarcts (numeric)
volinf: Volume of brain infarcts (numeric)
obstime: Observation time (numeric)
death: Mortality status (numeric)

Details

The dataset name has been kept as 'mriCardioVars_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

Data taken from the rigr package version 1.0.7

Muscatine pediatric CRF

Description

This dataset, muscatine_coronary_risk_df, is a data frame containing longitudinal observations from the Muscatine Coronary Risk Factor (MCRF) study, which examined the development of coronary disease risk factors in children. It includes 14,568 observations of 4,856 children tracked from 1977 to 1981.

Usage

data(muscatine_coronary_risk_df)

Format

A data frame with 14,568 observations and 7 variables:

id: Child identification number (integer)
gender: Gender of child (factor with 2 levels)
base_age: Age at first observation in years (integer)
age: Current age in years (integer)
occasion: Measurement occasion (integer)
obese: Obesity status (factor with 2 levels)
numobese: Numeric obesity indicator (numeric)

Details

The dataset name has been kept as 'muscatine_coronary_risk_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.

Source

Data taken from the geepack package version 1.3.12. Original study: The Muscatine Coronary Risk Factor Study, University of Iowa, 1977-1981.

Streptokinase Therapy in AMI

Description

This dataset, myocardialinfarction_df, is a data frame containing information from 33 clinical trials comparing intravenous streptokinase versus placebo or no therapy in patients hospitalized for acute myocardial infarction. It includes data on treatment outcomes between intervention and control groups.

Usage

data(myocardialinfarction_df)

Format

A data frame with 33 observations and 6 variables:

trial: Trial identifier (character vector)
year: Year of publication (integer vector)
ai: Number of events in treatment group (integer vector)
n1i: Total number of participants in treatment group (integer vector)
ci: Number of events in control group (integer vector)
n2i: Total number of participants in control group (integer vector)

Details

The dataset name has been kept as 'myocardialinfarction_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.

Source

Data taken from the metadat package version 1.2-0. Original publication: Lau J, Antman EM, Jimenez-Silva J, Kupelnick B, Mosteller F, Chalmers TC (1992). "Cumulative meta-analysis of therapeutic trials for myocardial infarction." New England Journal of Medicine, 327(4):248-254.

CAV in Heart Transplant Patients

Description

This dataset, patient_CAV_df, is a data frame containing longitudinal follow-up data from heart transplant recipients at Papworth Hospital, UK. It tracks 2,803 angiographic examinations for the onset of cardiac allograft vasculopathy and mortality.

Usage

data(patient_CAV_df)

Format

A data frame with 2,803 observations and 5 variables:

PTNUM: Patient identification number (integer)
years: Time since transplant in years (numeric)
state: Disease state (numeric)
dage: Donor age in years (integer)
pdiag: Primary diagnosis code (numeric)

Details

The dataset name has been kept as 'patient_CAV_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.

Source

Data taken from the flexmsm package version 0.1.2. Original data: Papworth Hospital, UK. Subset of cav data from msm package.

Radial Artery IVUS Patient Data

Description

This dataset, radial_ivus_df, is a data frame containing demographic and clinical data from 115 patients who underwent intravascular ultrasound (IVUS) examination of the radial artery following transradial coronary angiography. It includes 15 variables covering patient characteristics, laboratory results, and IVUS measurements.

Usage

data(radial_ivus_df)

Format

A data frame with 115 observations and 15 variables:

male: Male sex indicator (integer: 0/1)
age: Age in years (integer)
height: Height in cm (numeric)
weight: Weight in kg (numeric)
HBP: High blood pressure status (integer: 0/1)
DM: Diabetes mellitus status (integer: 0/1)
smoking: Smoking status (factor with 3 levels)
TC: Total cholesterol in mg/dL (integer)
TG: Triglycerides in mg/dL (integer)
HDL: HDL cholesterol in mg/dL (integer)
LDL: LDL cholesterol in mg/dL (integer)
hsCRP: High-sensitivity C-reactive protein in mg/L (numeric)
NTAV: Normalized total atheroma volume (numeric)
PAV: Percent atheroma volume (numeric)
sex: Sex (factor with 2 levels)

Details

The dataset name has been kept as 'radial_ivus_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.

Source

Data taken from the moonBook package version 0.3.1

Scottish Health Survey CVD

Description

This dataset, scottish_CVD_df, is a data frame containing cardiovascular health data from the 1998 Scottish Health Survey. It includes information from 8,804 respondents aged 18-64, with variables covering demographics, health behaviors, and cardiovascular disease status.

Usage

data(scottish_CVD_df)

Format

A data frame with 8,804 observations and 8 variables:

age: Respondent age in years (integer)
sex: Respondent sex (factor with 2 levels)
sc: Social class (factor with 3 levels)
cvddef: Doctor-diagnosed CVD status (integer: 0=no, 1=yes)
carstair: Carstairs deprivation score (numeric)
smoke: Smoking status (factor with 5 levels)
id: Respondent identification number (integer)
area: Geographic area code (integer)

Details

The dataset name has been kept as 'scottish_CVD_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.

Source

Data taken from the R2MLwiN package version 0.8-9. Original survey: 1998 Scottish Health Survey. Methodology reference: Charlton C, Rasbash J, Browne WJ, Healy M, Cameron B (2024). MLwiN Version 3.09. Centre for Multilevel Modelling, University of Bristol.

Statin intensity and MI risk

Description

This dataset, statinMIrisk_df, is a data frame containing results from 4 clinical trials investigating the effect of statin therapy intensity on the risk of myocardial infarction or coronary death. The data compares intensive versus standard statin regimens.

Usage

data(statinMIrisk_df)

Format

A data frame with 4 observations and 5 variables:

study: Study identifier (character)
eI: Number of events in intensive treatment group (numeric)
nI: Total patients in intensive treatment group (numeric)
eC: Number of events in control/standard group (numeric)
nC: Total patients in control/standard group (numeric)

Details

The dataset name has been kept as 'statinMIrisk_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.

Source

Data taken from the RTSA package version 0.2.2

Sulphinpyrazone for post-MI death prevention

Description

This dataset, sulphinpyrazone_tbl_df, is a tibble containing information from a clinical trial studying the efficacy of sulphinpyrazone in preventing sudden death after myocardial infarction. The data includes 1,475 patients randomly assigned to either the treatment or control group.

Usage

data(sulphinpyrazone_tbl_df)

Format

A tibble with 1,475 observations and 2 variables:

group: Treatment assignment (factor with 2 levels: "control" and "treatment")
outcome: Patient outcome (factor with 2 levels)

Details

The dataset name has been kept as 'sulphinpyrazone_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

Data taken from the openintro package version 2.5.0. Original study: Anturane Reinfarction Trial Research Group (1980). "Sulfinpyrazone in the prevention of sudden death after myocardial infarction." New England Journal of Medicine, 302(5):250-256.

US Mortality Rates by Cause and Gender

Description

This dataset, usMortality_df, is a data frame containing mortality rates across all ages in the USA from 2011-2013, stratified by cause of death, sex, and rural/urban status. It includes national aggregate rates for 10 causes of death, including Heart disease.

Usage

data(usMortality_df)

Format

A data frame with 40 observations and 5 variables:

Status: Residential status (factor: Rural/Urban)
Sex: Gender (factor: Male/Female)
Cause: Cause of death (factor with 10 levels)
Rate: Mortality rate per 100,000 population (numeric)
SE: Standard error of mortality rate (numeric)

Details

The dataset name has been kept as 'usMortality_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.

Source

Data taken from the lattice package version 0.22-6. Original source: Rural Health Reform Policy Research Center (2015). "Exploring Rural and Urban Mortality Differences." Bethesda, MD: August 2015.

View Available Datasets in CardioDataSets

Description

This function lists all datasets available in the 'CardioDataSets' package. If the 'CardioDataSets' package is not loaded, it stops and shows an error message. If no datasets are available, it returns a message and an empty vector.

Usage

view_datasets()

Value

A character vector with the names of the available datasets. If no datasets are found, it returns an empty character vector.

Examples

if (requireNamespace("CardioDataSets", quietly = TRUE)) {
  library(CardioDataSets)
  view_datasets()
}

CardioDataSets: A Comprehensive Collection of Cardiovascular and Heart Disease Datasets

Description

Details

Author(s)

See Also

Acute Coronary Syndrome (ACS) Patient Data

Description

Usage

Format

Details

Source

Age vs. Maximum Heart Rate

Description

Usage

Format

Details

Source

Acute Myocardial Infarction (Heart Attack) Events

Description

Usage

Format

Details

Source

Aortic dissection patients

Description

Usage

Format

Details

Source

FDA Beta Blockers Adverse Events

Description

Usage

Format

Details

Source

Anticoagulants for CAD Patients

Description

Usage

Format

Details

Source

Heart Failure Clinical Dataset

Description

Usage

Format

Details

Source

Coronary Artery Disease GWAS Meta-Analysis

Description

Usage

Format

Details

Source

Cardiovascular Risk Factors

Description

Usage

Format

Details

Source

Cardiovascular risks of diabetes drugs

Description

Usage

Format

Details

Source

Statin Dose Comparison Trials for CVD

Description

Usage

Format

Details

Source

High vs Moderate Statins for MI Prevention

Description

Usage

Format

Details

Source

Blood thinners in CPR survival

Description

Usage