Type: | Package |
Title: | A Comprehensive Collection of Cardiovascular and Heart Disease Datasets |
Version: | 0.1.0 |
Maintainer: | Renzo Caceres Rossi <arenzocaceresrossi@gmail.com> |
Description: | Offers a diverse collection of datasets focused on cardiovascular and heart disease research, including heart failure, myocardial infarction, aortic dissection, transplant outcomes, cardiovascular risk factors, drug efficacy, and mortality trends. Designed for researchers, clinicians, epidemiologists, and data scientists, the package features clinical, epidemiological, and simulated datasets covering a wide range of conditions and treatments such as statins, anticoagulants, and beta blockers. It supports analyses related to disease progression, treatment effects, rehospitalization, and public health outcomes across various cardiovascular patient populations. |
License: | GPL-3 |
URL: | https://github.com/lightbluetitan/cardiodatasets, https://lightbluetitan.github.io/cardiodatasets/ |
BugReports: | https://github.com/lightbluetitan/cardiodatasets/issues |
Encoding: | UTF-8 |
LazyData: | true |
Suggests: | ggplot2, testthat (≥ 3.0.0), dplyr, knitr, rmarkdown |
Depends: | R (≥ 4.2.0) |
Imports: | utils |
RoxygenNote: | 7.3.2 |
Config/testthat/edition: | 3 |
VignetteBuilder: | knitr |
NeedsCompilation: | no |
Packaged: | 2025-05-10 04:15:44 UTC; renzocrossi |
Author: | Renzo Caceres Rossi [aut, cre] |
Repository: | CRAN |
Date/Publication: | 2025-05-13 08:20:06 UTC |
CardioDataSets: A Comprehensive Collection of Cardiovascular and Heart Disease Datasets
Description
This package provides a wide variety of datasets focused on heart and cardiovascular research, covering heart disease, myocardial infarction, heart failure, stroke, ischemic heart disease, risk factors, clinical trials, and treatment outcomes.
Details
CardioDataSets: A Comprehensive Collection of Cardiovascular and Heart Disease Datasets
A Comprehensive Collection of Cardiovascular and Heart Disease Datasets.
Author(s)
Maintainer: Renzo Caceres Rossi arenzocaceresrossi@gmail.com
See Also
Useful links:
Acute Coronary Syndrome (ACS) Patient Data
Description
This dataset, acs_patients_df, is a data frame containing demographic and clinical data from 857 patients with Acute Coronary Syndrome (ACS). It includes 17 variables covering patient characteristics, vital signs, laboratory results, and risk factors.
Usage
data(acs_patients_df)
Format
A data frame with 857 observations and 17 variables:
- age
Patient age in years (integer)
- sex
Patient sex (character)
- cardiogenicShock
Presence of cardiogenic shock (character)
- entry
Method of hospital entry (character)
- Dx
Diagnosis (character)
- EF
Ejection fraction percentage (numeric)
- height
Height in cm (numeric)
- weight
Weight in kg (numeric)
- BMI
Body Mass Index in kg/m² (numeric)
- obesity
Obesity status (character)
- TC
Total cholesterol in mg/dL (numeric)
- LDLC
LDL cholesterol in mg/dL (integer)
- HDLC
HDL cholesterol in mg/dL (integer)
- TG
Triglycerides in mg/dL (integer)
- DM
Diabetes mellitus status (character)
- HBP
High blood pressure status (character)
- smoking
Smoking status (character)
Details
The dataset name has been kept as 'acs_patients_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.
Source
Data taken from the moonBook package version 0.3.1
Age vs. Maximum Heart Rate
Description
This dataset, age_heartrate_df, is a data frame containing simulated data representing the relationship between age and maximum heart rate. It includes 15 observations based on established physiological models.
Usage
data(age_heartrate_df)
Format
A data frame with 15 observations and 2 variables:
- age
Age in years (numeric)
- maxrate
Maximum predicted heart rate in beats per minute (numeric)
Details
The dataset name has been kept as 'age_heartrate_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.
Source
Data taken from the UsingR package version 2.0-7. Original research: Tanaka H, Monahan KD, Seals DR (2001). "Age-predicted maximal heart rate revisited." Journal of the American College of Cardiology, 37(1):153-156.
Acute Myocardial Infarction (Heart Attack) Events
Description
This dataset, ami_occurrences_tbl_df, is a tibble containing simulated but realistic daily counts of Acute Myocardial Infarction (AMI) occurrences in New York City over one year (365 days). The data represents the number of heart attack events recorded each day.
Usage
data(ami_occurrences_tbl_df)
Format
A tibble with 365 observations and 1 variable:
- ami
Number of Acute Myocardial Infarction events recorded each day (integer vector)
Details
The dataset name has been kept as 'ami_occurrences_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
Data taken from the openintro package version 2.5.0
Aortic dissection patients
Description
This dataset, aortaDiss_tbl_df, is a tibble containing clinical information from 226 patients with aortic dissection. It includes demographic variables, symptom presentation, and risk factor data.
Usage
data(aortaDiss_tbl_df)
Format
A tibble with 226 observations and 10 variables:
- Gender
Patient gender (numeric)
- Age
Patient age in years (numeric)
- Age_C
Categorized age (numeric)
- Aortadis
Aortic dissection status (numeric)
- Acute
Acute presentation indicator (numeric)
- Acute3
Three-level acute presentation classification (numeric)
- Stomach_Ache
Presence of stomach ache (numeric)
- Hyper
Hypertension status (numeric)
- Smoking
Smoking status (numeric)
- Radiation
Radiation exposure (numeric)
Details
The dataset name has been kept as 'aortaDiss_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
Data taken from the psfmi package version 1.4.0
FDA Beta Blockers Adverse Events
Description
This dataset, betablockers_matrix, is a matrix containing adverse event reports from the FDA Adverse Event Reporting System (FAERS) for 9 beta blockers from Q1 2021 to Q4 2023. The matrix includes 501 adverse events (rows) across 9 medications (columns).
Usage
data(betablockers_matrix)
Format
A matrix with 501 rows (adverse events) and 9 columns (beta blockers):
- Acebutolol
Adverse event counts for Acebutolol (integer)
- Atenolol
Adverse event counts for Atenolol (integer)
- Bisoprolol
Adverse event counts for Bisoprolol (integer)
- Carvedilol
Adverse event counts for Carvedilol (integer)
- Metoprolol
Adverse event counts for Metoprolol (integer)
- Nadolol
Adverse event counts for Nadolol (integer)
- Propranolol
Adverse event counts for Propranolol (integer)
- Timolol
Adverse event counts for Timolol (integer)
- Other
Adverse event counts for other beta blockers (integer)
Details
The dataset name has been kept as 'betablockers_matrix' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'matrix' indicates that the dataset is a matrix object. The original content has not been modified in any way.
Source
Data taken from the MDDC package version 1.1.0. Original data: FDA Adverse Event Reporting System (FAERS) database, Q1 2021 to Q4 2023.
Anticoagulants for CAD Patients
Description
This dataset, cad_anticoagulants_df, is a data frame containing information from 34 clinical trials examining the effectiveness of oral anticoagulants in patients with coronary artery disease. It includes data on treatment outcomes comparing anticoagulant therapy with control groups.
Usage
data(cad_anticoagulants_df)
Format
A data frame with 34 observations and 9 variables:
- study
Study identifier (character vector)
- year
Year of publication (integer vector)
- intensity
Intensity of anticoagulation treatment (character vector)
- asp.t
Aspirin use in treatment group (integer vector)
- asp.c
Aspirin use in control group (integer vector)
- ai
Number of events in treatment group (integer vector)
- n1i
Total number of participants in treatment group (integer vector)
- ci
Number of events in control group (integer vector)
- n2i
Total number of participants in control group (integer vector)
Details
The dataset name has been kept as 'cad_anticoagulants_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.
Source
Data taken from the metadat package version 1.2-0
Heart Failure Clinical Dataset
Description
This dataset, cardiac_failure_df, is a data frame containing clinical data from 299 patients with heart failure. It includes 13 variables covering demographic information, medical history, laboratory results, and mortality outcomes.
Usage
data(cardiac_failure_df)
Format
A data frame with 299 observations and 13 variables:
- age
Patient age in years (numeric)
- anaemia
Presence of anaemia (integer: 0=no, 1=yes)
- creatinine_phosphokinase
Level of CPK enzyme in mcg/L (integer)
- diabetes
Presence of diabetes (integer: 0=no, 1=yes)
- ejection_fraction
Percentage of blood leaving heart (integer)
- high_blood_pressure
Presence of hypertension (integer: 0=no, 1=yes)
- platelets
Platelet count in kiloplatelets/mL (numeric)
- serum_creatinine
Level of serum creatinine in mg/dL (numeric)
- serum_sodium
Level of serum sodium in mEq/L (integer)
- sex
Patient sex (integer: 0=female, 1=male)
- smoking
Smoking status (integer: 0=no, 1=yes)
- time
Follow-up period in days (integer)
- DEATH_EVENT
Death during follow-up (integer: 0=no, 1=yes)
Details
The dataset name has been kept as 'cardiac_failure_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.
Source
Data taken from the SOPC package version 0.1.0
Coronary Artery Disease GWAS Meta-Analysis
Description
This dataset, cardiac_gwas_df, is a data frame containing genome-wide association study (GWAS) results from a multi-ethnic meta-analysis of coronary artery disease (CAD). It includes 9,919 genetic variants with their effect sizes and study characteristics.
Usage
data(cardiac_gwas_df)
Format
A data frame with 9,919 observations and 7 variables:
- beta_flipped
Effect size estimates (numeric)
- gcse
Genomic control standard error (numeric)
- variants
Genetic variant identifiers (character)
- studies
Participating studies (character)
- cases
Number of cases (integer)
- controls
Number of controls (integer)
- fdr214_gwas46
False discovery rate adjusted p-values (numeric)
Details
The dataset name has been kept as 'cardiac_gwas_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.
Source
Data taken from the getmstatistic package version 0.2.2
Cardiovascular Risk Factors
Description
This dataset, cardioRiskFactors_df, is a data frame containing information from a study investigating the association between uric acid and cardiovascular risk factors in developing countries. It includes data from 998 participants (474 men and 524 women) aged 25-64 years.
Usage
data(cardioRiskFactors_df)
Format
A data frame with 998 observations and 14 variables:
- age
Age in years (integer)
- bmi
Body Mass Index in kg/m² (numeric)
- waisthip
Waist-to-hip ratio (numeric)
- smok
Smoking status (integer)
- choles
Total cholesterol in mg/dL (numeric)
- trig
Triglycerides in mg/dL (numeric)
- hdl
HDL cholesterol in mg/dL (numeric)
- ldl
LDL cholesterol in mg/dL (numeric)
- sys
Systolic blood pressure in mmHg (integer)
- dia
Diastolic blood pressure in mmHg (numeric)
- Uric
Uric acid level in mg/dL (integer)
- sex
Sex (integer)
- alco
Alcohol consumption (numeric)
- apoa
Apolipoprotein A in mg/dL (numeric)
Details
The dataset name has been kept as 'cardioRiskFactors_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.
Source
Data taken from the Rfit package version 0.27.0. Original study: Heritier S, Cantoni E, Copt S, Victoria-Feser M (2009). Robust Methods in Biostatistics. New York: John Wiley and Sons.
Cardiovascular risks of diabetes drugs
Description
This dataset, cardio_diabetes_tbl_df, is a tibble containing information comparing cardiovascular problems between two diabetes medications (Rosiglitazone and Pioglitazone) in elderly Medicare patients. It includes data from 227,571 patients.
Usage
data(cardio_diabetes_tbl_df)
Format
A tibble with 227,571 observations and 2 variables:
- treatment
Type of diabetes medication (factor with 2 levels: Rosiglitazone or Pioglitazone)
- cardiovascular_problems
Presence of cardiovascular problems (factor with 2 levels)
Details
The dataset name has been kept as 'cardio_diabetes_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
Data taken from the openintro package version 2.5.0. Original study: Graham DJ, et al. (2010). "Risk of acute myocardial infarction, stroke, heart failure, and death in elderly Medicare patients treated with rosiglitazone or pioglitazone." JAMA, 304(4):411.
Statin Dose Comparison Trials for CVD
Description
This dataset, cardiovascular_list, is a list containing data from 34 clinical trials comparing low dose (1), high dose (2), and placebo (3) statins for cardiovascular disease prevention. The dataset includes study identifiers, treatment assignments, and outcome counts.
Usage
data(cardiovascular_list)
Format
A list with 4 components:
- Study
Study identifiers (integer vector of length 34)
- Treat
Treatment assignments (numeric vector: 1=low dose, 2=high dose, 3=placebo)
- Outcomes
Outcome matrix with 34 rows and 3 columns:
- Alive
Number of patients alive (numeric)
- FnCVD
Number with non-fatal CVD events (numeric)
- FCVD
Number with fatal CVD events (numeric)
- N
Sample sizes (numeric vector of length 34)
Details
The dataset name has been kept as 'cardiovascular_list' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The original content has not been modified in any way.
Source
Data taken from the bnma package version 1.6.0
High vs Moderate Statins for MI Prevention
Description
This dataset, coronary_death_df, is a data frame containing information from 4 clinical trials comparing intensive (high dose) versus moderate (standard dose) statin therapy for preventing coronary death or myocardial infarction. It includes data on treatment outcomes across multiple endpoints.
Usage
data(coronary_death_df)
Format
A data frame with 4 observations and 16 variables:
- trial
Trial identifier (character vector)
- pop
Patient population description (character vector)
- nt
Number of patients in treatment group (integer vector)
- nc
Number of patients in control group (integer vector)
- ep1t
Endpoint 1 events in treatment group (integer vector)
- ep1c
Endpoint 1 events in control group (integer vector)
- ep2t
Endpoint 2 events in treatment group (integer vector)
- ep2c
Endpoint 2 events in control group (integer vector)
- ep3t
Endpoint 3 events in treatment group (integer vector)
- ep3c
Endpoint 3 events in control group (integer vector)
- ep4t
Endpoint 4 events in treatment group (integer vector)
- ep4c
Endpoint 4 events in control group (integer vector)
- ep5t
Endpoint 5 events in treatment group (integer vector)
- ep5c
Endpoint 5 events in control group (integer vector)
- ep6t
Endpoint 6 events in treatment group (integer vector)
- ep6c
Endpoint 6 events in control group (integer vector)
Details
The dataset name has been kept as 'coronary_death_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.
Source
Data taken from the metadat package version 1.2-0
Blood thinners in CPR survival
Description
This dataset, cpr_survival_tbl_df, is a tibble containing information from a study examining the effect of blood thinners on survival rates in CPR patients. The study randomly assigned 90 patients to either receive a blood thinner (treatment group) or not receive one (control group), with the outcome being survival for at least 24 hours.
Usage
data(cpr_survival_tbl_df)
Format
A tibble with 90 observations and 2 variables:
- group
Treatment assignment (factor with 2 levels: "control" and "treatment")
- outcome
Survival status (factor with 2 levels: "died" and "survived")
Details
The dataset name has been kept as 'cpr_survival_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
Data taken from the openintro package version 2.5.0
LA pollution and cardiovascular mortality
Description
This dataset, cv_mortality_ts, is a time series containing weekly cardiovascular mortality data from Los Angeles County. It consists of 508 six-day smoothed averages obtained by filtering daily values over the 10-year period from 1970 to 1979.
Usage
data(cv_mortality_ts)
Format
A time series object (ts) with 508 observations:
- cv_mortality
Weekly cardiovascular mortality counts (numeric vector)
Details
The dataset name has been kept as 'cv_mortality_ts' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'ts' indicates that the dataset is a time series object. The original content has not been modified in any way.
Time series characteristics: - Start: 1970, Week 1 - End: 1979, Week 40 - Frequency: 52 (weekly data)
Source
Data taken from the astsa package version 2.2
Anger recall effect on heart rate (Lakens, 2013)
Description
This dataset, emotion_heartrate_df, is a data frame containing heart rate measurements from a study investigating how recalling anger affects heart rate. It includes baseline and anger-induced heart rate measurements from 68 participants.
Usage
data(emotion_heartrate_df)
Format
A data frame with 68 observations and 3 variables:
- ID
Participant identification number (integer vector)
- HR_baseline
Baseline heart rate in beats per minute (numeric vector)
- HR_anger
Heart rate during anger recall in beats per minute (numeric vector)
Details
The dataset name has been kept as 'emotion_heartrate_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.
Source
Data taken from the esci package version 1.0-7. Original study: Lakens D (2013). Conceptual replication of Ekman et al. (1983) emotion study.
Artificial Heart Transplant Durations
Description
This dataset, heartTransplantTime_tbl_df, is a tibble containing the durations (in hours) of 15 artificial heart transplant operations.
Usage
data(heartTransplantTime_tbl_df)
Format
A tibble with 15 observations and 1 variable:
- duration
Operation duration in hours (numeric)
Details
The dataset name has been kept as 'heartTransplantTime_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
Data taken from the BSDA package version 1.2.3. Original source: Kitchens LJ (2003). "Basic Statistics and Data Analysis." Pacific Grove, CA: Brooks/Cole, a division of Thomson Learning.
Stanford Heart Transplant Data
Description
This dataset, heart_transplant_df, is a data frame containing survival data from the Stanford heart transplant program. It includes information on 172 patients with follow-up times, transplant status, and clinical covariates.
Usage
data(heart_transplant_df)
Format
A data frame with 172 observations and 8 variables:
- start
Start time of interval (numeric)
- stop
End time of interval (numeric)
- event
Survival status (numeric: 1=event, 0=censored)
- age
Patient age at enrollment (numeric)
- year
Year of enrollment (numeric)
- surgery
Prior bypass surgery (numeric)
- transplant
Transplant status (factor: 0=no, 1=yes)
- id
Patient identification number (numeric)
Details
The dataset name has been kept as 'heart_transplant_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.
Source
Data taken from the lrstat package version 0.2.13. Original source: Stanford Heart Transplant Study data from the survival package.
Heart Disease Patients Clinical Data
Description
This dataset, heartdisease_tbl_df, is a tibble containing information on individuals evaluated for heart disease. It is a cleaned version of the original "Heart Disease" dataset from the UCI Machine Learning Repository, and includes 303 observations on 9 variables.
Usage
data(heartdisease_tbl_df)
Format
A tibble with 303 observations and 9 variables:
- Age
Age of the individual (numeric).
- Sex
Sex of the individual (factor with 2 levels: typically "Male" and "Female").
- ChestPain
Type of chest pain experienced (factor with 4 levels).
- BP
Resting blood pressure (numeric).
- Cholesterol
Serum cholesterol in mg/dl (numeric).
- BloodSugar
Indicates if fasting blood sugar > 120 mg/dl (logical).
- MaximumHR
Maximum heart rate achieved (numeric).
- ExerciseInducedAngina
Exercise-induced angina (factor with 2 levels).
- HeartDisease
Presence or absence of heart disease (factor with 2 levels).
Details
The dataset name has been kept as 'heartdisease_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
Data taken from the cheese package version 0.1.2. Original source: UCI Machine Learning Repository. Heart Disease Data Set. https://archive.ics.uci.edu/ml/datasets/Heart+Disease
Heart Disease Risk Factors
Description
This dataset, heartdiseaserisk_tbl_df, is a tibble containing cardiovascular risk factor data from 498 individuals. It includes measures of physical activity (biking), smoking habits, and heart disease prevalence.
Usage
data(heartdiseaserisk_tbl_df)
Format
A tibble with 498 observations and 3 variables:
- Biking
Frequency of biking activity (numeric)
- Heart.disease
Prevalence of heart disease (numeric)
- Smoking
Smoking frequency or intensity (numeric)
Details
The dataset name has been kept as 'heartdiseaserisk_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
Data taken from the Path.Analysis package version 0.1
Heart Failure rehospitalization risk
Description
This dataset, heartfailure_df, is a data frame containing simulated data from 800 patients with heart failure who are at risk of recurrent hospitalization. The dataset includes 3,068 observations (2,268 events) tracking patient outcomes over time.
Usage
data(heartfailure_df)
Format
A data frame with 3,068 observations and 6 variables:
- id
Patient identification number (integer vector)
- treatment
Treatment assignment (factor with 2 levels)
- t0
Start time of observation period (numeric vector)
- t1
End time of observation period (numeric vector)
- enum
Event number (numeric vector)
- event
Event indicator (numeric vector)
Details
The dataset name has been kept as 'heartfailure_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.
Source
Data taken from the survPen package version 2.0-2. Based on hfaction_cpx12 dataset from package WA.
Statins for Heart Failure Prevention
Description
This dataset, hfPrevention_mtc_network, contains network meta-analysis data from 19 trials comparing statins versus placebo or usual care for cholesterol lowering in heart failure. The main outcome measured is the number of deaths. Trials are categorized as either primary prevention (no previous heart disease) or secondary prevention (previous heart disease).
Usage
data(hfPrevention_mtc_network)
Format
An 'mtc.network' object (list) with 4 components:
- description
Character string describing the analysis: "Cholesterol lowering in HF (outcome: death)"
- treatments
Data frame with 2 treatments:
- id
Treatment ID (factor with 2 levels)
- description
Treatment description (character vector)
- data.ab
Data frame with 38 rows (arm-level data):
- study
Study ID (factor with 19 levels)
- treatment
Treatment assignment (factor with 2 levels)
- responders
Number of deaths (integer vector)
- sampleSize
Total sample size per arm (integer vector)
- studies
Data frame with 19 rows (study-level data):
- study
Study ID (factor with 19 levels)
- secondary
Prevention type: 0 = primary, 1 = secondary (integer vector)
Details
The dataset name has been kept as 'hfPrevention_mtc_network' to maintain consistency with its original source and to avoid confusion with other datasets. This naming convention helps identify this specific network meta-analysis dataset from the CardioDataSets package. The dataset is structured as an 'mtc.network' object, which is the standard format for network meta-analysis in the gemtc package. The original content has not been modified.
Source
Data taken from the gemtc package version 1.0-2. Original publication: Dias S, Sutton AJ, Welton NJ, Ades AE (2013). "Heterogeneity - Subgroups, Meta-Regression, Bias, and Bias-Adjustment." Medical Decision Making, 33(5):618-640.
Elderly CV/MRI and Biomarkers
Description
This dataset, mriCardioVars_tbl_df, is a tibble containing MRI and clinical data from 735 elderly participants in a U.S. observational study of cardiovascular and cerebrovascular disease incidence. It includes 30 variables covering demographic, clinical, and imaging measures.
Usage
data(mriCardioVars_tbl_df)
Format
A tibble with 735 observations and 30 variables:
- ptid
Patient identification number (numeric)
- mridate
MRI date (Date)
- age
Age in years (numeric)
- sex
Sex (character)
- race
Race (character)
- weight
Weight in kg (numeric)
- height
Height in cm (numeric)
- packyrs
Smoking pack-years (numeric)
- yrsquit
Years since quitting smoking (numeric)
- alcoh
Alcohol consumption (numeric)
- physact
Physical activity level (numeric)
- chf
Congestive heart failure status (numeric)
- chd
Coronary heart disease status (numeric)
- stroke
Stroke history (numeric)
- diabetes
Diabetes status (numeric)
- genhlth
General health status (numeric)
- ldl
LDL cholesterol in mg/dL (numeric)
- alb
Albumin level (numeric)
- crt
Creatinine level (numeric)
- plt
Platelet count (numeric)
- sbp
Systolic blood pressure in mmHg (numeric)
- aai
Ankle-arm index (numeric)
- fev
Forced expiratory volume (numeric)
- dsst
Digit Symbol Substitution Test score (numeric)
- atrophy
Brain atrophy measure (numeric)
- whgrd
White matter hyperintensity grade (numeric)
- numinf
Number of brain infarcts (numeric)
- volinf
Volume of brain infarcts (numeric)
- obstime
Observation time (numeric)
- death
Mortality status (numeric)
Details
The dataset name has been kept as 'mriCardioVars_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
Data taken from the rigr package version 1.0.7
Muscatine pediatric CRF
Description
This dataset, muscatine_coronary_risk_df, is a data frame containing longitudinal observations from the Muscatine Coronary Risk Factor (MCRF) study, which examined the development of coronary disease risk factors in children. It includes 14,568 observations of 4,856 children tracked from 1977 to 1981.
Usage
data(muscatine_coronary_risk_df)
Format
A data frame with 14,568 observations and 7 variables:
- id
Child identification number (integer)
- gender
Gender of child (factor with 2 levels)
- base_age
Age at first observation in years (integer)
- age
Current age in years (integer)
- occasion
Measurement occasion (integer)
- obese
Obesity status (factor with 2 levels)
- numobese
Numeric obesity indicator (numeric)
Details
The dataset name has been kept as 'muscatine_coronary_risk_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.
Source
Data taken from the geepack package version 1.3.12. Original study: The Muscatine Coronary Risk Factor Study, University of Iowa, 1977-1981.
Streptokinase Therapy in AMI
Description
This dataset, myocardialinfarction_df, is a data frame containing information from 33 clinical trials comparing intravenous streptokinase versus placebo or no therapy in patients hospitalized for acute myocardial infarction. It includes data on treatment outcomes between intervention and control groups.
Usage
data(myocardialinfarction_df)
Format
A data frame with 33 observations and 6 variables:
- trial
Trial identifier (character vector)
- year
Year of publication (integer vector)
- ai
Number of events in treatment group (integer vector)
- n1i
Total number of participants in treatment group (integer vector)
- ci
Number of events in control group (integer vector)
- n2i
Total number of participants in control group (integer vector)
Details
The dataset name has been kept as 'myocardialinfarction_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.
Source
Data taken from the metadat package version 1.2-0. Original publication: Lau J, Antman EM, Jimenez-Silva J, Kupelnick B, Mosteller F, Chalmers TC (1992). "Cumulative meta-analysis of therapeutic trials for myocardial infarction." New England Journal of Medicine, 327(4):248-254.
CAV in Heart Transplant Patients
Description
This dataset, patient_CAV_df, is a data frame containing longitudinal follow-up data from heart transplant recipients at Papworth Hospital, UK. It tracks 2,803 angiographic examinations for the onset of cardiac allograft vasculopathy and mortality.
Usage
data(patient_CAV_df)
Format
A data frame with 2,803 observations and 5 variables:
- PTNUM
Patient identification number (integer)
- years
Time since transplant in years (numeric)
- state
Disease state (numeric)
- dage
Donor age in years (integer)
- pdiag
Primary diagnosis code (numeric)
Details
The dataset name has been kept as 'patient_CAV_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.
Source
Data taken from the flexmsm package version 0.1.2. Original data: Papworth Hospital, UK. Subset of cav data from msm package.
Radial Artery IVUS Patient Data
Description
This dataset, radial_ivus_df, is a data frame containing demographic and clinical data from 115 patients who underwent intravascular ultrasound (IVUS) examination of the radial artery following transradial coronary angiography. It includes 15 variables covering patient characteristics, laboratory results, and IVUS measurements.
Usage
data(radial_ivus_df)
Format
A data frame with 115 observations and 15 variables:
- male
Male sex indicator (integer: 0/1)
- age
Age in years (integer)
- height
Height in cm (numeric)
- weight
Weight in kg (numeric)
- HBP
High blood pressure status (integer: 0/1)
- DM
Diabetes mellitus status (integer: 0/1)
- smoking
Smoking status (factor with 3 levels)
- TC
Total cholesterol in mg/dL (integer)
- TG
Triglycerides in mg/dL (integer)
- HDL
HDL cholesterol in mg/dL (integer)
- LDL
LDL cholesterol in mg/dL (integer)
- hsCRP
High-sensitivity C-reactive protein in mg/L (numeric)
- NTAV
Normalized total atheroma volume (numeric)
- PAV
Percent atheroma volume (numeric)
- sex
Sex (factor with 2 levels)
Details
The dataset name has been kept as 'radial_ivus_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.
Source
Data taken from the moonBook package version 0.3.1
Scottish Health Survey CVD
Description
This dataset, scottish_CVD_df, is a data frame containing cardiovascular health data from the 1998 Scottish Health Survey. It includes information from 8,804 respondents aged 18-64, with variables covering demographics, health behaviors, and cardiovascular disease status.
Usage
data(scottish_CVD_df)
Format
A data frame with 8,804 observations and 8 variables:
- age
Respondent age in years (integer)
- sex
Respondent sex (factor with 2 levels)
- sc
Social class (factor with 3 levels)
- cvddef
Doctor-diagnosed CVD status (integer: 0=no, 1=yes)
- carstair
Carstairs deprivation score (numeric)
- smoke
Smoking status (factor with 5 levels)
- id
Respondent identification number (integer)
- area
Geographic area code (integer)
Details
The dataset name has been kept as 'scottish_CVD_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.
Source
Data taken from the R2MLwiN package version 0.8-9. Original survey: 1998 Scottish Health Survey. Methodology reference: Charlton C, Rasbash J, Browne WJ, Healy M, Cameron B (2024). MLwiN Version 3.09. Centre for Multilevel Modelling, University of Bristol.
Statin intensity and MI risk
Description
This dataset, statinMIrisk_df, is a data frame containing results from 4 clinical trials investigating the effect of statin therapy intensity on the risk of myocardial infarction or coronary death. The data compares intensive versus standard statin regimens.
Usage
data(statinMIrisk_df)
Format
A data frame with 4 observations and 5 variables:
- study
Study identifier (character)
- eI
Number of events in intensive treatment group (numeric)
- nI
Total patients in intensive treatment group (numeric)
- eC
Number of events in control/standard group (numeric)
- nC
Total patients in control/standard group (numeric)
Details
The dataset name has been kept as 'statinMIrisk_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.
Source
Data taken from the RTSA package version 0.2.2
Sulphinpyrazone for post-MI death prevention
Description
This dataset, sulphinpyrazone_tbl_df, is a tibble containing information from a clinical trial studying the efficacy of sulphinpyrazone in preventing sudden death after myocardial infarction. The data includes 1,475 patients randomly assigned to either the treatment or control group.
Usage
data(sulphinpyrazone_tbl_df)
Format
A tibble with 1,475 observations and 2 variables:
- group
Treatment assignment (factor with 2 levels: "control" and "treatment")
- outcome
Patient outcome (factor with 2 levels)
Details
The dataset name has been kept as 'sulphinpyrazone_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
Data taken from the openintro package version 2.5.0. Original study: Anturane Reinfarction Trial Research Group (1980). "Sulfinpyrazone in the prevention of sudden death after myocardial infarction." New England Journal of Medicine, 302(5):250-256.
US Mortality Rates by Cause and Gender
Description
This dataset, usMortality_df, is a data frame containing mortality rates across all ages in the USA from 2011-2013, stratified by cause of death, sex, and rural/urban status. It includes national aggregate rates for 10 causes of death, including Heart disease.
Usage
data(usMortality_df)
Format
A data frame with 40 observations and 5 variables:
- Status
Residential status (factor: Rural/Urban)
- Sex
Gender (factor: Male/Female)
- Cause
Cause of death (factor with 10 levels)
- Rate
Mortality rate per 100,000 population (numeric)
- SE
Standard error of mortality rate (numeric)
Details
The dataset name has been kept as 'usMortality_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the CardioDataSets package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a standard data frame. The original content has not been modified in any way.
Source
Data taken from the lattice package version 0.22-6. Original source: Rural Health Reform Policy Research Center (2015). "Exploring Rural and Urban Mortality Differences." Bethesda, MD: August 2015.
View Available Datasets in CardioDataSets
Description
This function lists all datasets available in the 'CardioDataSets' package. If the 'CardioDataSets' package is not loaded, it stops and shows an error message. If no datasets are available, it returns a message and an empty vector.
Usage
view_datasets()
Value
A character vector with the names of the available datasets. If no datasets are found, it returns an empty character vector.
Examples
if (requireNamespace("CardioDataSets", quietly = TRUE)) {
library(CardioDataSets)
view_datasets()
}