Type: | Package |
Version: | 1.1 |
Date: | 2025-02-07 |
Title: | Datasets to Accompany Wolfe and Schneider - Intuitive Introductory Statistics |
Description: | These datasets and functions accompany Wolfe and Schneider (2017) - Intuitive Introductory Statistics (ISBN: 978-3-319-56070-0) <doi:10.1007/978-3-319-56072-4>. They are used in the examples throughout the text and in the end-of-chapter exercises. The datasets are meant to cover a broad range of topics in order to appeal to the diverse set of interests and backgrounds typically present in an introductory Statistics class. |
Maintainer: | Grant Schneider <gschneidz@gmail.com> |
License: | GPL-2 |
LazyLoad: | yes |
Depends: | R (≥ 2.10) |
Imports: | asbio, BSDA, Hmisc, NSM3, Rfit |
RoxygenNote: | 6.0.1 |
NeedsCompilation: | no |
Packaged: | 2025-02-10 23:17:39 UTC; grant |
Repository: | CRAN |
Date/Publication: | 2025-02-11 16:50:05 UTC |
Author: | Grant Schneider [aut, cre] |
Datasets and functions to accompany Wolfe and Schneider - Intuitive Introductory Statistics (2017)
Description
This package contains the R datasets and functions referenced in the text and exercises of Wolfe and Schneider - Intuitive Introductory Statistics (2017)
Author(s)
Grant Schneider
Maintainer: Grant Schneider <gschneidz@gmail.com>
References
Wolfe and Schneider - Intuitive Introductory Statistics (2017)
Find "triples" of vector elements.
Description
FindTriples
returns the number of left triples, right triples, and triples that are neither left nor right for the given vector.
Usage
FindTriples(z)
Arguments
z |
Numeric vector |
Details
Triples are formed by taking any three data values from a vector and order this triple of data values from smallest to largest. If the middle ordered item is closer to the smallest than to the largest, the triple is said to be a right triple. If the middle ordered item is closer to the largest, the triple is said to be a left triple. If the middle ordered value is exactly halfway between the other two, the triple is neither right nor left. This function will do the necessary computations and return the number of each type of triple.
Value
A list with three components:
-
right
- Number of right triples (where middle value is closer to smallest) -
left
- Number of left triples (where middle value is closer to largest) -
neither
- Number of triples that are neither left nor right (where middle value is exactly halfway between the other two)
Examples
FindTriples(1:10)
FindTriples(rnorm(20))
Agricultural Chargeoff Rates by Quarter
Description
This data set contains twenty-five years (1991 through 2015) of quarterly charge-off rates for agricultural loans as reported by the Federal Reserve. This is the full dataset corresponding to the selected rows displayed in Table 2.5 of Wolfe and Schneider - Intuitive Introductory Statistics.
Usage
agricultural_chargeoff_rates_by_quarter
Format
A data.frame
with 100 rows and 3 columns indicating the chargeoff rates for agricultural loans for each quarter.
Source
https://www.federalreserve.gov/releases/chargeoff/chgallnsa.htm
Examples
data(agricultural_chargeoff_rates_by_quarter)
summary(agricultural_chargeoff_rates_by_quarter)
Airline Arrivals
Description
This data set contains on-time arrival records for U.S. flight carriers in for all four quarters of the year 2015, the month of December 2015, and the entire year 2015. This data is displayed in Table 2.9 of Wolfe and Schneider - Intuitive Introductory Statistics.
Usage
airline_arrivals
Format
A data.frame
with 13 rows and 13 columns indicating the percentages and ranks for each airline.
Source
U.S. Department of Transportation (2016)
Examples
data(airline_arrivals)
summary(airline_arrivals)
American League Salary 2014
Description
This data set contains the 2014 salaries (as of March 26, 2014) for all baseball players in the American League.
Usage
american_league_salary_2014
Format
A data.frame
with 447 rows and 3 columns corresponding to the name, salary, and team for each player.
Source
Petchesky (2014)
Examples
data(american_league_salary_2014)
summary(american_league_salary_2014)
Arion Subfuscus
Description
This data set contains the Acceptability Indices (AI) for Arion Subfuscus from woodland and waste sites with the toxic woodland plant Allium Ursinum (wild garlic) as test gel as reported by Whelan (1982). This data is displayed in Table 9.3 of Wolfe and Schneider - Intuitive Introductory Statistics.
Usage
arion_subfuscus
Format
A data.frame
with 10 rows and 2 columns indicating the AI for the woodland and waste sites.
Source
Whelan (1982)
Examples
data(arion_subfuscus)
summary(arion_subfuscus)
Average HDL Levels
Description
This data set contains the averages of two HDL measurements taken 24 hours apart for each of twelve women who participated in Kerr (1983). This data is displayed in Table 7.1 of Wolfe and Schneider - Intuitive Introductory Statistics.
Usage
average_HDL_levels
Format
A vector
with 12 observations corresponding to the average HDL measurements for each woman.
Source
Kerr (1983)
Examples
data(average_HDL_levels)
summary(average_HDL_levels)
Beer Head
Description
This data set contains the height of the initial head formation upon pouring, the percentage adhesion of the head, and the percentage collapse of the head 4 min after pouring for 20 bottles selected from two different production lots of beer as reported by Ault et al. (1967).
Usage
beer_head
Format
A list
containing 2 data.frame
s (one for each of the two production lots). Each data.frame
has 20 rows and 3 columns corresponding to the attributes for each bottle.
Source
Ault et al. (1967)
Examples
data(beer_head)
summary(beer_head)
Body Temperature and Heart Rate
Description
This data set contains body temperature values artificially generated by Shoemaker (1996), to closely recreate the original data obtained by Mackowiak et al. (1992) for 65 male and 65 female subjects.
Usage
body_temperature_and_heart_rate
Format
A data.frame
with 130 rows and 3 columns corresponding to the body temperature, gender (1 for male, 2 for female), and heart rate for each subject.
Source
Shoemaker (1996)
Examples
data(body_temperature_and_heart_rate)
summary(body_temperature_and_heart_rate)
Chargeoff Rates
Description
This data set contains twenty-five years (1991 through 2015) of quarterly charge-off rates for eight different types of loans as reported by the Federal Reserve. This data is displayed in Table 2.4 of Wolfe and Schneider - Intuitive Introductory Statistics.
Usage
chargeoff_rates
Format
A data.frame
with 100 rows and 9 columns indicating the chargeoff rates for each of the eight loan types for each quarter.
Source
https://www.federalreserve.gov/releases/chargeoff/chgallnsa.htm
Examples
data(chargeoff_rates)
summary(chargeoff_rates)
College Rankings 2012
Description
This data set contains a subset of the College Scorecard Data reported by the U.S. Department of Education.
Usage
college_rankings_2012
Format
A data.frame
with 7793 rows and 9 columns indicating various scorecard metrics for each school.
Source
United States Department of Education (2016)
Examples
data(college_rankings_2012)
summary(college_rankings_2012)
Delinquency Rates
Description
This data set contains twenty-five years (1991 through 2015) of quarterly delinquency rates for eight different types of loans as reported by the Federal Reserve.
Usage
delinquency_rates
Format
A data.frame
with 100 rows and 9 columns indicating the delinquency rates for each of the eight loan types for each quarter.
Source
https://www.federalreserve.gov/releases/chargeoff/delallnsa.htm
Examples
data(delinquency_rates)
summary(delinquency_rates)
Desimipramine
Description
This data set contains the body weight of 12 subjects with anorexia nervosa before and after treatment with the medication desimipramine as reported by Brambilla et al. (1985). This data is displayed in Table 8.3 of Wolfe and Schneider - Intuitive Introductory Statistics.
Usage
desimipramine
Format
A data.frame
with 12 rows and 2 columns indicating the pre-treatment and post-treatment weights (in kg) of each subject.
Source
Brambilla et al. (1985)
Examples
data(desimipramine)
summary(desimipramine)
Diamonds Carats Color Cost
Description
This data set contains weight in carats, color purity, grade of clarity, certification body, and value in Singapore dollars for 308 round diamond stones from an advertisement in the February 18, 2000 edition of Singapore's Business Times as discussed in Chu (2001).
Usage
diamonds_carats_color_cost
Format
A data.frame
with 308 rows and 5 columns indicating the attributes of each diamond.
Source
Chu (2001)
Examples
data(diamonds_carats_color_cost)
summary(diamonds_carats_color_cost)
Emissions
Description
This data set contains annual emissions data (in million metric tons of carbon dioxide equivalents) for four common air pollutants from 1990 to 2014 in the United States. This data is displayed in Table 2.3 of Wolfe and Schneider - Intuitive Introductory Statistics.
Usage
emissions
Format
A data.frame
with 25 rows and 5 columns indicating the amount of the four pollutants for each year.
Source
U.S. Environmental Protection Agency (2016)
Examples
data(emissions)
summary(emissions)
Engineering Drawing Hours
Description
This data set contains the total engineering drawing hours contributing to the cost of pieces of machinery/mechanical devices for a large Ohio-based company that is displayed in Table 1.3 of Wolfe and Schneider - Intuitive Introductory Statistics.
Usage
engineering_drawing_hours
Format
A vector
with 96 observations corresponding to the hours for each piece of machinery/mechanical device.
Source
Ohio-based company
Examples
data(engineering_drawing_hours)
summary(engineering_drawing_hours)
Female Amerindians
Description
This data set contains the stature of 20 female Amerindians from the Turner site in Cincinnati, Ohio, representing the Ohio Hopewell period (200-400 AD) studied by Sciulli and Carlisle (1975). This data is displayed in Table 7.8 of Wolfe and Schneider - Intuitive Introductory Statistics.
Usage
female_amerindians
Format
A vector
with 20 observations corresponding to the stature (in centimeters) for each female Amerindian.
Source
Sciulli and Carlisle (1975)
Examples
data(female_amerindians)
summary(female_amerindians)
FMR White-Tailed Deer
Description
This data set contains data on the fasting metabolic rate (FMR) of white-tailed deer for different two-month periods of the year. This data is displayed in Table 12.9 of Wolfe and Schneider - Intuitive Introductory Statistics.
Usage
fmr_white_tailed_deer
Format
A list
with 3 vector
s, each containing FMR (kcal/kg/day) for the deer observed in the corresponding time period.
Source
Silver et al. (1969)
Examples
data(fmr_white_tailed_deer)
summary(fmr_white_tailed_deer)
Gender Roles
Description
This data set contains counts of gender 7685 individuals for which gender could be identified into the seven role categories. This data was used by Vigorito and Curry (1998) to investigate whether there is any relationship between gender and the roles that individuals play in magazine ad illustrations. This data is displayed in Table 10.12 of Wolfe and Schneider - Intuitive Introductory Statistics.
Usage
gender_roles
Format
A data.frame
with 7 rows and 2 columns indicating the counts for each gender and role category.
Source
Vigorito and Curry (1998)
Examples
data(gender_roles)
summary(gender_roles)
Goggled Green Turtles
Description
This data set contains the number of times that 36 turtle hatchlings split into two groups 'circled', in an attempt to determine whether light had an effect on the orientation and sea-finding ability of such hatchlings as reported by Mrosovsky and Shettleworth (1974). This data is displayed in Table 9.6 of Wolfe and Schneider - Intuitive Introductory Statistics.
Usage
goggled_green_turtles
Format
A data.frame
with 18 rows and 2 columns indicating the number of circles in a two-minute period for the nasal field goggles and for the harlequin goggles.
Source
Mrosovsky and Shettleworth (1974)
Examples
data(goggled_green_turtles)
summary(goggled_green_turtles)
Health Care by Affiliation
Description
This data set contains counts of respondents' preferences between two statements about healthcare grouped by political party from the report "Generation to Generation: American Values about Taking Care of Each Other" (1998). This data is displayed in Table 10.20 of Wolfe and Schneider - Intuitive Introductory Statistics.
Usage
health_care_by_affiliation
Format
A data.frame
with 2 rows and 2 columns indicating the counts for each statement and each party.
Source
Princeton Survey Research Associates of Princeton, New Jersey (1998)
Examples
data(health_care_by_affiliation)
summary(health_care_by_affiliation)
Homes Prices
Description
This data set contains the information about house prices for a random sample of 100 properties in Wake County, North Carolina, as collected by Woodard and Leone (2008).
Usage
homes_prices
Format
A data.frame
with 100 rows and 6 columns indicating the attributes of each house.
Source
Woodard and Leone (2008)
Examples
data(homes_prices)
summary(homes_prices)
House Lot Sizes
Description
This data set contains the information about house and lot sizes for a random sample of 100 properties in Wake County, North Carolina, as collected by Woodard and Leone (2008).
Usage
house_lot_sizes
Format
A data.frame
with 100 rows and 8 columns indicating the attributes of each house.
Source
Woodard and Leone (2008)
Examples
data(house_lot_sizes)
summary(house_lot_sizes)
Infant Walking
Description
This data set contains the ages at which 12 infants were reported to have started walking by their mothers. The infants were randomly split into "exercise" and "no-exercise" groups" as part of the study conducted by Zelazo et al. (1972). This data is displayed in Table 9.8 of Wolfe and Schneider - Intuitive Introductory Statistics.
Usage
infant_walking
Format
A data.frame
with 6 rows and 2 columns indicating the age (in months) at which infants first walked for the exercise and no-exercise groups.
Source
Zelazo et al. (1972)
Examples
data(infant_walking)
summary(infant_walking)
Interstitial Lengths
Description
This data set contains the averages (mm) of interstitial lengths (distances between midpoints) of ten pairs of secondary branches for each of twelve blue spruce and twelve white pine trees. This data is displayed in Table 1.24 of Wolfe and Schneider - Intuitive Introductory Statistics.
Usage
interstitial_lengths
Format
A data.frame
with 12 rows and 2 columns corresponding to 12 observations each of blue spruce and white pine trees.
Source
Kayle (1984)
Examples
data(interstitial_lengths)
summary(interstitial_lengths)
Kentucky Derby 2012
Description
This data set contains race statistics for the Kentucky Derby for each year from 1990 to 2012. This data is displayed in Table 2.7 of Wolfe and Schneider - Intuitive Introductory Statistics.
Usage
kentucky_derby_2012
Format
A data.frame
with 23 rows and 6 columns indicating various statistics for each year.
Source
Examples
data(kentucky_derby_2012)
summary(kentucky_derby_2012)
Meniscal Repairs Load at Failure
Description
This data set contains the load at failure for 18 cadaveric menisci repaired by one of three techniques: the FasT-Fix Meniscal Repair Suture System (FasT-Fix), the use of biodegradable Meniscus Arrows (MA), and the Vertical Mattress Sutures (VMS) approach. This data is displayed in Table 12.1 of Wolfe and Schneider - Intuitive Introductory Statistics.
Usage
meniscal_repairs_load_at_failure
Format
A list
with 3 vector
s, each containing the load (in Newtons) at failure of the 6 menisci assigned to the corresponding group.
Source
Borden et al. (2003)
Examples
data(meniscal_repairs_load_at_failure)
summary(meniscal_repairs_load_at_failure)
Mother Smoking Age
Description
This data set contains the percentages of mothers in the United States who smoked during pregnancies from 2010 to 2014 for various age categories. This data is displayed in Table 2.14 of Wolfe and Schneider - Intuitive Introductory Statistics.
Usage
mother_smoking_age
Format
A data.frame
with 45 rows and 4 columns indicating the age group, number of births, and percentage of smoking mothers for each year.
Source
Centers for Disease Control and Prevention (CDC) "WONDER Online Database" (2016)
Examples
data(mother_smoking_age)
summary(mother_smoking_age)
Mother Smoking Education
Description
This data set contains the percentages of mothers in the United States from 2010 to 2014 who smoked during pregnancy for various categories of educational level. A subset of this data is displayed in Table 2.13 of Wolfe and Schneider - Intuitive Introductory Statistics.
Usage
mother_smoking_education
Format
A data.frame
with 40 rows and 4 columns indicating the education, births, and percentage of smoking mothers for each year.
Source
Centers for Disease Control and Prevention (CDC) "WONDER Online Database" (2016)
Examples
data(mother_smoking_education)
summary(mother_smoking_education)
Mother Smoking Education 1989-1993
Description
This data set contains the percentages of mothers in the United States from 1989 to 1993 who smoked during pregnancy for various categories of educational level. This data is displayed in Table 2.15 of Wolfe and Schneider - Intuitive Introductory Statistics.
Usage
mother_smoking_education_1989_1993
Format
A data.frame
with 25 rows and 3 columns indicating the education and percentage of smoking mothers for each year.
Source
Centers for Disease Control and Prevention, National Center for Health Statistics (1995)
Examples
data(mother_smoking_education_1989_1993)
summary(mother_smoking_education_1989_1993)
Mother Smoking Education 2010
Description
This data set contains the percentages of mothers in the United States in 2010 who smoked during pregnancy for various categories of educational level. This is a subset of the mother_smoking_education dataset.
Usage
mother_smoking_education_2010
Format
A data.frame
with 8 rows and 2 columns indicating the percentage of smoking mothers for each educational level in the year 2010.
Source
Centers for Disease Control and Prevention (CDC) "WONDER Online Database" (2016)
Examples
data(mother_smoking_education_2010)
summary(mother_smoking_education_2010)
Motor Vehicle Death Rate 2012
Description
This data set contains the total motor-vehicle fatalities per 100 million vehicle miles traveled in the 2012 calendar year that is displayed in Table 1.16 of Wolfe and Schneider - Intuitive Introductory Statistics.
Usage
motor_vehicle_death_rate_2012
Format
A data.frame
with 50 rows and 3 columns corresponding to the name, fatality rate, and region for each state.
Source
National Highway Traffic Safety Administration (2013)
Examples
data(motor_vehicle_death_rate_2012)
summary(motor_vehicle_death_rate_2012)
Movie Facts
Description
This data set contains a random sample of 100 movies from the 1996 edition of The Movie and Video Guide prepared annually by Leonard Maltin as selected by Moore (2006).
Usage
movie_facts
Format
A data.frame
with 100 rows and 6 columns corresponding to the title, release year, length (minutes), number of cast members, rating, and number of lines of description for each movie.
Source
Moore (2006)
Examples
data(movie_facts)
summary(movie_facts)
National League Salary 2014
Description
This data set contains the 2014 salaries (as of March 26, 2014) for all baseball players in the National League.
Usage
national_league_salary_2014
Format
A data.frame
with 437 rows and 3 columns corresponding to the name, salary, and team for each player.
Source
Petchesky (2014)
Examples
data(national_league_salary_2014)
summary(national_league_salary_2014)
NBA 2015-2016
Description
This data set contains NBA teams' performance in the 2015-2016 season as provided by the NBA's official website.
Usage
nba_2015_2016
Format
A data.frame
with 30 rows and 19 columns indicating various performance statistics for each team.
Source
https://www.nba.com/stats/teams/traditional?Season=2015-16
Examples
data(nba_2015_2016)
summary(nba_2015_2016)
OSU Math Salaries 2015
Description
This data set contains 2015 salaries of faculty and staff in the Mathematics Department at The Ohio State University. This data is displayed in Table 3.1 of Wolfe and Schneider - Intuitive Introductory Statistics.
Usage
osu_math_salaries_2015
Format
A vector
with 226 observations corresponding to each reported salary.
Source
Cleveland.com (2016)
Examples
data(osu_math_salaries_2015)
summary(osu_math_salaries_2015)
Pennies' Age
Description
This data set contains the ages (based on minting dates) of 374 United States pennies collected by an introductory class of 25 students.
Usage
pennies_age
Format
A vector
with 374 observations corresponding to the ages of the pennies.
Source
Classroom Experiment
Examples
data(pennies_age)
summary(pennies_age)
Percentage Hatched Eggs
Description
This data set contains the percentages of eggs that eventually hatched for dump (communal) nests and normal nests each year from 1966 through 1974 as reported by Clawson et al. (1979). This data is displayed in Table 8.1 of Wolfe and Schneider - Intuitive Introductory Statistics.
Usage
percentage_hatched_eggs
Format
A data.frame
with 9 rows and 2 columns indicating the percentage of hatched eggs for the two nest types for each year.
Source
Clawson et al. (1979)
Examples
data(percentage_hatched_eggs)
summary(percentage_hatched_eggs)
Pew Science Survey Data By Age Group
Description
This data set contains the percentages of people who answered "yes" to various questions about scientific topics in a survey conducted by the Pew Research Center grouped by age group. This data is displayed in Table 2.18 of Wolfe and Schneider - Intuitive Introductory Statistics.
Usage
pew_science_survey_data_by_age_group
Format
A data.frame
with 5 rows and 4 columns indicating the proportion of respondents answering "yes" to each of the four questions indicated in Exercise 2.3.4. of Wolfe and Schneider - Intuitive Introductory Statistics by age group.
Source
Pew Internet & American Life Project, Science Issues (2014)
Examples
data(pew_science_survey_data_by_age_group)
summary(pew_science_survey_data_by_age_group)
Pew Science Survey Data By Party
Description
This data set contains the percentages of people who answered "yes" to various questions about scientific topics in a survey conducted by the Pew Research Center grouped by political affiliation. This data is displayed in Table 2.17 of Wolfe and Schneider - Intuitive Introductory Statistics.
Usage
pew_science_survey_data_by_party
Format
A data.frame
with 3 rows and 4 columns indicating the proportion of respondents answering "yes" to each of the four questions indicated in Exercise 2.3.4. of Wolfe and Schneider - Intuitive Introductory Statistics by party.
Source
Pew Internet & American Life Project, Science Issues (2014)
Examples
data(pew_science_survey_data_by_party)
summary(pew_science_survey_data_by_party)
Pines 1997
Description
This data set contains a subset of the data collected by biology students at Kenyon College to determine which factors cause pines in the Kenyon Center for Environmental Study to vary in growth rates. A description of the dataset is displayed in Table 2.2 and Example 3.4 of Wolfe and Schneider - Intuitive Introductory Statistics.
Usage
pines_1997
Format
A data.frame
with 1000 rows and 5 columns corresponding to variables described in Table 2.2 of Intuitive Introductory Statistics for each tree.
Source
Kenyon Center for Environmental Study (1997)
Examples
data(pines_1997)
summary(pines_1997)
PMN Migration
Description
This data set contains the polymorphonuclear leukocytes (PMN) migration in the sera of the 8 patients with acute pancreatitis measured by Perez et al. (1983). This data is displayed in Table 7.4 of Wolfe and Schneider - Intuitive Introductory Statistics.
Usage
pmn_migration
Format
A vector
with 8 observations corresponding to the PMN migration for each patient.
Source
Perez et al. (1983)
Examples
data(pmn_migration)
summary(pmn_migration)
Population Estimates 2015
Description
This data set contains population estimates, birth rates (per 1000 population), and net migration (per 1000 population) for each year 2011 through 2015 at the state level as reported by the U.S. Census Bureau.
Usage
population_estimates_2015
Format
A data.frame
with 50 rows and 17 columns indicating each metric over the four years for each state.
Source
United States Census Bureau (2016)
Examples
data(population_estimates_2015)
summary(population_estimates_2015)
Presidential Election Polls
Description
This data set contains the results of the final pre-election Gallup Voter Opinion Poll and the eventual election results for every presidential election from 1936 through 2012.
Usage
presidential_election_polls
Format
A data.frame
with 46 rows and 5 columns corresponding to the year, name, final polling estimate, actual election results, and deviation between polling and actual results for each candidate.
Source
Gallup (2015)
Examples
data(presidential_election_polls)
summary(presidential_election_polls)
Proportion For-Profit Hospitals
Description
This data set contains data on the proportion of a state's hospitals that are for-profit for a subset of states from each of four regions of the country. This data is displayed in Table 12.5 of Wolfe and Schneider - Intuitive Introductory Statistics.
Usage
proportion_for_profit_hospitals
Format
A data.frame
with 20 rows and 3 columns indicating the region and proportion of for-profit hospitals for each state.
Source
American Hospital Association (2012)
Examples
data(proportion_for_profit_hospitals)
summary(proportion_for_profit_hospitals)
Q2/Q4 Agricultural Chargeoff Rates
Description
This data set contains twenty-five years (1991 through 2015) of Q2 and Q4 charge-off rates for agricultural loans as reported by the Federal Reserve. This is a subset of the agricultural_chargeoff_rates_by_quarter dataset.
Usage
q2_q4_agricultural_chargeoff_rates
Format
A data.frame
with 50 rows and 3 columns indicating the chargeoff rates for agricultural loans for each quarter.
Source
https://www.federalreserve.gov/releases/chargeoff/chgallnsa.htm
Examples
data(q2_q4_agricultural_chargeoff_rates)
summary(q2_q4_agricultural_chargeoff_rates)
Reading Habits 2011
Description
This data set contains data from the 2011 Pew Research Center report titled "The Rise of E-reading" intended to investigate how reading and e-reading habits vary by demographic and socio-economic categories. A subset of this data is displayed in Table 3.4 of Wolfe and Schneider - Intuitive Introductory Statistics.
Usage
reading_habits_2011
Format
A data.frame
with 2986 rows and 7 columns indicating various pieces of demographic information and reading habits for each study participant.
Source
Pew Internet & American Life Project: The rise of e-reading (2016)
Examples
data(reading_habits_2011)
summary(reading_habits_2011)
School Report Cards 2014
Description
This data set contains 2014 school performance data, or "School Quality Snapshots", for high schools in New York City. A subset of this data is displayed in Table 3.2 of Wolfe and Schneider - Intuitive Introductory Statistics.
Usage
school_report_cards_2014
Format
A data.frame
with 484 rows and 8 columns indicating various performance statistics for each high school.
Source
New York City Department of Education (2016)
Examples
data(school_report_cards_2014)
summary(school_report_cards_2014)
Sheep Weight
Description
This data set contains the mother's mating weight and her lamb offspring's weight at age 7 months for a subset of twenty ewe-lamb pairs from the Research Farm at Ataturk University, Erzurum, Turkey. This data is displayed in Table 11.6 of Wolfe and Schneider - Intuitive Introductory Statistics.
Usage
sheep_weight
Format
A data.frame
with 20 rows and 2 columns indicating the weight (in kg) for the mother and the lamb of each pair.
Source
Ozturk et al. (2005)
Examples
data(sheep_weight)
summary(sheep_weight)
State CDI
Description
This data set contains a sample of "chronic disease indicators", as reported by the CDC's Division of Population Health, for the four states California, Michigan, Ohio, and West Virginia. This data is displayed in Table 2.16 of Wolfe and Schneider - Intuitive Introductory Statistics.
Usage
state_cdi
Format
A data.frame
with 16 rows and 3 columns indicating the indicator and the percentage of the popoulation having such an indicator for each state.
Source
Centers for Disease Control and Prevention (2016)
Examples
data(state_cdi)
summary(state_cdi)
State Poverty Levels 2013
Description
This data set contains the number of individuals in each state whose annual income was below the established poverty level in 2013 at a state-by-state level. This data is displayed in Table 1.4 of Wolfe and Schneider - Intuitive Introductory Statistics.
Usage
state_poverty_levels_2013
Format
A data.frame
with 50 rows and 3 columns corresponding to the name, total population, and number of people at or below the poverty line in each state.
Source
United States Census Bureau: Population Division (2014) & American Community Surveys (2014)
Examples
data(state_poverty_levels_2013)
summary(state_poverty_levels_2013)
TIAA CREF
Description
This data set contains the historical unit values from January 2, 2015 to March 15, 2016 for seven TIAA and CREF variable annuities. This is the full dataset corresponding to the selected rows displayed in Table 2.6 of Wolfe and Schneider - Intuitive Introductory Statistics.
Usage
tiaa_cref
Format
A data.frame
with 302 rows and 8 columns indicating the unit values for each of the seven annuities for each day.
Source
Examples
data(tiaa_cref)
summary(tiaa_cref)
Traffic Accident Data
Description
This data set contains traffic accident data from 1949 and 1985 that is displayed in Table 1.2 of Wolfe and Schneider - Intuitive Introductory Statistics.
Usage
traffic_accidents
Format
A data.frame
with 9 rows and 2 columns. The rows correspond to various types of accidents and the two columns correspond to the years 1945 and 1985, respectively.
Source
National Safety Council (1996)
Examples
data(traffic_accidents)
summary(traffic_accidents)
Weekly Salaries
Description
This data set contains the median weekly earnings of male and female employees in service occupations, sales and office occupations, and construction and extraction occupations for each quarter from 2005 to 2015. This data is displayed in Table 2.1 of Wolfe and Schneider - Intuitive Introductory Statistics.
Usage
weekly_salaries
Format
A data.frame
with 44 rows and 6 columns corresponding to male and female earnings for each of the three occupation types for each quarter.
Source
U.S. Department of Labor, Bureau of Labor Statistics (2016)
Examples
data(weekly_salaries)
summary(weekly_salaries)
Weight of Euros
Description
This data set contains the weight of 2000 coins from 8 separate packages of 250 Euros each as weighed and reported by Shkedy et al. (2006).
Usage
weight_of_Euros
Format
A data.frame
with 2000 rows and 3 columns corresponding to the ID, weight, and batch/package number for each coin.
Source
Shkedy et al. (2006)
Examples
data(weight_of_Euros)
summary(weight_of_Euros)