Help for package GGIRread

Type:

Package

Title:

Wearable Accelerometer Data File Readers

Version:

1.0.5

Date:

2025-05-09

Maintainer:

Vincent T van Hees <v.vanhees@accelting.com>

Description:

Reads data collected from wearable acceleratometers as used in sleep and physical activity research. Currently supports file formats: binary data from 'GENEActiv' https://activinsights.com/, .bin-format from GENEA devices (not for sale), and .cwa-format from 'Axivity' https://axivity.com. Further, it has functions for reading text files with epoch level aggregates from 'Actical', 'Fitbit', 'Actiwatch', 'ActiGraph', and 'PhilipsHealthBand'. Primarily designed to complement R package GGIR https://CRAN.R-project.org/package=GGIR.

URL:

https://github.com/wadpac/GGIRread/

BugReports:

https://github.com/wadpac/GGIRread/issues

License:

Apache License (== 2.0)

Suggests:

testthat

Imports:

matlab, bitops, Rcpp (≥ 0.12.10), data.table, readxl, jsonlite, digest

Depends:

stats, utils, R (≥ 3.5.0)

NeedsCompilation:

yes

LinkingTo:

Rcpp

ByteCompile:

yes

Packaged:

2025-05-09 11:58:57 UTC; vvanh

Author:

Vincent T van Hees [aut, cre], Patrick Bos

[aut], Lena Kushleyeva [ctb], Jing Hua Zhao [ctb], Evgeny Mirkes [ctb], Dan Jackson [ctb], Jairo H Migueles [ctb], Medical Research Council UK [cph, fnd], Accelting [cph, fnd]

Repository:

CRAN

Date/Publication:

2025-05-09 14:50:13 UTC

A package to read data files from wearable accelerometers

Description

This package offers a collection of functions to read data files from wearable accelerometers. Some functions were migrated from R package GGIR to make GGIR more modular and to reduce it's complexity, while other functions such as readGENEActiv (R) and GENEActivReader (C++) are newly written.

Details

Package:	GGIRread
Type:	Package
Version:	1.0.5
Date:	2025-05-09
License:	LGPL (>= 2.0, < 3)

Author(s)

Vincent T van Hees <v.vanhees@accelting.com> main creator and developer
Patrick Bos developed function GENEActivReader
Jing Hua Zhao <jinghua.zhao@mrc-epid.cam.ac.uk> co-developed function readGenea
Evgeny Mirkes created function readAxivity
Dan Jackson helped improve function readAxivity

Function (C++) to read binary files as produced by the GENEActiv accelerometer

Description

Function to read binary files as produced by the GENEActiv accelerometer by ActivInsights Ltd https://activinsights.com/

Usage

  GENEActivReader(filename, start = 0L, end = 0L, progress_bar = FALSE)

Arguments

filename

filename (required)

start

start page for reading data

end

end page for reading data

progress_bar

Boolean

Details

If only start page is defined then all data is read beyond start until the end of the file is reached

Value

info

List with ReadOK (good=0 or error=1), ReadErrors (Count of pages with read errors), SampleRate (Hertz), numBlocksTotal

time

Numeric vector with time in miliseconds since start page

x

Numeric vector with x-axis acceleration in gravitational units

y

Numeric vector with y-axis acceleration in gravitational units

z

Numeric vector with z-axis acceleration in gravitational units

temperature

matrix with battery voltage and corresponding timestamps

lux

Numeric vector with lux values in Volts

Author(s)

Patrick Bos <egpbos@gmail.com> Vincent T van Hees <v.vanhees@accelting.com>

Check timestamp format

Description

Helper function used internally. Generate informative error message if timestamp could not be recognisede.

Usage

  checkTimeFormat(timestamp_POSIX, rawValue = " ?? ", timeformat = " ?? ",
                           timeformatName = NULL)

Arguments

timestamp_POSIX

POSIX object with timestamp

rawValue

Timestamp value as encounterd in the data before processing

timeformat

Timestap format used for converting timestamp

timeformatName

Character with the argument name to specify the timeformat. If used as dependency of GGIR then this argument name will be different.

Value

No output value is generated, only an error message if timestamp could not be recognised.

Author(s)

Vincent T van Hees <v.vanhees@accelting.com>

Detect quote

Description

Helper function used internally, used to find out what value should be used for argument quote in function fread.

Usage

  detectQuote(filename, skip)

Arguments

filename

Character

skip

skip

Value

Character with quote object to be used for fread

Author(s)

Vincent T van Hees <v.vanhees@accelting.com>

Find start of data inside text file.

Description

Helper function used internally. Finds variables start of the data in Actiwatch and Actical data.

Usage

  findStartData(filename, quote, startindex)

Arguments

filename

Character

quote

Quote as extracted with detectQuote

startindex

Start index where to start searching. For Actical we start at 300 while for Actiwatch we start at 1000.

Value

Start index

Author(s)

Vincent T van Hees <v.vanhees@accelting.com>

Get File Extension

Description

Helper function used internally to get file extension.

Usage

  getExtension(filename)

Arguments

filename

Character

Value

Character with file extension

Author(s)

Vincent T van Hees <v.vanhees@accelting.com>

Merge Fitbit json files into one data.frame

Description

Read and merge Fitbit json files into one data.frame, and give warning if time series do not intersect.

Usage

  mergeFitbitData(filenames = NULL,
                    desiredtz = "", configtz = NULL)

Arguments

filenames

Character vector, specifying the filenames (at least two) to be merged.

desiredtz

Character, timezone name where the accelerometer was worn. Timezone names are expected to be the timezone database names, e.g. Europe/London. See also: https://en.wikipedia.org/wiki/List_of_tz_database_time_zones

configtz

Character, timezone name where the accelerometer was configured. Leave NULL if equal to desiredtz. Timezones name are expected to be the timezone database names, e.g. Europe/London. See also: https://en.wikipedia.org/wiki/List_of_tz_database_time_zones

Value

Merged data from the two or more files.

Author(s)

Vincent T van Hees <v.vanhees@accelting.com>

Merge Philips Health Band xlsx file pairs into one csv file

Description

Read, merge, and save Philips Health Band xlsx file pairs to csv. If the pair is incomplete the function will still convert the available xlsx file to csv.

Usage

  mergePHBdata(filenames = NULL,
                    timeformat = "%m/%d/%Y %H:%M:%S",
                    desiredtz = "", configtz = NULL,
                    timeformatName = "timeformat")

Arguments

filenames

Character vector, specifying the two filenames to be merged. Assumption is that there is one file starting with "Datalist_" and one starting with "Sleep_Wake_".

timeformat

Character, timestemp format.

desiredtz

configtz

timeformatName

Character, name of timeformat variable to print in error message when timeformat is incorrect, of use to GGIR where argument names can differ.

Value

Merged data from the two files.

Author(s)

Vincent T van Hees <v.vanhees@accelting.com>

Read ActiGraph Count data files (csv)

Description

Reads ActiGraph Count data file. Currently a variety of csv format are facilitated.

Usage

  readActiGraphCount(filename = NULL, 
                            timeformat = "%m/%d/%Y %H:%M:%S",
                            desiredtz = "", configtz = NULL,
                            timeformatName = "timeformat")

Arguments

filename

filename (required)

timeformat

Character, timestemp format.

desiredtz

configtz

timeformatName

Character, name of timeformat variable to print in error message when timeformat is incorrect, of use to GGIR where argument names can differ.

Value

data

Matrix with one or multiple columns

epochSize

epoch size in seconds of data

startTime

POSIXlt format timestamp on which recording starts

deviceSerialNumber

Device serial number if it could be extracted from the file header

Author(s)

Vincent T van Hees <v.vanhees@accelting.com>

Read Actical Count data files (csv)

Description

Reads Actical Count data file.

Usage

  readActicalCount(filename = NULL,
                            timeformat = "%m/%d/%Y %H:%M:%S",
                            desiredtz = "", configtz = NULL,
                            timeformatName = "timeformat")

Arguments

filename

filename (required)

timeformat

Character, timestemp format.

desiredtz

configtz

timeformatName

Character, name of timeformat variable to print in error message when timeformat is incorrect, of use to GGIR where argument names can differ.

Value

data

Matrix with one or multiple columns

epochSize

epoch size in seconds of data

startTime

POSIXlt format timestamp on which recording starts

Author(s)

Vincent T van Hees <v.vanhees@accelting.com>

Read Actiwatch Count data files (csv)

Description

Reads Actiwatch Count data file. Currently a variety of csv format are facilitated.

Usage

  readActiwatchCount(filename = NULL,
                            timeformat = "%m/%d/%Y %H:%M:%S",
                            desiredtz = "", configtz = NULL,
                            timeformatName = "timeformat")

Arguments

filename

filename (required)

timeformat

Character, timestemp format.

desiredtz

configtz

timeformatName

Character, name of timeformat variable to print in error message when timeformat is incorrect, of use to GGIR where argument names can differ.

Value

data

Matrix with one or multiple columns

epochSize

epoch size in seconds of data

startTime

POSIXlt format timestamp on which recording starts

Author(s)

Vincent T van Hees <v.vanhees@accelting.com>

Function to read .cwa-format files as produced by the accelerometer named 'Axivity'

Description

For reading .cwa-format data with the Axivity AX3 and AX6 sensors.

Usage

  readAxivity(filename, start = 0, end = 0, progressBar = FALSE, 
    desiredtz = "", configtz = c(), interpolationType=1, loadbattery = FALSE,
    header = NULL, frequency_tol = 0.1, maxAllowedCorruptBlocks = 20)

Arguments

filename

filename (required) is name of cwa file to read

start

non-negative integer which is a cwa file block number.

end

non-negative integer which is a cwa file block number. End must be not be less than start. If end is less or equal to start, then there is no data read.

progressBar

Is trigger to switch on/off the text progress bar. If progressBar is TRUE then the function displays the progress bar but it works slightly slower

desiredtz

Desired timezone, a character with timezone database name.

configtz

Timezone in which the accelerometer was configured. Only use this argument if the timezone of configuration and timezone in which recording took place are different.

interpolationType

Integer to indicate type of interpolation to be used, 1=linear, 2=nearest neighbour.

loadbattery

Boolean to indicate whether battery voltage should be loaded

header

Header to be reused if it was extracted earlier

frequency_tol

Numeric value representing the tolerance for frequency bias expressed as a fraction of the expected sampling frequency. For example, 0.10 indicates that we accept the sampling frequency configured at 100 Hertz to vary between 90 and 110 Hertz. If this condition is not met the data will be imputed with a constant value and the start and end time will be logged in the QClog output object. Regardless of the setting the function will always log instances when frequency differs by more than 5%, but if this is less than frequency_tol the block will not be imputed.

maxAllowedCorruptBlocks

Max number of consecutive blocks with a failed checksum that we'll tolerate.

Value

data dataframe with data from start inclusive till end exclusive. If start == end then data section of final structure is empty.
- timestamp in numeric format
- gx gyroscope x-axis (only available in AX6)
- gy gyroscope y-axis (only available in AX6)
- gz gyroscope z-axis (only available in AX6)
- x accelerometer x-axis
- y accelerometer y-axis
- z accelerometer z-axis
- temperature in Celsius (only recorded once per block)
- battery one value per block (only recorded once per block)
- light in LUX on (only recorded once per block)
header file header. list of header information:
- uniqueSerialCode is unque serial code of used device
- frequency is measurement frequency. All data will be resampled for this frequency
- start is timestamp in numeric form. To get text representation it is enough to use as.POSIXct(start, origin = "1970-01-01", tz=desiredtz)
- device is "Axivity"
- firmwareVersion
- blocks is number of datablocks with 80 or 120 raw observations in each.
QClog dataframe with log of integrity checks on each block including:
- checksum_pass Boolean to indicate whether checksum was zero.
- blockID_previous ID for previous block
- blockID_current ID for current block
- start_previous numeric start time of previous block
- start_current numeric start time of current block
- blockLengthSeconds length of previous block
- frequency_blockheader frequency derived from block header
- frequency_observed frequency as observed
- imputed Boolean to indicate whether this block was inputed

Author(s)

Evgeny Mirkes <em322@leicester.ac.uk> Lena Kushleyeva Vincent van Hees <v.vanhees@accelting.com>

Examples

  
    cwafile  = system.file("testfiles/ax3_testfile.cwa", package = "GGIRread")[1]
    AX3 = readAxivity(filename = cwafile, desiredtz = "Europe/Berlin", start = 1, end = 2)

Read Fitbit data files (json)

Description

Reads Fitbit data file (json) with sleep, steps or calories. Steps and calories are converted to 30 second resolution to match resolution of sleep data.

Usage

  readFitbit(filename = NULL, desiredtz = "", configtz = NULL)

Arguments

filename

Character, filename (required) of json file

desiredtz

configtz

Value

Data.frame with data converted to time series

Author(s)

Vincent T van Hees <v.vanhees@accelting.com>

Function (R) to read binary files as produced by the GENEActiv accelerometer

Description

R function wrapper around GENEActivReader to read binary files as produced by the GENEActiv accelerometer by ActivInsights Ltd https://activinsights.com/

Usage

  readGENEActiv(filename, start = 0L, end = 0L, progress_bar = FALSE,
                desiredtz = "", configtz = NULL)

Arguments

filename

filename (required)

start

start page for reading data

end

end page for reading data

progress_bar

Boolean

desiredtz

Character, timezone database name of the timezone where the accelerometer was worn.

configtz

Character, timezone database name of the timezone where the accelerometer was configured. Leave NULL if equal to timezone where experiment took place.

Details

If only start page is defined then all data is read beyond start until the end of the file is reached

Value

header:

serial_number

Device serial number as extracted from file header

firmware

Firmware version

tzone

Time zone as extracted from file header

ReadOK

see GENEActivReader

SampleRate

matrix with battery voltage and corresponding timestamps

ReadErrors

matrix with battery voltage and corresponding timestamps

numBlocksTotal

matrix with battery voltage and corresponding timestamps

StartTime

Start time as extracted from file header

data.out:

time

Unix time in seconds with decimal places for miliseconds

x

see GENEActivReader

y

see GENEActivReader

z

see GENEActivReader

light

Ligth values express in lux units

temperature

Temperature in Celsius

Author(s)

Patrick Bos <egpbos@gmail.com> Vincent T van Hees <v.vanhees@accelting.com>

Examples

  binfile  = system.file("testfiles/GENEActiv_testfile.bin", package = "GGIRread")[1]
  rdata = readGENEActiv(filename = binfile, start = 1, end = 1, desiredtz = "Europe/London")

Function to read binary files as produced by the accelerometer named 'Genea', not to be confused with the 'GENEActiv' (see package GENEAread for this)

Description

For reading the binary data as collected with a Genea accelerometer (Unilever Discover, UK). For reading GENEActive binary data, see package GENEAread.

Usage

  readGenea(filename, start = 0, end = 0)

Arguments

filename

filename (required)

start

start point for reading data, this can either be a timestamp in POSIXct format "year-month-day hr:min:sec" or a page number (optional)

end

end point for reading data, this can either be a timestamp in POSIXct format "year-month-day hr:min:sec" or a page number (optional)

Details

If only start is defined then readGenea will read all data beyond start until the end of the file is reached

Value

rawxyz

matrix with raw x, y, and, z acceleration values

header

file header

timestamps1

timestamps for rawxyz in seconds since 1970-01-01 00:00

timestamps2

timestamps for rawxyz in day time format

batt.voltage

matrix with battery voltage and corresponding timestamps

Author(s)

Vincent T van Hees <v.vanhees@accelting.com> Jing Hua Zhao <jinghua.zhao@mrc-epid.cam.ac.uk>

Examples

  geneafile  = system.file("testfiles/genea_testfile.bin", package = "GGIRread")[1]
  GENEA = readGenea(filename = geneafile, start = 0, end = 2)

Read PHB Count data files (xlsx)

Description

Reads Philips Health Band (PHB) Count data file.

Usage

  readPHBCount(filename = NULL,
                timeformat = "%m/%d/%Y %H:%M:%S",
                desiredtz = "", configtz = NULL,
                timeformatName = "timeformat")

Arguments

filename

filename (required)

timeformat

Character, timestemp format.

desiredtz

configtz

timeformatName

Character, name of timeformat variable to print in error message when timeformat is incorrect, of use to GGIR where argument names can differ.

Value

Data.frame with time series, exact column names can vary between xlsx files.

Author(s)

Vincent T van Hees <v.vanhees@accelting.com>

Read and Process Binary Data from Matrix Devices

Description

Reads a binary file generated by Parmay Tech Matrix devices, processes its header and packet data, validates data integrity using CRC32 checksums, and outputs structured sensor data and quality check information.

Usage

  readParmayMatrix(filename, output = c("all", "sf", "dynrange")[1], 
                   start = 1, end = NULL,
                   desiredtz = "", configtz = NULL, interpolationType = 1,
                   read_acc = TRUE, read_gyro = FALSE, 
                   read_temp = TRUE, read_heart = FALSE)

Arguments

filename

Character. Path to the binary file to be read.

output

Character. Specifies the type of output. Options include:

"all": Returns the full processed data.
"sf": Returns the sampling frequency of the accelerometer data.
"dynrange": Returns the dynamic range of the accelerometer.

start

Integer. The index of the starting packet to process. Default is 1.

end

Integer. The index of the ending packet to process. Default is NULL, which means all packets are processed.

desiredtz

Character. Time zone for the returned timestamps. Default is an empty string, which uses the system's default time zone.

configtz

Character. Time zone specified in the file's configuration. Default is NULL, which means that it uses desiredtz.

interpolationType

Integer. Specifies the type of interpolation (see resample) to use when resampling data: 1 for Linear interpolation, 2 for Nearest-neighbor interpolation.

read_acc

Logical. Indicates whether accelerometer data should be read.

read_gyro

Logical. Indicates whether gyroscope data should be read.

read_temp

Logical. Indicates whether temperature data should be read.

read_heart

Logical. Indicates whether heart rate data should be read.

Details

Matrix devices store binary data in packets, with varying lengths depending on the number of sensor recordings in each packet. The function processes the file's header to extract metadata such as the total number of packets and sensor ranges, validates data integrity using CRC32 checksums, and interpolates data to a consistent sampling frequency.

Header Information:

Remarks: Bytes 1-512.
Total packets: Bytes 513-516.
Header string: Bytes 517-520. If not "MDTC", the file is considered corrupt.
Accelerometer dynamic range: Bytes 521-522.
Gyroscope range: Bytes 523-524.

Packet Structure:

Each packet contains accelerometer, gyroscope, temperature, and heart rate data.

8-byte package header.
4-byte CRC32 indicator.
4-byte start timestamp.
4-byte end timestamp.
4-byte number of accelerometer recordings in packet.
4-byte number of gyroscope recordings in packet.
4-byte number of temperature recordings in packet.
4-byte number of heart rate recordings in packet.
Sensor data for accelerometer, gyroscope, temperature, and heart rate.

Value

A list containing the following elements (when output = "all"):

QClog: A data frame with quality control information, including checksum validation and data gaps.
output: A data frame with resampled sensor data, including:

time
Timestamps of the recordings.

acc_x, acc_y, acc_z
Resampled accelerometer data.

gyro_x, gyro_y, gyro_z
Resampled gyroscope data.

bodySurface_temp, ambient_temp
Resampled temperature data.

hr_raw, hr
Resampled heart rate data.

remarks
Remarks extracted from the file header.
header: A list with the following elements:

sf
Sampling frequency of the accelerometer data.

acc_dynrange
Dynamic range of the accelerometer.

starttime
Start time of the first packet in POSIXct format.
lastchunk: Logical, indicating if the processed data includes the last packet in the file.

If output = "sf", the function returns only the sampling frequency.

If output = "dynrange", it returns the dynamic range of the accelerometer.

Author(s)

Jairo H Migueles <jairo@jhmigueles.com>

References

For more details on Matrix devices', see: https://www.parmaytech.com/devices/en-matrix For additional details on Matrix bin/BIN files structure, please contact manufacturer: https://www.parmaytech.com/contact

Examples

  ## Not run: 
  # Example usage:
  binfile  = system.file("testfiles/mtx_12.5Hz_acc.BIN", package = "GGIRread")
  # Read full data and process all packets
  result <- readParmayMatrix(binfile)
  
  # Get sampling frequency only
  sf <- readParmayMatrix(binfile, output = "sf")
  
  # Get accelerometer dynamic range
  dynrange <- readParmayMatrix(binfile, output = "dynrange")
  
  # Process a subset of packets
  result_subset <- readParmayMatrix(binfile, start = 10, end = 20)
  
## End(Not run)

Deprecated function to read .wav files as can be stored with OMGUI software by Axivity Ltd

Description

Originally designed to read the wav accelerometer data format as stored with the OMGUI software by Axivity Ltd and documented here.

This function has been deprecated as wav files are hardly used, their formatting is not consistent causing ongoing challenges, and the wav format is not well facilitated for AX6 recordings.

Usage

  readWav(filename, start = 1, end = 100,units="minutes")

Arguments

filename

filename (required)

start

start point for reading data, see also units

end

end point for reading data, see also units

units

units used for defining start and end

Details

If only start is defined then g.binread will read all data beyond start until the end of the file is reached

Value

rawxyz

matrix with raw x, y, and, z acceleration values

header

file header

timestamps

local timestamps for rawxyz

Author(s)

Vincent T van Hees <v.vanhees@accelting.com>

Resample timeseries

Description

Resample single- or multi-variate time series using linear or nearest neighbour interpolation

Usage

  resample(raw, rawTime, time, stop, type = 1)

Arguments

raw

stop-by-3 matrix with raw values of x, y and z.

rawTime

vector with stop elements of raw time.

time

array with required time points.

stop

Number of rows in raw

type

integer to indicate type of interpolation, 1=linear, 2=nearest neighbour

Examples

  raw = cbind(1:10, 1:10, 1:10)
  rawTime = seq(0.1, 1, by = 0.1)
  time = seq(0.15001, 1.05001, by = 0.1)
  stop = 10
  dat_lin = resample(raw, rawTime, time, stop, type = 1)