manydata 1.1.3
Package
- Updated website address
- Updated authorship
manydata 1.1.0
Package
- Updated GitHub actions to use code coverage secrets
Wrangling
- Added filter_datacube()for filtering datasets in a
datacube by date
- Added find_ID()andfind_common_ID()for
identifying ID columns in datasets
Evaluation
- Added find_year()for extracting just the year from a
date (potentially unnecessary ifmessydates::year()available)
- Added compare_new()andcompare_diff()for
comparing what is new or different in one dataset over another
- Added a range of score_*()functions for scoring
datasets on various criteria, including consistency, completeness,
accuracy, timeliness, and uniqueness of the data
Maintaining
- Added find_duplicates()for identifying duplicate
observations in datasets
- Added code_extend_glove()andcode_extend_bert()for extending existing coding to new or
missing data
manydata 1.0.3
Connection
- Added new getID()helper that obtains the one or two ID
columns that appear as the first one or two columns in a datacube
- compare_overlap()now returns a list of each datasets
IDs to avoid issues with- {ggVennDiagram}
- plot.compare_overlap()now always returns an upset plot
(closes #292)
- Fixed testing of ggplot objects (closes #308)
- Fixed how plot.compare_categories()treated identifier
variables (closes #291)
manydata 1.0.2
Connection
- Fixed global variables in several resolve_*()functions
manydata 1.0.1
Package
Connection
- resolve_*()functions now have a parameter indicating
whether missing values should be included; unlike base R, by default
missing values are excluded
- Restored resolve_mean()
- Restored resolve_median()
- Added resolve_mode()for retaining the most common
values
- Added resolve_consensus()for retaining only values
where there are no conflicts
manydata 1.0.0
Package
- Updated GitHub checks and release actions
- Fixes to URLs
- Updated website
- Improved ease of operation by making {cli},{dplyr}, and{messydates}Depends
- Dropped {usethis}Suggest
Collection
- Updated emperorsdataset
- Using zero-padded messydates
- Added citation prompts
- Datasets capitalised:
- emperors$Wikipedia
- emperors$UNRV
- emperors$Britannica
 
- Fixed non-unique IDs bugs
- Fixed inc
 
Calling
- Added call_citations()to print citations added as
hidden information
- Fixed finicky call_sources()bug related to calling
help files
- Improved call_sources()andcall_citations()to accept datacubes or datasets, as
objects or characters
- Moved mreport()from messydates
- Added mreport.list()to make it easier to report on
datacubes
 
- Added describe_data()for describing key aspects of
datasets in datacubes
- Fixed call_releases()to usemessydates::vmin()
Connection
- Improved pluck()
- Function now wraps dplyr::pluck()but adds a citation
prompt
 
- Improved consolidate()
- Improved useability with {cli}progress messages and
success alerts
- Improved speed using {dtplyr}in place ofdplyr::full_join()(closes #288)
- {duckplyr}considered: faster, but couldn’t handle- mdateclass
- {collapse}considered: even faster, but inconsistent
output
 
- Improved compatibility by converting ‘rows’ argument to ‘join’
(breaking)
- “all” becomes “inner”
- “any” becomes “full”
- “favour” becomes “left”
 
- Fixed being passed a single dataset
- Prompts users to cite datasets (closes #280)
- Fixed bug in ‘resolve’ argument, named ‘resolve’ vector no longer
has to be same length as variables
- Dropped ‘cols’ argument
 
- Updated tests for consolidate()to use new ‘join’
argument
- testthat tests use {cli}on quiet mode
 
- Updated resolve_coalesce()for coalescing (taking first
non-NA value)
- Updated resolve_random()for returning random values
sampling from those available
- Updated resolve_min()andresolve_max()for returning min or max values
- Added resolve_unite()for returning all possible values
as a set
- Added resolve_precision()for returning most precise
values available (closes #265)
- Added precision.numeric()to return most significant
figures
- Added precision.character()to return most
characters
 
- Dropped resolve_median()andresolve_mean()as uncommon choices
- Dropped resolve_multiple()in favour of always using
more flexible for loop
- Dropped favour()in favour of left joins and
coalesces
- Dropped coalesce_rows()as no longer necessary
manydata 0.9.3
Connection
- Updated call_sources()to be more flexible when
gathering data from datacube documentation
- Closed #279 by updating documentation across many packages to be
compatible with call_sources()
- Updated compare_dimensions()by fixing bugs related to
dates and NA observations
manydata 0.9.2
Package
- Fixed the emperorsdata documentation issues related to
lost braces with CRAN submission
manydata 0.9.1
Package
- Updated test expectations to make package compatible with the new
release of {ggplot2}
Connection
- Closed #266 by adding startup messages to ‘many’ packages
- Closed #267 by adding links to package websites in console
messages
- Closed #282 by updating all references from ‘database’ to
‘datacube’
- Closed #293 by fixing bugs related to missing dates when using
consolidate()
- Closed #294 by updating how call_sources()identify
datasets within datacubes
manydata 0.9.0
Package
- Closed #259 by revising CCC package structure and updating the
package cheatsheet
- Updated documentation for ‘emperors’ data to new style to improve
visibility and transparency
- Closed #264 by removing {tibble}and{janitor}package imports in DESCRIPTION file
- Closed #276 by reviewing package vignettes
- Closed #277 by updating ‘manydata-defunct’ file
- Closed #284 by removing vignette and updating README to include more
information on how to use the package
- Updated all references and argument from ‘database’ to
‘datacube’
Connection
- Renamed and updated ‘call_’ family of functions
- Closed #250, #251, and #262 by renaming get_packages()tocall_packages()and updating how the function works and
look up packages, version updates, and availailabity
- Closed #269 and # by adding a call_sources()function
that displays sources and variable changes for datasets in
datacubes
- Closed #271 by updating the retrieve_family of
functions tocall_functions
- Closed #283 by renaming plot_releases()tocall_releases
 
- Renamed and updated ‘compare_’ family of functions
- Closed #243 and #257 by creating a compare_missing()function to compare missing values in datasets in a ‘many’ datacube
- Closed #249 and #253 by renaming db_plot()function tocompare_categories()and updating variable categories
- Closed #261 by renaming and updating other db_functions tocompare_functions
- Closed #268 by adding compare_overlap()to help users
investigate overlap for datasets within datacubes
- Closed #285 by adding compare_dimensions()andcompare_ranges()to compare dimensions and ranges in
datacubes
 
manydata 0.8.3
Connection
- Made ´network_map()´ function defunct
manydata 0.8.2
Connection
- Updated consolidate()to require two keys when joining
memberships’ databases
- Updated db_comp()to follow consolidation defaults for
memberships’ databases
- Closed #231 by adding a retrieve_texts()function to
retrieve treaty texts from other ‘many’ packages
manydata 0.8.1
Package
- Added ‘RDataTmp’ files to Rbuildignore and .gitignore
- Updated data_evolution()to useinherits()instead ofclass()for condition comparison
manydata 0.8.0
Package
- Closed #212 by implementing package caching in GitHub actions
workflows
- Closed #218 by fixing bug with GitHub actions workflows
- Closed #225 by changing the structure of datasets in “many” data
packages
- Closed #240 by updating the package cheatsheet
Connection
- Closed #134 by adding a data_evolution()function to
the report family of functions that gets original datasets, if
available, or opens the preparation scripts, if not available
- Added ‘db_profile’ family of functions to visualise databases
- Closed #214 by adding db_plot()function to plot a
profile of the database to facilitate comparison of matched observations
across datasets
- Closed #224 by adding db_comp()function that creates a
tibble of the database to facilitate comparison of matched observations
across datasets
 
- Updated get_packages()function
- Closed #215 by making get_packages()interactive so
that users can chose which branch to download
- Closed #219 by improving get_packages()printing
- Updated get_packages()andplot_releases()to use{messydates}, instead of{lubridate},
for dates coercion
 
- Closed #222 by adding network_map()function for
plotting geographical networks
- Updated consolidate()function to make function over 20
times faster
- Closed #227 by making consolidate()ignore text related
variables due to their size
- Closed #230 by making consolidate()more concise to
avoid running into memory limits
- Closed #228 and #232 by replacing coalesce_compatible()for a faster approach to coalescing compatible missing observations that
relies onzoo::na.locf()
- Made coalesce_compatible()function defunct
 
manydata 0.7.5
Package
- Removed {skimr}table fromemperorsdatabase documentation
- Updated path for binaries in push release GitHub actions
manydata 0.7.4
Package
- Closed #187 by updating GitHub actions to implement package
caching
- Closed #209 by removing all non-ASCII characters in package
- Closed #210 by removing {pkgdown}dependency
- Updated emperorsdata to contain correct date class
name consistent with{messydates}
manydata 0.7.3
Connection
- Updated how the get_packages()function identifies
installed packages to avoid usinginstalled.packages()
- Updated documentation for coalesce_compatible()function to include the returns
manydata 0.7.2
- Ignored CRAN-SUBMISSION and resubmitted.
manydata 0.7.1
Package
- Updated DESCRIPTION by removing ambiguous word from title
- Updated README by correcting the URL for life cycle badge
Connection
- Updated helper functions for consolidate()to useinherits()to identify variable’s class
manydata 0.7.0
Package
- Closed #194 by updating all remaining references from “qID” to
“manyID”
- Updated package website
- Closed #196 by updating elements that configure website to work
properly
- Updated ’_pkgdown.yml’ file to use bootstrap 5 template to build
website
 
Connection
- Updated consolidate()function
- Closed #191 by making consolidate()function more
concise and faster by removing redundant code lines
- Fixed dates-related warnings by changing how
{messydates}package is used to resolve dates
- Updated how consolidate()substitutes missing
observations with first non-missing observation from other datasets
- Closed #201 by fixing how consolidate()detects
variables to be resolved to avoid ambiguous variable matching
- Closed #202 by allowing for multiple key vectors to be declared as
arguments for consolidate()
 
- Closed #199 by adding favour()(alsofavor()) function that re-orders datasets within a
database
manydata 0.6.0
Package
- Closed #189 by renaming package from {qData}to{manydata}
- Updated user vignette to include more examples on working with
consolidate()
- Updated package website
- Closed #167 by adding a cheatsheet to README
Connection
- Updated consolidate()function
- Closed #169 by making default key variable “many_ID” instead of
“qID”
- Closed #183 by adding further methods to resolve conflicts between
observations:
- Added “max” resolve argument which resolves conflicts in favor of
the largest non NA value
- Added “min” resolve argument which resolves conflicts in favor of
the smallest non NA value
- Added “mean” resolve argument which resolves conflicts in favor of
the average non NA value
- Added “median” resolve argument which resolves conflicts in favor of
the median non NA value
- Added “random” resolve argument which resolves conflicts in favor of
a random non NA value
 
- Closed #185 by making so that users can specify resolve argument
differently for different variables
 
- Closed #188 by adding more informative warnings for GitHub download
limits for get_packages()function
- Added extraction functions to generate edgelists from agreements
membership datasets
- Added extract_bilaterals()for extracting adjacency
edgelist for bilateral agreements
- Added extract_multilaterals()for extracting adjacency
edgelist for multilateral agreements
 
qData 0.5.0
Package
- Updated get_packages()to load downloaded packages
- Closed #160 by updating how report family of functions to print
objects only if not assigned
- Updated user vignette for working with
consolidate()
- Fixed #165 by moving interleave()to qCreate
- Closed #149 by adding a citation file
- Closed #181 by adding data for running examples and tests
Connection
- Closed #28 by adding consolidate()for collapsing a set
or database of (q)dataset into a single dataset
- Closed #177 by adding coalesce_compact()to coalesce
all compatible rows of a data frame
- Closed #180 by adding progress bar for
coalesce_compact()
- Added tests for consolidate()andcoalesce_compact()
 
- Renamed function for getting first non-missing value
coalesce_rows()
- Added tests for coalesce_rows()
 
qData 0.4.0
Package
- Separated {qData}into two packages,{qData}for users and{qCreate}for developers
- Moved setup_package()and related functions to{qCreate}
- Moved import_data()and related functions to{qCreate}
- Moved export_data()and related functions to{qCreate}
 
- Moved plot_releases()to{migraph}
Connection
- Closed #137 by making data_contrast()anddata_source()return exportable objects
qData 0.3.5
Connection
- Updated get_packages()function
- Fixed bug with dates in get_packages()by changing
dependencies to{lubridate}
- Updated formatting of the returned tibble to display more clearly
the information on qPackages
- Fixed #144 by substituting NAwhere a listed qPackage
is not installed locally
 
qData 0.3.4
Package
- Fixed #128 by suppressing issues with warnings on pkgdown
website
- Updated contributor and user vignettes for new package changes
Collection
- Updated setup_package()to make it more intuitive and
precise for users
- Closed #127 by updating the form in which author arguments are taken
in setup_package()
- Added role argument to setup_package()to make more
precise
- Closed #139 by reducing file overhead generated by
setup_package()
- Simplified setup_package()to be less verbose
 
- Renamed new_author()function toadd_author()for naming consistency
- Closed #140 by fixing bug with add_author()
- Fixed #121 by allowing add_author()to treat multiple
comments
 
- Updated documentation for setup_package()andadd_author()
Correction
- Updated standardise_dates()to simplify function and
testing
- Closed #110 by simplifying helper functions and removing redundancy
in standardise_dates()
- Closed #108 by simplifying tests and removing redundancy for
standardise_dates()
 
Connection
- Divided, updated and renamed report_data()
- Closed #131 by dividing report_data()intodata_source()anddata_contrast()functions
- Closed #104 by allowing users to select multiple datasets with
data_contrast()
- Closed #105 by allowing for multiple databases to display in a
package with data_contrast()
- Fixed #111 by removing tests for outdated report_data()function
 
qData 0.3.3
Package
- Closed #122 by setting package logo to the correct aspect ratio
Collection
- Updated setup_package()to make it easier to set up
collaborative qPackages
- Closed #106 by making setup_package()accept more than
two authors as arguments
- Closed #119 by updating setup_package()function so
that it adds LICENSE file to new qPackages
- Closed #90 by adding a new_author()function that adds
new package contributors to description file
 
- Closed #115 by updating README templates for new qPackages to
include more sections
Correction
- Closed #116 by adding more tests for new qPackage templates for
standardizing NA formats
Connection
- Updated get_packages()to make it easier to install
globalgov qPackages
- Fixed #117 by making the get_packages()work with
package name only for qPackages part of the globalgov organization
- Closed #79 by updating get_packages()to accept listed
number for package as arguments
- Closed #103 by adding GitHub lookup for depends()function
 
- Updated user and developer vignettes to be compatible with these
changes
qData 0.3.2
Package
- Replaced logo with new ‘black box’ hex
- Added open graph image to GitHub account
 
- Closed #83 by improving CII best practices score
- Added depends()for loading and, if necessary,
installing CRAN packages
- Added plot_releases()function that visualises
historical milestones/releases for GitHub repositories
- Updated setup_package()function by removing license
lines in the codes
- Fixed #89 by adding different description templates depending on the
number of authors declared in setup_package()
- Added ‘dtc’ contributor type in the DESCRIPTION file
 
Collection
- Updated import_data()to accept .RData files
Correction
- Updated and fixed bugs with standardise_dates()function and tests
- Closed #73 standardise_dates()function by usinglubridate::as_date()in place ofanytime::anydate()to correctly treat historical BC and AD
dates
- Fixed #95 by simplifying standardise_dates()to make
sure it works with all types of date variables
- Closed #77 and #93 by devolving standardise_dates()into various helper functions
 
- Updated export_data()to better document new database
and dataset structure
- Documentation generated by export_data()references the
newly createdreport_data()function to generate metadata
dataframes at the desired level (Package, Database, Dataset)
- Closed #76 by adding some smart documentation updates to
export_data()
- Closed #92 by tweaking export_data()to document
datasets at the database level
- Closed #94 by adding a bibliographic attribute to each dataset with
export_data()
- Fixed #63 and #87 by setting up tests templates for the different
types of database created
 
- Added retain()to retain chosen objects in the
environment, removing all other variables
Connection
- Closed #78 by creating report_data()function that
displays information on the data within a qPackage
qData 0.3.1
Package
- Closed #81 by changing description of the Connecting data
section to include export_data()specificities
- Closed #85 by deleting the argument in the function
use_ccby_license(name = packageAuthor)in the description
of thesetup_package()function
- Corrected the typing error found in “develop” vignette
- Corrected styling issues in texts (delete double space in
files)
- Updated the description of the export_data()function
to its new version including database name as a string
qData 0.3.0
Package
- Changed package name from qDatr to qData
- Created a hex logo for qData
- Closed #32 by setting up a more informative README document
- Added codefactor badge
- Added CII badge, in progress
 
- Closed #69 by creating a vignette for users working with qData
- Closed #67 by creating a vignette for developers interested in
contributing to the qData ecosystem
- Added pkgdown workflow for updating the website upon release
Collection
- Updated import_data()by suggesting upper case letters
for dataset names and lower case letters for database names
- Now complains if pointed to unrecognised data format
 
Correction
- Closed #70 by updating stadardise_dates()so that
extreme future dates are standardised
- Updated standardise_dates()to accept multiple variable
dates as inputs and it standardises separators
- Updated standardise_dates()to be more efficient when
dealing with different date formats
 
- Deprecates recent()in favour ofstandardise_dates()
- Updates export_data()by making it save datasets as
named list elements in a database
- Updates exported test templates for data to test for datasets within
databases
Connection
- Updates get_packages()function to:
- Correctly identify latest release and installed versions
- Identify registered contributors
 
qDatr 0.2.1
Package
- Fixed #61 by adding tests for setup_package()andimport_data()
- Closed #50 by making sure workflow templates are created in the
proper folder with setup_package()
Correction
- Closed #62 by creating a function which vectorises dates and cleans
and standardizes diverse date formats,
standardise_dates()
- Closed #55 by deprecating rearrange()function in
favour ofdplyr::relocate()
- Closed #65 by correcting issues with the path call in
export_data()function
- Ensures consistency for path calls across qData functions
 
Connection
- Closed #64 by creating the get_packages()function
which displays other packages, and some information about these
packages, in the qData ecosystem. The function serves as a wrapper for
downloading qPackages from GitHub.
qDatr 0.2.0
Package
- Updated pushrelease.yml to upload all assets individually to
release
- ((Tests for following functions on hold…))
Collection
- Renamed create_qPackage()tosetup_package()
- Now establishes main testing framework
- Opens fewer files but is more verbose
- Closed #51 by trimming dependencies from qPackage-DESCtemplate
- Fixed bug in qPackage-DESCthat names weren’t
strings
 
- Renamed use_qData_raw()toimport_data()
- Closed #49 with path argument or selector
- Added delete_original argument (default = FALSE)
- Closed #58 by extracting file type information from path and adding
it and path to preparation template
 
Connection
- Renamed use_qData()toexport_data()
- Avoided reliance on usethis::use_data()
- Fixed bug where it wasn’t selecting object correctly
- Closed #57 by passing on object information into the documentation
template (now qData-doc.R)
- Adapted test template so that object name is passed forward
 
qDatr 0.1.0
Package
- Updated CONTRIBUTING
- Added README.Rmd render to pushrelease workflow
Collection
- Closed #47 by adding qtemplate()function for finding
and rendering templates
- Closed #44 by adding create_qPackage()that establishes
a{qData}consistent package framework
- Fixed #42 by drawing information from DESCRIPTION where
possible
- Added DESCRIPTION template
- Added README template
- Added COC template
- Added CONTRIBUTING template
- Added PR and issue templates for bugs and features
- Added workflow templates for push-release, pr-checks, and
pr-commands
 
- Closed #48 by adding use_qData_raw()for setting up
data cleaning and wrangling
- Added template for importing, cleaning and exporting raw data
 
Correction
- Closed #24 by adding repaint()for filling in missing
data by lookup
- Added recent()for sensible centuries for dates
Connection
- Closed #45 by adding use_qData()for setting up tests,
documentation, and lazy-loading of cleaned data
- Added template for documenting cleaned data
- Added template for testing cleaned data
 
qDatr 0.0.4
Collection
- Closed #25 by adding create_qpackage()
- Fixed ‘pushrelease.yml’ postfix bug
Correction
- Closed #26 by adding interleave()
- Closed #23 by adding resequence()
- Closed #30 by adding recollect()
qDatr 0.0.3
Package
- Closed #18 by setting up {lintr},{goodpractice}, and{spelling}in
prchecks.yml
Correction
- Closed #29 by adding rearrange()
- Closed #31 by adding reunite()
qDatr 0.0.2
Package
- Added a NEWS.mdfile to track changes to the
package.
Correction
- Added transmutate()for merging variables
qDatr 0.0.1
Package
Correction
- Added entitle()for standardising treaty titles,
etc.