NEWS.md
inspectdf 0.0.112021-04-02inspect_types() for pairwise comparison plotsinspect_types() for pairwise comparisonsvdiffr 1.0.0 for graphics testsinspect_cat() plots when number of categories is small. Thanks to Ivo Kwee for the suggestion.inspect_num() histogram bins are correct in comparisons of data frames. Thanks to Roel Verbelen for the report.inspectdf 0.0.102021-02-20include_int option in inspect_cat() to allow treatment of integer columns as categorical.pval in the resulting output.inspect_cat() respect any filtering or sorting of the summary output prior to show_plot(). Thanks to Roel Verbelen for the report.inspect_type() comparison of two dataframes to make it easier to see which columns and types differ.inspectdf 0.0.82020-06-25show_plot argument has been removed from all inspect_*() functions. To generate visualisations of data frame summaries, please use the more flexible show_plot(inspect_*()) or via the pipe inspect_*() %>% show_plot().show_plot() improvement that nudges points that might otherwise have coincided for dataframe comparisons of imbalance (for example, with inspect_imb(df1, df2) %>% show_plot())inspect_cor(), inspect_na() and `inspect.inspect_cor() slight speed up for dataframes with large numbers of columns.inspect_cor() can be filtered prior to plotting, for example inspect_cor(starwars) %>% filter(abs(corr) > 0.2) %>% show_plot(). Thanks to Roel Verbelen for the suggestion
inspect_imb() to fail on certain types of factor columns. Thanks to Roel Verbelen for the report.show_plot() has new arguments label_size, label_angle and label_color. Each provide adjustments to text annotation where applicable. Thanks to Bartosz Bursa for the suggestion.coord_flip() works on resulting plots. Thanks to Roel Verbelen for the report.inspectdf 0.0.72019-11-05bytes column to inspect_mem() output, for downstream numeric comparison and consistency with inspectpd.pcnt_nna column to inspect_cor() output containing the percentage of pairwise complete observations used calculated correlations. Thanks to Theo Broekman for the suggestion.inspect_ statements to be incorrect. Thanks to the report from Theo Broekman.inspect_num().inspectdf 0.0.62019-09-29inspect_* functions now returns results by group grouped dataframes.inspect_num() %>% show_plot() to show histograms with color palettes specified by the col_palette argument.inspect_imb() to sometimes fail when factors present. Thanks to Doug Friedman for the report.inspectdf 0.0.52019-08-26inspect_num() to fail when columns contained all NA values. Thanks to Ryan Tanner for the report
inspect_cor() for large data frames with many numeric columns.method = 'kendall' and method = 'spearman' in inspect_cor().inspectdf 0.0.42019-07-27inspect_na() %>% show_plot() to fail when 0 NA present. Thanks to the report by Metin Yazici.show_plot() now returns a ggplot2 object rather than printing the plot - thanks to Garrick Aden-Buie for the suggestion.inspect_cat plotting by avoiding text labels for small regions.tech dataset.inspect_cat() plots when labels are empty strings. By default "" will be shown. Thanks to Michael Swenson for the report
inspect_cor(method = ...) argument added, thanks to suggestion from George Dontas. Options for pearson, spearman and kendall. Note that confidence intervals and tests currently only supported for pearson.inspect_cat() & inspect_imb().inspectdf 0.0.32019-06-27text_labels autoscale size using ggfittext::geom_fit_text(). For an example see inspect_cat(). Thanks to David Wilkins for the PR.show_plot() via col_palette argument. Colorblind friendly option specified via show_plot(col_palette = 1) - thanks to Richard Careaga for the suggestion.inspect_imb().
include_na option for categorical columns that are 100% missing, or constant are underlined in plot for easier comprehension.inspect_cor()
inspect_cor() comparison plots - makes it easier to see smaller differences in correlations.NA correlations omitted from inspect_cor() comparison when plotted. Ordering of correlations reversed to be consistent with returned tibble.inspectdf 0.0.22019-05-23show_plot() function (show_plot argument in inspect_ functions will be dropped in a future version)high_cardinality argument in show_plot() for combining unique or near-unique categories for plotting inspect_cat().progress bars shown when processing larger datasets