Simple Tools for Examining and Cleaning Dirty Data
Append a totals column to a data.frame.
Append a totals row to a data.frame.
Add presentation formatting to a crosstabulation table.
Add underlying Ns to a tabyl displaying percentages.
Format a data.frame of decimals as percentages.
Convert a data.frame of counts to percentages.
Round the numeric columns in a data.frame.
Add column name to the top of a two-way tabyl.
Append a totals row and/or column to a data.frame.
Add tabyl attributes to a data.frame.
Apply stats::chisq.test to a two-way tabyl
Cleans names of an object (usually a data.frame).
Do the the data.frames have the same columns & types?
Generate a comparison of data.frames (or similar objects) that indicat...
Convert many date and datetime formats as may be received from Microso...
Convert string values to true NA values.
Generate a crosstabulation of two vectors.
Describe the class(es) of an object
Convert dates encoded as serial numbers to Date class.
Find the header row in a data.frame
Apply stats::fisher.test to a two-way tabyl
Get rows of a data.frame with identical values for the specified var...
Find the list of columns that have a 1:1 mapping to each other
Deprecated Functions in Package janitor
janitor
Cleans a vector of text, typically containing the names of an object.
Constant to help map from mu to u
Pipe operator
Remove constant columns from a data.frame or matrix.
Removes empty columns from a data.frame.
Removes empty rows from a data.frame.
Remove empty rows and/or columns from a data.frame or matrix.
Round a numeric vector; halves will be rounded up, ala Microsoft Excel...
Round to the nearest fraction of a specified denominator.
Elevate a row to be the column names of a data.frame.
Convert a SAS date, time or date/time to an R object
Round a numeric vector to the specified number of significant digits; ...
Ensure that a vector has only a single value throughout.
Generate a frequency table (1-, 2-, or 3-way).
Generate a frequency table of a factor grouped into top-n, bottom-n, a...
Remove tabyl attributes from a data.frame.
Returns first non-NA value from a set of vectors.
The main janitor functions can: perfectly format data.frame column names; provide quick counts of variable combinations (i.e., frequency tables and crosstabs); and explore duplicate records. Other janitor functions nicely format the tabulation results. These tabulate-and-report functions approximate popular features of SPSS and Microsoft Excel. This package follows the principles of the "tidyverse" and works well with the pipe function %>%. janitor was built with beginning-to-intermediate R users in mind and is optimized for user-friendliness.
Useful links