tidyfst1.8.2 package

Tidy Verbs for Fast Data Manipulation

add_prop

Add percentage to counts in data.frame

arrange_dt

Arrange entries in data.frame

as_fst

Save a data.frame as a fst table

bind_rows_dt

Bind multiple data frames by row

bind_tf_idf_dt

Compute TF–IDF Using data.table with Optional Counting and Grouping

col_max

Get the column name of the max/min number each row

complete_dt

Complete a data frame with missing combinations of data

count

Count observations by group

cummean

Cumulative mean

distinct_dt

Select distinct/unique rows in data.frame

dt

Short cut to data.table

dummy_dt

Fast creation of dummy variables

filter_dt

Filter entries in data.frame

fst_io

Read and write fst files

fst

Parse,inspect and extract data.table from fst file

group_by

Group by variable(s) and implement operations

group_dt

Data manipulation within groups

import_fst_chunked

Read a fst file by chunks

impute_dt

Impute missing values with mean, median or mode

join

Join tables

lag_lead

Fast lead/lag for vectors

longer

Pivot data from wide to long

missing

Dump, replace and fill missing values in data.frame

mutate_vars

Conditional update of columns in data.table

mutate

Mutate columns in data.frame

nest

Nest and unnest

nth

Extract the nth value from a vector

object_size

Nice printing of report the Space Allocated for an Object

pairwise

Count pairs of items within a group

pkg_load

Load or unload R package(s)

print_options

Set global printing method for data.table

pull_dt

Pull out a single variable

rec

Recode number or strings

reexports

Objects exported from other packages

relocate_dt

Change column order

rename_dt

Rename column in data.frame

replace_dt

Fast value replacement in data frame

round0

Round a number and make it show zeros

rownames

Tools for working with row names

sample

Sample rows randomly from a table

select

Select column from data.frame

separate_dt

Separate a character column into two columns using a regular expressio...

setops

Set operations for data frames

slice

Subset rows using their positions

sql_join

Case insensitive table joining like SQL

summarise_dt

Summarise columns to single values

sys_time_print

Convenient print of time taken

t_dt

Efficient transpose of data.frame

tidymat

Conversion between tidy table and named matrix

uncount_dt

"Uncount" a data frame

unite_dt

Unite multiple columns into one by pasting strings together

utf8_encoding

Use UTF-8 for character encoding in a data frame

wider_dt

Pivot data from long to wide

A toolkit of tidy data manipulation verbs with 'data.table' as the backend. Combining the merits of syntax elegance from 'dplyr' and computing performance from 'data.table', 'tidyfst' intends to provide users with state-of-the-art data manipulation tools with least pain. This package is an extension of 'data.table'. While enjoying a tidy syntax, it also wraps combinations of efficient functions to facilitate frequently-used data operations.

  • Maintainer: Tian-Yuan Huang
  • License: MIT + file LICENSE
  • Last published: 2025-05-07