rcldf1.5.1 package

Read Linguistic Data in the Cross Linguistic Data Format (CLDF)

add_dataframe

Adds a dataframe.

as.cldf.wide

Extracts a CLDF table as a 'wide' dataframe by resolving all foreign k...

cldf

Reads a Cross-Linguistic Data Format dataset into an object.

coalesce_truth

Coalesce value to truthiness

datatype_to_type

Map csvw datatypes to R types

default_dialect

CSVW default dialect

default_schema

Create a default table schema given a csv file and dialect

get_cache_dir

Returns the cache dir.

get_details

Returns a dataframe of with details on the CLDF dataset in path.

get_dir_size

Returns the filesize in bytes of a directory.

get_filename

Get a filename from url value in metadata (handles .zip files)

get_from_zenodo

Downloads and installs a CLDF dataset from a Zenodo endpoint

get_separators

Identifies the separator characters specified by the CLDF metadata.

get_table_from

Extracts a single table from a CLDF dataset.

get_tablename

Convert a CLDF URL tablename to a short tablename

is_github

Returns TRUE if url looks like a github URL

is_url

Returns TRUE if url looks like a URL

list_cache_files

Returns a dataframe of directories in the cache dir

load_clts

Returns a CLDF dataset object of the latest CLTS version.

load_concepticon

Returns a CLDF dataset object of the latest Concepticon version.

load_glottolog

Returns a CLDF dataset object of the latest glottolog version.

make_cache_key

Returns the cachekey for the given path.

nullify

Converts all values specified in the CLDF metadata as null to R's `N...

override_defaults

Override defaults

print.cldf

Summarises the CLDF file

rcldf-package

rcldf: Read Linguistic Data in the Cross Linguistic Data Format (CLDF)

read_bib

Adds BibTeX source information into a CLDF dataset

relabel

Relabels a column in a dataset for merging.

resolve_path

Helper function to resolve the path (e.g. directory or md.json file)

separate

Expands all values with separators.

set_cache_dir

Sets the cache dir for the current session.

summary.cldf

Summarises the CLDF file

Cross-Linguistic Data Format (CLDF) is a framework for storing cross-linguistic data, ensuring compatibility and ease of data exchange between different linguistic datasets see Forkel et al. (2018) <doi:10.1038/sdata.2018.205>. The 'rcldf' package is designed to facilitate the manipulation and analysis of these datasets by simplifying the loading, querying, and visualisation of CLDF datasets making it easier to conduct comparative linguistic analyses, manage language data, and apply statistical methods directly within R.

  • Maintainer: Simon J. Greenhill
  • License: Apache License (>= 2.0)
  • Last published: 2025-09-30