R DATASETS

Find sample data sets available in R packages

commune_names

Communes in Poland

rgugik

gene_expression_data

Gene expression data for patients suffering from breast cancer

idiffomix

World_Bank_Codes

World_Bank_Codes

fedmatch

items_cbs

Commodity balance sheet items

whep

HollywoodMovies2013

Hollywood Movies - 2013

Lock5Data

recid

recid

wooldridge

true_beta_3node

The true beta matrix (6 by 6) used in simulation.

pompom

chicago

Labor market and demographic data for employed Hispanic workers in met...

GeneralOaxaca

multitrait

Example Cross object from R/QTL with multiple traits

qtl

bcnt.emapw

Benthic count data for the western United States

bio.infer

all.abundance

K-mer abundances

ICAMS

corporate.payment

Corporate payments of a West Coast utility company - 2010

benford.analysis

masses

MS masses A dataset containing approx 150000 MS1 precursor masses

prozor

finch2

Subset of data set finch

dynRB

ex11.42

R Data set: ex11.42

Devore7

Earthquake

Earthquake

spherepc

data.sda6

Dataset SDA6 (Jurich & Bradshaw, 2014)

CDM

corr_re

correlation index

scRNAtools

nomogram_shaps

Nomogram SHAP values using categorical predictors and binary outcome

rmlnomogram

terraclimate_data

TerraClimate zonal statistics data

brclimr

irps_books

One-mode undirected network of co-purchased books about US politics on...

manynet

cultures

Cultures pairwise dissimilarities

pald

vign_trees_4

Tree data for vignette, version 4

BerkeleyForestsAnalytics

csu_ci5_mean

cancer registry data

Rcan

fig3.18a

Figure 3.18a in "Applied Time Series Analysis with R, 2nd edition" by ...

tswge

Election1999

Presidential Election 1999

SLPresElection

Forbes2000

The Forbes 2000 Ranking of the World's Biggest Companies (Year 2004)

HSAUR3

Example

ID example dataset.

MantaID

example_data

Example Dataset for the mixedbiastest Package

mixedbiastest

apartment_apps

Apartment Apps

ExamPAData

college_grad_students

The Economic Guide To Picking A College Major

fivethirtyeight

rmvm

A multivariate normal dataset for data mining

asbio

sim

Simulation data.

IDSA

abvd

ABVD's Language identifiers

lingtypology

PSID1976

Panel Study of Income Dynamics 1976 Extract

HeckmanStan

generic_fhd_bootstraps

Bootstrap samples of generic FHDs of 25 seabird species

stochLAB

RMS_dat

ECLS-K (2011) Sample Dataset for Demonstration

nlpsem

df_om_source

Data source for df_om

multibias

Cardiological.CR

Cardiological Interval Data Set (Centre and Range)

iRegression

data_Orme_2009_2

data_Orme_2009_2

SIMPLE.REGRESSION

example.geno

Example datasets for pedgene

pedgene

dsq23_7_5

Dataset for Equation 23.7.5

aprean3

mmu_subset

Mouse stomach and intestine scRNA-seq data, microwell-seq Subset to 50...

scGOclust

tile

Control factor array and summary statistics for Ina tile experiment

daewr

dat.crede2010

Studies on the Relationship between Class Attendance and Grades in Col...

metadat

DC1912dels

Numbers of delegates for the individual states and groups

GmooG

SUGARCANETerrain

Sugar Cane terrain requirement for land evaluation

ALUES

kinshipdelta

Kinship Terms

smacof