fdm2id0.9.9 package

Data Mining and R Programming for Beginners

STUMP

Classification using one-level decision tree

summary.apriori

Print summary of a classification model obtained by APRIORI

SVD

Singular Value Decomposition

SVM

Classification using Support Vector Machine

SVMl

Classification using Support Vector Machine with a linear kernel

SVMr

Classification using Support Vector Machine with a radial kernel

SVR

Regression using Support Vector Machine

SVRl

Regression using Support Vector Machine with a linear kernel

SVRr

Regression using Support Vector Machine with a radial kernel

textmining-class

Text mining object

TEXTMINING

Text mining

toggleexport

Toggle graphic exports

treeplot

Dendrogram Plots

TSNE

t-distributed Stochastic Neighbor Embedding

ADABOOST

Classification using AdaBoost

apriori-class

APRIORI classification model

APRIORI

Classification using APRIORI

augmentation

Duplicate and add noise to a dataset

BAGGING

Classification using Bagging

boosting-class

Boosting methods model

boxclus

Clustering Box Plots

CA

Correspondence Analysis (CA)

CART

Classification using CART

cartdepth

Depth

cartinfo

CART information

cartleafs

Number of Leafs

cartnodes

Number of Nodes

cartplot

CART Plot

cda-class

Canonical Disciminant Analysis model

CDA

Classification using Canonical Discriminant Analysis

closegraphics

Close a graphics device

compare.accuracy

Comparison of two sets of clusters, using accuracy

compare.jaccard

Comparison of two sets of clusters, using Jaccard index

compare.kappa

Comparison of two sets of clusters, using kappa

compare

Comparison of two sets of clusters

confusion

Confuion matrix

cookplot

Plot the Cook's distance of a linear regression model

correlated

Correlated variables

cost.curves

Plot Cost Curves

data.diag

Square dataset

data.gauss

Gaussian mixture dataset

data.parabol

Parabol dataset

data.target1

Target1 dataset

data.target2

Target2 dataset

data.twomoons

Two moons dataset

data.xor

XOR dataset

dataset-class

Training set and test set

dbs-class

DBSCAN model

DBSCAN

DBSCAN clustering method

distplot

Plot a k-distance graphic

em-class

Expectation-Maximization model

EM

Expectation-Maximization clustering method

evaluation.accuracy

Accuracy of classification predictions

evaluation.adjr2

Adjusted R2 evaluation of regression predictions

evaluation.fmeasure

F-measure

evaluation.fowlkesmallows

Fowlkes–Mallows index

evaluation.goodness

Goodness

evaluation.jaccard

Jaccard index

evaluation.kappa

Kappa evaluation of classification predictions

evaluation.msep

MSEP evaluation of regression predictions

evaluation.precision

Precision of classification predictions

evaluation.r2

R2 evaluation of regression predictions

evaluation

Evaluation of classification or regression predictions

evaluation.recall

Recall of classification predictions

exportgraphics

Open a graphics device

factorial-class

Factorial analysis results

FEATURESELECTION

Classification with Feature selection

filter.rules

Filtering a set of rules

frequentwords

Frequent words

general.rules

Remove redundancy in a set of rules

getvocab

Extract words and phrases from a corpus

GRADIENTBOOSTING

Classification using Gradient Boosting

HCA

Hierarchical Cluster Analysis method

intern.dunn

Clustering evaluation through Dunn's index

intern.interclass

Clustering evaluation through interclass inertia

intern.intraclass

Clustering evaluation through intraclass inertia

intern

Clustering evaluation through internal criteria

kaiser

Kaiser rule

KERREG

Kernel Regression

kmeans.getk

Estimation of the number of clusters for K-means

KMEANS

K-means method

knn-class

K Nearest Neighbours model

KNN

Classification using k-NN

LDA

Classification using Linear Discriminant Analysis

leverageplot

Plot the leverage points of a linear regression model

LINREG

Linear Regression

loadtext

load a text file

LR

Classification using Logistic Regression

MCA

Multiple Correspondence Analysis (MCA)

meanshift-class

MeanShift model

MEANSHIFT

MeanShift method

MLP

Classification using Multilayer Perceptron

MLPREG

Multi-Layer Perceptron Regression

model-class

Generic classification or regression model

NB

Classification using Naive Bayes

NMF

Non-negative Matrix Factorization

params-class

Learning Parameters

PCA

Principal Component Analysis (PCA)

performance

Performance estimation

plot.cda

Plot function for cda-class

plot.factorial

Plot function for factorial-class

plot.som

Plot function for som-class

plotavsp

Plot actual vs. predictions

plotcloud

Plot word cloud

plotclus

Generic Plot Method for Clustering

plotdata

Advanced plot function

plotzipf

Plot rank versus frequency

POLYREG

Polynomial Regression

predict.apriori

Model predictions

predict.boosting

Model predictions

predict.cda

Model predictions

predict.dbs

Predict function for DBSCAN

predict.em

Predict function for EM

predict.kmeans

Predict function for K-means

predict.knn

Model predictions

predict.meanshift

Predict function for MeanShift

predict.model

Model predictions

predict.selection

Model predictions

predict.textmining

Model predictions

print.apriori

Print a classification model obtained by APRIORI

print.factorial

Plot function for factorial-class

pseudoF

Pseudo-F

QDA

Classification using Quadratic Discriminant Analysis

query.docs

Document query

query.words

Word query

RANDOMFOREST

Classification using Random Forest

regplot

Plot function for a regression model

resplot

Plot the studentized residuals of a linear regression model

roc.curves

Plot ROC Curves

rotation

Rotation

runningtime

Running time

scatterplot

Clustering Scatter Plots

selectfeatures

Feature selection for classification

selection-class

Feature selection

som-class

Self-Organizing Maps model

SOM

Self-Organizing Maps clustering method

spectral-class

Spectral clustering model

SPECTRAL

Spectral clustering method

splitdata

Splits a dataset into training set and test set

stability

Clustering evaluation through stability

vectorize.docs

Document vectorization

vectorize.words

Word vectorization

vectorizer-class

Document vectorization object

Contains functions to simplify the use of data mining methods (classification, regression, clustering, etc.), for students and beginners in R programming. Various R packages are used and wrappers are built around the main functions, to standardize the use of data mining methods (input/output): it brings a certain loss of flexibility, but also a gain of simplicity. The package name came from the French "Fouille de Données en Master 2 Informatique Décisionnelle".

  • Maintainer: Alexandre Blansché
  • License: GPL-3
  • Last published: 2023-06-12