DSAM R package [Documentation]

checkFull

Check whether the sample set is full

dataSplit

Main function of data splitting algorithm

DP.initialSample

Initial sampling of DUPLEX

DP.reSample

Repeat sampling of DUPLEX

DUPLEX

'DSAM' - DUPLEX algorithm

getAUC

Get the AUC value between two datasets

getMax

Get the maximum of the output column from the original data set

getMean

Get the mean and standard deviation of the output column from the orig...

getMin

Get the minimum of the output column from the original data set

getSnen

Get sampling number of each SOM neuron

MDUPLEX

'DSAM' - MDUPLEX algorithm

par.default

Default parameter list

remainUnsample

Get the remain unsampled data after SSsample

SBSS.P

'DSAM' - SBSS.P algorithm

selectData

Select specific split data

somCluster

Self-organized map clustering

SOMPLEX

'DSAM' - SOMPLEX algorithm

SS

'DSAM' - SS algorithm

SSsample

Core function of SS sampling

standardise

Standardized data

TIMECON

'DSAM' - Time-consecutive algorithm

Download source package Read PDF manual

Providing six different algorithms that can be used to split the available data into training, test and validation subsets with similar distribution for hydrological model developments. The dataSplit() function will help you divide the data according to specific requirements, and you can refer to the par.default() function to set the parameters for data splitting. The getAUC() function will help you measure the similarity of distribution features between the data subsets. For more information about the data splitting algorithms, please refer to: Chen et al. (2022) <doi:10.1016/j.jhydrol.2022.128340>, Zheng et al. (2022) <doi:10.1029/2021WR031818>.

Maintainer: Junyi Chen
License: MIT + file LICENSE
Last published: 2024-01-29

Useful links

DSAM1.0.2 package

Functions

Readme

Datasets

Dependencies

Imports

Versions

News