MC_baseline_distribution function

Function to compute baseline heterocitation values for the graph under study with random permutation of corpus attributions

Function to compute baseline heterocitation values for the graph under study with random permutation of corpus attributions

This function performs Monte Carlo runs with random permutations of corpus tags in the graph provided and computes the heterocitation balance on the new graphs. Permutation is repeated over several iterations (set through the "rep" argument) and provides a baseline Dx values for the graph topology considered. This can then be compared with the Dx value obtained for the original graph to evaluate whether it could merely be the result of chance (see significance_Dx).

MC_baseline_distribution(gr, labels, infYearLimit, supYearLimit, rep = 20)

Arguments

  • gr: Graph file (created with build_graph)
  • labels: List of the names of the two corpora studied (e.g. c("Computer Science", "Mathematics")), present in the "Corpus" attribute
  • infYearLimit: Minimum year considered in this study
  • supYearLimit: Maximum year considered in this study
  • rep: Number of Monte Carlo iterations

Returns

This function currently plots the histograms of distribution of Dx values generated through random permutations of corpus tags among the records. Returns a list containing: - Dx1: Dx value for corpus 1 per iteration

  • Dx1: Dx value for corpus 2 per iteration

  • DxALL: Global Dx value per iteration

Author(s)

Christian Vincenot (christian@vincenot.biz)

See Also

significance_Dx, heterocitation

  • Maintainer: Christian Vincenot
  • License: GPL (>= 2)
  • Last published: 2020-04-19

Useful links