coauth_network function

Creating Co-Authorship Network with Different Measures for Weights

Creating Co-Authorship Network with Different Measures for Weights

This function creates an edge list for co-authorship networks from a data frame with a list of entities and their publications. The weight of edges can be calculated following different methods. The nodes of the network could be indifferently authors, institutions or countries.

coauth_network( dt, authors, articles, method = c("full_counting", "fractional_counting", "fractional_counting_refined"), cosine_normalized = FALSE )

Arguments

  • dt: The data frame with authors (or institutions or countries) and the list of documents they have published.
  • authors: The column name of the source identifiers, that is the authors (or institutions or countries).
  • articles: The column name of the documents identifiers.
  • method: Method for calculating the edges weights, to be chosen among "full_counting","fractional_counting" or "fractional_counting_refined".
  • cosine_normalized: Possibility to take into account the total number of articles written by two linked authors and to normalize the weight of their link using Salton's cosine.

Returns

A data.table with the authors (or institutions or countries) identifier in from and to columns, with a weight column whose values depend on the method chosen. It also keeps a copy of from and to in the Source and Target columns. This is useful is you are using the tidygraph package then, where from and to values are modified when creating a graph.

Details

Weights can be calculated with:

  1. the "full_counting" method: the linkds between authors correspond to their absolute number of collaborations.
  2. the "fractional_counting" method which takes into account the number of authors in each article, following if(!exists(".Rdpack.currefs")) .Rdpack.currefs \<-new.env();Rdpack::insert_citeOnly(keys="perianes-rodriguez2016b",package="biblionetwork",cached_env=.Rdpack.currefs) equation:
k=1Maik.ajknk1 \sum_{k = 1}^{M} \frac{a_{ik}.a_{jk}}{n_{k}-1}

with M the total number of articles, aik.ajka_{ik}.a_{jk}

which takes 1 if author i and j have co-written the article k, and nkn_{k} the number of authors for article k. 3. the fractional_counting_refined method, inspired by if(!exists(".Rdpack.currefs")) .Rdpack.currefs \<-new.env();Rdpack::insert_citeOnly(keys="leydesdorff2017",package="biblionetwork",cached_env=.Rdpack.currefs)

which is similar to fractional_counting but which is formalised in a way that allows the sum of weights to equal the number of articles in the corpus:

k=1Maik.ajk.2nk.(nk1) \sum_{k = 1}^{M} \frac{a_{ik}.a_{jk}.2}{n_{k}.(n_{k}-1)}

.

In addition, it is possible to take into account the total number of collaborations of two linked authors. If cosine_normalized is set to True, the weight calculated with one of the three methods above is divided by Ci.Cj\sqrt{C_{i}.C_{j}}, with CiC_{i} being the number of articles co-written by author i.

Examples

library(biblionetwork) coauth_network(Authors_stagflation, authors = "Author", articles = "ItemID_Ref", method = "fractional_counting")

References

if(!exists(".Rdpack.currefs")) .Rdpack.currefs <-new.env();Rdpack::insert_all_ref(.Rdpack.currefs)