PsychWordVec2025.11 package

Word Embedding Research Framework for Psychological Science

as_embed

Word vectors data class: wordvec and embed.

cosine_similarity

Cosine similarity/distance between two vectors.

data_transform

Transform plain text of word vectors into wordvec (data.table) or `e...

data_wordvec_load

Load word vectors data (wordvec or embed) from ".RData" file.

data_wordvec_subset

[S3 method] Extract a subset of word vectors data.

dict_expand

Expand a dictionary from the most similar words.

dict_reliability

Reliability analysis and PCA of a dictionary.

get_wordvec

Extract word vector(s).

most_similar

Find the Top-N most similar words.

normalize

Normalize all word vectors to the unit length 1.

orth_procrustes

Orthogonal Procrustes rotation for matrix alignment.

pair_similarity

Compute a matrix of cosine similarity/distance of word pairs.

plot_network

Visualize a (partial correlation) network graph of words.

plot_similarity

Visualize cosine similarity of word pairs.

plot_wordvec_tSNE

Visualize word vectors with dimensionality reduced using t-SNE.

plot_wordvec

Visualize word vectors.

PsychWordVec-package

PsychWordVec: Word Embedding Research Framework for Psychological Scie...

reexports

Objects exported from other packages

sum_wordvec

Calculate the sum vector of multiple words.

tab_similarity

Tabulate cosine similarity/distance of word pairs.

test_RND

Relative Norm Distance (RND) analysis.

test_WEAT

Word Embedding Association Test (WEAT) and Single-Category WEAT.

tokenize

Tokenize raw text for training word embeddings.

train_wordvec

Train static word embeddings using the Word2Vec, GloVe, or FastText al...

An integrative toolbox of word embedding research that provides: (1) a collection of 'pre-trained' static word vectors in the '.RData' compressed format <https://psychbruce.github.io/WordVector_RData.pdf>; (2) a group of functions to process, analyze, and visualize word vectors; (3) a range of tests to examine conceptual associations, including the Word Embedding Association Test <doi:10.1126/science.aal4230> and the Relative Norm Distance <doi:10.1073/pnas.1720347115>, with permutation test of significance; and (4) a set of training methods to locally train (static) word vectors from text corpora, including 'Word2Vec' <doi:10.48550/arXiv.1301.3781>, 'GloVe' <doi:10.3115/v1/D14-1162>, and 'FastText' <doi:10.48550/arXiv.1607.04606>.

  • Maintainer: Han Wu Shuang Bao
  • License: GPL-3
  • Last published: 2025-11-30