geneHummus R package [Documentation]

accessions_by_spp

Compute the total number of accession proteins per species

accessions_from_spp

Extract the accession ids (XP accession) for a given organism

accessions_warning

Get acessions and organism for each protein identifier

archids_warning

Get architecture identifiers for the conserved domains

extract_proteins

Get the protein identifiers

filterArch_ids

Filter the protein architectures based on conserved domains

filterarchids_warning

Filter protein architectures based on conserved domains

geneHummus

genehummus: A pipeline to define gene families in Legumes and beyond

get_spp

Get the species name from the description sequence

getAccessions

Get the acessions ids and the organism for each protein identifier

getArch_ids

Get the potential architecture identifiers for the conserved domains

getArch_labels

Get the description label for a protein architecture identifier

getProteins_from_tax_ids

Get the RefSeq protein identifiers for the given taxonomic species

getProtlinks

Get the protein identifiers for a given architecture

getSparcleArchs

Get the electronic architecture for a conserved domain

labels_warning

Get description label for a protein architecture identifier

proteins_warning

Get RefSeq protein identifiers for the given taxonomic species

sizeIds

Build a list containing N elements per element list

Download source package Read PDF manual

A pipeline with high specificity and sensitivity in extracting proteins from the RefSeq database (National Center for Biotechnology Information). Manual identification of gene families is highly time-consuming and laborious, requiring an iterative process of manual and computational analysis to identify members of a given family. The pipelines implements an automatic approach for the identification of gene families based on the conserved domains that specifically define that family. See Die et al. (2018) <doi:10.1101/436659> for more information and examples.

Maintainer: Jose V. Die
License: MIT + file LICENSE
Last published: 2019-04-04

Useful links

geneHummus1.0.11 package

Functions

Readme

Datasets

Dependencies

Imports

Versions