wrProteo1.13.0 package

Proteomics Data Analysis Functions

AAmass

Molecular mass for amino-acids

AucROC

AUC from ROC-curves

cleanListCoNames

Selective batch cleaning of sample- (ie column-) names in list

combineMultFilterNAimput

Combine Multiple Filters On NA-imputed Data

convAASeq2mass

Molecular mass for amino-acids

corColumnOrder

Order Columns In List Of Matrixes, Data.frames And Vectors

countNoOfCommonPeptides

Compare in-silico digested proteomes for unique and shared peptides, c...

dot-atomicMasses

Molecular mass for Elements

dot-checkKnitrProt

Checking presence of knitr and rmarkdown

dot-checkSetupGroups

Additional/final Check And Adjustments To Sample-order After readSampl...

dot-commonSpecies

Get Matrix With UniProt Abbreviations For Selected Species As Well As ...

dot-extrSpecPref

Extract Additional Information To Construct The Colum 'SpecType'

dot-imputeNA

Basic NA-imputaton (main)

dot-plotQuantDistr

Generic Plotting Of Density Distribution For Quantitation Import-funct...

exportAsWombatP

Export As Wombat-P Set Of Files

exportSdrfDraft

Export Sample Meta-data from Quantification-Software as Sdrf-draft

extractTestingResults

Extract Results From Moderated t-tests

extrSpeciesAnnot

Extract species annotation

foldChangeArrow2

Add arrow for expected Fold-Change to VolcanoPlot or MA-plot

fuseProteomicsProjects

Combine Multiple Proteomics Data-Sets

getUPS1acc

Accession-Numbers And Names Of UPS1 Proteins

inspectSpeciesIndic

Inspect Species Indictaion Or Group of Proteins

isolNAneighb

Isolate NA-neighbours

massDeFormula

Molecular mass from chemical formula

matrixNAinspect

Histogram of content of NAs in matrix

matrixNAneighbourImpute

Imputation of NA-values based on non-NA replicates

plotROC

Plot ROC curves

razorNoFilter

Filter based on either number of total peptides and specific peptides ...

readAlphaPeptFile

Read (Normalized) Quantitation Data Files Produced By AlphaPept

readDiaNNFile

Read Tabulated Files Exported by DIA-NN At Protein Level

readDiaNNPeptides

Read Tabulated Files Exported by DiaNN At Peptide Level

readFasta2

Read File Of Protein Sequences In Fasta Format

readFragpipeFile

Read Tabulated Files Exported by FragPipe At Protein Level

readIonbotPeptides

Read Tabulated Files Exported by Ionbot At Peptide Level

readMassChroQFile

Read tabulated files imported from MassChroQ

readMaxQuantFile

Read Quantitation Data-Files (proteinGroups.txt) Produced From MaxQuan...

readMaxQuantPeptides

Read Peptide Identification and Quantitation Data-Files (peptides.txt)...

readOpenMSFile

Read csv files exported by OpenMS

readProlineFile

Read xlsx, csv or tsv files exported from Proline and MS-Angel

readProtDiscovererPeptides

readProtDiscovererPeptides, depreciated

readProtDiscovFile

Read Tabulated Files Exported By ProteomeDiscoverer At Protein Level, ...

readProtDiscovPeptides

Read Tabulated Files Exported by ProteomeDiscoverer At Peptide Level, ...

readProteomeDiscovererFile

Read Tabulated Files Exported By ProteomeDiscoverer At Protein Level

readProteomeDiscovererPeptides

Read Tabulated Files Exported by ProteomeDiscoverer At Peptide Level

readSampleMetaData

Read Sample Meta-data from Quantification-Software And/Or Sdrf And Ali...

readSdrf

Read proteomics meta-data as sdrf file

readUCSCtable

Read annotation files from UCSC

readUniProtExport

Read protein annotation as exported from UniProt batch-conversion

readWombatNormFile

Read (Normalized) Quantitation Data Files Produced By Wombat At Protei...

removeSampleInList

Remove Samples/Columns From list of matrixes

replMissingProtNames

Complement Missing EntryNames In Annotation

shortSoftwName

Get Short Names of Proteomics Quantitation Software

summarizeForROC

Summarize statistical test result for plotting ROC-curves

test2grp

t-test each line of 2 groups of data

testRobustToNAimputation

Pair-wise testing robust to NA-imputation

VolcanoPlotW2

Deprecialed Volcano-plot

writeFasta2

Write sequences in fasta format to fileThis function writes sequences ...

Data analysis of proteomics experiments by mass spectrometry is supported by this collection of functions mostly dedicated to the analysis of (bottom-up) quantitative (XIC) data. Fasta-formatted proteomes (eg from UniProt Consortium <doi:10.1093/nar/gky1049>) can be read with automatic parsing and multiple annotation types (like species origin, abbreviated gene names, etc) extracted. Initial results from multiple software for protein (and peptide) quantitation can be imported (to a common format): MaxQuant (Tyanova et al 2016 <doi:10.1038/nprot.2016.136>), Dia-NN (Demichev et al 2020 <doi:10.1038/s41592-019-0638-x>), Fragpipe (da Veiga et al 2020 <doi:10.1038/s41592-020-0912-y>), ionbot (Degroeve et al 2021 <doi:10.1101/2021.07.02.450686>), MassChroq (Valot et al 2011 <doi:10.1002/pmic.201100120>), OpenMS (Strauss et al 2021 <doi:10.1038/nmeth.3959>), ProteomeDiscoverer (Orsburn 2021 <doi:10.3390/proteomes9010015>), Proline (Bouyssie et al 2020 <doi:10.1093/bioinformatics/btaa118>), AlphaPept (preprint Strauss et al <doi:10.1101/2021.07.23.453379>) and Wombat-P (Bouyssie et al 2023 <doi:10.1021/acs.jproteome.3c00636>. Meta-data provided by initial analysis software and/or in sdrf format can be integrated to the analysis. Quantitative proteomics measurements frequently contain multiple NA values, due to physical absence of given peptides in some samples, limitations in sensitivity or other reasons. Help is provided to inspect the data graphically to investigate the nature of NA-values via their respective replicate measurements and to help/confirm the choice of NA-replacement algorithms. Meta-data in sdrf-format (Perez-Riverol et al 2020 <doi:10.1021/acs.jproteome.0c00376>) or similar tabular formats can be imported and included. Missing values can be inspected and imputed based on the concept of NA-neighbours or other methods. Dedicated filtering and statistical testing using the framework of package 'limma' <doi:10.18129/B9.bioc.limma> can be run, enhanced by multiple rounds of NA-replacements to provide robustness towards rare stochastic events. Multi-species samples, as frequently used in benchmark-tests (eg Navarro et al 2016 <doi:10.1038/nbt.3685>, Ramus et al 2016 <doi:10.1016/j.jprot.2015.11.011>), can be run with special options considering such sub-groups during normalization and testing. Subsequently, ROC curves (Hand and Till 2001 <doi:10.1023/A:1010920819831>) can be constructed to compare multiple analysis approaches. As detailed example the data-set from Ramus et al 2016 <doi:10.1016/j.jprot.2015.11.011>) quantified by MaxQuant, ProteomeDiscoverer, and Proline is provided with a detailed analysis of heterologous spike-in proteins.

  • Maintainer: Wolfgang Raffelsberger
  • License: GPL-3
  • Last published: 2024-11-18