This dataset contains 999 coding sequences from the Escherichia coli chromosome
data
data(ec999)
Format
List of 999 vectors of characters, one for each coding sequence.
ECFOLE.FOLE: chr [1:672] "A" "T" "G" "C" ...
ECMSBAG.MSBA: chr [1:1749] "A" "T" "G" "C" ...
ECNARZYW-C.NARV: chr [1:681] "A" "T" "G" "A" ...
...: ... TRUNCATED ...
XYLEECOM.MALK: chr [1:1116] "A" "T" "G" "G" ...
XYLEECOM.LAMB: chr [1:1341] "A" "T" "G" "A" ...
XYLEECOM.MALM: chr [1:921] "A" "T" "G" "A" ...
References
Lobry, J.R., Gautier, C. (1994) Hydrophobicity, expressivity and aromaticity are the major trends of amino-acid usage in 999 Escherichia coli chromosome-encode genes. Nucleic Acids Research,22 :3174-3180.
citation("seqinr")
Examples
data(ec999)## How to export sequences in a FASTA file:#fname <- tempfile(pattern ="ecc999", tmpdir = tempdir(), fileext ="ffn")tempdir(check =FALSE)write.fasta(ec999, names(ec999), file = fname)