refseqR
workshop
2024-11-27
“On protein synthesis” - at University College London for the Society for Experimental Biology (1957).
refseqR
Builds upon rentrez
Imports IRanges
, Biostrings
input can be any character string object (1st arg)
provides output for interoperability and integration with Bioconductor
refseqR
packg.IRanges object with 1 range and 0 metadata columns:
start end width
<integer> <integer> <integer>
NM_001030018.2 30 434 405
# A tibble: 1 × 4
caption moltype sourcedb slen
<chr> <chr> <chr> <chr>
1 NM_001030018 rna refseq 667
AAStringSet object of length 1:
width seq names
[1] 134 MADSELQLVEQRIRSFPDFPTPG...IQKDALEPGQRVVVVDDLLATGV NP_001025189
Get the protein ids. from a set of transcript accessions :
Fetch the amino acid (AA) sequences for a set of 100 protein ids:
refseq_AAseq()
mean time (5,000 MC) = 67.40673 sec
refseqR
to apply on GeneID accessions:
refseq_GeneID()
refseq_description()
to apply on transcript accessions:
refseq_GeneID()
refseq_description()
refseq_CDScoords()
refseq_CDSseq()
refseq_RNA2protein()
refseqR
to apply on protein accessions:
refseq_GeneID()
refseq_description()
refseq_AAmol_wt()
refseq_AAlen()
refseq_AAseq()
refseq_protein2RNA()