refseqR
workshop
2024-11-27

“On protein synthesis” - at University College London for the Society for Experimental Biology (1957).
refseqRBuilds upon rentrez
Imports IRanges, Biostrings
input can be any character string object (1st arg)
provides output for interoperability and integration with Bioconductor
refseqR packg.IRanges object with 1 range and 0 metadata columns:
start end width
<integer> <integer> <integer>
NM_001030018.2 30 434 405
# A tibble: 1 × 4
caption moltype sourcedb slen
<chr> <chr> <chr> <chr>
1 NM_001030018 rna refseq 667
AAStringSet object of length 1:
width seq names
[1] 134 MADSELQLVEQRIRSFPDFPTPG...IQKDALEPGQRVVVVDDLLATGV NP_001025189
Get the protein ids. from a set of transcript accessions :
Fetch the amino acid (AA) sequences for a set of 100 protein ids:
refseq_AAseq()
mean time (5,000 MC) = 67.40673 sec
refseqRto apply on GeneID accessions:
refseq_GeneID()
refseq_description()
to apply on transcript accessions:
refseq_GeneID()
refseq_description()
refseq_CDScoords()
refseq_CDSseq()
refseq_RNA2protein()
refseqRto apply on protein accessions:
refseq_GeneID()
refseq_description()
refseq_AAmol_wt()
refseq_AAlen()
refseq_AAseq()
refseq_protein2RNA()