suba logo
AT1G44910.1
Subcellular Consensus
(Prediction and Experimental)

min: heatmap :max

.
SUBAcon:
nucleus 1.000
ASURE: nucleus
What is SUBAcon?
What is ASURE?
SUBAcon computations
Experimental Localisations and PPI
FP MS/MS PPI
SUBAcon links
AGI-AGI relationships
Coexpression PPI
no PPI data
Description (TAIR10) protein_coding : pre-mRNA-processing protein 40A
Curator
Summary (TAIR10)
Binds the carboxyl-terminal domain (CTD) of the largest subunit of RNA polymerase II and functions as a scaffold for RNA processing machineries.
Computational
Description (TAIR10)
pre-mRNA-processing protein 40A (PRP40A); CONTAINS InterPro DOMAIN/s: FF domain (InterPro:IPR002713), WW/Rsp5/WWP (InterPro:IPR001202); BEST Arabidopsis thaliana protein match is: pre-mRNA-processing protein 40B (TAIR:AT3G19670.1); Has 113757 Blast hits to 61097 proteins in 2531 species: Archae - 390; Bacteria - 9660; Metazoa - 51291; Fungi - 12468; Plants - 6867; Viruses - 622; Other Eukaryotes - 32459 (source: NCBI BLink).
Protein Annotations
eggNOG:COG5104eggNOG:KOG0152EMBL:AC007915EMBL:AC020576EMBL:AK316856EMBL:CP002684EnsemblPlants:AT1G44910
EnsemblPlants:AT1G44910.1entrez:841057Gene3D:1.10.10.440GeneID:841057Genevisible:B6EUA9GO:GO:0000398GO:GO:0003723
GO:GO:0005685GO:GO:0005829GO:GO:0006351GO:GO:0006355GO:GO:0016592GO:GO:0071004hmmpanther:PTHR11864
hmmpanther:PTHR11864:SF0HOGENOM:HOG000030620InParanoid:B6EUA9InterPro:IPR001202InterPro:IPR002713iPTMnet:B6EUA9KEGG:ath:AT1G44910
KO:K12821ncoils:CoilOMA:TKPEEMMPaxDb:B6EUA9Pfam:B6EUA9Pfam:PF00397Pfam:PF01846
Pfscan:PS50020Pfscan:PS51676PhylomeDB:B6EUA9PRIDE:B6EUA9PRO:PR:B6EUA9PROSITE:PS50020PROSITE:PS51676
ProteinModelPortal:B6EUA9Proteomes:UP000006548RefSeq:NP_001117438.1RefSeq:NP_175113.2SMART:SM00441SMART:SM00456SMR:B6EUA9
STRING:3702.AT1G44910.1SUPFAM:SSF51045SUPFAM:SSF81698TAIR:AT1G44910tair10-symbols:ATPRP40Atair10-symbols:PRP40AUniGene:At.14888
UniGene:At.28268UniProt:B6EUA9
Coordinates (TAIR10) chr1:+:16975930..16982818
Molecular Weight (calculated) 109388.00 Da
IEP (calculated) 6.43
GRAVY (calculated) -1.15
Length 958 amino acids
Sequence (TAIR10)
(BLAST)
001: MANNPPQSSG TQFRPMVPGQ QGQHFVPAAS QPFHPYGHVP PNVQSQPPQY SQPIQQQQLF PVRPGQPVHI TSSSQAVSVP YIQTNKILTS GSTQPQPNAP
101: PMTGFATSGP PFSSPYTFVP SSYPQQQPTS LVQPNSQMHV AGVPPAANTW PVPVNQSTSL VSPVQQTGQQ TPVAVSTDPG NLTPQSASDW QEHTSADGRK
201: YYYNKRTKQS NWEKPLELMT PLERADASTV WKEFTTPEGK KYYYNKVTKE SKWTIPEDLK LAREQAQLAS EKTSLSEAGS TPLSHHAASS SDLAVSTVTS
301: VVPSTSSALT GHSSSPIQAG LAVPVTRPPS VAPVTPTSGA ISDTEATTIK GDNLSSRGAD DSNDGATAQN NEAENKEMSV NGKANLSPAG DKANVEEPMV
401: YATKQEAKAA FKSLLESVNV HSDWTWEQTL KEIVHDKRYG ALRTLGERKQ AFNEYLGQRK KVEAEERRRR QKKAREEFVK MLEECEELSS SLKWSKAMSL
501: FENDQRFKAV DRPRDREDLF DNYIVELERK EREKAAEEHR QYMADYRKFL ETCDYIKAGT QWRKIQDRLE DDDRCSCLEK IDRLIGFEEY ILDLEKEEEE
601: LKRVEKEHVR RAERKNRDAF RTLLEEHVAA GILTAKTYWL DYCIELKDLP QYQAVASNTS GSTPKDLFED VTEELEKQYH EDKSYVKDAM KSRKISMVSS
701: WLFEDFKSAI SEDLSTQQIS DINLKLIYDD LVGRVKEKEE KEARKLQRLA EEFTNLLHTF KEITVASNWE DSKQLVEESQ EYRSIGDESV SQGLFEEYIT
801: SLQEKAKEKE RKRDEEKVRK EKERDEKEKR KDKDKERREK EREREKEKGK ERSKREESDG ETAMDVSEGH KDEKRKGKDR DRKHRRRHHN NSDEDVSSDR
901: DDRDESKKSS RKHGNDRKKS RKHANSPESE SENRHKRQKK ESSRRSGNDE LEDGEVGE
See Also
Citation
If you find this resource useful please cite one of the following publications:

Hooper CM, Castleden I, Tanz SK, Aryamanesh, and Millar, AH (2017) SUBA4: the interactive data analysis centre for Arabidopsis subcellular protein locations Nucleic Acids Res. Jan 4;45(D1):D1064-D1074. doi: 10.1093/nar/gkw1041 (PubMed)

Hooper CM, Tanz SK, Castleden IR, Vacher MA, Small ID, Millar AH (2014) "SUBAcon: a consensus algorithm for unifying the subcellular localization data of the Arabidopsis proteome. Bioinformatics." 1;30(23):3356-64. (Bioinformatics) (PubMed)