PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Bobra.0491s0005.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Trebouxiophyceae; Elliptochloris clade; Botryococcus
Family SBP
Protein Properties Length: 1041aa    MW: 108366 Da    PI: 4.9053
Description SBP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Bobra.0491s0005.1.pgenomePhytozomeView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1SBP994e-31137214178
                          --SSTT-----TT--HHHHHTT--HHHHT-S-EEETTEEEEE-TTTSSEEETTT--SS--S-STTTT-------S--- CS
                  SBP   1 lCqvegCeadlseakeyhrrhkvCevhskapvvlvsgleqrfCqqCsrfhelsefDeekrsCrrrLakhnerrrkkqa 78 
                          +CqvegCe +l+++  yhrr+++Ce+h k  v + +g++qrfCqqC+r h+l+ f+ +krsCr+rL++hn+rrrkk++
  Bobra.0491s0005.1.p 137 ICQVEGCERELASLSPYHRRYRICEMHFKLDVFVRDGRRQRFCQQCGRCHDLNAFEGSKRSCRNRLNQHNARRRKKNP 214
                          5**************************************************************************975 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1041 aa     Download sequence    
MAQRTASDWQ AEEGVAALGT QETGNRGEGE AETRDAPGND AEATDALLSL AAPRGTLDPA  60
TAETWKYRLT PPQRPTRPTA TSVATGSIQR PSSDAMATTA EETPEQAGQP ALEEQPAPTT  120
MQGRRGPYRR STLDHMICQV EGCERELASL SPYHRRYRIC EMHFKLDVFV RDGRRQRFCQ  180
QCGRCHDLNA FEGSKRSCRN RLNQHNARRR KKNPAGQQGN GAQQGDKIEG GQYVGGKRGG  240
GEMNSGSKRR ATAEAAAFPT WGLGSGGASA FTPQGGARTA QVGKSGASAF GPPGRVDGDP  300
TRRGQSSEAS EGGLGQDDGG LDPGLAAPGM TGHPGTLGAN DSPSGAGVAA MYPGMMAAGM  360
SPGSNPLAAQ IPLAGFPPGA LGAHAGLWHP AAGLPGLPSN DPAALALQAD MMSAQMAALG  420
GHLGASMGIP PNAQHTQTVQ MLQAQLLRYQ YMLHAMVSGS WPGAAGNTAQ LGLEGLGFPP  480
MSAGLLGMPP GDPMLLGLGQ LPGMGAISEV PADGEQQPAT SGPPTGDLAA FAPGGSAEGV  540
TDGTAATSVS PAGLLEGAVV QGEVVNPGDE SNSVPNATMT PSTTAPTWRP KEEDPNLPEG  600
EIPQAGPGTF AAPHAIDGQV ATPGIGEVGA GLHSGVAALD PAAQILAPPP LGVDPSGIEG  660
SEVKLEEQAA PVMTPDPGVA LGEAIPPLPV AAGMEYAGQA DVERVAVKVY GRTPENMSDE  720
ERQKFTEWIS TAPVMTDATT KAGCVHMTWR AFLAPHDAGN LRENGARAFV EYLISHPDKQ  780
LSEQSVLLYI GGKSAFCRGG QLMWESELSL IPTVRDLRTS EDILAPVLPT DLPCISASYV  840
NGMFAAVIAQ VPGTLSLELK GSGLHCEGLH VFCHDGSSVL NSTVTPPIRH CCELNANHTS  900
LHDSSSLMVD LDGLPSVGLC WLEAERGPYI GATYPLLVLP PGCESAAIEL GLVLQVINNK  960
QGQQQQQQQQ ELKHDFGASV LPWSSWAPSA RGLVSDFGLL LRTMPLDKRP VWWESVRGKL  1020
LGFATDYALD AVAEMMMCWR *
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1235241GGKRGGG
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G43270.12e-16SBP family protein