PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cre16.g683953.t1.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas
Family SBP
Protein Properties Length: 2538aa    MW: 248999 Da    PI: 6.8046
Description SBP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cre16.g683953.t1.1genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1SBP81.41.3e-2543118176
                         --SSTT-----TT--HHHHHTT--HHHHT-S-EEETTEEEEE-TTTSSEEETTT--SS--S-STTTT-------S- CS
                 SBP   1 lCqvegCeadlseakeyhrrhkvCevhskapvvlvsgleqrfCqqCsrfhelsefDeekrsCrrrLakhnerrrkk 76 
                         +C+vegC +dl++a + h+r+++C++h kapv++v+g+  rfCqqCsrfh l ef+ + ++Cr  L+k  +r+r +
  Cre16.g683953.t1.1  43 RCKVEGCPSDLETAPKTHQRFRLCNTHIKAPVIAVDGVLSRFCQQCSRFHPLAEFERSNKTCRMMLEKNRARQRSN 118
                         6*****************************************************************9988888765 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:4.10.1100.101.0E-2539105IPR004333Transcription factor, SBP-box
PROSITE profilePS5114122.88841118IPR004333Transcription factor, SBP-box
SuperFamilySSF1036125.89E-2442120IPR004333Transcription factor, SBP-box
PfamPF031101.1E-2544117IPR004333Transcription factor, SBP-box
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 2538 aa     Download sequence    Send to blast
MSETEPQHHP DEGAISHHSY PVITTKEPED VGVGGRRAPA PLRCKVEGCP SDLETAPKTH  60
QRFRLCNTHI KAPVIAVDGV LSRFCQQCSR FHPLAEFERS NKTCRMMLEK NRARQRSNAA  120
SQRQQQQPQP LAAAGSGGQL PAGAAAFPAG LGLLPAGAAA AGAPAGALLA PAAAAGTVGG  180
GPSAAATLRV AGATTTHPYM LLLQQQYLLN RANALTAAAS AGAAAIGVGG GGGGLPGQQL  240
APLGLPITQQ QAVALPAAGA GSRAASLSGL GALLTLQQQQ AQQQLAIAAA LAAAGPLSHA  300
AAATSAAVAA RAANAAAGGP AGGATAADTP AVVGEAATDP AAAGGAAAAA VRDSSAVGST  360
RGTMGADAGA AAGDTGNRAA SDYSDRQDAD EHDQQTVAVG RGGGGASRAG SRRQPAAASV  420
AAGEDASNAA GATGTAEGAA SAAARGAVAG GGGLPAEATA APLDAASQSQ LAAAMQQLQQ  480
QHMQLLSLSR MLSSWEGRPL PPALAAAAAN AAAGLPLQGT IAGLGSLPAA AAATRRVSDA  540
TGAVSGAGLA AHELAAFTRI SAETPGGSGG GLGAAGGAAA GRLPPAALHQ LDGRAAAAAL  600
QHPPELQQHT SLTAAAAQAA ARQQQGHQQL LPSRQQSQSL LQPLQELDEM VMRQDSGGVS  660
PADAGGGAAA TAAVAMDVDE PPNGGGGRHS SRQQSPAVAA AAAAAAAAAA ALQVQQQRQQ  720
AQHQQAQQQP RRRPKSDPDT VMLSGSDSSE EAEALRASQQ HEQQQTQQLQ QQALAGETRV  780
KIEPADSGAQ TQPMTEQVTE GESSSAVAGM PTSSAPLRVR SAPAIGSLGG SAAGAAAGRS  840
DTPAAAPSSR LLAAVAPVHG GVVNNATAGT AAGDSLQRRP RGVSDQAGGY VQKVAAAADA  900
GAGAPSAVAA GGRLPSAASG ANLVAGSSPG GWSGAGSAPR GSSADGGSSV GSAERLQLLE  960
RGTSGSGASG TAGGTGGGGG GMFGTVPWLV PPGSSGGGGS AVAAAAAAAA LGGISGGADT  1020
AGARRGVRPL LPGRSFAAAD ALPAGGSPTG VLPAPADAVG AAAAAAAGAA GPTAGMYASA  1080
ECMLLPPAAG AAGPLAGPAN VPPTLGRRSE PAMETAAGSS KRRRGQSGTL DAEAAVIDTS  1140
MMTAAGVDEF DEAADEPYPG ALLALRMGPQ LARNQHPRRA MSAMEPAAGA WSVPVLAPVG  1200
AGSSGGGAAA GSAPVFDRRS LDVVSSSPSV AAAAAAAAGA RKTSAANSPR RGSSARSRQQ  1260
RQIMWPPQLL QPLPALQQLY ATGGGAAGGA VASPTGSSEL LAAELAGSGG GLASARLAMV  1320
GAAVDLGGMT ALPRAAGGSG HGAVLGASAS AYPPFGLAPP PQQQQHHQQQ QQQQQHHEQQ  1380
QQQQQQHLLQ SAGASAAAAA AGNDGHRALN PSALASARAA AAAATAAATA VKALTRRRSS  1440
EEHRAAALAA QQLPIPMELF EPVDVDVDEA LSMLAEGAGG SDEDPGVSNK PAGSAGVGGA  1500
GGAGGAAGAG GSSHSAVQQM DASYAQQAAA GAAADAATVD AAAAVAARHD SSSARRVGSR  1560
AGTHGQPQQG GSGGGGDALA ALAAEWGRDV AMDEEGAERH AMQPAGLAGS LGEGWPGAAA  1620
ASQQLQQPQQ QHQQQQQQQP VSSASVLFQQ LSLLQQQQQQ QQDVIRAQLS RLAPSSQQLQ  1680
QAFAASPAAS AGQQQPAPLP VPYQQTQPRP WGTVAWTAGA ASGSAGLFAR RSDESSLASR  1740
SQSQHQHQQS SDSTPGDTAA AAGGVRGGGS GPLVLLPSAT AAAQATQQSL HQQHQPQPHQ  1800
RQRQMTAGGM TLPAAVDSAQ HSPAQLQQLL TQARQLPSLQ SLHAQVQRQQ QLQSLAAATA  1860
LGALTEAQLQ RVTAGGGGGG EYEAASSRAA TPLGPGATGS TSPGTGAGAG VGADAGAPTS  1920
GRHQQQPPLA LPLPQLQAQP PQMHMLQPQD LQEVLGPADE NQGRMSSMSA DWLDNLLRQQ  1980
QEASRQLMVL QQQLQEQQRQ LQQHQQLLQR STVAQLQRQQ QQGPSAAQQQ AAAVITAAGA  2040
ATGTITAAMA QPDHGAGGAG AGGASFLSMP ATEAAAAADA SANSSARNGA EGGAYEVSLQ  2100
QMQQLRGSSS GVVDTSTTEQ QQLSAQAHAQ AQQAALQAQQ PPGQWGLPAA GGWIGVNFGP  2160
VGAAGAVPPR PATPTSADAV TAAAAAAVAA ATAAVAEASG AIAVEGRGQQ QLLLQHQRSR  2220
GPASAAAAAT RSPADPDQQQ QQQRQLAIPG LATSMLPQAG GPGGLLPLYA AGMASTPVDV  2280
EQGRRRRARD AAARGRVSAA SSETSSLGSS LAPAVGPGGG APVFGRTSAG AAGSSGRAAE  2340
SGLASSLQQQ QQYEGTDAAT WALQTQLLLQ HDQQQEHQQQ FHVQMQMLRG TPPGPYGAGA  2400
DVGGHSGMSP QHQDARQGSR QLLHQQQQPR SQLQLEGVLA TSPAAGLVLV SNSGLTGAAA  2460
AAATAEAEAQ AALHARDGGY TAAAGTTTGE LAGAAEAAQV AALVAATGAD AMAAQVVVDE  2520
GSGSQPRAAQ QPGEETM*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
1ul4_A4e-1435116183squamosa promoter binding protein-like 4
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1729735PRRRPKS
Cis-element ? help Back to Top
SourceLink
PlantRegMapCre16.g683953.t1.1
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
TrEMBLA0A2K3CUV60.0A0A2K3CUV6_CHLRE; Uncharacterized protein
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
ChlorophytaeOGCP2012101
Representative plantOGRP9717230
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G50670.11e-17SBP family protein