PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Vocar.0003s0435.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; Chlamydomonadales; Volvocaceae; Volvox
Family SBP
Protein Properties Length: 2020aa    MW: 203690 Da    PI: 6.8562
Description SBP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Vocar.0003s0435.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1SBP80.42.6e-2574152175
                          --SSTT-----TT--HHHHHTT--HHHHT-S-EEETTEEEEE-TTTSSEEETTT--SS--S-STTTT----....---S CS
                  SBP   1 lCqvegCeadlseakeyhrrhkvCevhskapvvlvsgleqrfCqqCsrfhelsefDeekrsCrrrLakhne....rrrk 75 
                          +Cqv+gC+  l++ k y++r+k+Ce h kap++ ++g+  r CqqC+rfhels f+ +kr+Cr +L++  +    rrrk
  Vocar.0003s0435.1.p  74 VCQVQGCNRSLQNSKLYYQRFKLCEDHLKAPAISIDGVLSRLCQQCGRFHELSAFEGKKRTCRAQLDRIRKakelRRRK 152
                          6***************************************************************998643211225554 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:4.10.1100.106.0E-2669136IPR004333Transcription factor, SBP-box
PROSITE profilePS5114121.18772149IPR004333Transcription factor, SBP-box
SuperFamilySSF1036121.31E-2274146IPR004333Transcription factor, SBP-box
PfamPF031101.7E-2675152IPR004333Transcription factor, SBP-box
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 2020 aa     Download sequence    Send to blast
MFRNDAEPST HPEWDPDAYT WDGVALTATR NPQQGSRSRT AVDASPYAEY LDADVPALMS  60
RPPVGVLSKA QRLVCQVQGC NRSLQNSKLY YQRFKLCEDH LKAPAISIDG VLSRLCQQCG  120
RFHELSAFEG KKRTCRAQLD RIRKAKELRR RKAGGSGGSE SPDPADADVR ESDADMKAEG  180
TDTVAMDAST AATRRRDANR GGGGRGSGSG GTSLNSRSSE LTSNGSGGGT GGGGGGGGRG  240
ATAAAAAATV TAGETVSGGR DGGIGSCSRG AKARSSQGVV DVWSQRDEAA AAATDVGVGD  300
GCGNVSDGSV LGAAAAQGMV VEQPAGHGLM APMPVLAVPY SHDHYHNHHY PHNYDHHHQQ  360
QQHDVFPSAV GRYHNGSGGA AAATESGGRS ANERGATNVD AAAGAAATAG STGSQIASKA  420
TGASVGADQR LAKQQQLLQQ LESVRQRKDT QKQQLQLLRS ELGLQPGGYG DGKGDGGSWL  480
PVASGRSTSW VGMTQHCGGT SSDPRVLGEQ LLPGAAPAAP FGPPRFLSSS ASSILASLEP  540
MARLARGSPH VGFGQPPQSY WSTGMGMEGG ADAMPRTAEH GSTVSTAMAA AAAAGFLGAP  600
GGGRAAAATH ARAAHGLTFG PGATTAIGVG GGGGADTSAR AGGLGAVPGG APILVGSSGG  660
GGGAGVEGGN VLGSGRPGTQ FYGSHYRDHQ LGAGVPSQSV VGLSGGEGGG YGRWLGDGNY  720
LPGPKQLQYQ DRKQSQQGQP DINTLRRQMQ TGGAAGGLLH AATTRPECTT LASEGELQGL  780
LEELASTAFT AIDKCGMAAL QSSCGGGRGG GAMTVGELAV AVPPEAAATA MMPGGIAFPG  840
EAGPRGATTM TGTATVGLAG TYGTDMMAKG HAMADDVEWA AVAAALESGS QVRLNRTTNT  900
ANTVQQAQQQ QQQLAHQLQS IESWGGPSAA ALAAGSGPDQ AHPGLQLFGD RNAGGGGSGG  960
SGGSGGGPPY IARDHMTRVT LKMMNCLPDE LPPNLGANLQ RWVSNRSAAE VLQACMRPGC  1020
LELLVDVVHV RSAHELAAAL LAEGRAEDAW TALKRVITHK LAADNEVVLE VCNLVLSATP  1080
GSPGGGVRIR TWEEAAEAFA VADGSASVVT HASAAVGPAG SSETAAMMTT TAAMGKEAAR  1140
SGTTDLAAAV IAAGMKGGPR GPSAVQAPAT AAPVPPLDSK RQSSQPVILD SSLVAVRSAT  1200
ALTFTLYGKG LSDDDVDILG RMSGQNLQIT TRKLLTTTPT TARTATAAAE HVSRIAVLVQ  1260
PVRQGGMLIL EPRRGRLLGG WWPLVVVPEP EVVGELNAMA QEVAVAANEA EAETAQLRSA  1320
GSMYGSTPGW LRDFLVDLGQ VLDWVGWKPL KESEGDPPSA PFSPVSGSCS GEVWGSGVAD  1380
SPTPTAMGVS DGAVVAGRTP GVGCADTVAA LPGFRAEAAG PQPIENHAFT AGTAALQPNR  1440
VLQAAAVHAP LAIGAAALMV PRRRGPMDRA PTAAAGASVE TYNLAGGDSG KQVQSPLYGV  1500
SHGSVSATPD GSWAVSYGSD MTAGGSRSVG EGVGAAGGGD GGCRLGSGAG ENSDEEDRNG  1560
GSAATARALA ALRDADAKLE LRLSFKACRL LSFCVVRGLV HTASLLWRLL QVQGMSPDSI  1620
HDHALYNGAG LLLCAVQSRS GPMVEELLNR DTDGSWAALH ALRLGPSGLS PLHVAATLPG  1680
GYDVGAVLLE RIRGGAQRWF LLRGACGGPT PAQLAARCGN DRLNTLAVEL VQRDVAAAAA  1740
AAAAAAAAAA VGSGVNANSG LGAGDGNESA GGRSHAGQRL QPAALAAERG FTDTRAWSSS  1800
WEQQVCHRQP YHYHQQVLPP PLDNQQQHHH RPDRGQVRSL GNLMEAGMPG RFGQGIIGAQ  1860
LLEAIAELGL LWIGPDTAMV ATDEAPVSAA AAAPQLEADV RYPEGRGSGG SGRGCGSGSG  1920
GGSSDSGGSD SGSSSDSGNG GSSVSDAGSG GGSGGSLVRQ RRPGALAPAA RPSDAVQPQA  1980
GGGADSDRSS GGFRRLKDLM KGFLYGSSGN GKAQSSQTR*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
1ul4_A2e-15741521084squamosa promoter binding protein-like 4
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1130151KKRTCRAQLDRIRKAKELRRRK
2145151KELRRRK
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
ChlorophytaeOGCP2012101
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G43270.38e-18squamosa promoter binding protein-like 2