PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Vocar.0022s0024.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; Chlamydomonadales; Volvocaceae; Volvox
Family SBP
Protein Properties Length: 2042aa    MW: 204622 Da    PI: 7.1807
Description SBP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Vocar.0022s0024.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1SBP76.15.6e-2441109169
                          --SSTT-----TT--HHHHHTT--HHHHT-S-EEETTEEEEE-TTTSSEEETTT--SS--S-STTTT-- CS
                  SBP   1 lCqvegCeadlseakeyhrrhkvCevhskapvvlvsgleqrfCqqCsrfhelsefDeekrsCrrrLakh 69 
                          +Cq++gC++dls+ak y++rh +C +h kap+v ++g++ r+CqqC++f +l+ f  + rsC+  L++ 
  Vocar.0022s0024.1.p  41 ACQAPGCTVDLSDAKPYFKRHSICATHMKAPQVWINGEKMRYCQQCGHFENLDMFIGANRSCKMSLDRR 109
                          5*************************************************************9999864 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:4.10.1100.107.2E-2536103IPR004333Transcription factor, SBP-box
PROSITE profilePS5114119.57239116IPR004333Transcription factor, SBP-box
SuperFamilySSF1036127.32E-2241110IPR004333Transcription factor, SBP-box
PfamPF031103.1E-2342111IPR004333Transcription factor, SBP-box
Gene3DG3DSA:1.25.40.203.1E-412551377IPR020683Ankyrin repeat-containing domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 2042 aa     Download sequence    Send to blast
MSASGFAHAQ WQAGPLEWNT VIMEAYDTRG AGKRRKARTS ACQAPGCTVD LSDAKPYFKR  60
HSICATHMKA PQVWINGEKM RYCQQCGHFE NLDMFIGANR SCKMSLDRRS AIAHASKRKE  120
LRSAKSRDQE GNQELSASDN TSSDKGERSG WKDLDSTWGS PEEMMRTESS RRNVLDPCLH  180
EADRPCPARD VDMALDSSHP QGSGGRLLSD RTAPFRNASN SNGLNVLPVL NDNGASSDNN  240
GHNNFNYQYS GSNPQVNGNN VVHLEPPSRC CPDVVMRDGG DDGGGGGGRG GSPEEGSSGH  300
MNFPWMRSLQ QHQQHHLQPG GPGAAGLPGL PSRTLIAFPS LHGGKPAAAA ATPHLTWDQQ  360
RQQRPEVLTA PLPAAAVTSS MYGGGIAAVA ASAASGNVTL NNATTGRLLA GASAPPQIFS  420
AAPPSVVVAL DPAELPGIHD PALERQLQDI VTSARLEGLG WYGRPPGSGT NMSPYLPTGN  480
EGASGFADVA GPPSQQAHYS LARGAPGPAG FDGSGGGGGG RHACTGSAAI GGGGSHQVRG  540
RGVAGAACTA AANGTAAAAA APSAPEPSLA TALWYHRQAA DDVGDVGGTA GGWVQPKSKP  600
GGQNGAASGS SGSSLETCMH YLSQQPPQQQ VPQHSYGYEY GDVGAGGSYG GGAASPATHG  660
PGSAGGRGHR AVGLGRGGAL RGLLGYRSGE PDQLVRMALK LTSRAPHELE PSTIASLRRM  720
LSVYDKLQSV QGYVRPGCTE LIVDAHRPGS TATERMMRIA AAAAAAGAGG GGGDGGDGGG  780
NNNNNAYTSG GAASDAAAAS SSSGNSSLST ATGGGLGCRF SLGGDALTTV SSSGGGGGGG  840
AVSAGSPSSS SWWYESDSGS CQGIRTTALA EAGLAVLKER LRPLPLDGEA AAAAAAAAVE  900
GRVEGGVRGR LGGLRALTLS RVLLERRSCE GGVVGAEAVL RSPDVLAALR DIATVNDTAL  960
TCQLGDMAVR VSEEGDVGDV QKLLALPTIV AASCAAISAD KPVTEVLVYG TNLERFGMAF  1020
WARMHGVCYQ LQFRTSRGGA VMLQLPALPQ VGLLQLEAVS PGGGAVGPSY PLLVLPSAGA  1080
VEEVHALARL MGPASLRRFI ADFGFVLGVG ALLSLPLNGI TPVKSGPSMS LGLGPESVSA  1140
SDSTSSAFNS AVNVDVQPGG GGTWGSTESA DVSCGNTGDA PAEETNSSSG QPAVAMAASQ  1200
ATAMAAAMAA AAAPLVMPMR RRGVLSRGGS RTSSRNTSPE NSGTRGLFSE RNDPTDAMTN  1260
ADDEQVVELL LSAGAVRSLL DTSVTLLHTL VERRMQHCIA LLVDRLSELG ERFPDRVHGG  1320
LAPPLAPGPA FVLMHAAARV GDLRLLMPLL ALEGLLGEHC SLTARGVNGV TPLHLIALLP  1380
QAPLLVRSLQ RLQPGLEQAL GTLEADDGVT PSQLFHMMHV AATTNDCAPT TATAGGGGGD  1440
GGGGGGGGGA SSSPPHQRNS HFAEGATVTA MGSWVLASSP ASSNYGSSPS PGRLPADMLL  1500
AAAAAGELDS PVSEAWSAQG AGIAAASSSS AGAAWRWRKA DGSIATTSIA AAAAAAATAS  1560
PHPLESLVRV ERREVQQPRA DDLRTGDSRG SSSSGGGGGD MEPIGGGGGT GGGDSRYLSP  1620
PLPVREAVAA VLISSSSCQG RTAVGPGSEQ QQQQQQQRRR QAALVPQYST APGSIVPAVL  1680
GTGGAAGGGG RERLDSVETV PQVCTFVHGS GGAAGAGAAG AGAASGIGAA TARLVGVTPG  1740
APGAAGPKGP RGDLPPSRPS SSSPLSFQQQ PGRNGPSWSA TATAKQGPSG TAIKVSGSEA  1800
PPGAPSPATA PQHPPQHQGR RSDTGEVPQG ERGPQGPVSG GCSGGGAVRP PTDPPLATSL  1860
ATPAAALARR GAGGAAAAAE GLGETEGAIT GTTTTTTTTT TTTTTATTNL SAALPGETCG  1920
ESVEREASKV ATATASRSLV RTAAARATLL VAALGVVAAI AMQRVRRSSG AWRLPFSALY  1980
GSGDGDDDGN AAAAIVSPLV ASLLAVLGVA APLLLLLLTA TRPRVRAKRV GGPAPGGEGD  2040
S*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
1ul4_A1e-14421201184squamosa promoter binding protein-like 4
Search in ModeBase
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_002957657.10.0hypothetical protein VOLCADRAFT_98744
TrEMBLD8UG650.0D8UG65_VOLCA; Uncharacterized protein
STRINGXP_002957657.10.0(Volvox carteri)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G33810.15e-17squamosa promoter binding protein-like 3