PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Vocar.0042s0082.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; Chlamydomonadales; Volvocaceae; Volvox
Family SBP
Protein Properties Length: 2763aa    MW: 287586 Da    PI: 6.4393
Description SBP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Vocar.0042s0082.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1SBP79.26.1e-2565142178
                          --SSTT-----TT--HHHHHTT--HHHHT-S-EEETTEEEEE-TTTSSEEETTT--SS--S-STTTT-------S--- CS
                  SBP   1 lCqvegCeadlseakeyhrrhkvCevhskapvvlvsgleqrfCqqCsrfhelsefDeekrsCrrrLakhnerrrkkqa 78 
                          +Cq++gC adls +++y+rr++vCe+h +++vv + g+e rfC qCs fh l  fD  +r+Cr++L++   +rr +++
  Vocar.0042s0082.1.p  65 RCQADGCMADLSGLRRYFRRYHVCETHIRSQVVHIGGREVRFCDQCSTFHPLAFFDGVRRTCRDKLEQNRMKRRARKQ 142
                          6*****************************************************************998766665443 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:4.10.1100.103.0E-2462127IPR004333Transcription factor, SBP-box
PROSITE profilePS5114123.05963140IPR004333Transcription factor, SBP-box
SuperFamilySSF1036121.14E-2265141IPR004333Transcription factor, SBP-box
PfamPF031103.0E-2566139IPR004333Transcription factor, SBP-box
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 2763 aa     Download sequence    Send to blast
MDNDNPMRTL PSSEQPGAEQ PSDEHRSSSF QAQQQRHNQS IQEGDLRPSV MASATPVVQD  60
TGANRCQADG CMADLSGLRR YFRRYHVCET HIRSQVVHIG GREVRFCDQC STFHPLAFFD  120
GVRRTCRDKL EQNRMKRRAR KQQNANHAGA ITAAALAGAA AAASGNNNGG GGRSDDVDSD  180
DQPANGGGPA AAAGGDDAGP AVRAQGGHSL ARKRQSGTLA QGARRLQPSA RSVAAPDVTK  240
AVKVQRATTG QLGAARAEVW SPESADEEPD MQTPGPGDAG GGHHEDADWG AADMMTREAE  300
MGLGGALGAT LAAADGRDTG FGPAAATATA TAKRGRPWDP VGLAPDWPRV NHQDLRDQEQ  360
LQVAYGAVRD GAMYSPRGQL HGHLGQQGPQ VQVQVRAQTV KVQPLQLVPP LSQHQQQHQH  420
QQHQQHQQRL PMSEWEMETG GVIALERLPQ GAVQGTWRLH GGSAPNLPQE LPASSPKWAP  480
GGDARATWQS VPAIIVPASG TQDSHALPDK SMHDGGMRLA AVGGPTNNGG YQVHVRTPHA  540
ANGAGYPAPN TLGDPGCARD GRVGLPSRVA DQLITYKVDG EGAPAVTLLA QILRTEPEFR  600
QQLPRTSQAG PEPMSPPKIT LHNGGGSGSA DMLAGAAEGL TPAQQAALRL KLDAGLLASP  660
PNATTAAPQP QPPSSVSASA VPGPNMPKQW AQVQDDAPLQ QQRHHHQPLV RHELPGYGTS  720
RPGGQAATVW AGNGAAPQGS VWVTPDGTRF GAPLAAYPAP VGLGPQAVLI DRRISAGSSV  780
YTADEEAALA MEDVTSLQME AERAAIGTHN DALQQPIEEA PSLLDTPSFR DGIARDNADA  840
QMWEARRELQ KQQQQVQIVV SFPNHQNQRQ QLQGFDGEHR VTLLEHMQKQ EAFLEVVQQQ  900
QQEQHLGSQQ QHHSHMLEQH PRGQQHGLRQ NPANSLSHVD RWQQQQQHPQ QLEQQQPASR  960
VQVQLGSSAP PQNVFGKLCD EQRPVYLPSA FRMHVPEDLD ADSDYLQPQP QHAGIAAGAR  1020
ASGRGPVAAV GRPAPLGGIG GGGGQRPPSP APPQPHSWAG ASSGWRYSNG DSGGAPAMLE  1080
AQPQQQSASA IGYGTQQHMQ QLGDSLAGVA PSDQELILST LRKFAAHPPR AVKTAGGAHQ  1140
QQQQQPDSMG LSPSMPQGRG PQDRGGDSGS GGSSGCGGTL HMLRAEEGRA TQQQSYMQQQ  1200
QQRQPPPPQQ QPQQPAHNAA AGGEQLNLRR ESSYNDNING GPGSAEGLQQ VLASEGLQSM  1260
DSLAISEALL DGRGSLQLLD SVVFAQAQDA WVNTDTNPAA ASGADCGPAA MQGGPPTPPP  1320
HGSDGFTFGQ GGRVAGEGER WMGSTSDSQV HARSWQQQGA AAGEGFSGEE WVELPGGGPG  1380
MRVWERHQEQ QHRHVQQHER GPSHGGMFPA TERRPQHQQQ QQQQQGYRPQ SWNERHHHHH  1440
HQQQQQQQQE GLRDGCRPGH ALYTSPYDIN RPLFGGRVTA EGSGGGGGGQ TLSHVALKLS  1500
HARPDSLPME LVERMQEAAR GGGAAVLPAT LREGCVEVLV EVMHSCPTSQ LCNKLFGFPE  1560
ASGGASSALD LELDSALATA SGEQLRAWVG PHVFDSCERV TMQLLRGPVL DCKPGAPPKL  1620
RRWEAAMQDA GLGFAAHWPE VLMLCSGPPV LVADAVNAQQ IQLYGPAELA SPQGVVQLCV  1680
RMAGGAGSLG TVEAVAVDPE DGSSAPIPAD QASTAAPAAV AEGWLQPTTS TTSCWAAARR  1740
SPSPPVWPLA GPLSGLKAPV PGATSPALFR TVNASAGATN RLADLVAMAP RPTEQLKASP  1800
SLATGPCPFY CFAVEEVSEA EGPSSLGMSS RELCVRKGTP PAAGSAPGCT KGTVSETTEH  1860
PRSCFLTAPL PSSLPPSPPV AQQRQHRHQP GCLYNTIATA AATVARQPFS QLLLQQREQP  1920
RQQLQLCCRP PTFHCRSKSD DGCMTLCGSN DSGGRHAGSP SPRVPALASK GESLGRHSTP  1980
SAASHVCAGQ PYGGGGAPLV SVTSATTGLV HCLGPLSRRV SLCSSATLTT AMTLHDRLSC  2040
RSSATTAFTL GTTATLYDRL SLRSEWSWTS NMTSFGGMTA GAACGGGGIN DGGVGGAAPY  2100
SRTNEPQVSE PTYSVGASFG CTDQSLGPAH CARTARDDSK LEEENELMVS TPRRVAGPPA  2160
SCSSPRAVTA ASVAAQPYSS NSSDDASRSG GCCLRTGRLL DSAMVGLRPL RLQLPVLPCA  2220
GLALLEATGG HAVAASWPLL VVETVSQRNE LLALWRELAD AGREDVWREL AADLGLVLGN  2280
LPRRASVCDE PDPYLRSVPA AAAADRAFSS SALGSGEDFG FGGHGASAAV APLRWRSLGT  2340
PDRARDLLPG CYLNSGSLVS IESASQESWG TASATPRLAT AGQAVNTGPV SCMATLVHQR  2400
SSSSSDGDGS GSDVSRVGLP GSATSGASGD SGTSAVCVAD QHTGAGLVAA ATAAAGVGAV  2460
YLAAGAPARS RDGPDAATVD GSHAVRGAGA AAGAPVSAFS SHMATTSFGV GVVGVGGVEE  2520
AQEEGEGKGE QLQLPAVGRT CMLQRGDEVS LASGLAVIRG GCCDCDGEDD LHIASLTSFL  2580
GDLRMTAGRR ESSSDGRGRD DDAAARREGP GLGDGRRSNS SSQESASVLA AVESGTRAET  2640
GDVAVGKGEQ RKIGTAPKEV PGPPGLALPS PSPSLPVPPL AAVGLSAEAR LALQAAVSRV  2700
RRCIDARRLP ALWELLSRVA VDARTGGNTE EEESTSEVGG TAVLGPDEEA AAAVAAVAAA  2760
AV*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
1ul4_A1e-1461138683squamosa promoter binding protein-like 4
Search in ModeBase
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_002952986.10.0hypothetical protein VOLCADRAFT_93794
TrEMBLD8U3290.0D8U329_VOLCA; Uncharacterized protein
STRINGXP_002952986.10.0(Volvox carteri)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
ChlorophytaeOGCP2012101
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G27360.41e-17squamosa promoter-like 11