PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Vocar.0001s0492.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; Chlamydomonadales; Volvocaceae; Volvox
Family SBP
Protein Properties Length: 2119aa    MW: 213553 Da    PI: 6.7928
Description SBP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Vocar.0001s0492.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1SBP78.41.1e-2466142177
                          --SSTT-----TT--HHHHHTT--HHHHT-S-EEETTEEEEE-TTTSSEEETTT--SS--S-STTTT-------S-- CS
                  SBP   1 lCqvegCeadlseakeyhrrhkvCevhskapvvlvsgleqrfCqqCsrfhelsefDeekrsCrrrLakhnerrrkkq 77 
                          +Cqv+gC +dls  k y +r+ vCe h ka v +++g+e rfCqqC +f +l+ef+ ++rsC  r ++ n rrr ++
  Vocar.0001s0492.1.p  66 TCQVDGCGKDLSGEKAYLQRYSVCEGHFKADVSFLHGQEVRFCQQCNKFQDLREFEGARRSCAARSKDRNLRRRMQT 142
                          6************************************************************************9765 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:4.10.1100.102.5E-2460127IPR004333Transcription factor, SBP-box
PROSITE profilePS5114122.05964141IPR004333Transcription factor, SBP-box
SuperFamilySSF1036124.19E-2266141IPR004333Transcription factor, SBP-box
PfamPF031101.2E-2567140IPR004333Transcription factor, SBP-box
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 2119 aa     Download sequence    Send to blast
MSSEGARGSS KAAWDSTAYS MDPYLGFARV QGADARKTGE ELRHASGHTD ITLLPKKSRK  60
ATRPSTCQVD GCGKDLSGEK AYLQRYSVCE GHFKADVSFL HGQEVRFCQQ CNKFQDLREF  120
EGARRSCAAR SKDRNLRRRM QTTLQKGSTA EELFAPEAIA LVTEQQHQHQ QQQQQQHQSG  180
SPGRSGGGQA GGGGSDDSGG GGGGRAGSGD ASMGGSAAFA RRRPQEVRGA SGGSEYSSEA  240
VGAAATLAVH SRAHPGGGHH HAVLGGGVPQ SRTLASSAAA APPPLMDSNP HQQLRRPTGS  300
GGNGGTSSSN PEAGGQEHLA GRGPTWMSTE KMGQSCDLDG GGGGGGANLP SHLAASLARR  360
SAASYMAVGN TGPGGGDGGP VPMQVDGFLS AASSKQYTPH GGGRQADGLL AAFEGDGGGA  420
ATSAAALREH DSAYAAKLSA TLGGPSSSSG GGGGGGGGGW AGPAALSLLD VFDNILPQQR  480
NQQQQPQQQH YPLGARQATP DLPPPQLQHQ RHSHSNSQQP PQPQQHQQVP AQRGGGTLLQ  540
HSSATGGSVM ASWGSGVGQR LQVLAAGGGS GAGGGGSGSG QYVGLHQQQR TLGGLALADD  600
DAAAALLEQL QGGGVGALAA AAREAGSLSG LLGPGGVRSG GGDIGGGGCG GGGTADGHTV  660
RAVLLQSQQQ QQPAGAMANL LSAASSSRHD DLSAMIKERR AALVRQQLLQ QQQQQRQQQQ  720
QQHHDHHNHH NQQQQHHLHG QLQGSDAGVL PLDQLTLLGQ PNGGGAFLLG LSHSEQAQLR  780
LLTAGGGGGG GNTAVLGEAA SAWAASSGLD DYYDVMPGYG MRGGGGGGGG VQGVLGGAGR  840
LLVSGGAGLG AAAGGSAAYG PAGELLGAAA RRYHVLGGPA LMTAQLLEQY ALGGGGGEDA  900
GPIDRPGFGG GGGGGGMNGL HGTGGRGLQQ HSAASRAGGG SSGDDGGGGG GRGFGRYDTG  960
DTAGAGFAPH GGGGGGGDRS NGPDGGGGGG GGGSSRLLSV LFPGAGGRED ALAARVCLKL  1020
FSCVPDDLPG DLLLRLRRWA TAADSDVVQI FMRPGCVHLI VNMRLSQGGQ ALLERSVLGG  1080
GDLDLAVTAL RGAFESSGLL RGRRVVVQAA EGPRGAFEYD GASGGQARWV ADADLARAPS  1140
VVMVQPAALT CGISTTVTLI GRNLHQHGTR FYARCGGKSY ELRPKVPPPS AAAAAAAAAA  1200
MMSLSTRYND ADLASPPEAA AAVATAAAAA AETPPLPPQL CRPAHVRSAP EPGRESWPDG  1260
MSTPSPISSV LGGGGGADGP QQLLYSELDG SLLPSPEAGL VPERFCVSAA ASPGLTLLAE  1320
ASKVSYSRQT SHQHAEDRST TAEAVAAAEE DDAAAAALVS PLPLDYCARG SVVCMQPSWC  1380
SELEVVHVEV PALPRHGLVA VEAINVSVDV IGTWAPGVVC HDPLAAEELN LWLRDSGDAF  1440
CGSWFLKELG VLLDYDAVMP SLLQGHAGLA GTGAWGRRPW EGSTAAPPPP GAAVGPSSPV  1500
AVRSGTSQRA LQHKHAAVHD NHSQQQHVQQ VRQPGAAQRV IPAMSNPLPR VATAEPGGMA  1560
AAAAAVRAGA LSAMPYTAAE CTDLPPILSA RPDGSLELRL LPSDPLVNGG GRAALTTVVQ  1620
SRASARPAQV PLAAATPRGA HGAAADLPYM SPGEPLDGGR LVPRRPLNPQ WPLSDDTKVP  1680
DAPGRLLGPS LPRLLSHPLL HAGCRRAMLS AGSRLLSFTV ASGLVVLANG LVEWMMALEA  1740
PSLGLAPGFD PTTDTFATIT TATASAPSHH HNHHHAVPDN TQPTTFSSPV GELFERVASR  1800
AAVATIRGAA NAAAACVRGP GGCSGGGGGG SGASTTTTTT TLAESSFRDG GSSSIESSLD  1860
SSCSRDGGFR NLNMGLTLLH LAVASCSSAM LMQVLDWAEE YGRPWAFDTR SWPGRITALH  1920
LAAAAAAAAA AAQHGGGGRG ADAARLLLSL GPEAQSAWLQ ARDGEGRTPA DIARAAGLDE  1980
LNYFCVYGEE APEAAPKELL LEEASETKMA TVGDDGGDSC VDIDSDANTR VAAAEMGTGP  2040
AAAVAAAGAS SHGGLSAGLD PRAPLQHVQV PQPPLPLPPP PPPPPILPRR LPANPFGWLL  2100
LLGALVVMLL ASALRMWP*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
1ul4_A1e-17671391183squamosa promoter binding protein-like 4
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1132138DRNLRRR
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
ChlorophytaeOGCP1131122
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G53160.22e-18squamosa promoter binding protein-like 4