PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cre01.g070932.t1.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas
Family SBP
Protein Properties Length: 2044aa    MW: 208015 Da    PI: 6.7506
Description SBP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cre01.g070932.t1.1genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1SBP80.32.8e-2556131176
                         --SSTT-----TT--HHHHHTT--HHHHT-S-EEETTEEEEE-TTTSSEEETTT--SS--S-STTTT-------S- CS
                 SBP   1 lCqvegCeadlseakeyhrrhkvCevhskapvvlvsgleqrfCqqCsrfhelsefDeekrsCrrrLakhnerrrkk 76 
                         +Cqv+gC  dls  k y +r+ vCe h ka+v++v+g+e rfCqqC +f +l+ef+ ++rsC  r ++ n rrr +
  Cre01.g070932.t1.1  56 VCQVDGCGRDLSAEKAYLQRYSVCEGHFKAEVAVVHGQEMRFCQQCNKFQDLREFEGSRRSCAARSKDRNMRRRLQ 131
                         6************************************************************************976 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5114122.52154131IPR004333Transcription factor, SBP-box
Gene3DG3DSA:4.10.1100.103.2E-2554117IPR004333Transcription factor, SBP-box
SuperFamilySSF1036123.01E-2355132IPR004333Transcription factor, SBP-box
PfamPF031101.3E-2657129IPR004333Transcription factor, SBP-box
Gene3DG3DSA:1.25.40.209.3E-414341447IPR020683Ankyrin repeat-containing domain
Gene3DG3DSA:1.25.40.209.3E-415201521IPR020683Ankyrin repeat-containing domain
Gene3DG3DSA:1.25.40.209.3E-418411935IPR020683Ankyrin repeat-containing domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005634Cellular Componentnucleus
GO:0016021Cellular Componentintegral component of membrane
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 2044 aa     Download sequence    Send to blast
MDKDWGLNPN WDAYQARPAQ PAAAGLGGRK DAAPAPGSAE ASLLPKKARR ALRPTVCQVD  60
GCGRDLSAEK AYLQRYSVCE GHFKAEVAVV HGQEMRFCQQ CNKFQDLREF EGSRRSCAAR  120
SKDRNMRRRL QTTMQKGPGG EGSEVLLQST SNQRDGQNPQ ASGSPGRSPP QGGGAASETS  180
GGAGSAEGSA GGSAALLAAR RQGGSGGSDY SLDAHAVGLG LSAGAGMAGI GGPMSAGHAG  240
MRGAGMLARP LHAQQQQQQH QQQYAFGDGQ PQQRQQQHHH HHQTFSSGHG GGGGGGGSSS  300
NMAMDADHHY TADQRQPGGA GLGSWARLEQ VGQSPYDAEG ALGLGMRHGP QSADAYGRRS  360
YAPGASVAGV GHVMGSEAGF MEGGGGGMMD LDTRALDTLI SGGEHPFPTQ QQAHMQIQAQ  420
QQMQPQQQPQ APAGLLAGSS RQHGGLGQQA QGHFNQQGRS DSPAAGFGPP GGGMDSFGGA  480
PDAAGPGTSW GPSPDQLQMM DGPYDGGRGS APYLMAGHQH QQQLLQHHGS MPLPGSHGAQ  540
QQVGTSQQGL GTVRNGVSAD LLEQLHTSHS HRHHMAGAME GPQYRGSGSG SGGYPPGMSV  600
GTAMDEMQMG VAGMPRGSNL GPTGIGSAPP VRGPSTTGMQ GGGDFSAQRT RVMQYQQQQQ  660
QPAHAYSQPL PSIMPGRSQQ QAPHQQQEDQ SNVAQERRAV LIRQQLLGGQ QLQGPGRSGM  720
SQQEQLAQQE QLAMLSAYGD SNFLGLTPSE DMQLQLLAGG AGLDRRGGRG GGGHMGVGHS  780
LADIAEYSGQ DAGPNSTSSM AMDMSTGGVA GTGRMMHGPA SAPPNTRYGG GAMPGAGGAM  840
GGALHGSMPL QQAHGGMAPR GQHPQHHMMD PQHTSSASAM DVLQQCALGS GSSGGAAQHQ  900
HQAALMRPSG PGSAGVRIGG DEGAGRYGRG GGRMGMDSAG RGVDLMHHPA MDQMGGDGGG  960
LGESSRLLSG LLPGTGGRDD ALAARVCLKL FSCIPEDLPG DMLARLRRWA MAADSDAMQV  1020
FMRPGCVHVI VSIRHSGGQE AIERHVLGNG DLDAAVAALR GAFESSGLLR GRRVVVQAAE  1080
GPRGAFEHDG LAAGAARWVS DADMARAPGV VSMMPAAIPC GTATVAVLAG RNLLRPGTRY  1140
YVRCGGRSYE LRPLVQQPHT VSIKASTPTS GSPAEAASSA PAVHTPRAQP HGGGGHASGG  1200
PATPPMVSGD ASVALDSPTA AAAPSTSSSD SYASSVRSAA ARSEANTTEH PLSTSTSVDH  1260
ASGPPATPSL TPGAGAVKGG SPLRRTSRGD AAKAIAPVPA GVERRLASSS APAEPWRAPA  1320
AAGAAGQLEC VMVHVPVLPK HGLMAVEAVH AEVDVMGAWA AGVACQDPSS ACEINAWMRR  1380
CADSRSARWL LLELGVLLDY DSVLPSLLET SAHTRPAAAA VGAMDASPPA LRSGINAHAA  1440
DVEGASPSAP APAAQAVQPL PGLAVAPVAS NPLPLGRGPD HVEMSAAASA AMAGAGALSA  1500
VPYQHQLHAA TATATAAEEA AESMLDPRGD GTLALRLLPS SGGESARRGP PGPASQEQLE  1560
QLRQLEQLQQ EQQQQHPEHQ SARGSSGVPY TVISFAPTGS DVNGSNAGRG ARNSDGTEDT  1620
ASISDDTLKV PDAAVRPAGV SYGRLLAHPL LHPNCRRNML ASASRLLVYS VASGLPLLAS  1680
GLLEWLMGME APVMGLSAAA VTSAASQALH PVADAAVGTG TGSSTHLTAL SALFSRVLVR  1740
AAAATERAVA ASLSVAGTVS ASGTDGEGEN LYPAVLDSAA IVPGLVPPQS PGLLASGYEP  1800
LPLTPPISGQ SFSSNSFTLE SRLESFTDGA QRSSLLGLGL LHLAVASGSS GMVLAALDWA  1860
DMYGRPWSLD EQAGPRRTSP LHLAAALADG GEISRLLLSL SGTGPHGGQW HDAWFMLRDG  1920
QGRTPADIAA ANSDPGMAGV NHFCMHGEEL PDAEEVGGHG GDAGVTVGGL HRGDAVGDAS  1980
TTAGPAKLRD APTTPSLLPM EVQESAGQQQ QLGHVGSPLL LVGGLMLALA LALMLLSGAW  2040
RRE*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
1ul4_A2e-16561291083squamosa promoter binding protein-like 4
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1286297SGHGGGGGGGGS
2766772GGRGGGG
Cis-element ? help Back to Top
SourceLink
PlantRegMapCre01.g070932.t1.1
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_001700248.10.0predicted protein
TrEMBLA0A2K3E8I60.0A0A2K3E8I6_CHLRE; Uncharacterized protein
STRINGEDO983630.0(Chlamydomonas reinhardtii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
ChlorophytaeOGCP1131122
Representative plantOGRP9717230
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G69170.11e-19SBP family protein