PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cre05.g233551.t1.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas
Family SBP
Protein Properties Length: 2621aa    MW: 257538 Da    PI: 6.1452
Description SBP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cre05.g233551.t1.1genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1SBP73.83e-231792275
                        -SSTT-----TT--HHHHHTT--HHHHT-S-EEETTEEEEE-TTTSSEEETTT--SS--S-STTTT-..------S CS
                 SBP  2 CqvegCeadlseakeyhrrhkvCevhskapvvlvsgleqrfCqqCsrfhelsefDeekrsCrrrLak..hnerrrk 75
                        Cq++gC   l++ k y++r+k+Ce h k p++ ++g+  r CqqC+rfhel  f+ +kr+C+ +L++  h + +r+
  Cre05.g233551.t1.1 17 CQIQGCGRSLQNSKLYYQRFKLCEDHLKRPAISIDGVLSRLCQQCGRFHELAAFEGKKRTCKAQLDRirHAKAKRR 92
                        *****************************************************************95336665554 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:4.10.1100.105.2E-241178IPR004333Transcription factor, SBP-box
PROSITE profilePS5114120.4181491IPR004333Transcription factor, SBP-box
SuperFamilySSF1036124.84E-211690IPR004333Transcription factor, SBP-box
PfamPF031103.1E-241790IPR004333Transcription factor, SBP-box
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 2621 aa     Download sequence    Send to blast
MSRPPVGALS KAQRLSCQIQ GCGRSLQNSK LYYQRFKLCE DHLKRPAISI DGVLSRLCQQ  60
CGRFHELAAF EGKKRTCKAQ LDRIRHAKAK RRAGGPGTGT GTGDNSSDSP ERDPGPHGDP  120
GSYGGGPGGG GGGGGGGGSL GTGSGPTTTG TGGTGGTGHQ VVSGSSSASA DNDNTAAAAA  180
AAAAAAAAAA AGGARKGRKP AGKAAAAAAV VSGGGKALLV ARDKAVLRKG AGGWGRGGRG  240
DTSDTEEGDD QRPGERGARS GGGGGGGDGD GDVDMLPAAH EAEVAAQQHQ RQQQQPQQPL  300
LPGAVGLAGN MAGVSLAAGL GPAGSVGPLG PLGLGLHGPL DPVVAIPEEL LPHQHHHQQQ  360
QQQQQQQQQQ QQGLLMPGPT PGPGPGQRHL AVASNSLPPP PLPQLGSGGG GGGGSGGYSP  420
EGGGGGGGGG GVHAPPQPLQ QAGSGSLHGM RQPSPPEYSP HPQMQQAQMQ PQQGYHQQQH  480
PHPHPHANAL HTQHQQMLMQ GRGGGGGGGR GCASGGSVGP QPGHVGGGGG GHGPSLFGVT  540
DRGGVSSGGG IQHQQQQYQQ QQQQQHPNQY QQQQLAGGYQ GGHGGASPDD SAGRVSSPSG  600
SGGAAAGGGG GGGMMGSQLS SHVGNATAAG GWARPAGGPG YGDTQYGISN DNRGWGSSGR  660
DTAGAQQLLN RRAGGSDGTM SGVQQQHPQQ QQLGDSGRGG SGMPGPGGLG GGGGGGSEPW  720
HARGSASSLP VMGGLGGGGA GWGGGSAFAG GMGGDGGGLG MGMGGAGRGP QGLAGLGGGA  780
AANGSPALLR AVQSTPLWAD YPGLAALRGA GVGGMGGGGG GGGGLCGSGS GGLYGGGRHQ  840
HQHQQLGGGG GGGGGGGGGG SGGSFGRPPL GGLGGRGGDP DVEIDSIFDD LAAPSPSLLH  900
GGLGGGGGAA AGAAAAPGFP RLSHQPAGYP PQQQQQQQQP QGVPHRLYAA AGGANAGGRL  960
STSAAGSPDA PPGGSPQVAA AVQQGPPRGR RLYEAAGGGG DGDVADGITA AAAELLGASV  1020
GHHRNGGGGG GGATTAAAAL AARLDALSGS SSGNSLLETL KMLSKSSSAG SAGDGNDLMG  1080
LYDAMMTQGF FGADAPAAAA AQSGAPLGPG SAAAVCGHLR AAAGIAGEGS GGLLPTPPLL  1140
GGQQQQQGGG LPPHLQHQQS FNIPGQGSLL GAMDAGGGGG GGGGGGMGLN QHQQQQQYQQ  1200
QHYNQQQYHQ NQNHTQRQQQ QQNQSRQYSP MHAGSFQASM AAGGGGGGGG WGTAPGGGGG  1260
MSLSGGSGGG GPAYVSRDSL SRLTLKLMNC LPEDLPPLLR ANMQRWAAGG GGASAEVLQA  1320
CMRPGCLELV VDVVHSAPPH EFAAGLLMLG RADAAARGLR RAVGPRLADD NELVLELEGL  1380
VLSVPPGWYM PGGAAGAGVP EAAAAPGEKD QQQQQQQQQR EEGKAVTATG AASAAVPGPV  1440
IRTWREAAER YERQDAQGEE GEEDAGGSKW PCVRPTILAC SLAAARCAPL LALTVCGMGL  1500
DHPSVTLCAR MGGQVVPAAA AQLPPPRPPP PPPPPPPVRD QDQGQDQGPA GASEPGAPSL  1560
QQPGTAAAAA TATASTGDHS NQGSSGSSSG GGVSGWSRLS VQVAPPRREG LLVLEARRGR  1620
LLGPWWPVLV VADGAVAAEL NHLSLTVQGE EEAAAAVGGG GGGGLALPDW LRCFLVDLGQ  1680
LLDWVGWTGL EEAPEAAFPQ PLAAPHSTST STGAGSAAPS TASARTWGSG TDSLTASASV  1740
SNSISNSNSC SGSGSAAAEA AGMAAAAAAL AAGFSAAASA AATSSTDAPY RILPSAVSLF  1800
NRRRRSIIPS AVLAAEAAAA AATSRTLTPT GIVAAMAAAD AASGGGSHGG GGGYSVGSSS  1860
AAASSVAASS VAACTDSSWA PSYSGSHSRS NAASGVPIGG GAATEVLMAG RGSGGGGSGG  1920
GCELYDCRVV AALRDAEASL ERRLSYKACR LLAFCVLRGL LATSNLLWGV LTLQGQALAA  1980
VLDNALHEGE GLLMCAVRSR CRHMLASVLR RGAVPMSGPA PWVTRDLSRV GPTGLSPLHL  2040
AASLPGGAPL AQALLELVPT AAPLWFGLRG AGGVTPSELA TRTHGAGGVG GALNALAAHL  2100
VLLAAGPGVN IPLLPRAPGH CGAGASPYTP YSAIAHGGGG GGSLAVSSAV ASDAAITAAA  2160
AAAAAAAAAA AAAAGDGSGP LDLALVTSLL VNKRRALTAI DGSGGGGRGR VRGSLQGVAE  2220
EVEQEEQEEE AEGLGGARGR TPQQAEVEWT DDEEEEEEEG EEEEEEEEEE EEEEEAEEEG  2280
AEEEEEEGEP AAVVEEEGAM PTPPPQEALL LLRHQEQEEE QEQEQEEEIR RGLEHMQLRA  2340
LAEEEPREPR EPRKPKAPAQ EAQQGEIAPH AGAISGGGSG SDSGADGSLP AAVQQPQPPQ  2400
SLPQSPPQLL HSPPQPQPPQ PLSQLLHSPP QPQPPQPLSQ LLQSPPQPQL LQSPPQPQLL  2460
HSPPLSQPPP PQPQLLQSPP QPQLLQSPPQ PQLLHSPPLS QPPPPQPQLL QEVQLLWESP  2520
QQCQPAAGSD GGGGGGRTQA HDAAWPAGPS SAGGSQAPTP TSAGDSYQRA SSSGGTGGSG  2580
AGGKPGGSAS RLSGWLKGLV KGRSDKDESS KGSKAPKSRK *
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
1ul4_A2e-1317811175squamosa promoter binding protein-like 4
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1501509RGGGGGGGR
Cis-element ? help Back to Top
SourceLink
PlantRegMapCre05.g233551.t1.1
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
TrEMBLA0A2K3DRT20.0A0A2K3DRT2_CHLRE; Uncharacterized protein
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
ChlorophytaeOGCP2012101
Representative plantOGRP48281320
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G27360.42e-16squamosa promoter-like 11