PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID fgenesh3_pg.C_scaffold_9000137
Common NameCHLNCDRAFT_52112
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Trebouxiophyceae; Chlorellales; Chlorellaceae; Chlorella
Family SBP
Protein Properties Length: 1719aa    MW: 179716 Da    PI: 8.61
Description SBP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
fgenesh3_pg.C_scaffold_9000137genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1SBP971.6e-30277354178
                                     --SSTT-----TT--HHHHHTT--HHHHT-S-EEETTEEEEE-TTTSSEEETTT--SS--S-STTTT-------S--- CS
                             SBP   1 lCqvegCeadlseakeyhrrhkvCevhskapvvlvsgleqrfCqqCsrfhelsefDeekrsCrrrLakhnerrrkkqa 78 
                                     +Cqv+gC+  l ++++y++r+k+C  h + p  +v+g++ rfCqqC+rf  l++f+ ++rsCrr+L+khnerrrk++a
  fgenesh3_pg.C_scaffold_9000137 277 VCQVPGCDRSLHKLRDYYKRYKICPYHLELPCLVVEGQTIRFCQQCGRFQLLTDFEGDRRSCRRKLDKHNERRRKAEA 354
                                     6**************************************************************************875 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:4.10.1100.108.0E-26270339IPR004333Transcription factor, SBP-box
PROSITE profilePS5114125.777275352IPR004333Transcription factor, SBP-box
SuperFamilySSF1036121.83E-27277355IPR004333Transcription factor, SBP-box
PfamPF031106.9E-28278351IPR004333Transcription factor, SBP-box
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0010468Biological Processregulation of gene expression
GO:0042742Biological Processdefense response to bacterium
GO:0005634Cellular Componentnucleus
GO:0016021Cellular Componentintegral component of membrane
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1719 aa     Download sequence    Send to blast
MGLQEIPVSQ RGATDAQQVW RRQAAAADPE AAAQPGVDAS EIKPPRLVDR PRPFINSIDP  60
AAGSLLACHY GNDRAPPMQY SVFPAAAEQH APLGAQLGKR VRKPLKQFAD HMLYDEGNAA  120
AATADGRQHI PVAASGPREQ GWGMGNVVRV SRAWEEGGGW ARLHGRLHTW YKPLNCSWQG  180
TSWSNACIHG TITFSCIGHE ASALGMNVAT RVPEVRPCLL TFSKTPADWR NLDATAVLIA  240
RPSPKLPRGP ETLAVLLPRA RGRGRSFFKE QPKKGAVCQV PGCDRSLHKL RDYYKRYKIC  300
PYHLELPCLV VEGQTIRFCQ QCGRFQLLTD FEGDRRSCRR KLDKHNERRR KAEAEQKARM  360
MDSGSDDSPG NPRSAKAYRP SPYRGSLAGI KGYASHGDLM MAGGGGAGEL SSRLQQLLGD  420
PALQDLLSPA LLQSLAEPAP AAGMDWLAAG SLAGLAMGAA QPPPQPQLAP PPAAPVLPQE  480
LLALLPAEVQ QQVLQQGHGV LNGALVMQLQ QLQAAAGAPS QPLAHHASPP APRAPGFGVY  540
GGSPAAQRPL AATSPPYPYD SPLDAALSPG SLGHSLQRQL HLQQQQRVQL AQGLAAAVPA  600
LSANGPGASA ASVFLQHMQQ PQPQQHAQQQ QQQQELQQAA VQPGHSIPFL RSEPVAEPQP  660
EPAASAADVS DKAALLVALA RSVGVSSEAL TQALREGGRG SPDEAAQAAA AAAAGASTSS  720
GAAQPGGEPR LPQQQPLLAL LQAPQQGQHG SNGGHVPPAA AAGLPSLHTN GTAPGSSGNV  780
NSSSIGQPPS VGFQLPQLGQ QQQEEEGAPP AGPPPPPPFA AAAGPSHLLD VDQLQSAVQN  840
GGLDAGFASD ASLLLQLVAA VEASQRQAGP GAERMQSLSL KLFNCTPDQL PKDIMEQLEE  900
WVVQNRSLLD GATRPGCVHV SLSALMSGEE ARHLSANFPA MVQQLAGGAL GGLRTSILAQ  960
LGIQAAALDA SGQQVVRLDL GGSAAIAPHI LSVRPLAVSP AYAGPVLVTG RHIGGPQDTI  1020
YCRNAGTYPT TEVLGCGMLL ASPEGQAGDL SWALLRIPAL KEGCHQVEAQ HGVLVSTPHS  1080
LLVLDDEEAV AELRQLELPG CHAAHASELL HRLGAVLRFG RGRWRRGAAG GSSADGALLR  1140
RVDGAAQELA ATCILRRWAA VLRLVLPMTC SACTPSQAVA DIERRLCGIP VLHAAGTPLM  1200
GGAEVVRTLA AWAEFVGLPL CLNTRWGGGL TALHVATLLR EPEAVALALT DLCPATATEH  1260
WVAARAEHGE ESPLGLACRL NRRRLLDALR EHGVPEAGAM LAALRLQDRP AGATPRLPPL  1320
GVEDPARELD QQRRQWTKVC GGLPTEQAPT TPGSLQTHRT WPLAGAAAPG AGLRLPAGSP  1380
SSAPPQACRP GGGGDPADWM QAGHQSPSSV LLPLEASCRA AAGQREQQKR GSGAAPSSEV  1440
VEEEEEEGPA GAAPAAPAKG GAHAPPAAAC QAAATRSGGL QAQPLCPAMF KPRVHDTCAA  1500
RQQLRTWRQP KLSKASACRG MDAAVYALLA ALLYALVVAA PAPARQRMHS WLLSTVGRAP  1560
LIFQPMVLLC TPVQSMYRKA HRKSRYVLLA AVHVALFGVQ SWEARTHCSD WLAGSAVLAA  1620
HPGGVWMLAV LRLALRSPGL LFVYCANVML LPTACGGQLA SCPGGSVAAC VAFGGARILA  1680
TVTALPLLSG LVARRRARHA LKKPAAAAWA ARPKQCQH*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
1ul5_A4e-20276356484squamosa promoter binding protein-like 7
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1338349RRKLDKHNERRR
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00603PBMTransfer from AT2G47070Download
Motif logo
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMap-Retrieve
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_005847965.10.0hypothetical protein CHLNCDRAFT_52112
TrEMBLE1ZEF90.0E1ZEF9_CHLVA; Uncharacterized protein
STRINGXP_005847965.10.0(Chlorella variabilis)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
ChlorophytaeOGCP2012101
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G69170.17e-19SBP family protein
Publications ? help Back to Top
  1. Blanc G, et al.
    The Chlorella variabilis NC64A genome reveals adaptation to photosymbiosis, coevolution with viruses, and cryptic sex.
    Plant Cell, 2010. 22(9): p. 2943-55
    [PMID:20852019]