PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID IGS.gm_28_00045
Common NameCHLNCDRAFT_140014
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Trebouxiophyceae; Chlorellales; Chlorellaceae; Chlorella
Family SBP
Protein Properties Length: 497aa    MW: 52091.6 Da    PI: 8.4503
Description SBP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
IGS.gm_28_00045genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1SBP38.33.5e-12791372076
                      HTT--HHHHT-S-EEETTEEE..EE-TTTSSEEETTT--SS--S-STTTT-------S- CS
              SBP  20 rhkvCevhskapvvlvsgleq..rfCqqCsrfhelsefDeekrsCrrrLakhnerrrkk 76 
                       h++C +h  a ++ v+g+ q  r+CqqC++ h+++ef  + rsC++ L++  +r+r++
  IGS.gm_28_00045  79 LHRICAAHRAALQIEVAGQAQalRYCQQCTKVHTVDEFGGDARSCQHSLQRRRQRKRRQ 137
                      49***********9999988777****************************99999987 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:4.10.1100.104.3E-1444124IPR004333Transcription factor, SBP-box
PROSITE profilePS5114115.72547137IPR004333Transcription factor, SBP-box
SuperFamilySSF1036127.46E-1149138IPR004333Transcription factor, SBP-box
PfamPF031102.8E-1377136IPR004333Transcription factor, SBP-box
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 497 aa     Download sequence    Send to blast
MTEGEPGGSC EQQTQASAGS GTSDGTGAAA GSDASKKQRQ RRTTPASPAC RIAGCSQALT  60
HSYNKASMRA GHAARSSPLH RICAAHRAAL QIEVAGQAQA LRYCQQCTKV HTVDEFGGDA  120
RSCQHSLQRR RQRKRRQLQR EAGGGGDPAH DCQPEPEPRQ QPGSRPGERS PSQRQRQASF  180
GASSGAAPAG GASWRGSPAS GGTLSQAEED PGLQGQPWEL QPGGHPPAQL IRCHSAPQPL  240
TAPMSAPPTP PVLLPPTPQP SALQQHPQLL IAGRQLGGQQ ALQGQLPLLP DVPAEIRTGP  300
WQPGPGPDKP ALSATSAAAL PPYASPQALP PTRELHWCQQ QPAAPRLGRA CSVPAPGNLP  360
PVPLLALRTQ PCPRLVEQAA WLLEDDLGLL MGDGHNVVPS EAEMLAIAQE LEAAHPQPAP  420
QLPVLPQEQL PAAAAAPAAA PGWPASPALA LLAAQQAQRA QQAAHALHLQ WRQLLQQEQG  480
LAPFPSRFFQ TTPFDR*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1129136RRQRKRRQ
2131136QRKRRQ
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_005843654.10.0hypothetical protein CHLNCDRAFT_140014
TrEMBLE1ZRE10.0E1ZRE1_CHLVA; Uncharacterized protein
STRINGXP_005843654.10.0(Chlorella variabilis)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G69170.18e-09SBP family protein
Publications ? help Back to Top
  1. Blanc G, et al.
    The Chlorella variabilis NC64A genome reveals adaptation to photosymbiosis, coevolution with viruses, and cryptic sex.
    Plant Cell, 2010. 22(9): p. 2943-55
    [PMID:20852019]