PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID estExt_fgenesh3_pg.C_130070
Common NameCHLNCDRAFT_58151
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Trebouxiophyceae; Chlorellales; Chlorellaceae; Chlorella
Family SBP
Protein Properties Length: 2031aa    MW: 210324 Da    PI: 7.8256
Description SBP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
estExt_fgenesh3_pg.C_130070genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1SBP89.15e-28130207178
                                  --SSTT-----TT--HHHHHTT--HHHHT-S-EEETTEEEEE-TTTSSEEETTT--SS--S-STTTT-------S--- CS
                          SBP   1 lCqvegCeadlseakeyhrrhkvCevhskapvvlvsgleqrfCqqCsrfhelsefDeekrsCrrrLakhnerrrkkqa 78 
                                  +Cq++gCeadl     yh+r k+C+vh ka + l +g++ rfCq+C++ h+l efD +k+sCr++L+khn+rrrk+q+
  estExt_fgenesh3_pg.C_130070 130 ACQADGCEADLRAHTYYHQRNKICTVHIKADMFLRNGEQLRFCQRCGHAHALAEFDPGKHSCRKQLEKHNARRRKRQQ 207
                                  5**************************************************************************997 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:4.10.1100.109.6E-23125192IPR004333Transcription factor, SBP-box
PROSITE profilePS5114124.606128205IPR004333Transcription factor, SBP-box
SuperFamilySSF1036121.18E-25130209IPR004333Transcription factor, SBP-box
PfamPF031102.7E-25131204IPR004333Transcription factor, SBP-box
PfamPF024852.0E-3616371860IPR003406Glycosyl transferase, family 14
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005634Cellular Componentnucleus
GO:0016021Cellular Componentintegral component of membrane
GO:0003677Molecular FunctionDNA binding
GO:0008375Molecular Functionacetylglucosaminyltransferase activity
Sequence ? help Back to Top
Protein Sequence    Length: 2031 aa     Download sequence    Send to blast
MSWSEADWHW DPIAMTATAR QPAATASPAT GLPHRPPHPA GCSAAGSLQV EFLASLACVG  60
DGASKADAAD GMKPALPPAG TPGSSSGGTV AVEGAGGAAT AAGPARATPA RGRSDAKQKK  120
GAAGGGGNLA CQADGCEADL RAHTYYHQRN KICTVHIKAD MFLRNGEQLR FCQRCGHAHA  180
LAEFDPGKHS CRKQLEKHNA RRRKRQQDQA AAAAGLHGAT AGGGEAGGTV RRGRRQQAPK  240
RSRLRQDSSP EAEATAGSAD VAPSADAVEA FAAAWRSSAS WPSGGVQDES GSADEAGPAS  300
TSPRSASLQQ QLQQQQQQQR EQGQQAEASA DQLHAGPLPT HSGPDTSASA SAVATTLAPA  360
GPLGDMAGAA AAAGGAARPA SPPLPLLDFD IDLRPTPYEE LFQQQKPMHP AEAGLADDLA  420
AWLNRNLEED EQQQQQQQQQ QQQHAWQHAP GAAPAALPRP AAGAGPAWGM AAHQEGQPAS  480
LQLARGIMGQ QVMAPLDGLL PGYQQPLLHP HRHHHHHQLY GGTASSPAVS PMPAPVLGAA  540
LPLGAAAAGG SPPPHAPPLT SPTLATVSVK LFGCTPAELP MGLREHLKGW FDGAVHSLDG  600
YLRPGCVHLT AQAMLGGYAE ERLGRHETPP AEAAPGGVRA EQQEEHKSCC HRPTPAGVAA  660
PNPAGACCGR KRAAESPAEP AGAVGVASAV RRVVDRMLRS GEALWRTRTM LVQAGSHVAL  720
VHQGRLRQTW DIESASARAV PAILEALPPL LLASQPAALR VRGINMLQND CRLLLRLQGR  780
YVQPAAAMCT DCACSAPVHP GSSVGMAGQV AAVTAAASFE QRCCGCCVNK LQLAGLLPQA  840
ATATPAAVPP AAGLAVAEPP PGCCHSASPT AEAAAAGPPL DSATAAAAAP APAPGGCCGC  900
KEEQEPACQP PSRLQQQLGA RVLVVSSPAV HRELLQLVER HGPGALAPHL LDQARASAAR  960
RRHLGVVHEW VGRPEKMQYQ LVEATAARLL YWAVGSGLPA TADLLLSVLQ AQGGSSSAAA  1020
AAVLQRALRH EAAGGAAAAA LATMDGLSQL HLSMQATSAA AMQRAAELAC HDGLSLLHRA  1080
VESRCGATLR AVLGWGEDAG APWRCDLAGP MAITPLHLAA LLPDPAAARH AVLLLLSRCA  1140
PGARAWQCCC TVDGQTPADF YRAAGAARSS GGSSGAAEAT ATLEMEVAAL LHLQQAKAQR  1200
EHVPGVPLAA AAVPEPSPAP WLKVQAAVAE ALSLAPAQQQ AQPPAAAPGP ASVRPAAAAD  1260
PSGPEMQPST PRKNPGCLCA PGCPCALLDR CACCSSDSDE EGGPQEDGRH HQGTCGAGSG  1320
KCCCCSTDAS CHVVGCAGCA GAGHHGSGAP MPGSNAGGGG GLRRGVAASP SKAKACGDAP  1380
RSSTAAAAAP APTPQRRPVR RQRMQCVLLL HGTWLSALML LTLASIHGRQ HRGQLLAESL  1440
LDAVSEGGAR AAAGAAGAHR AFTAAHACGE ALQLPQVALL FLVKGPMHHE VLWRLWFESA  1500
AGLLPTDALA AALCSSNSSS GGNAASSSSS QPSAGERQRR VLQACSSRSG ADGGVEPVDG  1560
GVLSDSEGVT APQPGLAEEH EGQPLEEPAA SPSHSSRLLQ QAHSKHSATS NRAGTVNAGF  1620
VAPKGRAMPP ASGVLGEQIL FDVYVHPHPS FKGYPANSLF HGRELPRIER VATEWGQHSL  1680
VDAARALLKA AHRNPRNVKF VLMSESDLPL YSPHVLYTQL LGEPLSRLNA CNTTDGWRLF  1740
DHRWVPRMET KVLKPHHWRK SWQWFALGRR HVDLVLSDTA VDASFRAHCR TMFEQDRGAE  1800
RECYSDEHYI PTLLAVHGRD EETDCQGWLM DTDWSRVSNI SPHPWEYMPD ELTDKLLHQL  1860
RRPGQPRCGH AARVAVTARG AFVEAADLAG VSTAAAGSVQ ADAALTGMCR QLEQLAVRHR  1920
QEHQPLGPHC RLLARKFLNT TVAAALAAAA PCESMALVLN NGTCPGETPM QAALRRAWRR  1980
AAPYALLLLG TLLLPVLAAL CLGAVVVPAA VAVNGSRRGT RTDAACLRIH *
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
1ul4_A2e-18121204184squamosa promoter binding protein-like 4
Search in ModeBase
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_005846749.10.0hypothetical protein CHLNCDRAFT_58151
TrEMBLE1ZHQ50.0E1ZHQ5_CHLVA; Uncharacterized protein
STRINGXP_005846749.10.0(Chlorella variabilis)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G50670.13e-17SBP family protein
Publications ? help Back to Top
  1. Blanc G, et al.
    The Chlorella variabilis NC64A genome reveals adaptation to photosymbiosis, coevolution with viruses, and cryptic sex.
    Plant Cell, 2010. 22(9): p. 2943-55
    [PMID:20852019]