PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID IGS.gm_3_00397
Common NameCHLNCDRAFT_140658
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Trebouxiophyceae; Chlorellales; Chlorellaceae; Chlorella
Family CPP
Protein Properties Length: 665aa    MW: 65785.9 Da    PI: 6.7964
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
IGS.gm_3_00397genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR50.25.1e-16261299240
             TCR   2 ekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                     e+++CnCk+s Clk+YCeCfaag +C+ +C+C +C+N+ 
  IGS.gm_3_00397 261 EQRPCNCKRSMCLKMYCECFAAGGFCAPSCSCLSCSNTP 299
                     789**********************************86 PP

2TCR49.21e-15330369240
             TCR   2 ekkgCnCkkskClkkYCeCfaagkkCsee.CkCedCkNke 40 
                     +++gC+Ck+skClkkYCeCf+ag+ C+ e C+CedC+N+e
  IGS.gm_3_00397 330 HRRGCRCKRSKCLKKYCECFHAGARCNPEvCQCEDCRNTE 369
                     789************************888********97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM011141.8E-14260301IPR033467Tesmin/TSO1-like CXC domain
PROSITE profilePS5163428.626261370IPR005172CRC domain
PfamPF036381.9E-12263298IPR005172CRC domain
SMARTSM011148.7E-13329371IPR033467Tesmin/TSO1-like CXC domain
PfamPF036384.4E-12332369IPR005172CRC domain
Sequence ? help Back to Top
Protein Sequence    Length: 665 aa     Download sequence    Send to blast
MQDAFAAVAL RLESLKAAVA HKELGGGQDA HQLAAIMSLT QELEALLQPQ DPAAAAAGPH  60
SAPGAAMQGA AAAEAAMAQP AAAAAVKAAG LPATPFEQRR VPATGPATAV KAPAPTPGGG  120
LPATTPLAAE VLASLNNLGP GGGGGSNVAA ATAGPAVAAD RADGDAAATL SGLFGGMDRL  180
PSGAGSTLLQ QQQQAAAAHA DPALASLLQG SQLLGPPLPA HAAVMAAAAA VAAAGPALPV  240
PRRQSATLPM LPPLGASQAR EQRPCNCKRS MCLKMYCECF AAGGFCAPSC SCLSCSNTPA  300
EMGVVMAARE VVLAKNPNAF EVKVTAATGH RRGCRCKRSK CLKKYCECFH AGARCNPEVC  360
QCEDCRNTEG DAMLLPLLGR AKAQAAAWAD ASPDVLAMFQ QPLPPLAAAG SGGSQASDGE  420
GGARGLPLPH APFSFALPPM PEPALHSAGS LPAFQPATSA SLLAAFPPGS LFAPPPMPEF  480
DPAAAAAFQL HPAAAAAAAT FAAAATVAEQ QAQLEESMAD AASFPVAAPG APPAVPAAAR  540
VASALTSHLS DVAGEAADPE MHDTATSPAP HMAVAKLAAR RTLAAAMAPT LTGGAAAGNT  600
GRPPLPSRCA ARACPPAPRP EPSERAGNFD GYDQEAAQAA AYAAGCYNKA VITDARTKRA  660
ACVD*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5fd3_A2e-202573696122Protein lin-54 homolog
5fd3_B2e-202573696122Protein lin-54 homolog
Search in ModeBase
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_005850650.10.0hypothetical protein CHLNCDRAFT_140658
TrEMBLE1Z5W70.0E1Z5W7_CHLVA; Uncharacterized protein
STRINGXP_005850650.10.0(Chlorella variabilis)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
ChlorophytaeOGCP3231530
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G22760.13e-29Tesmin/TSO1-like CXC domain-containing protein
Publications ? help Back to Top
  1. Blanc G, et al.
    The Chlorella variabilis NC64A genome reveals adaptation to photosymbiosis, coevolution with viruses, and cryptic sex.
    Plant Cell, 2010. 22(9): p. 2943-55
    [PMID:20852019]