PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cz14g02220.t1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; Sphaeropleales; Chromochloridaceae; Chromochloris
Family C2H2
Protein Properties Length: 858aa    MW: 94457 Da    PI: 4.581
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cz14g02220.t1genomePhytozomeView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H214.10.00014300322323
                    ETTTTEEESSHHHHHHHHHH..T CS
        zf-C2H2   3 CpdCgksFsrksnLkrHirt..H 23 
                    C  C+k+F++  +L +H r+  H
  Cz14g02220.t1 300 CVVCDKRFRSAGQLANHERSkkH 322
                    99***************998666 PP

2zf-C2H213.90.00016830852323
                    ETTTTEEESSHHHHHHHHHH..T CS
        zf-C2H2   3 CpdCgksFsrksnLkrHirt..H 23 
                    C+ Cg +F+++ +L +Hi++  H
  Cz14g02220.t1 830 CSVCGVVFPTRNQLFKHIQSsgH 852
                    ******************87666 PP

Sequence ? help Back to Top
Protein Sequence    Length: 858 aa     Download sequence    
MRCLYEVLGV ERDADDDAIK KAYRKQALIW HPDKNQHRLE EADKQFKDIS NAYEVLSDKH  60
ERAWYDSHRE QILRSGDHHQ AGGTGGFTPG QKPDDEIDLF QYFTSGCFQG FGDGPKGFYG  120
VYAELFDTLA DQELSAYEAD TARTGNNPPT YPGFGSSTSP AAEVYAFYSN WSSFATYKNF  180
AWADHYNPAA APNRQVRRRM EEENRKARRI ARREYMDNVR ELVAFVKKRD KRVAKFQAEE  240
AARRAEREAA ELERRAAEKY ERLRRAAEYE EPEWVTDTAQ DHGDVDDDGE SDAELDTLFC  300
VVCDKRFRSA GQLANHERSK KHLDALASLK QVLADEEQWI LEEGEAGVGR EGEDDVDARG  360
LESGGKNGVQ GAEGVPGPYA NDGASLGKKN KKKAKKHQQQ QSKAVESQQQ QQQQQQGVQL  420
LDAEADDDVT VDSSWSAKRQ KQKKKRAGMA AAVDEQHTAQ PGSQLNKGHS GPANQHKQQH  480
TQQAHQPSMD DQTSSVSEPS EHEQDESDVD EDALLARLVS SKARHQQDAN PAATGTAAAG  540
DGNDDAIDEE AEEASGSSAD DEEHCTDDDD DEEAMLARMV SGRQTSQQPK QRQRQQQQQQ  600
QHDSASLEDG GDQPGSVDAA TDDDSGDDVG GDDDSDDDDD DDDDDDDDDD GADDDDEDAL  660
LARLVSNRTS STSRQAPSGK ATNSKQQRRQ QQQTLEAGND SVDAMGVQQQ HQQQHRQQQD  720
TACNAPGSIN GLAHAVQDMH VNAAETQGLQ DRESSESSTM NGHADKKAPK SLRRDAQQSK  780
SSTAGTVSAA QDDGSDDEDA HGRTSSKKDK RKQKQEKKQK ADAAAAGLLC SVCGVVFPTR  840
NQLFKHIQSS GHAALKH*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1388396KKNKKKAKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G74250.16e-60C2H2 family protein