PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cz02g27040.t1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; Sphaeropleales; Chromochloridaceae; Chromochloris
Family CPP
Protein Properties Length: 1410aa    MW: 145837 Da    PI: 8.6265
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cz02g27040.t1genomePhytozomeView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR42.71.1e-13516552441
            TCR   4 kgCnCkkskClkkYCeCfaagkkCseeCkCedCkNkee 41 
                    ++C+Ckks+Clk+YC+Cfa+g +C++ C C dCkN++ 
  Cz02g27040.t1 516 RSCHCKKSQCLKLYCDCFAKGMYCTN-CLCLDCKNTKA 552
                    68************************.********986 PP

2TCR47.92.7e-15583621240
            TCR   2 ekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                    +kkgCnC+ks+ClkkYCeC++ ++ C++ CkC +CkN e
  Cz02g27040.t1 583 SKKGCNCRKSHCLKKYCECYQMNVRCGSMCKCVECKNLE 621
                    689**********************************76 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1410 aa     Download sequence    
MRRRRRRSSK AKAQARSSST PSTNHMAASE TVAPIARTPE KPKPAISGLS MLASPELGSP  60
RLFSPSHLHL PLFQASPIRR APVSPAHHYS GLAEFIQSPL RTPGGAHLNP GDLLNFDTPC  120
SKQASLAHGF FSPSPHKRQS RLATGSGHQS SPLHLGQFDW GDLHLVKPAG GGFTPDHHTY  180
AVNQHGGVDD AADLLSDILQ SPGAKEDRPV TRSRSGGAHT RRRCLDFSKA DGAGPRLGAD  240
LLHSHAPGSP ALISPYTVTH SQVAAATVAL LAAAAGGQPS QYGAASLPAS DQLNAPAMQP  300
HVRTRGLGQQ DTSVMIPLQQ DGALFAGQQM LLPEDDPQML MAQESLSSDS DHKEQLDPAL  360
AVAEQTTAET HHAYTGAAAT SASEAMPADN TAQPPAYKRG GAGRTGVQYS IVQQAVANAA  420
AAAGAAGRRA ARAQLPVSEP TVLAEVAQPP QQASQLSPDD PPKSLPGAAL QQQQQLHSAI  480
PTPDTRIPHT SESPMSVVPP VTPDDARKRD SSIVARSCHC KKSQCLKLYC DCFAKGMYCT  540
NCLCLDCKNT KANSELVMER RQCIKQRDPE AFSNKLREDK VSSKKGCNCR KSHCLKKYCE  600
CYQMNVRCGS MCKCVECKNL ETPGAENPTA PTARLQHPSV DWHVSPAAGV TSTPPLSRTV  660
HQVHSTPQAQ AAPVAPVMLR IALPGPSTPV SAPMVYFPMQ TPLAPDGSMA VLQGLSTGLS  720
GERGSGASQQ TFAPVTSTPA VSIALEATPA SVPVSFESNY WQRQLVKSED EDDGEKENSG  780
HVQEQPAAKA AAQPAASEPL QLATPEQAAA QTPLLAAGTP SSRASSTVSV QSMPNTRLRH  840
RLSTVSTGDV HKPQAAQLGG VSPDAAAAAA APGVAAGMTT RGRQAKLKVE ADDDKYVAIP  900
DSKQGTSTQH QPHAKVHQQQ QHHQQHRQQH QHQQDQQQQV SSSSSKALMP PPPPLRRGAA  960
AGQPSAVVMP GAASVPVPTA VLPGQGLLTS ALTAAGIEPV SGPSACSSTT VSPAAAAAAA  1020
ATTPAGHIIQ LGVNGPLLQL PPEYLAPNAA PVAMNTTATA VAGDGAQGQI GLLLYIGDER  1080
GKVSTFYVPT LQQAIPSSVA QPTAVVAAQP RLPAGSTAPA AVVAALDVSD TTHTANVTVA  1140
YPASAPIVMD PGAAVAAAAY CGDSGMTSSS GMGATASDDS LSHHAALEYL EQQQYGTQES  1200
QAPSTPFTGA ATATAAAAAG VPDQPAPKLS TPVKRKVADT GGNISTHTEM SEGASPLKLH  1260
SGARSTAASP AHPTHSHTPL APCSAANAMM MALSPTMQQP IPMGYVLLAG RKGVQYAAPV  1320
SAAGNVGVPM IPGYVGTMSS GMVEVSGGSN KQQPLNGMQS PKKGLLTPSK RTGQFMAGSS  1380
KLSSPTLFRF APGSTTVMPS PPQPRPRQI*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
127RRRRRR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G16160.19e-29CPP family protein