PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_023909383.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fagales; Fagaceae; Quercus
Family CPP
Protein Properties Length: 792aa    MW: 86420.5 Da    PI: 5.8523
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_023909383.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR51.32.3e-16497535240
             TCR   2 ekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                      +k+CnCkkskClk+YCeCfaag++C e C+C+dC+Nk 
  XP_023909383.1 497 ACKRCNCKKSKCLKLYCECFAAGVYCIEPCSCQDCFNKP 535
                     589**********************************96 PP

2TCR50.63.8e-16582620139
             TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                     ++k+gCnCkks+ClkkYCeC++ g+ Cs  C+Ce+CkN 
  XP_023909383.1 582 RHKRGCNCKKSNCLKKYCECYQGGVGCSIGCRCEGCKNA 620
                     589***********************************7 PP

Sequence ? help Back to Top
Protein Sequence    Length: 792 aa     Download sequence    
MDTPERNQIG TPISKFEDSP VFNYINSLSP IKPVKSTHIT QTFNPLNFSS LPSVFTSPHV  60
NSHKESRLKR HNFSDPSKPE FSSENGNKVC TNEGIGVDAT QLYDNSAELQ DNFDSRVSDG  120
EASVEPSSEH SKFAIELPRT LKYDCGSPVC DPTPCCGTEA GCMPTSLVQC GQEASAKGSS  180
EGEVHLSEMC RTDQTKEGLE CDWESLISDC PDILIFNSPN DSEAFKGIIQ KSVEPLTRFC  240
SSLMSQLPLN EFNDVHKMQI VDPVDSGELE TEDLSAKPGE ASELEEIDQL QSNPADTVLY  300
KGMSSNASEK LDNEVGMYMP DGSKAVTILH RGMRRRCLDF EMVGARRKNL GDGSNSSSSM  360
LSQSDENIAC HDKQLVPFKP GGDSSRCILP GIGLHLNALA TTSKDYKNVK HEDLSSGRQS  420
YLPSSTASLH SPSNSQEPFL QSLATASSER ETDHGENGAA PVEDASQASA YQVSDDFNQN  480
SPKKKRRRLE HAGESDACKR CNCKKSKCLK LYCECFAAGV YCIEPCSCQD CFNKPIHEDT  540
VLATRKQIES RNPLAFAPKV IRNSDSVTEI GDESSKTPAS ARHKRGCNCK KSNCLKKYCE  600
CYQGGVGCSI GCRCEGCKNA FGRKDGSAPI GTEAELEEET EACENSAVDK VLQKPEIQNN  660
EEQHPACALP MTPLRISRPM VPLPFSSNVK LSRASFITVG SSSGLYTSQK LGKPSILRSV  720
PKFEKHFQTV PEDEMPEILR GNCTPSTGVK SSSPNSKRVS PPQSDFGASP SRRSGRKLIL  780
QSIPSFPSLT QH
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1482489PKKKRRRL
2483488KKKRRR
3485489KRRRL
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.11e-168CPP family protein