PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_022761084.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Helicteroideae; Durio
Family CPP
Protein Properties Length: 780aa    MW: 84988.1 Da    PI: 5.3336
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_022761084.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR512.8e-16485523240
             TCR   2 ekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                      +k+CnCkkskClk+YCeCfaag++C e C+C+dC+Nk 
  XP_022761084.1 485 ACKRCNCKKSKCLKLYCECFAAGVYCIEPCSCQDCFNKP 523
                     589**********************************96 PP

2TCR50.44.3e-16570608139
             TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                     ++k+gCnCkks+ClkkYCeC++ g+ Cs +C+Ce+CkN 
  XP_022761084.1 570 RHKRGCNCKKSSCLKKYCECYQGGVGCSINCRCEGCKNA 608
                     589***********************************7 PP

Sequence ? help Back to Top
Protein Sequence    Length: 780 aa     Download sequence    
MMDTPEKTQI NSSISKFDDS PVFNYINSLS PIKPVKSIHV TPTFNPFCFA SLPSIFTSPH  60
VSSHKESRFL KRHSNTDPSK PELSPGEGTT VSTNEDAGVE AGHLCDSSDE LQENFDTAVS  120
LGETSLELPN EPSRFANELP QTLKFDCGSP NCDPEPCIIE MNCVSESACT SVSIVPFVQE  180
ASEKGLSDGV EVVGICQIDQ KRETIGCEWE SLISDASDLL IFNSPNGPEA FRGVIQKSLD  240
PGARFCPALI SRFPQDDINE VCQTTVDSNE YKDPSLQPGE TGELKEINHV NGNFENTSVT  300
NCMAGSLTDY VESGMCAPFS LKPGSNLHRG LRRRCLDFEM LSARRKNLDD GSNSSSSVDN  360
QLVLGKPGND SSRCIVPGIG LHLNALAITS TDNKNTKHET FSSGTQKLSF PSSTPSILPP  420
TAGQEAVHEF FTSASTEKET NAVENGVQFA EDASQASAYL VNEEFNQNSP KKKRRRLEQA  480
GESEACKRCN CKKSKCLKLY CECFAAGVYC IEPCSCQDCF NKPIHEDTVL ATRKQIESRN  540
PLAFAPKVIR TSDSIPEVGD DSSKTPASAR HKRGCNCKKS SCLKKYCECY QGGVGCSINC  600
RCEGCKNAFG RKDGSAIVET EEEPEEEETD LSDNSGMNKK LEKTDILNSE EQNPVSALPT  660
TPLQLCRSLV QLPFSSKSKP PRSFIAIGSS STLFNGQRSK PNIIRPQNII EKHFQTVAED  720
EMPEILRGNS SPGTGIKTSS PNSKRISPPQ CELGSTPGRR SGRKLILKSI PSFPSLTPQH
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1470477PKKKRRRL
2471476KKKRRR
3473477KRRRL
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.11e-169CPP family protein