PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_021670960.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Malpighiales; Euphorbiaceae; Crotonoideae; Micrandreae; Hevea
Family CPP
Protein Properties Length: 909aa    MW: 99818.5 Da    PI: 7.1331
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_021670960.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR47.43.9e-15490527340
             TCR   3 kkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                     +k+CnCk++kClk+YC+Cfaag +C+e C C++C+N+ 
  XP_021670960.1 490 CKRCNCKRTKCLKLYCDCFAAGIYCAEPCACQGCFNRP 527
                     89**********************************86 PP

2TCR50.44.6e-16573611139
             TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                     ++k+gCnCk+s ClkkYCeC++a++ Cs+eC+Ce+CkN 
  XP_021670960.1 573 RHKRGCNCKRSMCLKKYCECYQANVGCSSECRCEGCKNV 611
                     589***********************************6 PP

Sequence ? help Back to Top
Protein Sequence    Length: 909 aa     Download sequence    
MKAFCLLKLA SYHRDFESPF SNYISNLSPI KPVKTAHVPQ GFLGVSSPPL VFTSPRTIPH  60
RETSFFQRSQ FTQIASAEIP ENDDGRKNFA GLSNDIGESD NYSSKLIADV RQNNDGENSA  120
RDQPGSSSGC VDEYLYDPVD VDCASSANLV NPNAKQSNDV LQSSVSSLTD SNKGILKSDG  180
KNHLRAEVDK SQALSKQAEE DVQGQSRFEI KPVQVEEDQS GNKKSSIKCP NVQSDHASEK  240
NQCDVLETQV VQAHEDYNEN VAASLQGAMH NMVQLEQEAS QLQRGLSRRC LQFEEAQWKT  300
IVNSTCSPNL TNYVTGSGSP GSATELESLN SSLVDLTDSS NKKEMVNLSR PATSMFPLRC  360
NEKSPIVVSK PSGIGLHLNS IVNTLPVGHT AAASIKSSNC LKLSNLVEKV PITPKDRMLE  420
TKASLAASDT TAESFHNAEP LNMLQSLGHQ LTPSNKRKFN PEHEDNFEEV GQESPTKKKR  480
KKSSLDGEGC KRCNCKRTKC LKLYCDCFAA GIYCAEPCAC QGCFNRPDYE DTVLETRQQI  540
ESRNPLAFAP KIVPHVTGFA AEDGNQLMPS LARHKRGCNC KRSMCLKKYC ECYQANVGCS  600
SECRCEGCKN VYGRKEEYGR TGEIASNIVG EERLDGRIHD KLEMMATNKD LLHAELYDLR  660
NLTPSTPSFQ HSDHGKDAQK SRFNSSRYVP SPQSDFSILP SYAKSTRSPR NSHNNNMIPE  720
TSEEILDIDT CGQGMDYNVA DMMNQISPRH NALENICDLT PFQNPSMTGA SSASSKARDW  780
AGGSGLQLCP GSGCFSSGRS LRWRSSPITP MTRLVESKNQ EHGTDSGLYD IQEDDTPEIL  840
KEASTPVTSV KASSPNKKRV SPPHKHIQDL RSSSSGGLKS GRKFILKAVP SFPPLTPCID  900
SKGSKNEKQ
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1478482KKRKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.19e-61CPP family protein