PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_020685601.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; Asparagales; Orchidaceae; Epidendroideae; Malaxideae; Dendrobiinae; Dendrobium
Family CPP
Protein Properties Length: 1045aa    MW: 114702 Da    PI: 7.0966
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_020685601.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR48.41.8e-15665704342
             TCR   3 kkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNkeek 42 
                     +k+CnCk+skClk+YC+Cfaag +C+++C+C++C+N++e+
  XP_020685601.1 665 QKRCNCKRSKCLKLYCDCFAAGMFCNKTCSCQGCSNNSEN 704
                     799*********************************9985 PP

2TCR50.44.4e-16749787139
             TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                     ++k+gCnCkkskClkkYCeC++ag+ Cs  C+Ce+C+N 
  XP_020685601.1 749 RHKRGCNCKKSKCLKKYCECYQAGVGCSLGCRCEGCRNA 787
                     589***********************************7 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1045 aa     Download sequence    
MSSPHGFKPT PNTPSAKDSP TSIFFNNLSP IKVTSSRLSS QAFDNVDVPS PPPVFATPQN  60
RLINSKMWKR SENLAFPGSQ KICPGDADLV GTSLEYPDAT IQFIPVAAAQ LSNSSDKQYN  120
NDHHPQDEVC RGLSNGVDDF FSIPSEDCEN TSCSPGFFLG EADDLVRDKE ITPFSSNGDQ  180
MAFFNADSAQ FPAVKSAATV TNSVAEYTKH LNLVSTPNLQ PNKDALPQEF PKNSIAYEGS  240
NKEFLISNAG DSSNLNIVDQ SSSCQPEKDI SKKKPSNRTV PAAFEEIEKK GGLLGKKIEA  300
NLKSFVDLIS STQNEPITPD SNGSNHSGVP NNMNFKSICH VNEHPSTSSS CLLKMQDVNL  360
FSSRTDHLGK LGCIYEEKCE IPRNIQSHAN HLVIASQMRD ASGMSQIPYY QEDLTHHSRG  420
MYKRLQFEDV EKHEQSIADS ENYMKLKSPR FMSERTTMAS NMESQYPSTT NSSCRIQTTQ  480
SLSGNISCNY LVKTNSSEQS KGRFTAVAHR SSGIGLHLNA IGMPGKISGH IDMKMAMESL  540
GVEGKTCFHN TCEAPSQILK NMIVSRNGTG EVSLQLISNS EDSSFGSKDE CNNDRLQKSS  600
DSAPDYHSAL DVEPLDMRMQ PEFSDHHITP CSRKRQHYVE ASQSEDSAQT TLGRKRKKDP  660
EIEGQKRCNC KRSKCLKLYC DCFAAGMFCN KTCSCQGCSN NSENEEMVSS TRQLIETRNP  720
LAFAPKVVHA HGDLKENADN KHITPPSARH KRGCNCKKSK CLKKYCECYQ AGVGCSLGCR  780
CEGCRNAYGT KEAYGDISEI VVEHQKPKED SINGLPSVEA LKVVEGNSKE ASNTKQFTTH  840
FTSPAPLVQD TNINSVARLQ LPASYHPSSK SSFATLSSFE SPTSTMTDTR SSAHEEKNEQ  900
TLSLIPYNQK FDSSSGLNVD SLSPGWNISP SSVPSQSLCG STTLRTKQAK ALGEGKFIIK  960
PATDEPNYNL AEETPDILKD THSPINTVKT SSPNQKRVSP PKTVLHEAGS SFSPGLRSGR  1020
KYVLQSIPQF PSLTPYRNNM ENKDN
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1654658RKRKK
2654659RKRKKD
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.14e-51CPP family protein