PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_019155008.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Convolvulaceae; Ipomoeeae; Ipomoea
Family CPP
Protein Properties Length: 802aa    MW: 87080.2 Da    PI: 6.5494
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_019155008.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR50.54.2e-16511547339
             TCR   3 kkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                     +k+CnCkkskClk+YCeCfaag++C e C+C+dC+Nk
  XP_019155008.1 511 CKRCNCKKSKCLKLYCECFAAGVYCVEPCSCQDCFNK 547
                     89**********************************8 PP

2TCR51.12.7e-16590628139
             TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                     ++k+gCnCkks ClkkYCeC++ g+ Cs +C+Ce+CkN 
  XP_019155008.1 590 RHKRGCNCKKSGCLKKYCECYQGGVGCSINCRCEGCKNA 628
                     589***********************************7 PP

Sequence ? help Back to Top
Protein Sequence    Length: 802 aa     Download sequence    
MGGVEANMES ENAERKSSGS EVGGSEEGGP AVMDTPDRTK INQIVTPLSK IEESPVFNFL  60
NSLSPIEPVK SVHITQTFNP LNFSSPPSVF TSPHLSMLKE SRFLRRHQLP DPSKPEFSSD  120
NDHKVNSSQV HNLDVSGNTS EQPESYLGSC TIEASIRASY EFSKLAVELV KSLDYDCSSP  180
NSSPVTNCDF KDKSFSALAG SSDTIVPVVH DVPGKKCSLR SQVKMEGASQ IDRNEEDSGC  240
DFEGLISGTG DALIFDSPND TGTIKKEVDP ISRTYSFGRN EIQNIKTFSA VGSGEKVEEI  300
IETENQSTQS GEGSELNQYA EVQDRNPDSS VCTKSLVGCT NEKMEADMVS GFNRGIRRRC  360
LVFDMTGIRK KHLDENTGSD SGSSLWTQSD GNTTSTDKHL VGTKSRREPS RCILPGIGLH  420
LNALAVAAKD GHAVKHEALA SGHQLLVAPG SAANYRSLTL TSKARVALED GIHLNEDANQ  480
SPGCSVNEEL NQSSSPRKKK RRMESGESEG CKRCNCKKSK CLKLYCECFA AGVYCVEPCS  540
CQDCFNKPIY EDTVIATRKQ IESRNPLAFA PKVIRATETL AEIGTPASAR HKRGCNCKKS  600
GCLKKYCECY QGGVGCSINC RCEGCKNAFG RKDGTEAELE EEETDVHEKS SIDESSQKSA  660
FQTEIEQIPD SFLPATPLRS KRPPMQPLVS LKNKPTRMPS ILGVGSSSGG TYAASQGLGR  720
PTFFRPLPKF HKQFESIEED EIPSELQGNT SPTSGIVKST SPNSKRVSPP HTELGTSPSR  780
RSSSRKLILQ SIPSFPSLTP NN
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1498502KKKRR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G22780.11e-141CPP family protein