PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Pav_sc0000800.1_g1060.1.mk
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Amygdaloideae; Amygdaleae; Prunus
Family CPP
Protein Properties Length: 1203aa    MW: 132169 Da    PI: 8.7391
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Pav_sc0000800.1_g1060.1.mkgenomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR44.33.5e-14264302241
                         TCR   2 ekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNkee 41 
                                 ++k+CnCk+s+Clk+YCeCfaag +C+  C+C++C+N+ +
  Pav_sc0000800.1_g1060.1.mk 264 KQKQCNCKNSRCLKLYCECFAAGIYCEG-CNCSNCHNNVD 302
                                 89*************************9.********986 PP

2TCR48.91.3e-15348387140
                         TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                                 k++kgC+C+ks ClkkYCeCf+a+  Cse+CkC +CkN e
  Pav_sc0000800.1_g1060.1.mk 348 KHNKGCHCRKSGCLKKYCECFQANILCSENCKCMGCKNFE 387
                                 589***********************************65 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1203 aa     Download sequence    
MEQSETVSQI APKKLARQLD FTTVCRASAN AALPEVQLQL QLQSPSQAQL SQNSPAKSHV  60
HLQLQSPLPP PVPQSPKQLT PPPAIPQFPA RPSPAPAANA RIQHPGHKLS ISTLQSAIIF  120
HDFEHLLMEQ SETVSQIAPK KLARQLDFTT VCRASANAAL PEVQLQLQLQ SPSQAQLSQN  180
SPAKSHVHLQ LQSPLPPPVP QSPKQLTPPP AIPQFPARPS PAPAANARIQ HPVHKLSIST  240
LQTSKQESPI SRQRGDGKDA TPKKQKQCNC KNSRCLKLYC ECFAAGIYCE GCNCSNCHNN  300
VDNEAARQEA VGLILERNPN AFRPKIASSP QESRDGREDA GEIQAPGKHN KGCHCRKSGC  360
LKKYCECFQA NILCSENCKC MGCKNFEGSE ERRALHHEDH NTVAYMQQAN AAISGAIGSS  420
GYGTPLVSRK RKSYELYFGA TNQTTHPIKQ PQQENHLRPP MASSSLSSVP TCRTANAAVS  480
RSSKSTYRSP LADIIQSKNI KDLCSRLVVV SGAAAEALAG NRHRETIDKS STNLSTQEGK  540
EGKKEHDIQN SVHDDHLGVN EADRDESNYS GLNGGDVQNS RPMSPGTLAL MCDEQDKMFM  600
AAGLPNGVGS SSPSMTQKST QEIGCTEVYA EQERLVLTGF RDFLNHLITR GSIKETMCSP  660
QAKRERVSQK EPVQGGTAKP SPEPRCQKEA YSNGVAKSPV SANGKMLQPV TMLHPVATVT  720
SGDNDLSLKV VKKKHNPPCA NIIKPSPSKK ALIICIYYFI VPLCGKRIII MHRSLLNAHQ  780
PPMSFTLTSR NSNSPRTVLS HVAHTMAHTC PRSLPTSAPK LSYKYRSSSQ TRSVSDSTET  840
AMSPAMEEAL DDVQKLRDEF LKVLRSRRSG EVPLSVEPAK PVAHPLFQEA SPPTFSEAMN  900
ACPKANIPNF KDKLHEENLY LITEEGEQGR LPVWILSMKE NNTQKRPAVV FLHSTNKNKE  960
WLRPLLEAYA SRGYVAIAID SRYHGERASN ISTYRDALIS SWEKGDTMPF LFDTTWDLIK  1020
LADYLTTQRE DVDHSRIGIT GESLGGMHAW FAAAADTRYA VVVPIIGVQG FRWAIDNDKW  1080
QARVDSIKPV FEAAQIDLGK TSIDKEVVEK VWDRIAPGLA SKFDSPYTIP AIAPRPLLIV  1140
NGAEDPRCPL AGLEIPKSRA CKAYEDAQSM HNFKLIAEPG IGHQMTAFMV KEASYWLDQF  1200
LMP
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1428433SRKRKS
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G29000.11e-123CPP family protein