PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID OMO54647
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Grewioideae; Apeibeae; Corchorus
Family CPP
Protein Properties Length: 1037aa    MW: 115002 Da    PI: 6.3374
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
OMO54647genomeEnsemblPlantsView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR50.44.5e-16481519240
       TCR   2 ekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
               ++k+CnCkkskClk+YCeCfaag++C e C+C+dC+Nk 
  OMO54647 481 SCKRCNCKKSKCLKLYCECFAAGVYCIEPCSCQDCFNKP 519
               689**********************************96 PP

2TCR506e-16566604139
       TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
               ++k+gCnCkks+ClkkYCeC++ g+ Cs +C+Ce+CkN 
  OMO54647 566 RHKRGCNCKKSSCLKKYCECYQGGVGCSINCRCEGCKNA 604
               589***********************************7 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1037 aa     Download sequence    
MDTPEKTQIS SSLSKFEDSP VFNYINSLSP IKPVKSIHLT QTFNPLSFAS LPSIFTSPHV  60
SAQKESRFLK RHSYADPSKP ELLSGEVTKA STIVEVGNEA GQLCDSSAEL QGNFDPGVSL  120
GEASLELPNE PSRYAMELPR TLKYDCGSPN CDPAPCLIET NCVSESTCTS IVPFVQEASH  180
EKGLSDSGVE VAGVEQKRDS AGCDWESLIS DAADLLIFNS PNDSEAFRCI IQKPMDPGTR  240
FCPTLNSRFP QNDINGVPQT TVDSDEYKDP CTQTEEDGEL KGTYPAHDNF ENTGVDNCMS  300
GSLTDNVETG ISVPFSFKPG SSLHRGFRRR CLDFEMLAAR RKSLDGGSNT SSSVDNQLVP  360
GKPSNDSSRR ILPGIGLHLN ALAITSRDDN NIKHETLSSG TQKLSFPGST ASILTPTAKP  420
EAVHESLNSA SIERDTDPVE NGLQLAEDAS QASTYLRNEE FNQNSPKKKR RRLEQAGEGE  480
SCKRCNCKKS KCLKLYCECF AAGVYCIEPC SCQDCFNKPI HEDTVLATRK QIESRNPLAF  540
APKVIRSADS IPEVGDDSSK TPASARHKRG CNCKKSSCLK KYCECYQGGV GCSINCRCEG  600
CKNAFGRKDG SAIVETEGEP EEEERDPSDK NGIERSLEKT DILNNEEQNP VSALPTTPLQ  660
LCRPLLQLPF SSKGKPPRSF IAIGSSTALY NGQRYGKPNI IRPQNIIEKH FQTVAEDEMP  720
DILRDNCSPN TGVKSSSPNT LKGKLPVSIR LKEKLMHRSS GHLGRYGRED STNWSLNDDI  780
TGEVLCRLPG KTVLVSWEWR RSKDVFHGNF RIRKNNMIFV RNRIHWLTGE HHIITFDVET  840
ELSAVIKLPG RVMQLSHVQD MDRGAICIGA SSGCLHYVCS HPSEIRVWEL KDYGESDQWV  900
LKYNLNLIDL VMENKKRFGQ RDLENFDEFV KVKDETVANY LFSSLAYSEE VVFMKVNTVI  960
FSYDFRSREL KERFCLLVQF HRTPLTDPTV IPYTICLAAA GVLKEIKNVV SGLFQLKQDS  1020
GSRPSIRSCP CCETHPP
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1466473PKKKRRRL
2467472KKKRRR
3469473KRRRL
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.11e-133CPP family protein