PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID TRIDC4BG004770.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; BOP clade; Pooideae; Triticodae; Triticeae; Triticinae; Triticum
Family CPP
Protein Properties Length: 779aa    MW: 83950.6 Da    PI: 6.9587
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
TRIDC4BG004770.1genomeEnsemblPlantsView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR50.44.4e-16468506240
               TCR   2 ekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                       ++k+C+CkkskClk+YCeCfaag++Cse C+C++C Nk 
  TRIDC4BG004770.1 468 SCKRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLNKP 506
                       689**********************************96 PP

2TCR49.31e-15553592140
               TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                       ++k+gCnCkks+ClkkYCeC++ g+ Cs++C+Ce CkN+ 
  TRIDC4BG004770.1 553 RHKRGCNCKKSSCLKKYCECYQGGVGCSNNCRCETCKNTF 592
                       589***********************************85 PP

Sequence ? help Back to Top
Protein Sequence    Length: 779 aa     Download sequence    
AAAARAEDSP LFGFIDSLSP IEPLKSAYSG NGLQAYHQSL NAASVSSIFT SPHHNAHKEG  60
SKLSKSSFGD YTENEISMED GTDKNKSPTS STAVRLFACT STITRESHTM ITCSVNEGIV  120
DPPKEPNDLP QPGRFDSGSP DHNTAPCHGV SVRSDLKQDK CPKLEAVQAT NNTVEKRKCL  180
FSSDMQPQDG CQPAKENNEV MGCEWEDLVS VTSGELLAFD SSMDQHHTGV QLAVNNAESC  240
GYLLSKLAGG ADIPDRTHPT TSSQAYYHEM VVGEDKTENG QLFPEDKKTI LSEEIQDNIN  300
EENACIPLGC KVETQQRGVR RRCLVFEASG YSHRTVQKEY VGDLSFSTSK GKSSAQNHRN  360
PGKTPSPHVF RGIGLHLNAL ALTSKDKMAC QDPLATALVP SLKTEQDVHG NLLSAGGNFV  420
HSGSGSLDLQ MDNDDCSVGG FLGNDHNSSQ SSSPPKKRRK SDNGDDDSCK RCSCKKSKCL  480
KLYCECFAAG VYCSEPCSCQ GCLNKPIHEE IVLSTRKQIE FRNPLAFAPK VIRMSEAGQE  540
TQEDPKNTPA SARHKRGCNC KKSSCLKKYC ECYQGGVGCS NNCRCETCKN TFGTRDVAVS  600
AENEEMKQEG DQTENREQEK ENDQQKANVH SEDHKLVELV VPITPPLDVS SCLLKQPNFS  660
NAKPPRPCKA RSGSSSRPSK ASETVQSRKT SKAGDSVFIE EMPGILREPS SPGIVKTCSP  720
NGKRVSPPHN ALGVSPSRKG GRKLILKSIP SFPSLAGDAN GGSATCSSDS ATALALGPS
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1455461PKKRRKS
2456460KKRRK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.12e-79CPP family protein