PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID TRIDC4BG004770.7
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; BOP clade; Pooideae; Triticodae; Triticeae; Triticinae; Triticum
Family CPP
Protein Properties Length: 680aa    MW: 74371.9 Da    PI: 7.1662
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
TRIDC4BG004770.7genomeEnsemblPlantsView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR50.63.8e-16445483240
               TCR   2 ekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                       ++k+C+CkkskClk+YCeCfaag++Cse C+C++C Nk 
  TRIDC4BG004770.7 445 SCKRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLNKP 483
                       689**********************************96 PP

2TCR49.58.5e-16530569140
               TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                       ++k+gCnCkks+ClkkYCeC++ g+ Cs++C+Ce CkN+ 
  TRIDC4BG004770.7 530 RHKRGCNCKKSSCLKKYCECYQGGVGCSNNCRCETCKNTF 569
                       589***********************************85 PP

Sequence ? help Back to Top
Protein Sequence    Length: 680 aa     Download sequence    
SSPPTPGTAS KRTISRSTPP PFPQSSRPRI TTRTRRDQSS PRSSFGDYTE NEISMEDGTD  60
KNKSPTSSTA VRLFACTSTI TRESHTMITC SVNEGIVDPP KEPNDLPQPG RFDSGSPDHN  120
TAPCHGVSVR SDLKQDKCPK LEAVQATNNT VEKRKCLFSS DMQPQDGCQP AKENNEVMGC  180
EWEDLVSVTS GELLAFDSSM DQHHTGVQLA VNNAESCGYL LSKLAGGADI PDRTHPTTSS  240
QAYYHEMVVG EDKTENGQLF PEDKKTILSE EIQDNINEEN ACIPLGCKVE TQQRGVRRRC  300
LVFEASGYSH RTVQKEYVGD LSFSTSKGKS SAQNHRNPGK TPSPHVFRGI GLHLNALALT  360
SKDKMACQDP LATALVPSLK TEQDVHGNLL SAGGNFVHSG SGSLDLQMDN DDCSVGGFLG  420
NDHNSSQSSS PPKKRRKSDN GDDDSCKRCS CKKSKCLKLY CECFAAGVYC SEPCSCQGCL  480
NKPIHEEIVL STRKQIEFRN PLAFAPKVIR MSEAGQETQE DPKNTPASAR HKRGCNCKKS  540
SCLKKYCECY QGGVGCSNNC RCETCKNTFG TRDAENEEMK QEGDQTENRE QEKENDQQKA  600
NVHSEDHKLV ELVVPITPPL DVSSPVVCSN SQTSQMQSHL DHAKPAAGVP PGPQKPLKQF  660
SLARPRRPVT ACSSRKCLVF
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1432438PKKRRKS
2433437KKRRK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.12e-66CPP family protein