PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID TRIDC4AG044020.5
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; BOP clade; Pooideae; Triticodae; Triticeae; Triticinae; Triticum
Family CPP
Protein Properties Length: 773aa    MW: 83627.5 Da    PI: 7.0333
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
TRIDC4AG044020.5genomeEnsemblPlantsView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR50.44.4e-16462500240
               TCR   2 ekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                       ++k+C+CkkskClk+YCeCfaag++Cse C+C++C Nk 
  TRIDC4AG044020.5 462 SCKRCSCKKSKCLKLYCECFAAGVYCSEPCSCQGCLNKP 500
                       689**********************************96 PP

2TCR49.39.9e-16547586140
               TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                       ++k+gCnCkks+ClkkYCeC++ g+ Cs++C+Ce CkN+ 
  TRIDC4AG044020.5 547 RHKRGCNCKKSSCLKKYCECYQGGVGCSNNCRCETCKNTF 586
                       589***********************************85 PP

Sequence ? help Back to Top
Protein Sequence    Length: 773 aa     Download sequence    
MQDSPLFSFI DSLSPIEPLK SAYSGNGLQA YHQSLNVTSV SSIFTSPHHN AHKESKLSKG  60
SFADYTENEL CMEEGTDKNK SPTSSTAVRL FACTSTITRE SHTMTTCSVN EGIVDPPKGP  120
NDLPQPGRFD SGSPDHNTAP CHGVSVRSDL KQDKCPKLEA VQTTNNTVEK RKCLFSTDMQ  180
LQDGCQPAKE NNEVMGCEWD DLVSVTSGEL LAFDSSMDQH HTGVQLAVNN AESCGYLLSK  240
LAGGADISDR THPTTSSQAY YHEMVVGEDK AENGQLFPED KKTILSEEIQ DNINEENACI  300
PLGCKVETQQ RGVRRRCLVF EAAGYSHRTV QKESAGDLSF STCKGKSSAQ NHRNPGKTPS  360
PHVFRGIGLH LNALALTSKD KMACQDPLAT ALVPSLKTEQ DVHGNLLSAG GNFFHSGSGS  420
LDLQMDNDDC SVGGFLGNDH NSSQSSSPPK KRRKSDNGDD DSCKRCSCKK SKCLKLYCEC  480
FAAGVYCSEP CSCQGCLNKP IHEEIVLSTR KQIEFRNPLA FAPKVIRMSE AGQETQEDPK  540
NTPASARHKR GCNCKKSSCL KKYCECYQGG VGCSNNCRCE TCKNTFGTRD VAVSAENEEM  600
KQEGEQTESC EKEKENDQQK ANVHSEDHKL VELVVPITPP LDVSSSLLKQ PNFSNAKPPR  660
PCKARSGSSS RSSKASVTVQ SRKISKVGDS VFIEEMPDIL REPSSPGIVK TCSPNGKRVS  720
PPHNALSISP NRKGGRKLIL KSIPSFPSLV GDTNGGSAIC SSDSTTALAL GPS
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1449455PKKRRKS
2450454KKRRK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.11e-82CPP family protein