PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID TRIDC5AG046620.7
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; BOP clade; Pooideae; Triticodae; Triticeae; Triticinae; Triticum
Family HD-ZIP
Protein Properties Length: 851aa    MW: 90628.4 Da    PI: 5.9368
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
TRIDC5AG046620.7genomeEnsemblPlantsView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox67.32e-21123178156
                       TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
          Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                       +++ +++t++q++eLe++F+++++p++++r eL+++l+L+ rqVk+WFqNrR+++k
  TRIDC5AG046620.7 123 KKRYHRHTPQQIQELEAVFKECPHPDEKQRMELSRRLNLESRQVKFWFQNRRTQMK 178
                       688999***********************************************999 PP

2START172.62.5e-543355614205
                       HHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEEE CS
             START   4 eeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla....kaetle 83 
                       ++a++elvk  ++++p+W  s     e +n de+++ f++  +     + +ea r++g+ + ++++lv++l++  ++W+e+++    +a+t+e
  TRIDC5AG046620.7 335 LAAMEELVKVTQVDDPLWQPSLdiglETLNFDEYRRAFARVLGpspagYVSEATREVGIAIINSVDLVNSLMNEVPRWSEMFPcvvaRASTME 427
                       789*******************9***************88666************************************************** PP

                       EECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--......-TTSEE-EESSEEEEEEEECTCEEEE CS
             START  84 vissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppe.....sssvvRaellpSgiliepksnghskv 164
                        issg      g +qlm aelq+lsplvp R+++f+R+++q+ +g wvivdvSvd    p +      ++++ ++llpSg+++e+++ng+ kv
  TRIDC5AG046620.7 428 IISSGmggtrsGSIQLMRAELQVLSPLVPiREVTFLRFCKQHADGLWVIVDVSVDGVLRPDSgtgagPAGYMGCRLLPSGCIVEDMQNGYAKV 520
                       *******************************************************99888877778789************************ PP

                       EEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXX CS
             START 165 twvehvdlkgrlphwllrslvksglaegaktwvatlqrqce 205
                       twv h++++++ +h+l+r+l++sg+a ga++w+a lqrqce
  TRIDC5AG046620.7 521 TWVVHAEYDEAAVHELYRPLLRSGQALGARRWLASLQRQCE 561
                       ****************************************8 PP

Sequence ? help Back to Top
Protein Sequence    Length: 851 aa     Download sequence    
MSFGGMFDGA GSGVFSYDAG GGGPGMHNPG RLLAPPPIPR PGAGAGGGGF ASSTGLSLGL  60
QTNMEGGGQL GGAFMGSTGS GGDGDSLGRA REDENDSRSG SDNLDGASAD ELDPDNSNPR  120
KKKKRYHRHT PQQIQELEAV FKECPHPDEK QRMELSRRLN LESRQVKFWF QNRRTQMKTQ  180
IERHENALLR QENDKLRTEN MTIREAMRSP TCGNCGGAAV LGEVSLEEQH LRIENSRLKD  240
ELDRVCALAG KFLGRPVSAI SSPLSLPSSL CSGLDLAVGS NNGFMGMGMQ SIPDLMGGGS  300
AAMRLPAGIM GGGLDDGLGG EGVAIDRGAL LELGLAAMEE LVKVTQVDDP LWQPSLDIGL  360
ETLNFDEYRR AFARVLGPSP AGYVSEATRE VGIAIINSVD LVNSLMNEVP RWSEMFPCVV  420
ARASTMEIIS SGMGGTRSGS IQLMRAELQV LSPLVPIREV TFLRFCKQHA DGLWVIVDVS  480
VDGVLRPDSG TGAGPAGYMG CRLLPSGCIV EDMQNGYAKV TWVVHAEYDE AAVHELYRPL  540
LRSGQALGAR RWLASLQRQC EYHAILCSNP HPNHGDRHEP ISPAGRRCML RLAQRMADNF  600
CAGVCATAAQ KWRRLDEWRV EGAGGRDPAG GEDKVRMMAR QSVGAPGEPP GVVLSATTSV  660
RLPGTPPQRV FDYLRDEQRR GEWDILANGE AMQEMDHIAK GQHHGNAVSL LRPNATSGNQ  720
NNMLILQETC TDASGSLVVY APVDVQSMHV VMGGGDSAYV SLLPSGFAIL PDGHTAPAAA  780
TDPSSQGSSP NAHGGSSNNS PGSLVTVAFQ ILVNNLPTAK LTVESVDTVS NLLSCTIQKI  840
KSALQASIIS P
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1120125RKKKKR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00730.10.0HD-ZIP family protein