PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Tp4g14740
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Brassicaceae incertae sedis; Schrenkiella
Family HD-ZIP
Protein Properties Length: 712aa    MW: 78772.9 Da    PI: 6.5516
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Tp4g14740genomethellungiellaView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox59.94.1e-1954109156
                TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
   Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                r++ +++t++q++e+e++F+++++p+ ++r+eL ++lgL+  q+k+WFqN+R++ k
  Tp4g14740  54 RKRYHRHTQHQIQEMESFFKECPHPDDKQRKELGRQLGLDHLQIKFWFQNKRTQNK 109
                789999***********************************************998 PP

2START171.36.1e-542394582206
                HHHHHHHHHHHHHHHC-TT-EEEE...EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEEEEECTT. CS
      START   2 laeeaaqelvkkalaeepgWvkss...esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla....kaetlevissg. 88 
                la  a++el++ a+++ep+W   +     +n de++++f  + +     +++ea++a+++v+m++ ++v+ l+d +  W+++++    +a+t+e+i +g 
  Tp4g14740 239 LAVGAMEELMATASVGEPLWNVGAngsLDLNLDEYTRSFLNGLGprlngFRTEASKATTIVFMNHLNVVQRLMDMN-LWSTMFIgmvaRAMTHETIFTGv 337
                78899***************9999888889*********888779*******************************.*********************** PP

                .....EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE--SSXX.HHHH CS
      START  89 .....galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwvehvdlkgrlp.hwll 181
                     ga +lm+ae+q+lsp+v+ R+ +fvRy++q+g+g w++vdvS+d+  ++ +     +++++pSg+li++++ng skvtwvehv++++r     l+
  Tp4g14740 338 qgnfdGAFHLMTAEYQVLSPIVStRECYFVRYCKQQGDGLWAVVDVSIDHLVPNLQ----LKCRRRPSGCLIQQLPNGFSKVTWVEHVEVDDRGGvNPLY 433
                *****************************************************986....99999****************************988**** PP

                HHHHHHHHHHHHHHHHHHTXXXXXX CS
      START 182 rslvksglaegaktwvatlqrqcek 206
                ++l++sg+a+ga++wvatl+rqce+
  Tp4g14740 434 KHLISSGQAFGANRWVATLERQCER 458
                ***********************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.604.3E-2133109IPR009057Homeodomain-like
SuperFamilySSF466891.42E-1839112IPR009057Homeodomain-like
PROSITE profilePS5007116.35851111IPR001356Homeobox domain
SMARTSM003897.6E-1752115IPR001356Homeobox domain
CDDcd000861.33E-1753112No hitNo description
PfamPF000467.5E-1754109IPR001356Homeobox domain
PROSITE profilePS5084839.869229461IPR002913START domain
SuperFamilySSF559617.88E-32231460No hitNo description
CDDcd088753.06E-105233457No hitNo description
SMARTSM002341.1E-48238458IPR002913START domain
PfamPF018521.6E-45239458IPR002913START domain
Gene3DG3DSA:3.30.530.201.2E-4342428IPR023393START-like domain
SuperFamilySSF559616.23E-13506703No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
GO:0008289Molecular Functionlipid binding
Sequence ? help Back to Top
Protein Sequence    Length: 712 aa     Download sequence    Send to blast
MVPVDTNNNN NNGDNDNNNN MNGGTIVEHE ELDSANTSEN QEDGSDQDPR PSKRKRYHRH  60
TQHQIQEMES FFKECPHPDD KQRKELGRQL GLDHLQIKFW FQNKRTQNKN HQERHENSQL  120
RAENNRLRAE NHQYREAIAN ALCPKCGGRT AIGEMSFEEH HLRLENAGLN DEIRQLSAVA  180
TKCTGNPAMN YPLMSPPIQP RPFEVGMGSN GREVYGSLRN LSGSIIGVKD ADKPLVIELA  240
VGAMEELMAT ASVGEPLWNV GANGSLDLNL DEYTRSFLNG LGPRLNGFRT EASKATTIVF  300
MNHLNVVQRL MDMNLWSTMF IGMVARAMTH ETIFTGVQGN FDGAFHLMTA EYQVLSPIVS  360
TRECYFVRYC KQQGDGLWAV VDVSIDHLVP NLQLKCRRRP SGCLIQQLPN GFSKVTWVEH  420
VEVDDRGGVN PLYKHLISSG QAFGANRWVA TLERQCERLA SIVATNIPSV EPDGLITMTN  480
GAKQNILKLA ERMGRSFFAG VTTSTADMWC NLSGFTGNTV RVMTRKSVND PGRPHGLILS  540
AFTSVWVPVS PNTVFEFLRN ENNRINWDVL SNGGAVQLLS QIANGRDSRN CVSVLRSANT  600
CQSKMMMIQD SSTDPTASYV IYAPIDISFL EGVLTGGDSD YVPLLPSGFA ILPDGISQQG  660
REGGGSLVNV AFQVLVESVP SAMLTFSSVA TIENLIIATA QKVKAHFTCQ AA
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor. {ECO:0000250}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapTp4g14740
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_024003849.10.0homeobox-leucine zipper protein HDG2
SwissprotQ94C370.0HDG2_ARATH; Homeobox-leucine zipper protein HDG2
TrEMBLV4M9N20.0V4M9N2_EUTSA; Uncharacterized protein (Fragment)
STRINGXP_006410369.10.0(Eutrema salsugineum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM49128149
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G05230.30.0homeodomain GLABROUS 2