PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thhalv10026759m
Common NameEUTSA_v10026759mg
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Eutremeae; Eutrema
Family HD-ZIP
Protein Properties Length: 712aa    MW: 79749.4 Da    PI: 6.2933
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thhalv10026759mgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox53.73.6e-1777128556
                      SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
         Homeobox   5 ttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                      +++t++q++e+e++Fe+n++p+ ++r +L+++lgLt  q+k+WFqN+R + k
  Thhalv10026759m  77 HRHTAYQIREMEAFFEENPHPNDKHRVRLSQELGLTPLQIKFWFQNKRNQIK 128
                      6799********************************************8876 PP

2START138.66.4e-442284602206
                      HHHHHHHHHHHHHHHC-TT-EEEE.EXCCTTEEEEEEESSS...........SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEE CS
            START   2 laeeaaqelvkkalaeepgWvkss.esengdevlqkfeeskv..........dsgealrasgvvdmvlallveellddkeqWdetla....kae 80 
                      +a ++ qel k+ + +ep+W k   +++n+  +l ++e  k            + ea+ras+vv m++  lv+ +ld   +W+e++     +a+
  Thhalv10026759m 228 IAVSCVQELAKMCYTNEPLWIKKIsDNNNESLYLNEEEYFKIsqwppmdndhIRREASRASTVVLMNSISLVNAFLDAE-KWSEMFCsivsRAK 320
                      67899****************999777777777777765555999****9887889***********************.******999*9*** PP

                      EEEEECTT........EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE..TTS-EEEEEEEEE-......TTS--.-TTSEE-EESSEEEEEEEEC CS
            START  81 tlevissg........galqlmvaelqalsplvp.RdfvfvRyirq.lgagdwvivdvSvds......eqkppesssvvRaellpSgiliepks 158
                      t++ issg        g l lm+aelq lsplvp R+ +f+Ry +q  ++g+w+ivd  +ds      +++ +++ ++ R    pSg++i++++
  Thhalv10026759m 321 TIQIISSGvsevsgasGPLLLMYAELQGLSPLVPtREGYFLRYVEQkAEEGKWMIVDFPIDSfhglinPDSATTTDQYRR---KPSGCIIQDLP 411
                      **********************************************9*********977664221111555554666666...*********** PP

                      TCEEEEEEEE-EE--SSXX.HHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
            START 159 nghskvtwvehvdlkgrlp.hwllrslvksglaegaktwvatlqrqcek 206
                      ng+s+vtwvehv+++++++ h+++r  vksg+a+   +w++ l+rqce+
  Thhalv10026759m 412 NGYSQVTWVEHVEVEEKHVqHEAVREYVKSGVAFDSERWLSVLKRQCER 460
                      ***********************************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466899.61E-1952131IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.603.7E-2057124IPR009057Homeodomain-like
PROSITE profilePS5007116.1870130IPR001356Homeobox domain
SMARTSM003892.7E-1671134IPR001356Homeobox domain
CDDcd000863.23E-1673128No hitNo description
PfamPF000461.2E-1477128IPR001356Homeobox domain
PROSITE patternPS000270105128IPR017970Homeobox, conserved site
PROSITE profilePS5084841.241218463IPR002913START domain
SuperFamilySSF559611.51E-27221462No hitNo description
CDDcd088754.63E-95224459No hitNo description
SMARTSM002344.0E-27227460IPR002913START domain
PfamPF018524.6E-37228460IPR002913START domain
SuperFamilySSF559611.79E-9480668No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 712 aa     Download sequence    Send to blast
MYGEEEEDEG NVLINPDNIV GSASSSPRGT IQNPDFKFTT FENPNFIPKE EYGMMSMMGN  60
GSGGSIGSGN DPKKRFHRHT AYQIREMEAF FEENPHPNDK HRVRLSQELG LTPLQIKFWF  120
QNKRNQIKTL QERRENVKLK AENDTLRRVN QNLRSNLKCI SCSSCDGSGD KLRLENSRLR  180
QELDLFRSIA SLMNPPLSPQ DTASFFSETN NSNVKLIAEE ENTIAMDIAV SCVQELAKMC  240
YTNEPLWIKK ISDNNNESLY LNEEEYFKIS QWPPMDNDHI RREASRASTV VLMNSISLVN  300
AFLDAEKWSE MFCSIVSRAK TIQIISSGVS EVSGASGPLL LMYAELQGLS PLVPTREGYF  360
LRYVEQKAEE GKWMIVDFPI DSFHGLINPD SATTTDQYRR KPSGCIIQDL PNGYSQVTWV  420
EHVEVEEKHV QHEAVREYVK SGVAFDSERW LSVLKRQCER MASLMATNIT DLGVIPSAEA  480
RRNLMRLSQR MVRIFCLNLN GSYGRALSES TKDTVRITTR KVSGGVVLCA VSTTFLPYSH  540
HQVFDLLCDD YHRSQILFNG NTLQEVSHIA NGSHLRNCIS LLRNINVATK SSNNVELVLQ  600
ETFTDISGSL LVYSTVDVKT VQLVLNGKDL SSIPLLPLGF SVVPVNPPEG ISANSPYCLL  660
TVGIQVLVSN VATARLNLST VNDRICSTVK QIISALKSSG SSAEPKQEIS W*
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor. {ECO:0000250}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapThhalv10026759m
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_024004764.10.0LOW QUALITY PROTEIN: homeobox-leucine zipper protein HDG4
SwissprotQ8L7H40.0HDG4_ARATH; Homeobox-leucine zipper protein HDG4
TrEMBLV4LYB10.0V4LYB1_EUTSA; Uncharacterized protein
STRINGXP_006414197.10.0(Eutrema salsugineum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM43562548
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G17710.10.0homeodomain GLABROUS 4
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]