PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thhalv10017809m
Common NameEUTSA_v10017809mg
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Eutremeae; Eutrema
Family HD-ZIP
Protein Properties Length: 713aa    MW: 79747.8 Da    PI: 6.4762
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thhalv10017809mgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox61.99.8e-2069124156
                      TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
         Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                      r++ +++t++q++e+e++F+++++p+ ++r+eL+++lgL+  q+k+WFqN+R++ k
  Thhalv10017809m  69 RKRYHRHTQHQIQEMEAFFKECPHPDDKQRKELSRQLGLDHLQIKFWFQNKRTQNK 124
                      789999***********************************************998 PP

2START171.74.8e-542514731206
                      HHHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEE CS
            START   1 elaeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla....kaet 81 
                      ela +a++el+++a+++ep+W+  +      ++ de++++f+ + +     +++ea+r+++vv m++++ v+ l+d++  W++++a    +a+t
  Thhalv10017809m 251 ELAFRAMDELIAMAQVGEPLWKGGVngtsLALDLDEYTRTFQNGLGprlngFRTEASRETAVVYMNHMNIVKRLMDVN-LWSTMFAgmvaRAMT 343
                      57899*****************999987778999*******66555********************************.*************** PP

                      EEEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE CS
            START  82 levissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwve 168
                      +ev+  g      g+l+lm+ e+q lsplv+ R+++f+Ry+rq+g+g w++vdvS+d+  ++ e     R++++pSg+li++ +ng skvtwve
  Thhalv10017809m 344 HEVLYAGvngtfdGTLHLMTVEYQILSPLVStRESYFLRYCRQQGEGLWAVVDVSIDHLIPNLE----LRCRRRPSGCLIQQIQNGFSKVTWVE 433
                      **************************************************************98....************************** PP

                      -EE--SSXX..HHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
            START 169 hvdlkgrlp..hwllrslvksglaegaktwvatlqrqcek 206
                      hv+l++r    h+l+++ ++sg+a +a++wv tl+rq e+
  Thhalv10017809m 434 HVELDDRGAiiHNLYKQYISSGQALAANRWVTTLDRQSER 473
                      ****9765445*************************9886 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.602.5E-2148124IPR009057Homeodomain-like
SuperFamilySSF466899.19E-1952125IPR009057Homeodomain-like
PROSITE profilePS5007116.77966126IPR001356Homeobox domain
SMARTSM003895.8E-1767130IPR001356Homeobox domain
PfamPF000462.1E-1769124IPR001356Homeobox domain
CDDcd000867.64E-1869126No hitNo description
PROSITE patternPS000270101124IPR017970Homeobox, conserved site
PROSITE profilePS5084836.095242476IPR002913START domain
SuperFamilySSF559614.81E-29243474No hitNo description
CDDcd088752.21E-106246472No hitNo description
PfamPF018523.2E-46251473IPR002913START domain
SMARTSM002345.4E-46251473IPR002913START domain
Gene3DG3DSA:3.30.530.201.2E-4353456IPR023393START-like domain
SuperFamilySSF559616.59E-13494685No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 713 aa     Download sequence    Send to blast
MYQPDMLVPV DTNKNGDDDN NENNNNNENN NNNMDGQGDT SFGDEEFDST NTSETQEDGS  60
EQDPRSNKRK RYHRHTQHQI QEMEAFFKEC PHPDDKQRKE LSRQLGLDHL QIKFWFQNKR  120
TQNKNHLERH ENSQLRSENN RLRNENHQYR EAIANALCPK CGGHTAIGEM SFEEHHLRLE  180
NAGLAQEIQQ LSAVATKYAG RPRYALMAPP VPARPSEPGM ATNGRQVHES IQNHLRSIIG  240
VKDADKPLII ELAFRAMDEL IAMAQVGEPL WKGGVNGTSL ALDLDEYTRT FQNGLGPRLN  300
GFRTEASRET AVVYMNHMNI VKRLMDVNLW STMFAGMVAR AMTHEVLYAG VNGTFDGTLH  360
LMTVEYQILS PLVSTRESYF LRYCRQQGEG LWAVVDVSID HLIPNLELRC RRRPSGCLIQ  420
QIQNGFSKVT WVEHVELDDR GAIIHNLYKQ YISSGQALAA NRWVTTLDRQ SERLACMMAT  480
NIPSIEPGAL TNQSKHNILK LAERMGRSFF SGVTASTADL WCNLSGYAGD SIRLMTRTSV  540
NDPGRPPGVI LCAATSFWVP VPPSTVFDFL RDENTRIHWD VLSSAGNVQK LSQIANGRDS  600
RNCVSILQGP KTIEKNMIIV QETSTDPAAS FVISAPIDVS AMDLVFKGGD SDYVPMLPSG  660
FAILPDGMAR PGREGGSLLS VAFQILVESV HSAKLTFSSV ATIENLVLST VHK
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor. {ECO:0000250}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapThhalv10017809m
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_024003849.10.0homeobox-leucine zipper protein HDG2
SwissprotQ94C370.0HDG2_ARATH; Homeobox-leucine zipper protein HDG2
TrEMBLV4M9N20.0V4M9N2_EUTSA; Uncharacterized protein (Fragment)
STRINGXP_006410369.10.0(Eutrema salsugineum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM49128149
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G05230.30.0homeodomain GLABROUS 2