PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thhalv10008672m
Common NameEUTSA_v10008672mg
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Eutremeae; Eutrema
Family HD-ZIP
Protein Properties Length: 231aa    MW: 26694.8 Da    PI: 4.8759
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thhalv10008672mgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox54.81.6e-1768121356
                      --SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
         Homeobox   3 kRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                      k++++t+ ql+ Lee Fe+++ +  e++  LA+klgL+ +qV vWFqNrRa+ k
  Thhalv10008672m  68 KKRKLTPVQLRLLEESFEEEKRLEPERKLWLAEKLGLQPSQVAVWFQNRRARFK 121
                      56699***********************************************98 PP

2HD-ZIP_I/II113.71.1e-3668158292
      HD-ZIP_I/II   2 kkrrlskeqvklLEesFeeeekLeperKvelareLglqprqvavWFqnrRARtktkqlEkdyeaLkraydalkeenerLekeveeLreelk 92 
                      kkr+l+  q++lLEesFeee++LeperK  la++Lglqp qvavWFqnrRAR+ktkqlE+d+++Lk++y +lk++++ L  ++e+L+++++
  Thhalv10008672m  68 KKRKLTPVQLRLLEESFEEEKRLEPERKLWLAEKLGLQPSQVAVWFQNRRARFKTKQLEQDCDSLKASYAKLKTDRDILFLQNETLKNKVA 158
                      9**************************************************************************************9886 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466892.17E-1849125IPR009057Homeodomain-like
PROSITE profilePS5007116.48763123IPR001356Homeobox domain
SMARTSM003891.7E-1666127IPR001356Homeobox domain
CDDcd000867.32E-1568124No hitNo description
PfamPF000465.8E-1568121IPR001356Homeobox domain
Gene3DG3DSA:1.10.10.609.0E-1970130IPR009057Homeodomain-like
PRINTSPR000317.3E-594103IPR000047Helix-turn-helix motif
PROSITE patternPS00027098121IPR017970Homeobox, conserved site
PRINTSPR000317.3E-5103119IPR000047Helix-turn-helix motif
PfamPF021831.0E-12123164IPR003106Leucine zipper, homeobox-associated
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 231 aa     Download sequence    Send to blast
MENSNTDSEF LFWFQNQKQS HSHNFASSCF PPSSHSAFYG SSSMSNTETN TVDDEDVFES  60
YKMREITKKR KLTPVQLRLL EESFEEEKRL EPERKLWLAE KLGLQPSQVA VWFQNRRARF  120
KTKQLEQDCD SLKASYAKLK TDRDILFLQN ETLKNKVALL KEKLKMKDNL ETQPMEAEKL  180
GEEGSSVKSD DNTQCSEEEE ALGNQYSFPE LAALGFYYDP TLAASNLRLW *
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1115123RRARFKTKQ
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor. {ECO:0000250}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapThhalv10008672m
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAC0003481e-120AC000348.2 Genomic sequence for Arabidopsis thaliana BAC T7N9 from chromosome I, complete sequence.
GenBankCP0026841e-120CP002684.1 Arabidopsis thaliana chromosome 1 sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_024008463.11e-158homeobox-leucine zipper protein ATHB-54 isoform X1
SwissprotP0CJ651e-137ATB54_ARATH; Homeobox-leucine zipper protein ATHB-54
TrEMBLV4K9K41e-168V4K9K4_EUTSA; Uncharacterized protein
STRINGXP_006415992.11e-169(Eutrema salsugineum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM159081314
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G27045.11e-119HD-ZIP family protein