PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Araha.4851s0007.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis
Family HD-ZIP
Protein Properties Length: 722aa    MW: 79309.5 Da    PI: 5.7353
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Araha.4851s0007.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox61.51.3e-1965121157
                          TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHHC CS
             Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakekk 57 
                          +++ +++t+ q++e+e++F+++++p+ ++r++L+++lgL+  qVk+WFqN+R+++k+
  Araha.4851s0007.1.p  65 KKRYHRHTQLQIQEMEAFFKECPHPDDKQRKQLSRELGLEPLQVKFWFQNKRTQMKN 121
                          688999************************************************995 PP

2START2221.8e-692524652206
                          HHHHHHHHHHHHHHHC-TT-EEEEEXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEE CS
                START   2 laeeaaqelvkkalaeepgWvkssesengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla....kaetl 82 
                          la +a++elv++++++ep+W++++  ++++e+ ++f+++ +     +++ea+r+s+vv+m++++ ve+l+d++ qW++ +a    +a tl
  Araha.4851s0007.1.p 252 LAVAAMEELVRMVQVDEPLWKSLV--LDEEEYARTFPRGIGprpagYRSEASRESAVVIMNHVNIVEILMDVN-QWSTIFAgmvsRAITL 338
                          6789********************..************999********************************.**************** PP

                          EEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEE CS
                START  83 evissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvt 165
                          +v+s+g      galq+m+ae+q++splvp R+ +f+Ry++q+g+g+w++vd+S+ds q++p     +R++++ Sg+li++++ng+skvt
  Araha.4851s0007.1.p 339 AVLSTGvagnynGALQVMTAEFQVPSPLVPtRETYFARYCKQQGDGSWAVVDISLDSLQPNPP----ARCRRRASGCLIQEMPNGYSKVT 424
                          **************************************************************8....*********************** PP

                          EEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
                START 166 wvehvdlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                          wvehv++++r +h+l++++v++g+a+gak+wva l+rqce+
  Araha.4851s0007.1.p 425 WVEHVEVDDRGVHNLYKHMVSTGHAFGAKRWVAILDRQCER 465
                          ***************************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.601.4E-2144120IPR009057Homeodomain-like
SuperFamilySSF466894.6E-1953122IPR009057Homeodomain-like
PROSITE profilePS5007116.47162122IPR001356Homeobox domain
SMARTSM003891.6E-1863126IPR001356Homeobox domain
CDDcd000867.96E-1965123No hitNo description
PfamPF000463.2E-1765120IPR001356Homeobox domain
PROSITE patternPS00027097120IPR017970Homeobox, conserved site
PROSITE profilePS5084842.05242468IPR002913START domain
SuperFamilySSF559614.23E-33243467No hitNo description
CDDcd088751.35E-123246464No hitNo description
SMARTSM002341.5E-62251465IPR002913START domain
PfamPF018528.4E-62252465IPR002913START domain
Gene3DG3DSA:3.30.530.206.7E-5344431IPR023393START-like domain
SuperFamilySSF559612.2E-26484713No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0010090Biological Processtrichome morphogenesis
GO:0048497Biological Processmaintenance of floral organ identity
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 722 aa     Download sequence    Send to blast
MFEPNMLLAA MNNADSNNHN YNHEDNNNEG FLRDDEFDSP NTKSGSENQE GGSGNDQDPL  60
HPNKKKRYHR HTQLQIQEME AFFKECPHPD DKQRKQLSRE LGLEPLQVKF WFQNKRTQMK  120
NHHERHENSH LRAENEKLRN DNLRYREALA NASCPNCGGP TAIGEMSFDE HQLRLENARL  180
REEIDRISAI AAKYVGKPVS NYPLMSPPPL PPRPLELAMG NLGGEAYGNN STDLLKSITA  240
PTESDKPVII DLAVAAMEEL VRMVQVDEPL WKSLVLDEEE YARTFPRGIG PRPAGYRSEA  300
SRESAVVIMN HVNIVEILMD VNQWSTIFAG MVSRAITLAV LSTGVAGNYN GALQVMTAEF  360
QVPSPLVPTR ETYFARYCKQ QGDGSWAVVD ISLDSLQPNP PARCRRRASG CLIQEMPNGY  420
SKVTWVEHVE VDDRGVHNLY KHMVSTGHAF GAKRWVAILD RQCERLASVM ATNISSGEVG  480
VITNQEGRRS MLKLAERMVI SFCAGVSAST AHTWTTLSGT GAEDVRVMTR KSVDDPGRPP  540
GIVLSAATSF WIPVPPKRVF DFLRDENSRN EWDILSNGGV VQEMAHIANG RDTGNCVSLL  600
RVNSANSSQS NMLILQESCT DPTASFVIYA PVDIVAMNIV LNGGDPDYVA LLPSGFAILP  660
DGNANSGAPG GDGGSLLTVA FQILVDSVPT AKLSLGSVAT VNNLIACTVE RIKASMSCET  720
A*
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor. {ECO:0000250}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapAraha.4851s0007.1.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAK3167760.0AK316776.1 Arabidopsis thaliana AT1G05230 mRNA, complete cds, clone: RAFL09-78-H10.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_020870784.10.0homeobox-leucine zipper protein HDG2 isoform X1
RefseqXP_020870785.10.0homeobox-leucine zipper protein HDG2 isoform X1
SwissprotQ94C370.0HDG2_ARATH; Homeobox-leucine zipper protein HDG2
TrEMBLB9DFH80.0B9DFH8_ARATH; AT1G05230 protein
STRINGAT1G05230.40.0(Arabidopsis thaliana)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM49128149
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G05230.40.0homeodomain GLABROUS 2