PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gh_A10G0143
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family HD-ZIP
Protein Properties Length: 744aa    MW: 81286.8 Da    PI: 6.037
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gh_A10G0143genomeNAU-NBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox61.81e-1964119156
                  TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
     Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                  r++ +++t+ q++e+e+lF+++++p+ ++r++L+++lgL+  qVk+WFqN+R++ k
  Gh_A10G0143  64 RKRYHRHTQRQIQEMEALFKECPHPDDKQRKQLSRELGLDPLQVKFWFQNKRTQLK 119
                  789999**********************************************9877 PP

2START2083.7e-652644861206
                  HHHHHHHHHHHHHHHHC-TT-EEEE...EXCCTTEEEEEEESSS.........SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEEE CS
        START   1 elaeeaaqelvkkalaeepgWvkss...esengdevlqkfeeskv........dsgealrasgvvdmvlallveellddkeqWdetla....kaetle 83 
                  ela +a++el+++a+++ep+Wv++    + +n+ e+l++f+++ +        +++ea+r+ +v +m++++lve+l+d++ qW++ +     +a+tl+
  Gh_A10G0143 264 ELAVTAMEELIRMAQSGEPLWVTDEnsiDVLNENEYLRIFPRGIGskpfanlgFRSEASREAAVIIMNPVNLVEILMDVN-QWSTVFCgivsRAMTLD 360
                  57899*********************99**************999***********************************.******99999****** PP

                  EECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE--S CS
        START  84 vissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwvehvdlkg 174
                  v+s+g      galq+m+ae+q++splvp R+ +f+Ry++++ +g w++vdvS+d+ ++ p    + R++++pSg+li++++ng+skv+wve+v++++
  Gh_A10G0143 361 VLSTGvagnynGALQVMTAEFQLPSPLVPtRENYFARYCKRHHDGIWAVVDVSLDNLRHAP----FTRCRRRPSGCLIQELPNGYSKVIWVENVEVDD 454
                  ***********************************************************99....9******************************** PP

                  SXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
        START 175 rlphwllrslvksglaegaktwvatlqrqcek 206
                  r ++ +++ lv+++la+gak+wvatl+rqce+
  Gh_A10G0143 455 RGVSDIYKTLVNTSLAFGAKRWVATLDRQCER 486
                  ******************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.606.9E-2244119IPR009057Homeodomain-like
SuperFamilySSF466895.01E-1948121IPR009057Homeodomain-like
PROSITE profilePS5007116.81161121IPR001356Homeobox domain
SMARTSM003896.0E-1862125IPR001356Homeobox domain
CDDcd000868.51E-1863121No hitNo description
PfamPF000462.6E-1764119IPR001356Homeobox domain
PROSITE patternPS00027096119IPR017970Homeobox, conserved site
PROSITE profilePS5084844.819255489IPR002913START domain
SuperFamilySSF559611.14E-33256488No hitNo description
CDDcd088752.14E-119259485No hitNo description
SMARTSM002342.1E-66264486IPR002913START domain
PfamPF018521.5E-55265486IPR002913START domain
Gene3DG3DSA:3.30.530.203.8E-6362486IPR023393START-like domain
SuperFamilySSF559611.35E-22508735No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 744 aa     Download sequence    Send to blast
MFNSDLYENP NMFDMFQRPS DSDQTERDDD NNDTKSGTEV DAPSADDDNQ GPASSGPSRR  60
RAKRKRYHRH TQRQIQEMEA LFKECPHPDD KQRKQLSREL GLDPLQVKFW FQNKRTQLKA  120
QTERHENGLL KAENEKLRAE NHRYKEALNN ISCPTCGGPA ALGEMSFEEQ HLRLENARLR  180
EEIERISGVT AKYVGKPIGP SFSRFADRAP ISFGTQPGFL GEYGGPGGAA GGPGVGAGGP  240
GGGLGEVLRP VSVTNEADKP LIVELAVTAM EELIRMAQSG EPLWVTDENS IDVLNENEYL  300
RIFPRGIGSK PFANLGFRSE ASREAAVIIM NPVNLVEILM DVNQWSTVFC GIVSRAMTLD  360
VLSTGVAGNY NGALQVMTAE FQLPSPLVPT RENYFARYCK RHHDGIWAVV DVSLDNLRHA  420
PFTRCRRRPS GCLIQELPNG YSKVIWVENV EVDDRGVSDI YKTLVNTSLA FGAKRWVATL  480
DRQCERLASA MANNIPAGDL GVLNSSDGRK SILKLAERMV NSFCTGVGAS TAHAWTTLTG  540
SDEIRVMTRK SIDDPGRPPG IVLSAATSFW VAVPPRKAFN ILRSEKFRSE WDILSNGGVV  600
DEMAHIANGR DPGNCVSLLR VKGANASQSN MLILQESSND ATGSYVIYAP VDFAAMNIVL  660
NGGDPDYVAL LPSGFAILPD REGPNRGIGI TEIGSGGSLV TLAFQILVDS APNSKISVGS  720
VATVNSLIKC TLERIRTAVM CNDA
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
15864RRRAKRK
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Ghi.37621e-179ovule
Expression -- Description ? help Back to Top
Source Description
UniprotTISSUE SPECIFICITY: Specifically expressed in the layer 1 (L1) of shoot meristems. {ECO:0000269|PubMed:12505995}.
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor that binds to the L1 box DNA sequence 5'-TAAATG[CT]A-3'. Plays a role in maintaining the identity of L1 cells, possibly by interacting with their L1 box or other target-gene promoters. Functionally redundant to ATML1. {ECO:0000269|PubMed:12505995}.
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankJX6159805e-96JX615980.1 Gossypium hirsutum clone NBRI_GE60293 microsatellite sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_016711277.10.0PREDICTED: homeobox-leucine zipper protein PROTODERMAL FACTOR 2-like
SwissprotQ93V990.0PDF2_ARATH; Homeobox-leucine zipper protein PROTODERMAL FACTOR 2
TrEMBLA0A1U8L9J60.0A0A1U8L9J6_GOSHI; homeobox-leucine zipper protein PROTODERMAL FACTOR 2-like
STRINGGorai.011G016200.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM2482434
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G04890.10.0protodermal factor 2
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]
  2. Ding Y, et al.
    Four distinct types of dehydration stress memory genes in Arabidopsis thaliana.
    BMC Plant Biol., 2013. 13: p. 229
    [PMID:24377444]