PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cotton_A_38379_BGI-A2_v1.0
Common NameF383_24730
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family HD-ZIP
Protein Properties Length: 725aa    MW: 79660.6 Da    PI: 5.8687
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cotton_A_38379_BGI-A2_v1.0genomeBGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox59.94e-1959113256
                                 T--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
                    Homeobox   2 rkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                                 ++ +++t+ q++e+e++F+++++p+ ++r+eL ++lgL+  qVk+WFqN+R+++k
  Cotton_A_38379_BGI-A2_v1.0  59 KRYHRHTQRQIQEMEAFFKECPHPDDKQRKELGRELGLEPLQVKFWFQNKRTQMK 113
                                 567899**********************************************998 PP

2START216.78e-682474661206
                                 HHHHHHHHHHHHHHHHC-TT-EEEE...EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT CS
                       START   1 elaeeaaqelvkkalaeepgWvkss...esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdet 75 
                                 ela +a++el+++a+++ep+Wv      + + +de+l++f+++ +      ++ea+r+s+vv+m++++lve+l+d++ qW+  
  Cotton_A_38379_BGI-A2_v1.0 247 ELAVAAMEELIRMAQSGEPLWVPGDnsiDVLSEDEYLRTFPRGIGpkplgLRSEASRESAVVIMNHVNLVEILMDVN-QWSSV 328
                                 57899*****************76666799************999********************************.***** PP

                                 -S....EEEEEEEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EE CS
                       START  76 la....kaetlevissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRael 147
                                 +     +a tlev+s+g      galq+m+ae+q++splvp R+ +fvRy++q+ +g+w++vdvS+d+ +++p    + ++++
  Cotton_A_38379_BGI-A2_v1.0 329 FCgivsRAVTLEVLSTGvagnynGALQVMTAEFQVPSPLVPtRENYFVRYCKQHIDGTWAVVDVSLDNLRPNP----MSKCRR 407
                                 *99999******************************************************************9....688999 PP

                                 SSEEEEEEEECTCEEEEEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
                       START 148 lpSgiliepksnghskvtwvehvdlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                                 +pSg+li++++ng+skv+wvehv++++r +h+++r++v+sgla+gak+wvatl+rqce+
  Cotton_A_38379_BGI-A2_v1.0 408 RPSGCLIQELPNGYSKVIWVEHVEVDDRAIHNIYRPVVNSGLAFGAKRWVATLDRQCER 466
                                 *********************************************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.603.4E-2143113IPR009057Homeodomain-like
SuperFamilySSF466895.01E-1844115IPR009057Homeodomain-like
PROSITE profilePS5007116.0555115IPR001356Homeobox domain
SMARTSM003892.4E-1756119IPR001356Homeobox domain
CDDcd000862.10E-1759116No hitNo description
PfamPF000468.5E-1759113IPR001356Homeobox domain
SuperFamilySSF559615.13E-36238468No hitNo description
PROSITE profilePS5084844.108238469IPR002913START domain
CDDcd088751.33E-126242465No hitNo description
SMARTSM002342.2E-69247466IPR002913START domain
PfamPF018522.7E-57248466IPR002913START domain
Gene3DG3DSA:3.30.530.202.6E-5344466IPR023393START-like domain
SuperFamilySSF559616.68E-26488717No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0009845Biological Processseed germination
GO:0009913Biological Processepidermal cell differentiation
GO:0048497Biological Processmaintenance of floral organ identity
GO:0048825Biological Processcotyledon development
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
GO:0008289Molecular Functionlipid binding
Sequence ? help Back to Top
Protein Sequence    Length: 725 aa     Download sequence    Send to blast
MFSPNLFESP HMFDMSHKTS ESELMGKVRD DDYEIKSVTE TMDAPSGDDQ DPDQRPKMKR  60
YHRHTQRQIQ EMEAFFKECP HPDDKQRKEL GRELGLEPLQ VKFWFQNKRT QMKAQHERHE  120
NAILKAENEK LRAENNRYKE ALSNATCPSC GGPAALGEMS FDEQHLRIEN ARLREEIDRI  180
SGIAAKYVGK PLSSLPHLSS HLHSRSVDLG ASNFGTQSGF VGEMDRSVDL LRSVSGPTEA  240
DKPMIVELAV AAMEELIRMA QSGEPLWVPG DNSIDVLSED EYLRTFPRGI GPKPLGLRSE  300
ASRESAVVIM NHVNLVEILM DVNQWSSVFC GIVSRAVTLE VLSTGVAGNY NGALQVMTAE  360
FQVPSPLVPT RENYFVRYCK QHIDGTWAVV DVSLDNLRPN PMSKCRRRPS GCLIQELPNG  420
YSKVIWVEHV EVDDRAIHNI YRPVVNSGLA FGAKRWVATL DRQCERLASS MASNIPAGDL  480
CVITSLEGRK SMLKLAERMV TSFCTGVGAS TAHAWTTLSA TGSDDVRVMT RKSMDDPGRP  540
PGIVLSAATS FWIPVPPKRV FDFLRDENSR SEWDILSNGG LVQEMAHIAN GRDPGNCVSL  600
LRVNSANSSQ SNMLILQESC TDATGSYVIY APVDIVAMNV VLSGGDPDYL ALLPSGFAIL  660
PDGPGVNGGG ILEIGSGGSL LTVAFQILVD SVPTAKLSLG SVTTVNSLIK CTVERIKAAV  720
MCNNA
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor that binds to the L1 box DNA sequence 5'-TAAATG[CT]A-3'. Plays a role in maintaining the identity of L1 cells, possibly by interacting with their L1 box or other target-gene promoters. Functionally redundant to ATML1. {ECO:0000269|PubMed:12505995}.
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankJN5859510.0JN585951.1 Gossypium hirsutum HD-1A gene, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017615807.10.0PREDICTED: homeobox-leucine zipper protein MERISTEM L1-like
SwissprotQ93V990.0PDF2_ARATH; Homeobox-leucine zipper protein PROTODERMAL FACTOR 2
TrEMBLA0A0B0P3W40.0A0A0B0P3W4_GOSAR; Homeobox-leucine zipper MERISTEM L1-like protein
STRINGGorai.010G177800.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM49128149
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G21750.20.0HD-ZIP family protein
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]
  2. Ding Y, et al.
    Four distinct types of dehydration stress memory genes in Arabidopsis thaliana.
    BMC Plant Biol., 2013. 13: p. 229
    [PMID:24377444]