PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cotton_A_24123_BGI-A2_v1.0
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family HD-ZIP
Protein Properties Length: 790aa    MW: 85740.2 Da    PI: 6.0525
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cotton_A_24123_BGI-A2_v1.0genomeBGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox66.24.4e-21107162156
                                 TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
                    Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                                 +++ +++t++q++eLe+lF+++++p++++r eL+k+l L++rqVk+WFqNrR+++k
  Cotton_A_24123_BGI-A2_v1.0 107 KKRYHRHTPQQIQELEALFKECPHPDEKQRLELSKRLCLETRQVKFWFQNRRTQMK 162
                                 688999***********************************************999 PP

2START204.54.2e-643025271206
                                 HHHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-T CS
                       START   1 elaeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWde 74 
                                 ela++a++elvk+a+ +ep+W +s     e +n de+ + f++  +     + +ea+r +gvv+ ++  lve+l+d++ +W e
  Cotton_A_24123_BGI-A2_v1.0 302 ELALAAMYELVKMAQTDEPLWISSLeggrEVLNHDEYSRMFTPCIGikpagFLTEASRQTGVVIINSLALVETLMDSN-RWAE 383
                                 5899*********************99999999999999999988899******************************.**** PP

                                 T-S....EEEEEEEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--..-TTSEE- CS
                       START  75 tla....kaetlevissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppe.sssvvRa 145
                                 +++    + +t++vissg      ga+qlm aelq+lsplvp R++ f+R+++q+ +g+w++vdvS+d  ++ +     +v++
  Cotton_A_24123_BGI-A2_v1.0 384 MFPcmiaRTSTTDVISSGmggtrnGAIQLMHAELQLLSPLVPvREVNFLRFCKQHAEGVWAVVDVSIDTLRETSGaPTTYVKC 466
                                 ************************************************************************9998899**** PP

                                 EESSEEEEEEEECTCEEEEEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
                       START 146 ellpSgiliepksnghskvtwvehvdlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                                 ++lpSg+++++++ng+skvtwvehv+++++++h l+r+l++sg+ +ga++wvatlqrqce+
  Cotton_A_24123_BGI-A2_v1.0 467 RRLPSGCVVQDMPNGYSKVTWVEHVEYDESQVHHLYRPLLSSGIGFGAQRWVATLQRQCEC 527
                                 ***********************************************************96 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466898.36E-2191164IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.602.2E-2197164IPR009057Homeodomain-like
PROSITE profilePS5007117.41104164IPR001356Homeobox domain
SMARTSM003899.9E-18105168IPR001356Homeobox domain
CDDcd000861.24E-18106164No hitNo description
PfamPF000461.3E-18107162IPR001356Homeobox domain
PROSITE patternPS000270139162IPR017970Homeobox, conserved site
PROSITE profilePS5084844.598293530IPR002913START domain
SuperFamilySSF559612.2E-34295527No hitNo description
CDDcd088756.44E-126297526No hitNo description
PfamPF018528.0E-56302527IPR002913START domain
SMARTSM002341.6E-49302527IPR002913START domain
SuperFamilySSF559612.01E-23556783No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 790 aa     Download sequence    Send to blast
MSFGGFLDNS SGGGLGGATI VADIPFSNNM AAAGAMAQNI YNSPGLSLAL QQPSIDNQGD  60
GVRMGENFDA SIGRRSREEE HESRSGSDNI DGVSGDDHDA ADNRPRKKRY HRHTPQQIQE  120
LEALFKECPH PDEKQRLELS KRLCLETRQV KFWFQNRRTQ MKTQLERHEN SLLRQENDKL  180
RAENMSIREA MRNPICTNCG GPAIIGDLSL EEQHLRIENA RLKDELDRVC ALGSKFLGRP  240
LSSLATSIAP PLPNSNLELG VGSNGFGGLS TTLPLGPDFG GGVSNSLPVV PPNGVERSMF  300
LELALAAMYE LVKMAQTDEP LWISSLEGGR EVLNHDEYSR MFTPCIGIKP AGFLTEASRQ  360
TGVVIINSLA LVETLMDSNR WAEMFPCMIA RTSTTDVISS GMGGTRNGAI QLMHAELQLL  420
SPLVPVREVN FLRFCKQHAE GVWAVVDVSI DTLRETSGAP TTYVKCRRLP SGCVVQDMPN  480
GYSKVTWVEH VEYDESQVHH LYRPLLSSGI GFGAQRWVAT LQRQCECLAI LMSSTVPTGD  540
HTAITASGRR SMLKLAQRMT GNFCAGVCAS TVHKWNKLNA GNGEEDVRVM TRKSVDNPGE  600
PPGIVLSAAT SVWLPVSPQR LFDFLRDERL RSEWDILSNG GPMQEIAHIA KGQDHGNCVS  660
LLRSSAMNTN QSSMLILQET CMDAGGSLVV YAPVDIPAVQ VVMNGGDSAY VALLPSGFSI  720
IPDGTASPGP TTSNGNGDSH RVGGSLLTVA FQILVNSLPT AKLTVESVET VNNLISCTVQ  780
KIKAALQCES
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in the regulation of the tissue-specific accumulation of anthocyanins and in cellular organization of the primary root. {ECO:0000269|PubMed:10402424}.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017608938.10.0PREDICTED: homeobox-leucine zipper protein ANTHOCYANINLESS 2-like isoform X1
SwissprotQ0WV120.0ANL2_ARATH; Homeobox-leucine zipper protein ANTHOCYANINLESS 2
TrEMBLA0A0D2PSP00.0A0A0D2PSP0_GOSRA; Uncharacterized protein
STRINGGorai.001G171000.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM112827105
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00730.10.0HD-ZIP family protein
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]