PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gorai.004G120700.2
Common NameB456_004G120700, LOC105791195
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family HD-ZIP
Protein Properties Length: 748aa    MW: 82193.7 Da    PI: 5.8455
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gorai.004G120700.2genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox66.73.2e-2154109156
                         TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
            Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                         ++k +++t+ q++eLe++F+++++p++++r eL+++lgL+ +q+k+WFqNrR+++k
  Gorai.004G120700.2  54 KKKYHRHTPRQIQELESFFKECPHPDEKQRMELSRRLGLEGKQIKFWFQNRRTQMK 109
                         79999************************************************999 PP

2START166.91.4e-522554772205
                         HHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EE CS
               START   2 laeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla....ka 79 
                         +a++a++el+k+a+ + p+W k      es+n +e+ ++f++  +     + +ea +a+g+v      lve l+d + +W e+++    +a
  Gorai.004G120700.2 255 VALSAMDELIKMAQMDNPLWIKGLgggmESLNVEEYKRNFSSCIGmksssYATEATKATGLVYLRGLALVEALMDAN-RWVEMFPcmisRA 344
                         7899************************************88887999999**************************.************* PP

                         EEEEEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEE CS
               START  80 etlevissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghsk 163
                         +t++v+ssg      ++lq+m ae+q+lsplvp R + f+R+++q+++g+w++vdvS+d+ q+ + ++ +  +++lpSg++i++++   sk
  Gorai.004G120700.2 345 ATIDVLSSGtgvtrdNELQVMDAEFQVLSPLVPvRQVRFIRFCKQHSEGVWAVVDVSIDPSQDATDTHMFPNCRRLPSGCVIQDVDTKCSK 435
                         ****************************************************************99************************* PP

                         EEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXX CS
               START 164 vtwvehvdlkgrlphwllrslvksglaegaktwvatlqrqce 205
                         +twveh +++++ +h ll++l++sg  +ga +w+atlqrqc 
  Gorai.004G120700.2 436 ITWVEHSEYDDNAVHHLLQPLLSSGFGFGAHRWLATLQRQCD 477
                         ****************************************96 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.606.1E-2241111IPR009057Homeodomain-like
SuperFamilySSF466893.22E-2042111IPR009057Homeodomain-like
PROSITE profilePS5007117.5451111IPR001356Homeobox domain
SMARTSM003891.3E-1853115IPR001356Homeobox domain
PfamPF000467.8E-1954109IPR001356Homeobox domain
CDDcd000861.77E-1954111No hitNo description
PROSITE patternPS00027086109IPR017970Homeobox, conserved site
PROSITE profilePS5084837.174245481IPR002913START domain
SuperFamilySSF559611.51E-29247477No hitNo description
CDDcd088752.26E-103249476No hitNo description
SMARTSM002343.9E-34254478IPR002913START domain
PfamPF018523.9E-45255477IPR002913START domain
SuperFamilySSF559614.67E-19507741No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0016021Cellular Componentintegral component of membrane
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 748 aa     Download sequence    Send to blast
MDGHGEMGLI GENFDPGFVG RMKEDGYEIR SESDNFDVAS GDDQDAAADG PSKKKKYHRH  60
TPRQIQELES FFKECPHPDE KQRMELSRRL GLEGKQIKFW FQNRRTQMKT QLERHENVIL  120
RQENDKLRAE NDLLKQAMTT PICNSCGGPA VPGEISYEQH QLRIENARLK DELTRICALT  180
NKFLGRPLSS SGSPIPPHSL NSNLELAVGR NGFGGLNNAG TSLPMGFEFG DGSMMPIVKP  240
MVNEMQYDRS AFVDVALSAM DELIKMAQMD NPLWIKGLGG GMESLNVEEY KRNFSSCIGM  300
KSSSYATEAT KATGLVYLRG LALVEALMDA NRWVEMFPCM ISRAATIDVL SSGTGVTRDN  360
ELQVMDAEFQ VLSPLVPVRQ VRFIRFCKQH SEGVWAVVDV SIDPSQDATD THMFPNCRRL  420
PSGCVIQDVD TKCSKITWVE HSEYDDNAVH HLLQPLLSSG FGFGAHRWLA TLQRQCDCMA  480
ILMSQDIPGE NNTGITPAGR KSMIKLAQRM TYNFCAGVCA SSIHKWDKLS VGNVGEDVRV  540
MTRKNINDPG EPHGVVLSAA TSVWMPVTQE RLFDFLRDER MRSEWDILSN GGPMQEMVHV  600
AKGMGHGNCV SLLRGSAINA NENNMLILQE TWSDASGALV VYAPVDISSM SVVMNGGDSA  660
YVALLPSGFA ILPGISPSYH GGRSESNGAL VKPEIDGSIV SGCLLTVGFQ ILVNNVPTAK  720
LTVESVETVN NLISCTIQKI KAALTVT*
Expression -- Description ? help Back to Top
Source Description
UniprotTISSUE SPECIFICITY: Expressed in roots, stems, leaves and floral buds. {ECO:0000269|PubMed:10402424}.
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in the regulation of the tissue-specific accumulation of anthocyanins and in cellular organization of the primary root. {ECO:0000269|PubMed:10402424}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_012474603.10.0PREDICTED: homeobox-leucine zipper protein ANTHOCYANINLESS 2-like isoform X1
SwissprotQ0WV120.0ANL2_ARATH; Homeobox-leucine zipper protein ANTHOCYANINLESS 2
TrEMBLA0A0D2MX630.0A0A0D2MX63_GOSRA; Uncharacterized protein
STRINGGorai.004G120700.10.0(Gossypium raimondii)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00730.10.0HD-ZIP family protein
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]
  2. Paterson AH, et al.
    Repeated polyploidization of Gossypium genomes and the evolution of spinnable cotton fibres.
    Nature, 2012. 492(7429): p. 423-7
    [PMID:23257886]