PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Glyma.10G238100.2.p
Common NameGLYMA_10G238100, LOC100794400
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fabales; Fabaceae; Papilionoideae; Phaseoleae; Glycine; Soja
Family HD-ZIP
Protein Properties Length: 752aa    MW: 82247.8 Da    PI: 5.8437
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Glyma.10G238100.2.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox64.91.2e-2054109156
                          TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
             Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                          +++ +++t++q++eLe++F+++++p++++r +L+k+l L+ +qVk+WFqNrR+++k
  Glyma.10G238100.2.p  54 KKRYHRHTPHQIQELEAFFKECPHPDEKQRLDLSKRLALENKQVKFWFQNRRTQMK 109
                          688999***********************************************999 PP

2START180.11.2e-562644872206
                          HHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....E CS
                START   2 laeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla....k 78 
                          la++a++el+k+ +ae+p+W ks     e+ n +e+ + f++  +     + +ea r++g+v+ ++  lve+l+d + +W e+++    +
  Glyma.10G238100.2.p 264 LALAAMEELLKMTQAESPLWIKSLdgekEMFNHEEYARLFSPCIGpkptgYITEATRETGIVIINSLALVETLMDAN-RWAEMFPsmiaR 352
                          6899********************999999999*******99988********************************.*******99999 PP

                          EEEEEEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCE CS
                START  79 aetlevissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksngh 161
                          a  l+vis+g      galq+m ae q+lsplvp R + f+R+++q+ +g+w++vdvS++   +  + ++v+ +++lpSg+++++++ng+
  Glyma.10G238100.2.p 353 AINLDVISNGmggtrnGALQVMHAEVQLLSPLVPvRQVRFIRFCKQHAEGVWAVVDVSIEIGHDAANAQPVMSCRRLPSGCIVQDMPNGY 442
                          999*********************************************************99999999********************** PP

                          EEEEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
                START 162 skvtwvehvdlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                          skvtw+eh++++++++h+l+r+l++sg+ +ga +w atlqrqce+
  Glyma.10G238100.2.p 443 SKVTWLEHWEYDENVVHQLYRPLLSSGVGFGAHRWIATLQRQCEC 487
                          *******************************************96 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.601.7E-2138105IPR009057Homeodomain-like
SuperFamilySSF466891.41E-1940111IPR009057Homeodomain-like
PROSITE profilePS5007117.29751111IPR001356Homeobox domain
SMARTSM003899.5E-1852115IPR001356Homeobox domain
CDDcd000866.47E-1853111No hitNo description
PfamPF000462.6E-1854109IPR001356Homeobox domain
PROSITE patternPS00027086109IPR017970Homeobox, conserved site
PROSITE profilePS5084840.996254490IPR002913START domain
SuperFamilySSF559616.87E-31256487No hitNo description
CDDcd088756.80E-117258486No hitNo description
SMARTSM002341.4E-35263487IPR002913START domain
PfamPF018523.3E-47264487IPR002913START domain
SuperFamilySSF559617.38E-21515745No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 752 aa     Download sequence    Send to blast
MEGPSEIGLI GENFDAGLMG RMRDDEYESR SGSDNFEGAS GDDQDGGDDQ PQRKKRYHRH  60
TPHQIQELEA FFKECPHPDE KQRLDLSKRL ALENKQVKFW FQNRRTQMKT QLERHENIML  120
RQENDKLRAE NSLMKDAMSN PVCNNCGGPA IPGQISFEEH QIRIENARLK DELNRICALA  180
NKFLGKPISS LTNPMALPTS NSGLELGIGR NGIGGSSTLG TPLPMGLDLG DGVLGTQPAM  240
PGIRPALGLM GNEVQLERSM LIDLALAAME ELLKMTQAES PLWIKSLDGE KEMFNHEEYA  300
RLFSPCIGPK PTGYITEATR ETGIVIINSL ALVETLMDAN RWAEMFPSMI ARAINLDVIS  360
NGMGGTRNGA LQVMHAEVQL LSPLVPVRQV RFIRFCKQHA EGVWAVVDVS IEIGHDAANA  420
QPVMSCRRLP SGCIVQDMPN GYSKVTWLEH WEYDENVVHQ LYRPLLSSGV GFGAHRWIAT  480
LQRQCECLAI LMSSSISSDD HTALSQAGRR SMLKLAQRMT SNFCSGVCAS SARKWDSLHI  540
GTLGDDMKVM TRKNVDDPGE PPGIVLSAAT SVWVPVSRQR LFDFLRDERL RSEWDILSNG  600
GPMQEMVHIA KGQGHGNCVS LLRANAVNAN DSSMLILQET WMDASCSVVV YAPVDVQSLN  660
VVMSGGDSAY VALLPSGFAI LPDGHCNDNG CNGTLQKGGG GNDGGGSLLT VGFQILVNSL  720
PTAKLTVESV DTVNNLISCT IQKIKASLRV A*
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Gma.130070.0flower| hypocotyl| leaf| somatic embryo
Expression -- Description ? help Back to Top
Source Description
UniprotTISSUE SPECIFICITY: Expressed in roots, stems, leaves and floral buds. {ECO:0000269|PubMed:10402424}.
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in the regulation of the tissue-specific accumulation of anthocyanins and in cellular organization of the primary root. {ECO:0000269|PubMed:10402424}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapGlyma.10G238100.2.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAP0150410.0AP015041.1 Vigna angularis var. angularis DNA, chromosome 8, almost complete sequence, cultivar: Shumari.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006589557.10.0homeobox-leucine zipper protein ANTHOCYANINLESS 2 isoform X1
SwissprotQ0WV120.0ANL2_ARATH; Homeobox-leucine zipper protein ANTHOCYANINLESS 2
TrEMBLK7LL440.0K7LL44_SOYBN; Uncharacterized protein
STRINGGLYMA10G38280.20.0(Glycine max)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00730.10.0HD-ZIP family protein
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]