PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GSBRNA2T00094107001
Common NameGSBRNA2T00094107001
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Brassiceae; Brassica
Family HD-ZIP
Protein Properties Length: 695aa    MW: 78002.1 Da    PI: 5.7861
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GSBRNA2T00094107001genomeGenoscopeView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox54.12.7e-172378156
                         TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
             Homeobox  1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56
                         +r+ +++t +q+++Le++F+++++p +++r  L ++l+L+ +q+k+WFqN+R++ k
  GSBRNA2T00094107001 23 KRTYHRHTFQQTQRLEAYFKECPNPEESQRLSLGEELNLEPDQIKFWFQNKRTQNK 78
                         68889999*********************************************998 PP

2START66.48.4e-222223959165
                          HHHHHHHHC-TT-EEEE...EXCCTTEEEEEEESSS.......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEEEEE CS
                START   9 elvkkalaeepgWvkss...esengdevlqkfeeskv......dsgealrasgvvdmvlallveellddkeqWdetla....kaetlevi 85 
                          e++k+  ++e++Wv ss   +  + +e++ kf+ s         ++e ++++++v+ ++++l e +ld+  +W + ++    ka+t+ ++
  GSBRNA2T00094107001 222 EELKRLFVTEDLWVMSSidgTYVIDQESYEKFSHSIKhfrklsARVESSKDVTLVPIEATNLIEMFLDSE-KWERLFPtivaKAMTIHKL 310
                          77888899*********988777788888888776668899999**************************.99999999999******** PP

                          CTT.......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEE...EE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEE CS
                START  86 ssg.......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvS...vdseqkppesssvvRaellpSgiliepksnghskv 164
                           s        + lq++  +l  lsplvp R+f++vR++++  +g w+++dvS   vds+q +p       + ++pSg+li+  ++ hs+ 
  GSBRNA2T00094107001 311 GSElpinencNNLQVIWEKLHILSPLVPpREFMIVRCCQKIDEGIWIVADVSqriVDSDQINPF------CYKRPSGCLIRAFPGAHSEA 394
                          ****************************************************777778888775......788*********99999765 PP

                          E CS
                START 165 t 165
                           
  GSBRNA2T00094107001 395 C 395
                          3 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.608.5E-18874IPR009057Homeodomain-like
SuperFamilySSF466892.18E-16979IPR009057Homeodomain-like
PROSITE profilePS5007115.9042080IPR001356Homeobox domain
SMARTSM003891.8E-122284IPR001356Homeobox domain
PfamPF000467.0E-152378IPR001356Homeobox domain
CDDcd000861.00E-132380No hitNo description
PROSITE profilePS5084820.095206436IPR002913START domain
SuperFamilySSF559613.57E-16208393No hitNo description
SMARTSM002340.0052214433IPR002913START domain
Gene3DG3DSA:3.30.530.201.0E-4215388IPR023393START-like domain
PfamPF018523.0E-16221395IPR002913START domain
SuperFamilySSF559612.88E-10456656No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
GO:0008289Molecular Functionlipid binding
Sequence ? help Back to Top
Protein Sequence    Length: 695 aa     Download sequence    Send to blast
MDHNGGGNSS SEHTSVNAKK REKRTYHRHT FQQTQRLEAY FKECPNPEES QRLSLGEELN  60
LEPDQIKFWF QNKRTQNKTQ IERNANILLR EENKKIKCEN EAMLEALRIV TCPDCGGPPL  120
GVERDHNFQN LGLANTFLTE KRDELANTVS MNQHQQNMVN LFASVQGQQI FDTHTSYGTI  180
PNSLMNDPSN SVGSSTSQDI QLQLISQMDI TQLSETATRA VEELKRLFVT EDLWVMSSID  240
GTYVIDQESY EKFSHSIKHF RKLSARVESS KDVTLVPIEA TNLIEMFLDS EKWERLFPTI  300
VAKAMTIHKL GSELPINENC NNLQVIWEKL HILSPLVPPR EFMIVRCCQK IDEGIWIVAD  360
VSQRIVDSDQ INPFCYKRPS GCLIRAFPGA HSEACLHCLD LTCFSILKIE LLCGGSGYGA  420
KRWTTTLERM CERMALSSIR TIPATDWSEA IATVEGRMSV MKLGERMVKN FNEMLAMSGK  480
VDFPQQSKCG VRISIRMNHD SGQPSGLVVS AASSFSVPIT PLQVFDSLLN NETRHQWDVL  540
CYRSAVSVTA RILTGSNENN YITILLPTQR EDDVISMSQG PKRNMMMLQE CYMDALGGMI  600
VYAPLDMASM SLATSGEVDP LKIPILSSGF TISNDGRRSM VAEEGGTILT VVFQILVSED  660
KRISGLNEQS VDTVTSLISS TVRNIKLLLN CPLE*
Expression -- Description ? help Back to Top
Source Description
UniprotTISSUE SPECIFICITY: Expressed in the embryo at early stage and in the endosperm. {ECO:0000269|PubMed:16778018}.
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor. {ECO:0000250}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapGSBRNA2T00094107001
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_013591191.10.0PREDICTED: homeobox-leucine zipper protein HDG8-like
SwissprotQ9M9P40.0HDG8_ARATH; Homeobox-leucine zipper protein HDG8
TrEMBLA0A078IJ280.0A0A078IJ28_BRANA; BnaCnng18470D protein
TrEMBLA0A0D3AFV30.0A0A0D3AFV3_BRAOL; Uncharacterized protein
STRINGBo1g154790.10.0(Brassica oleracea)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM127911530
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G03260.10.0homeodomain GLABROUS 8
Publications ? help Back to Top
  1. Chalhoub B, et al.
    Plant genetics. Early allopolyploid evolution in the post-Neolithic Brassica napus oilseed genome.
    Science, 2014. 345(6199): p. 950-3
    [PMID:25146293]