PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GSBRNA2T00076109001
Common NameGSBRNA2T00076109001
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Brassiceae; Brassica
Family HD-ZIP
Protein Properties Length: 690aa    MW: 78029.4 Da    PI: 6.0255
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GSBRNA2T00076109001genomeGenoscopeView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox51.22.1e-162979656
                         S--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
             Homeobox  6 tftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56
                         ++t++q++ Le++F+++++p++++re   + l+L  +qVk+WFqN+R++ k
  GSBRNA2T00076109001 29 RHTPQQIQKLEAYFKECPHPNESQREAFCSVLDLGIDQVKFWFQNKRTQSK 79
                         689*********************************************998 PP

2START108.88.4e-352104343206
                          HHHHHHHHHHHHHHC-TT-EEEE...EXCCTTEEEEEEESSS.......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EE CS
                START   3 aeeaaqelvkkalaeepgWvkss...esengdevlqkfeeskv......dsgealrasgvvdmvlallveellddkeqWdetla....ka 79 
                          a +a +el ++   +e++Wv ss   +  + +e++ kf+ s         ++e +++++vv+ +++ l   +ld+  +W+  ++    ka
  GSBRNA2T00076109001 210 AATAVEELKRLFRTDEALWVMSSidgTYVIDQESYEKFSHSIKhfrnlsARVESSKDITVVPIEATSLIDMFLDSE-KWKMLFPtivnKA 298
                          6788999999999**********988777788888888776668899999****************9999999999.99999988888** PP

                          EEEEEECTT.......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCE CS
                START  80 etlevissg.......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksngh 161
                          +t+ ++ s        ++lq++  +l  lsplvp R+f++vR+++q g+g w+i+dvS ds+   ++++ +  + ++pSg+li++++n h
  GSBRNA2T00076109001 299 KTIHTLGSElpinencNVLQVIWEQLHILSPLVPpREFMIVRCCQQIGEGLWIIADVSQDSQHIFNSDQASPSCYKRPSGCLIRSLPNAH 388
                          **********************************************************999886666777888899************** PP

                          EEEEEEE-EE--SSXX.HHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
                START 162 skvtwvehvdlkgrlp.hwllrslvksglaegaktwvatlqrqcek 206
                          ++v w+ehv+++     h ++r lv+ +   ga++w  tl+r ce+
  GSBRNA2T00076109001 389 TEVRWIEHVEVDHTADtHKMYRDLVSGSSGYGARRWIVTLERMCER 434
                          ************99988***************************96 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.601.0E-15679IPR009057Homeodomain-like
SuperFamilySSF466894.02E-16981IPR009057Homeodomain-like
SMARTSM003897.5E-132185IPR001356Homeobox domain
PROSITE profilePS5007114.8192181IPR001356Homeobox domain
CDDcd000861.75E-122881No hitNo description
PfamPF000466.5E-142979IPR001356Homeobox domain
PROSITE profilePS5084837.958199437IPR002913START domain
SuperFamilySSF559612.61E-25200435No hitNo description
CDDcd088755.64E-90204433No hitNo description
SMARTSM002343.7E-18208434IPR002913START domain
PfamPF018523.2E-28210434IPR002913START domain
Gene3DG3DSA:3.30.530.207.1E-6210400IPR023393START-like domain
SuperFamilySSF559615.49E-8456649No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
GO:0008289Molecular Functionlipid binding
Sequence ? help Back to Top
Protein Sequence    Length: 690 aa     Download sequence    Send to blast
MDYDGDGSPD NEHYTSVNAM SKEKRICHRH TPQQIQKLEA YFKECPHPNE SQREAFCSVL  60
DLGIDQVKFW FQNKRTQSKT QDERTSNILL REENKKLQLE NAAMIEVLKT VTCPPCGGPP  120
FGRDDRESNL HKMRLENAFL KTERDSLTTT KNKYQQTMLD SLTSVQRQQT FEALTSYGMN  180
LYNQPSSLES QTIQPQLLPQ MDLPQLSETA ATAVEELKRL FRTDEALWVM SSIDGTYVID  240
QESYEKFSHS IKHFRNLSAR VESSKDITVV PIEATSLIDM FLDSEKWKML FPTIVNKAKT  300
IHTLGSELPI NENCNVLQVI WEQLHILSPL VPPREFMIVR CCQQIGEGLW IIADVSQDSQ  360
HIFNSDQASP SCYKRPSGCL IRSLPNAHTE VRWIEHVEVD HTADTHKMYR DLVSGSSGYG  420
ARRWIVTLER MCERMALSSI LIMPATDWSE TIPTVEGRRS VMKLGERMLK IFNEMLIMSG  480
KVEFPQQSKC GVRISIRMNK EPGQLPGLVV TAASCLSIPL TPLQVFNCLR SKDTRHQWDV  540
LCRGNTITET ARIFTGSSGT NCITLLQPTP LWDIGQNMVQ EPQKKMMVLQ ECYMDALGGM  600
IVYSPLDMAT MSIAAAGEVD PLNIPILPSG FTISSDNSEG TLLMLAFQIL NSDENSKTRS  660
VSEIAVDRVS RLISQTVQNI KLMLNCPPE*
Expression -- Description ? help Back to Top
Source Description
UniprotTISSUE SPECIFICITY: Expressed in the embryo at early stage and in the endosperm. {ECO:0000269|PubMed:16778018}.
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor. {ECO:0000250}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapGSBRNA2T00076109001
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_013747906.20.0homeobox-leucine zipper protein HDG8-like
SwissprotQ9M9P40.0HDG8_ARATH; Homeobox-leucine zipper protein HDG8
TrEMBLA0A078HZ210.0A0A078HZ21_BRANA; BnaA05g32490D protein
STRINGBra032003.1-P0.0(Brassica rapa)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM127911530
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G03260.10.0homeodomain GLABROUS 8
Publications ? help Back to Top
  1. Chalhoub B, et al.
    Plant genetics. Early allopolyploid evolution in the post-Neolithic Brassica napus oilseed genome.
    Science, 2014. 345(6199): p. 950-3
    [PMID:25146293]