PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sobic.001G537300.1.p
Common NameSb01g050000, SORBIDRAFT_01g050000
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; PACMAD clade; Panicoideae; Andropogonodae; Andropogoneae; Sorghinae; Sorghum
Family HD-ZIP
Protein Properties Length: 841aa    MW: 92115.2 Da    PI: 6.1524
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sobic.001G537300.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox57.62.2e-182785357
                          --SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHC....TS-HHHHHHHHHHHHHHHHC CS
              Homeobox  3 kRttftkeqleeLeelFeknrypsaeereeLAkkl....gLterqVkvWFqNrRakekk 57
                          k  ++t+eq+e+Le+l+  +++ps ++r++L +++    +++ +q+kvWFqNrR ++k+
  Sobic.001G537300.1.p 27 KYVRYTPEQVEVLERLYIDCPKPSSSRRQQLLRECpilsNIEPKQIKVWFQNRRCRDKQ 85
                          5678****************************************************996 PP

2START160.41.3e-501653732205
                           HHHHHHHHHHHHHHHC-TT-EEEEEXCCTTEEEEEEESSS.SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-SEEEEEEEECTT.. CS
                 START   2 laeeaaqelvkkalaeepgWvkssesengdevlqkfeeskvdsgealrasgvvdmvlallveellddkeqWdetlakaetlevissg.. 88 
                           +aee+ +e+++ka+ ++  Wv+++ +++g++++ +++ s++++g a+ra+g+v  +++++ e+l d++  W +++++ e+      g  
  Sobic.001G537300.1.p 165 IAEETFTEFLSKATGTAIDWVQMPGMKPGPDSVGIVAISHGCRGVAARACGLVNLEPTKVIEILKDRP-SWFRDCRSLEVFTMFPAGng 252
                           688999****************************************************7777777777.***********999999999 PP

                           EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--....-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE-- CS
                 START  89 galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppe...sssvvRaellpSgiliepksnghskvtwvehvdlk 173
                           g+++l++++++a+++lvp Rdf+++Ry+ ++++g++v++++S++     p+    +++vRae+lpSg+l++p+++g+s v++v+h dl+
  Sobic.001G537300.1.p 253 GTVELIYMQMYAPTTLVPaRDFWTLRYTTTMEDGSLVVCERSLSGSGGGPNaasAQQFVRAEMLPSGYLVRPCEGGGSIVHIVDHLDLE 341
                           *********************************************999999998999******************************** PP

                           SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXX CS
                 START 174 grlphwllrslvksglaegaktwvatlqrqce 205
                           +++++++lr+l++s+++ ++k+++ +l+++++
  Sobic.001G537300.1.p 342 AWSVPEVLRPLYESSRVVAQKMTTVALRHLRQ 373
                           ***********************999998765 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.601.9E-17885IPR009057Homeodomain-like
PROSITE profilePS5007115.2892286IPR001356Homeobox domain
SMARTSM003893.9E-152490IPR001356Homeobox domain
SuperFamilySSF466898.98E-162688IPR009057Homeodomain-like
CDDcd000861.26E-152787No hitNo description
PfamPF000466.2E-162885IPR001356Homeobox domain
CDDcd146863.77E-679118No hitNo description
PROSITE profilePS5084826.931155383IPR002913START domain
CDDcd088751.52E-74159375No hitNo description
Gene3DG3DSA:3.30.530.208.5E-22163348IPR023393START-like domain
SMARTSM002345.1E-38164374IPR002913START domain
SuperFamilySSF559618.93E-37165376No hitNo description
PfamPF018522.5E-48165373IPR002913START domain
PfamPF086701.1E-49697839IPR013978MEKHLA
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0009855Biological Processdetermination of bilateral symmetry
GO:0009944Biological Processpolarity specification of adaxial/abaxial axis
GO:0009956Biological Processradial pattern formation
GO:0010014Biological Processmeristem initiation
GO:0010051Biological Processxylem and phloem pattern formation
GO:0010089Biological Processxylem development
GO:0030154Biological Processcell differentiation
GO:0005634Cellular Componentnucleus
GO:0008289Molecular Functionlipid binding
GO:0044212Molecular Functiontranscription regulatory region DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 841 aa     Download sequence    Send to blast
MAAAVAMRGG SSDSGGFDKV PGMDSGKYVR YTPEQVEVLE RLYIDCPKPS SSRRQQLLRE  60
CPILSNIEPK QIKVWFQNRR CRDKQRKESS RLQAVNRKLT AMNKLLMEEN ERLQKQVSQL  120
VHENAHMRQQ LQNTSLANDT SCESNVTTPP NPIRDASNPS GLLAIAEETF TEFLSKATGT  180
AIDWVQMPGM KPGPDSVGIV AISHGCRGVA ARACGLVNLE PTKVIEILKD RPSWFRDCRS  240
LEVFTMFPAG NGGTVELIYM QMYAPTTLVP ARDFWTLRYT TTMEDGSLVV CERSLSGSGG  300
GPNAASAQQF VRAEMLPSGY LVRPCEGGGS IVHIVDHLDL EAWSVPEVLR PLYESSRVVA  360
QKMTTVALRH LRQIAQETSG EVVYALGRQP AVLRTFSQRL SRGFNDAISG FNDDGWSVMG  420
GDGIEDVVVA CNSTKKIRNN SNAGITFGAP GGIICAKASM LLQSVPPAVL VRFLREHRSE  480
WADYNMDAYL ASSLKTSACS LPGLRPMRFS GGQMIMPLAH TVENEEILEV VRLEGQPLTH  540
DEALLSRDIH LLQLCTGIDE KSVGSSFQLV FAPIDEHFPD DAPLISSGFR VIPLDMKTDG  600
VASGRTLDLA SSLDVGSAAP QASGDASPDD CNLRSVLTIA FQFPYEMHLQ DSVATMARQY  660
VRGVVSAVQR VSMAISPSQS GLNAGQRMLS GFPEAATLAR WVCQSYHYHL GLELLNQSDE  720
AGEALLKMLW HHPDAVLCCS FKEKPMFTFA NKAGLDMLET SLVALQDLTL DKIFDESGRK  780
AIFSDISKLM EQGYAYLPSG VCMSGMGRHV SFDQAVAWKV LGEDSNVHCL AFCFVNWSFV  840
*
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Sbi.136430.0panicle
Expression -- Description ? help Back to Top
Source Description
UniprotTISSUE SPECIFICITY: Expressed in stems, leaf sheaths and blades and panicles. {ECO:0000269|PubMed:17999151}.
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor. {ECO:0000250}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapSobic.001G537300.1.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankDQ2865260.0DQ286526.1 Zea mays rolled leaf 2 (rld2) mRNA, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_002468669.10.0homeobox-leucine zipper protein HOX10 isoform X2
SwissprotA2XBL90.0HOX10_ORYSI; Homeobox-leucine zipper protein HOX10
TrEMBLC5WMP70.0C5WMP7_SORBI; Uncharacterized protein
STRINGSb01g050000.10.0(Sorghum bicolor)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G60690.10.0HD-ZIP family protein