PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sobic.001G217200.1.p
Common NameSb01g019120, SORBIDRAFT_01g019120
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; PACMAD clade; Panicoideae; Andropogonodae; Andropogoneae; Sorghinae; Sorghum
Family HD-ZIP
Protein Properties Length: 839aa    MW: 91984.9 Da    PI: 5.9183
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sobic.001G217200.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox58.61.1e-182885457
                          -SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHC....TS-HHHHHHHHHHHHHHHHC CS
              Homeobox  4 RttftkeqleeLeelFeknrypsaeereeLAkkl....gLterqVkvWFqNrRakekk 57
                            ++t+eq+e+Le++++++++ps ++r++L +++    +++ +q+kvWFqNrR ++k+
  Sobic.001G217200.1.p 28 YVRYTPEQVETLERVYAECPKPSSARRQQLLRECpilsNIEPKQIKVWFQNRRCRDKQ 85
                          6789***************************************************996 PP

2START157.51e-491643722205
                           HHHHHHHHHHHHHHHC-TT-EEEEEXCCTTEEEEEEESSS.SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-SEEEEEEEECTT.. CS
                 START   2 laeeaaqelvkkalaeepgWvkssesengdevlqkfeeskvdsgealrasgvvdmvlallveellddkeqWdetlakaetlevissg.. 88 
                           +aee+++e+++ka+ ++  Wv+++ +++g+++  +++ s++ +g a+ra+g+v  ++++ ve+l d++  W +++++ e+   +  g  
  Sobic.001G217200.1.p 164 IAEETLTEFLSKATGTAIDWVQMPGMKPGPDSFGIVTISHGGRGVAARACGLVNLEPTKIVEILKDRP-SWFRDCRSLEVFTMLPAGng 251
                           789*******************************************************9999999999.****************9999 PP

                           EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--....-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE-- CS
                 START  89 galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppe...sssvvRaellpSgiliepksnghskvtwvehvdlk 173
                           g+++l +++++a+++lvp Rdf+++Ry+ ++++g++v++++S++   + ++    +++vRae+lpSg+l++++++g+s v +v+h dl+
  Sobic.001G217200.1.p 252 GTIELVYMQMYAPTTLVPaRDFWTLRYTTTMEDGSLVVCERSLSGSGDGQSaatAQQFVRAEMLPSGYLVRQCEGGGSIVRIVDHLDLD 340
                           ********************************************988888778899********************************* PP

                           SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXX CS
                 START 174 grlphwllrslvksglaegaktwvatlqrqce 205
                           +++++++lr+l++s+++ ++k+++a+l+++++
  Sobic.001G217200.1.p 341 AWSVPEVLRPLYESSRVVAQKMTTAALRHLRQ 372
                           ***************************99876 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.601.0E-18785IPR009057Homeodomain-like
PROSITE profilePS5007115.5642286IPR001356Homeobox domain
SMARTSM003891.2E-162490IPR001356Homeobox domain
SuperFamilySSF466892.1E-162788IPR009057Homeodomain-like
CDDcd000864.06E-172787No hitNo description
PfamPF000462.5E-162885IPR001356Homeobox domain
CDDcd146864.41E-679118No hitNo description
PROSITE profilePS5084826.809154382IPR002913START domain
CDDcd088752.70E-75158374No hitNo description
SuperFamilySSF559611.65E-34163375No hitNo description
Gene3DG3DSA:3.30.530.202.6E-20163349IPR023393START-like domain
SMARTSM002346.0E-37163373IPR002913START domain
PfamPF018521.5E-47164372IPR002913START domain
PfamPF086701.3E-49695837IPR013978MEKHLA
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
GO:0008289Molecular Functionlipid binding
Sequence ? help Back to Top
Protein Sequence    Length: 839 aa     Download sequence    Send to blast
MAAAVAMRSG SSDGGGGYEK GGMDTGKYVR YTPEQVETLE RVYAECPKPS SARRQQLLRE  60
CPILSNIEPK QIKVWFQNRR CRDKQRKESS RLQAVNRKLT AMNKLLMEEN ERLQKQVSQL  120
VHENAYMKQQ LQNPSLANDA SCDSNVTAPA NLRDASNPSG LLSIAEETLT EFLSKATGTA  180
IDWVQMPGMK PGPDSFGIVT ISHGGRGVAA RACGLVNLEP TKIVEILKDR PSWFRDCRSL  240
EVFTMLPAGN GGTIELVYMQ MYAPTTLVPA RDFWTLRYTT TMEDGSLVVC ERSLSGSGDG  300
QSAATAQQFV RAEMLPSGYL VRQCEGGGSI VRIVDHLDLD AWSVPEVLRP LYESSRVVAQ  360
KMTTAALRHL RQIAQETSGE VVYAMGRQPA VLRTFSQRLS RGFNDAISGF NDDGWSIMAG  420
DGIEDVIIAC NSKKIRSGSN PATAFGAPGG IICAKASMLL QSVPPAVLVR FLREHRSEWA  480
DYNFDAYSAS ALKTSPCSLP GLRPMRFSGS QIIMPLAHTV ENEEILEVVR LEGQTLTHDE  540
GLLSRDIHLL QLCTGIDEKS MGSCFQLVFA PIDELFPDDA PLISSGFRVI PLDIKTDGLS  600
SGRTLDLASS LEVGATTQQA SADGSQDACN LRSVLTIAFQ FPYEIHLQDT VAAMARQYVR  660
SIVSAVQRVS MAISPSQSGL NTGQKIISGF PEAATLVRWI CQSYRYHMGV DLVSHSDQAG  720
ESLLRMFWDH QDAVLCCSFK EKPVFTFGNQ MGIDMLETTL IALQDLTLDK IFDEPGRKAL  780
HAEVPKLMEQ GYAYLPAGVC LSGMGRHVSY EEAVAWKVLG EDGNVHCLAF CFVNWSFV*
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Sbi.211070.0panicle
Expression -- Description ? help Back to Top
Source Description
UniprotTISSUE SPECIFICITY: Expressed in seedlings, roots, stems, leaf sheaths and blades and panicles. {ECO:0000269|PubMed:17999151}.
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor. {ECO:0000250}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapSobic.001G217200.1.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_021306737.10.0homeobox-leucine zipper protein HOX9 isoform X2
SwissprotA2Z8L40.0HOX9_ORYSI; Homeobox-leucine zipper protein HOX9
TrEMBLC5WYD40.0C5WYD4_SORBI; Uncharacterized protein
STRINGSb01g019120.10.0(Sorghum bicolor)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP37438197
Representative plantOGRP6511671
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G60690.10.0HD-ZIP family protein