PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sobic.006G034900.4.p
Common NameSb06g004510, SORBIDRAFT_06g004510
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; PACMAD clade; Panicoideae; Andropogonodae; Andropogoneae; Sorghinae; Sorghum
Family HD-ZIP
Protein Properties Length: 736aa    MW: 81015.6 Da    PI: 5.1133
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sobic.006G034900.4.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox63.62.9e-2075125656
                           S--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
              Homeobox   6 tftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                           +f  +q++eLe+ F+ +++p+ + r+eLA+k+gL+erqVk+WFqNrR ++k
  Sobic.006G034900.4.p  75 RFAMHQIQELEAQFRVCSHPNPDVRQELATKIGLEERQVKFWFQNRRSQMK 125
                           6999********************************************998 PP

2START76.47e-25334437106205
                           XEEEEEEEEEEE.TTS-EEEEEEEEE....-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE--SSXXHHHHHHHHHHHHH CS
                 START 106 pRdfvfvRyirqlgagdwvivdvSvd....seqkppesssvvRaellpSgiliepksnghskvtwvehvdlkgrlphwllrslvksgla 190
                           +R + f+R+++ +  g+w++vdvSvd    +eq+ +++s    ++llpSg+l+e++s+g++kvtwv h+++++ +++ l+r+l++sg+a
  Sobic.006G034900.4.p 334 NRSVKFLRFSKMMANGRWAVVDVSVDgiygVEQEGSSTSYTTGCRLLPSGCLLEDMSGGYCKVTWVVHAEYDETTVPFLFRPLLQSGQA 422
                           59999*********************444456666657777889********************************************* PP

                           HHHHHHHHHTXXXXX CS
                 START 191 egaktwvatlqrqce 205
                            ga +w++ lq+qce
  Sobic.006G034900.4.p 423 LGACRWLRSLQKQCE 437
                           **************8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466892.26E-1757125IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.606.2E-1863125IPR009057Homeodomain-like
PROSITE profilePS5007116.45567127IPR001356Homeobox domain
SMARTSM003892.9E-1769131IPR001356Homeobox domain
CDDcd000864.32E-1775125No hitNo description
PfamPF000461.0E-1775125IPR001356Homeobox domain
PRINTSPR000319.1E-598107IPR000047Helix-turn-helix motif
PROSITE patternPS000270102125IPR017970Homeobox, conserved site
PRINTSPR000319.1E-5107123IPR000047Helix-turn-helix motif
SMARTSM002342.6E-4245438IPR002913START domain
SuperFamilySSF559612.69E-14247437No hitNo description
PfamPF018526.9E-20334437IPR002913START domain
PROSITE profilePS5084820.707335441IPR002913START domain
SuperFamilySSF559615.86E-7465706No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 736 aa     Download sequence    Send to blast
MDIMDNGGQL NNNSNEQDND GFTMDEIPDL PWNSHMEYDV DAFLGAEDHV NTNQTTDDVD  60
HRSPVGETPS KGVKRFAMHQ IQELEAQFRV CSHPNPDVRQ ELATKIGLEE RQVKFWFQNR  120
RSQMKVKAYG DDNKGIRQEL AKLKAENEEL KQRRQNPICF MCTNPIAAIQ SENWRLLNDN  180
TRLKDEYVRS KAHMDRLIRE AAAEHPPSAM RSSDHHLASA HMNMDPVALT GNCRTTTNLE  240
ATLTSHAARA MKEFVMLATK GEPMWVLAKD GEKLNHQEYI LQTFPGLLGL CPQGFVEEAT  300
RETDMIKGTA MDLVSILTDV MNVELWVQSP RLLNRSVKFL RFSKMMANGR WAVVDVSVDG  360
IYGVEQEGSS TSYTTGCRLL PSGCLLEDMS GGYCKVTWVV HAEYDETTVP FLFRPLLQSG  420
QALGACRWLR SLQKQCEYIT VLPSSHVLPS SSSSSAISTL GVGRRSVMEL AGQMMVSFYA  480
AVSGPVIVPA TSSVNEWRLV SNGNGTERVE AFVRLVTWNC ADIMPGEPSV TVLSATTTVW  540
LPGTPPLCVF EYLCDLQRRG EWDTHVDAGE VKELSSVATS PQLPGNNVVS VLEPTTVVTD  600
ETESSKVLIL QETSTDVSCF LVVYSLIEES LMRGIMDGRE RSNIFVLPSG FAILPDGHGK  660
AHADHTAANS SNSAPIDSRN NNAGSIVSVA FQTLLPGNLS SNLDNTGAFE DARLQVCHAI  720
TKIKAAVGAS NIIPA*
Cis-element ? help Back to Top
SourceLink
PlantRegMapSobic.006G034900.4.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_020404892.10.0homeobox-leucine zipper protein ROC6
TrEMBLA0A1Z5RCT80.0A0A1Z5RCT8_SORBI; Uncharacterized protein
STRINGSb06g004510.10.0(Sorghum bicolor)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G61150.11e-104homeodomain GLABROUS 1