PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sobic.006G034900.6.p
Common NameSb06g004510, SORBIDRAFT_06g004510
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; PACMAD clade; Panicoideae; Andropogonodae; Andropogoneae; Sorghinae; Sorghum
Family HD-ZIP
Protein Properties Length: 733aa    MW: 80656.2 Da    PI: 5.1754
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sobic.006G034900.6.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox63.62.9e-2072122656
                           S--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
              Homeobox   6 tftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                           +f  +q++eLe+ F+ +++p+ + r+eLA+k+gL+erqVk+WFqNrR ++k
  Sobic.006G034900.6.p  72 RFAMHQIQELEAQFRVCSHPNPDVRQELATKIGLEERQVKFWFQNRRSQMK 122
                           6999********************************************998 PP

2START76.46.9e-25331434106205
                           XEEEEEEEEEEE.TTS-EEEEEEEEE....-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE--SSXXHHHHHHHHHHHHH CS
                 START 106 pRdfvfvRyirqlgagdwvivdvSvd....seqkppesssvvRaellpSgiliepksnghskvtwvehvdlkgrlphwllrslvksgla 190
                           +R + f+R+++ +  g+w++vdvSvd    +eq+ +++s    ++llpSg+l+e++s+g++kvtwv h+++++ +++ l+r+l++sg+a
  Sobic.006G034900.6.p 331 NRSVKFLRFSKMMANGRWAVVDVSVDgiygVEQEGSSTSYTTGCRLLPSGCLLEDMSGGYCKVTWVVHAEYDETTVPFLFRPLLQSGQA 419
                           59999*********************444456666657777889********************************************* PP

                           HHHHHHHHHTXXXXX CS
                 START 191 egaktwvatlqrqce 205
                            ga +w++ lq+qce
  Sobic.006G034900.6.p 420 LGACRWLRSLQKQCE 434
                           **************8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466892.21E-1754122IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.606.2E-1860122IPR009057Homeodomain-like
PROSITE profilePS5007116.45564124IPR001356Homeobox domain
SMARTSM003892.9E-1766128IPR001356Homeobox domain
CDDcd000864.29E-1772122No hitNo description
PfamPF000461.0E-1772122IPR001356Homeobox domain
PRINTSPR000319.0E-595104IPR000047Helix-turn-helix motif
PROSITE patternPS00027099122IPR017970Homeobox, conserved site
PRINTSPR000319.0E-5104120IPR000047Helix-turn-helix motif
SMARTSM002342.6E-4242435IPR002913START domain
SuperFamilySSF559612.69E-14244434No hitNo description
PfamPF018526.9E-20331434IPR002913START domain
PROSITE profilePS5084820.707332438IPR002913START domain
SuperFamilySSF559615.86E-7462703No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 733 aa     Download sequence    Send to blast
MDNGGQLNNN SNEQDNDGFT MDEIPDLPWN SHMEYDVDAF LGAEDHVNTN QTTDDVDHRS  60
PVGETPSKGV KRFAMHQIQE LEAQFRVCSH PNPDVRQELA TKIGLEERQV KFWFQNRRSQ  120
MKVKAYGDDN KGIRQELAKL KAENEELKQR RQNPICFMCT NPIAAIQSEN WRLLNDNTRL  180
KDEYVRSKAH MDRLIREAAA EHPPSAMRSS DHHLASAHMN MDPVALTGNC RTTTNLEATL  240
TSHAARAMKE FVMLATKGEP MWVLAKDGEK LNHQEYILQT FPGLLGLCPQ GFVEEATRET  300
DMIKGTAMDL VSILTDVMNV ELWVQSPRLL NRSVKFLRFS KMMANGRWAV VDVSVDGIYG  360
VEQEGSSTSY TTGCRLLPSG CLLEDMSGGY CKVTWVVHAE YDETTVPFLF RPLLQSGQAL  420
GACRWLRSLQ KQCEYITVLP SSHVLPSSSS SSAISTLGVG RRSVMELAGQ MMVSFYAAVS  480
GPVIVPATSS VNEWRLVSNG NGTERVEAFV RLVTWNCADI MPGEPSVTVL SATTTVWLPG  540
TPPLCVFEYL CDLQRRGEWD THVDAGEVKE LSSVATSPQL PGNNVVSVLE PTTVVTDETE  600
SSKVLILQET STDVSCFLVV YSLIEESLMR GIMDGRERSN IFVLPSGFAI LPDGHGKAHA  660
DHTAANSSNS APIDSRNNNA GSIVSVAFQT LLPGNLSSNL DNTGAFEDAR LQVCHAITKI  720
KAAVGASNII PA*
Cis-element ? help Back to Top
SourceLink
PlantRegMapSobic.006G034900.6.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_020404892.10.0homeobox-leucine zipper protein ROC6
TrEMBLA0A1Z5RC050.0A0A1Z5RC05_SORBI; Uncharacterized protein
STRINGSb06g004510.10.0(Sorghum bicolor)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G61150.11e-104homeodomain GLABROUS 1