PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sobic.003G321500.1.p
Common NameSb03g036820, SORBIDRAFT_03g036820
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; PACMAD clade; Panicoideae; Andropogonodae; Andropogoneae; Sorghinae; Sorghum
Family HD-ZIP
Protein Properties Length: 761aa    MW: 82534.2 Da    PI: 5.5384
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sobic.003G321500.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox50.43.9e-16111165256
                           T--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
              Homeobox   2 rkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                           r+ ++ t++qle+Le +F  + +p+ ++r++L++++gL   qVk+WFqN+R++ k
  Sobic.003G321500.1.p 111 RSLHRVTSQQLETLEGFFSICAHPDDNQRRQLSESTGLLLHQVKFWFQNKRTQVK 165
                           556789*********************************************9987 PP

2START882e-282774922206
                           HHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS.......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S... CS
                 START   2 laeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv......dsgealrasgvvdmvlallveellddkeqWdetla... 77 
                           la+ aaqel+ +a  ++++W  ++    e +n   ++ +f+  +        ++ea ras+vv  ++  lve l+d    + ++++   
  Sobic.003G321500.1.p 277 LAQNAAQELLILANPSSALWLNVPggsfETLNMAAYTETFPG-QMsadtitMNTEATRASAVVMLDPKSLVEFLMDAE-SYGTMFPglv 363
                           68899***********************99**********55.555999*9*****************9999999999.********** PP

                           .EEEEEEEECTT........EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEE CS
                 START  78 .kaetlevissg........galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliep 156
                            +a+t++v s          ga q m+ el+++splv+ R+ +fvR+ ++l++g  ++vdvS+d          ++R+++ pSg++i+p
  Sobic.003G321500.1.p 364 sAAATTKVYSCPtgreecydGAMQMMTVELVFPSPLVAaRKCTFVRCVKKLEQGAVAVVDVSLDD---------GARCRKMPSGLVIQP 443
                           97777776654345777888****************************************99984.........479************ PP

                           ECTCEEEEEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
                 START 157 ksnghskvtwvehvdlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                              + +kvt ++hv ++g + h l+ + + sgl +ga++w+  + rqc++
  Sobic.003G321500.1.p 444 IRYNTCKVTAIDHVVVDGTITHDLFAPCL-SGLLFGARRWLTSMARQCAR 492
                           ***************************98.699***************86 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.608.8E-1795161IPR009057Homeodomain-like
SuperFamilySSF466892.59E-15103167IPR009057Homeodomain-like
SMARTSM003899.2E-13107171IPR001356Homeobox domain
PROSITE profilePS5007114.787107167IPR001356Homeobox domain
CDDcd000863.92E-13110166No hitNo description
PfamPF000461.2E-13111165IPR001356Homeobox domain
PROSITE patternPS000270142165IPR017970Homeobox, conserved site
PROSITE profilePS5084822.545267495IPR002913START domain
SuperFamilySSF559611.33E-19270492No hitNo description
CDDcd088752.18E-72272491No hitNo description
SMARTSM002343.8E-8276492IPR002913START domain
PfamPF018521.1E-19277492IPR002913START domain
SuperFamilySSF559611.37E-6511692No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 761 aa     Download sequence    Send to blast
MERGEPSGGD PSLMGLTGGI DDPSLMGPLY VIDYSSLMGP VDGVGGGLES QMNDTPENAP  60
SNQSVEVQAN NGNEESPGDG AARITEIGSS TAGKNKKKRG DRQDGPQPNK RSLHRVTSQQ  120
LETLEGFFSI CAHPDDNQRR QLSESTGLLL HQVKFWFQNK RTQVKHLNGR EENYKLKVEN  180
ETLKEENNRL KQLQNNIIAP APCAKCIIDP GRLLLEKEVE RLKELNQMLQ QELQLLKSMD  240
DGIPPMAMDS AVGNFHLDPL LENIFVVQHD EQMLANLAQN AAQELLILAN PSSALWLNVP  300
GGSFETLNMA AYTETFPGQM SADTITMNTE ATRASAVVML DPKSLVEFLM DAESYGTMFP  360
GLVSAAATTK VYSCPTGREE CYDGAMQMMT VELVFPSPLV AARKCTFVRC VKKLEQGAVA  420
VVDVSLDDGA RCRKMPSGLV IQPIRYNTCK VTAIDHVVVD GTITHDLFAP CLSGLLFGAR  480
RWLTSMARQC ARIRDVFQVT NCTLNVTSRG RKTIMKLADN LLASFTSSVT AYPEDAWNFQ  540
CGLGTEQDIK IMYKTQNEST SSGSPTAVVC ASASFLVPLH MGKAFELLKN NMLRAKWDVL  600
VNGGTVKEEV RVADGVGSGD AVSILHVKHG HGANRDTVMI LQNTFYDASG AFMVYSSLDK  660
QLLEIIGDNQ AMSNISLFPA GFSLVPLTDP AGHDGAGIAQ PGATVMTAGF QILMKLARGT  720
GLCSRSVTSV INIMTDNIAN IKDALLNSHP VFYKSIQPVN *
Cis-element ? help Back to Top
SourceLink
PlantRegMapSobic.003G321500.1.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_002456467.10.0homeobox-leucine zipper protein TF1
TrEMBLC5XMC40.0C5XMC4_SORBI; Uncharacterized protein
STRINGSb03g036820.10.0(Sorghum bicolor)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP104752833
Representative plantOGRP1438133
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G04890.11e-77protodermal factor 2