PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sobic.001G286700.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; PACMAD clade; Panicoideae; Andropogonodae; Andropogoneae; Sorghinae; Sorghum
Family HD-ZIP
Protein Properties Length: 883aa    MW: 94806 Da    PI: 5.4691
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sobic.001G286700.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox64.61.3e-20141196156
                           TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
              Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                           +++ +++t++q++++e+lF+++++p+ ++r +L+++lgL+ rqVk+WFqNrR+++k
  Sobic.001G286700.1.p 141 KKRYHRHTAHQIQQMEALFKECPHPDDKQRLKLSQELGLKPRQVKFWFQNRRTQMK 196
                           688999***********************************************998 PP

2START128.58.3e-413625982206
                           HHHHHHHHHHHHHHHC-TT-EEEEEXCCTTEEEEEEESSS...................SCEEEEEEEECCSCHHHHHHHHHCCCGGCT CS
                 START   2 laeeaaqelvkkalaeepgWvkssesengdevlqkfeeskv..................dsgealrasgvvdmvlallveellddkeqW 72 
                           la +aa+ l k+  a+ep+W + +    + ev+   e ++                    ++e +r+s+vv+m++ +lv  +ld + +W
  Sobic.001G286700.1.p 362 LAATAADTLAKMCRAGEPLWLRRR--GASSEVMVADEHARMfswpvdggqqgsastgaaARTEGSRDSAVVIMNSITLVDAFLDAN-KW 447
                           577899999999999999999999..444444444444444577777778899999999999************************.** PP

                           -TT-S....EEEEEEEECTT........EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE..TTS-EEEEEEEEE-TTS--..-TTSEE-E CS
                 START  73 detla....kaetlevissg........galqlmvaelqalsplvp.RdfvfvRyirq.lgagdwvivdvSvdseqkppe.sssvvRae 146
                            e ++    ka t++vi+ g        g l lm+ae q++splvp R++vf Ry+     +g+w +vd   d  q     +ssvv++ 
  Sobic.001G286700.1.p 448 MELFPsivsKARTIQVINHGarsghmgsGSLLLMQAEVQFPSPLVPaREVVFFRYCMHnGDEGTWSVVDFPADGFQLEGLqTSSVVKCC 536
                           **********************************************************777899*****988877776668******** PP

                           ESSEEEEEEEECTCEEEEEEEE-EE--SSXX..HHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
                 START 147 llpSgiliepksnghskvtwvehvdlkgrlp..hwllrslvksglaegaktwvatlqrqcek 206
                           ++pSg++i++++ng+s v+wveh+++ g     h++++  v sg+a+ga +wv+ lqrqce+
  Sobic.001G286700.1.p 537 RRPSGCIIQDMPNGYSSVVWVEHMEMVGEEKplHQVFKDYVASGYAFGATRWVSLLQRQCER 598
                           *************************98765449***************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.606.0E-21126192IPR009057Homeodomain-like
SuperFamilySSF466897.1E-19132199IPR009057Homeodomain-like
PROSITE profilePS5007117.07138198IPR001356Homeobox domain
SMARTSM003891.1E-19139202IPR001356Homeobox domain
PfamPF000463.7E-18141196IPR001356Homeobox domain
CDDcd000866.99E-19141199No hitNo description
PROSITE patternPS000270173196IPR017970Homeobox, conserved site
PROSITE profilePS5084844.231352601IPR002913START domain
SuperFamilySSF559611.19E-27355600No hitNo description
SMARTSM002342.5E-22361598IPR002913START domain
PfamPF018524.8E-33362598IPR002913START domain
CDDcd088756.55E-100372597No hitNo description
SuperFamilySSF559611.79E-15615858No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0048497Biological Processmaintenance of floral organ identity
GO:0005634Cellular Componentnucleus
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 883 aa     Download sequence    Send to blast
MFGDCQVLSS MAAMAGASSS ADALFIPNPG ALAGFMSSSA AAMPFHHFST TAASLIPKEE  60
GGIMGALQVA KDEDMDQLEM DMELSGGSGS AHLDGLLSFA DVDDDRTEQK PQHSGLELQT  120
TVDAAGQQQQ QQQLATANGK KKRYHRHTAH QIQQMEALFK ECPHPDDKQR LKLSQELGLK  180
PRQVKFWFQN RRTQMKAQQD RADNVLLRAE NESLKSDNYR LQAAIRNVVC PNCGHAAVLG  240
EMSYEEQQLR IENARLKDEL DRLACIATRY GGGRQPSMSS ALGCLSAPPP VLMPPLDLDM  300
NVYARHFTDQ SSVMGCGDLI QSVLAPQQQI PVGGAEHHAT SSFMGAAIGP VQEQDRQLVL  360
DLAATAADTL AKMCRAGEPL WLRRRGASSE VMVADEHARM FSWPVDGGQQ GSASTGAAAR  420
TEGSRDSAVV IMNSITLVDA FLDANKWMEL FPSIVSKART IQVINHGARS GHMGSGSLLL  480
MQAEVQFPSP LVPAREVVFF RYCMHNGDEG TWSVVDFPAD GFQLEGLQTS SVVKCCRRPS  540
GCIIQDMPNG YSSVVWVEHM EMVGEEKPLH QVFKDYVASG YAFGATRWVS LLQRQCERLA  600
SELARNIADL GVIRTPEART NMMKLSQRMI TTFSANISAS GSQSWTALSE TTEDTIRVTT  660
RKNTDPGQPS GVILTAVSTS WLPFSHQQVF ELLADEQQRC QLEILSNGGS LHEVAHIANG  720
SHPRNCISLL RINAASNSSQ NVELMLQETS THPDGGSLVV FATVDVDAIQ VTMSGEDPSY  780
IPLLPLGFAI FPATNPSPAA TSTSSGNGES SPGNTDEPTS GCLLTVGMQV LASAVPSAKL  840
NLSSITAINS HVCNAIHQIT TALKGQGAGV SGVEPVAAAG SD*
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor. {ECO:0000250}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapSobic.001G286700.1.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAJ2509860.0AJ250986.2 Zea mays mRNA for OCL4 protein.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_002464880.20.0homeobox-leucine zipper protein ROC3
SwissprotA2ZAI70.0ROC3_ORYSI; Homeobox-leucine zipper protein ROC3
TrEMBLA0A1B6QLR70.0A0A1B6QLR7_SORBI; Uncharacterized protein
STRINGSb01g028160.10.0(Sorghum bicolor)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP78463441
Representative plantOGRP94751113
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G46880.10.0homeobox-7