PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID OPUNC01G30780.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; BOP clade; Oryzoideae; Oryzeae; Oryzinae; Oryza
Family HD-ZIP
Protein Properties Length: 832aa    MW: 90504.8 Da    PI: 7.0079
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
OPUNC01G30780.1genomeOGEView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox642.1e-2094149156
                      TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
         Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                      r++ +++t+eq++++e+lF+++++p++++r++ +k+lgL+ rqVk+WFqNrR++ k
  OPUNC01G30780.1  94 RKNYHRHTAEQIRIMEALFKESPHPDERQRQQVSKQLGLSARQVKFWFQNRRTQIK 149
                      788999**********************************************9877 PP

2START174.18.7e-553075341206
                      HHHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS.........SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....E CS
            START   1 elaeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv........dsgealrasgvvdmvlallveellddkeqWdetla....k 78 
                      ela +a++elv + +++ep+Wv+ +    +++n+de+++ f  +++         s+ea+r++g+v  ++++lv  ++d+  +W+  ++    k
  OPUNC01G30780.1 307 ELAGRALDELVGMCSSGEPLWVRGVetgrDILNYDEYVRLFRHDHGgsgdqppgWSVEASRECGLVYLDTVQLVHAFMDVD-KWKALFPtmisK 399
                      57899************************************99999***********************************.99988888888* PP

                      EEEEEEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEE CS
            START  79 aetlevissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvt 165
                      a+tlevi++g      g+lqlm+a+lq l+p+vp R+ +f+Ry+++  a++w+ivdvS d+ +    +ss vR+ + pSg+lie++ ng++kvt
  OPUNC01G30780.1 400 AATLEVINNGekdgrdGVLQLMYAQLQTLTPMVPtRELYFARYCKKVAAERWAIVDVSFDESETGVHESSPVRCWKNPSGCLIEEENNGRCKVT 493
                      ************************************************************99999889************************** PP

                      EEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
            START 166 wvehvdlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                      w+eh+ ++  + + ++r++  sg+a+ga++wva+lq qce+
  OPUNC01G30780.1 494 WLEHTRCRRCTAPPVYRVVTASGVAFGARRWVAALQLQCER 534
                      ***************************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466898.77E-2079152IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.603.0E-2281145IPR009057Homeodomain-like
PROSITE profilePS5007117.60591151IPR001356Homeobox domain
SMARTSM003893.5E-1793155IPR001356Homeobox domain
PfamPF000468.8E-1894149IPR001356Homeobox domain
CDDcd000866.19E-1794149No hitNo description
PROSITE patternPS000270126149IPR017970Homeobox, conserved site
PROSITE profilePS5084839.232298537IPR002913START domain
SuperFamilySSF559612.75E-31299534No hitNo description
CDDcd088752.81E-106302533No hitNo description
PfamPF018524.5E-45307534IPR002913START domain
SMARTSM002343.2E-47307534IPR002913START domain
Gene3DG3DSA:3.30.530.206.3E-4377497IPR023393START-like domain
SuperFamilySSF559611.1E-5581696No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0009957Biological Processepidermal cell fate specification
GO:0010062Biological Processnegative regulation of trichoblast fate specification
GO:0005634Cellular Componentnucleus
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 832 aa     Download sequence    Send to blast
MGTNRPPPRT KDFFAAPALS LTLAGVFGRK NGPAASGGDE VEEGDEEVQA AGEAVEISSE  60
NAGPGCSQSQ SGGGSGEDGG HDDDDGEGSK KKRRKNYHRH TAEQIRIMEA LFKESPHPDE  120
RQRQQVSKQL GLSARQVKFW FQNRRTQIKA VQERHENSLL KSELEKLQDE HRAMRELAKK  180
PSRCPNCGVA AASSDAAAAD AAADTREQRL RLENAKLKTE VCMHRLARPF RCATCKALTP  240
AEWRCWDSFQ IERLRRGTPG KAAADGVASP TSPPCSTGAV QASNRSPLHE NDGGFVCHDD  300
DKPRILELAG RALDELVGMC SSGEPLWVRG VETGRDILNY DEYVRLFRHD HGGSGDQPPG  360
WSVEASRECG LVYLDTVQLV HAFMDVDKWK ALFPTMISKA ATLEVINNGE KDGRDGVLQL  420
MYAQLQTLTP MVPTRELYFA RYCKKVAAER WAIVDVSFDE SETGVHESSP VRCWKNPSGC  480
LIEEENNGRC KVTWLEHTRC RRCTAPPVYR VVTASGVAFG ARRWVAALQL QCERVVFAVA  540
TNVPTRDSTG KTINDTQRRV IHHPWPSHGS RLYAGVSTLA GRRSVLKLAH RMTSSLCRAV  600
GASRAMAWRR APKGGSGGND DDIWLTSREN AGDDPGEPQG LIACAAASTW LPVNPTALLD  660
LLRDESRRPE WDVMLPAKSV QSCVNLAKGK DRTNCVTAYA ARPEEEEEGG GKWVLQDICT  720
NPCESMIAYA AIDAAALQPV IAGHDSSGVH FLPCGFITVM PDGLESKPAV ITVSRRGGEA  780
WGAGSLVTVA FQVPASSSAA GTLSSDSVEA VTGLVSSTLR NIRKALGCEE DF
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
18993KKKRR
28994KKKRRK
39094KKRRK
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor. {ECO:0000250}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankCP0126090.0CP012609.1 Oryza sativa Indica Group cultivar RP Bio-226 chromosome 1 sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_015644459.10.0homeobox-leucine zipper protein ROC9
SwissprotQ5JMF30.0ROC9_ORYSJ; Homeobox-leucine zipper protein ROC9
TrEMBLA0A0E0JP010.0A0A0E0JP01_ORYPU; Uncharacterized protein
STRINGOPUNC01G30780.10.0(Oryza punctata)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP79938147
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G79840.10.0HD-ZIP family protein