PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PH01000841G0210
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; BOP clade; Bambusoideae; Arundinarodae; Arundinarieae; Arundinariinae; Phyllostachys
Family HD-ZIP
Protein Properties Length: 838aa    MW: 94240.6 Da    PI: 8.7858
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PH01000841G0210genomeICBRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox58.21.4e-18162216256
                      T--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
         Homeobox   2 rkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                      rk+ +++k+q  +Lee F+++++++ ++++ LAk+l+L+ rqV vWFqNrRa+ k
  PH01000841G0210 162 RKKLRLSKDQSAVLEESFKEHNTLNPKQKAALAKQLNLKPRQVEVWFQNRRARTK 216
                      788899***********************************************98 PP

2HD-ZIP_I/II126.41.2e-40162251191
      HD-ZIP_I/II   1 ekkrrlskeqvklLEesFeeeekLeperKvelareLglqprqvavWFqnrRARtktkqlEkdyeaLkraydalkeenerLekeveeLreel 91 
                      +kk+rlsk+q+++LEesF+e+++L+p++K++la++L+l+prqv+vWFqnrRARtk+kq+E+d+e+Lkr++++l+een+rL+kev+eLr +l
  PH01000841G0210 162 RKKLRLSKDQSAVLEESFKEHNTLNPKQKAALAKQLNLKPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTEENRRLHKEVQELR-TL 251
                      69*************************************************************************************9.55 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF046187.4E-2920137IPR006712HD-ZIP protein, N-terminal
Gene3DG3DSA:1.10.10.607.4E-19143216IPR009057Homeodomain-like
SuperFamilySSF466893.59E-18151219IPR009057Homeodomain-like
PROSITE profilePS5007117.184158218IPR001356Homeobox domain
SMARTSM003892.6E-17160222IPR001356Homeobox domain
CDDcd000862.96E-16162219No hitNo description
PfamPF000465.9E-16162216IPR001356Homeobox domain
PROSITE patternPS000270193216IPR017970Homeobox, conserved site
SMARTSM003402.0E-23218261IPR003106Leucine zipper, homeobox-associated
PfamPF021832.3E-10218251IPR003106Leucine zipper, homeobox-associated
PfamPF142232.2E-10351403No hitNo description
PfamPF139764.9E-16508574IPR025724GAG-pre-integrase domain
CDDcd092726.00E-20735819No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 838 aa     Download sequence    Send to blast
MEMVLNGRDE QYPHHHLGLG LGLGLSLSIA DTAEAPQRAL SVAPISTLPA APQTQCWNGA  60
GLFFSSSSSG DQQRTGACMH ADQRAAACHE MPFLRGIDVN RAPTGERPGS CSEEDEDPGA  120
SSPNSTLSSL SGKRGAPARS GGEQEMRAGS DDEDSGAGGG SRKKLRLSKD QSAVLEESFK  180
EHNTLNPKQK AALAKQLNLK PRQVEVWFQN RRARTKLKQT EVDCEFLKRC CETLTEENRR  240
LHKEVQELRT LKLVPPQLYM RMPPPTTLTM CPSCERLVSG KPDTADEGRP VPRGPWGPVP  300
APAMFVDRPA QRIFSKGLRE GKPKTVEDDD WEELKEQAAA IRGGKNPAGR LEKKIVDEDK  360
TIILLCSLPP SYEHLVTTLT YGKDTIKVGE ITAALLAHNQ RKLNMWESSQ VSNGRLIDAW  420
ILDLGCSYHI TPNREWFTSY RSVEIKMDDG IAKTMHDVRH IPGLKKNQIS LDSLHNCGLT  480
YEADNDKETM KNCKCALTVM KGRGTARNIY ILFWSTVVGG VNSVESHNDT AKLWHMRLGH  540
LSVRGMTKLH NKDLLARIKS CELGLCKSCV LGKQSKVWVY FMKQKSEVFA KFKFQRYCEE  600
HGIQWHFTMR KTPQKNADER SKLDAKSTKC NFMGFKKGVK GYKLWDPISK KVVMSRYVVF  660
DEKSILQQNK EEVAQEIEDK EFEEMAKISY ANAVGYLLYA MKAILRDSQV DIQLLERYNK  720
HGIIFGGQQG NPLVVGYVDS NYAGDLDNRR STTGTKHIHE RYNRIKEFAI YLAKNQVYHV  780
RTKHIHERYN RIKELINSEE IDLMKMHTDD NAADMLTKPA TANKFEHCLD LLGVTGC*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1160166SRKKLRL
2210218RRARTKLKQ
Cis-element ? help Back to Top
SourceLink
PlantRegMapPH01000841G0210
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAK3669661e-140AK366966.1 Hordeum vulgare subsp. vulgare mRNA for predicted protein, complete cds, clone: NIASHv2049C06.
GenBankAK3685491e-140AK368549.1 Hordeum vulgare subsp. vulgare mRNA for predicted protein, complete cds, clone: NIASHv2075M23.
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP25873889
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G16780.18e-64homeobox protein 2