PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PCP026730.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Amygdaloideae; Maleae; Pyrus
Family HD-ZIP
Protein Properties Length: 941aa    MW: 103597 Da    PI: 5.1398
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PCP026730.1genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox66.73e-21107162156
                  TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
     Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                  r+k +++t+eq++e+e+lF+++++p++++r++L+k+lgL  rqVk+WFqNrR++ k
  PCP026730.1 107 RKKYHRHTTEQIREMEALFKESPHPDEKQRQQLSKQLGLAPRQVKFWFQNRRTQIK 162
                  7999************************************************9877 PP

2START2261.1e-702764992206
                  HHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS.......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEEEEE CS
        START   2 laeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv......dsgealrasgvvdmvlallveellddkeqWdetla....kaetlevi 85 
                  + ++a++el k+a+a+ep+Wv+s+    e++n+de++++f  +        +s+ea+r++g+v+ +l++lv++++d++ qW+e+++    ka+t++vi
  PCP026730.1 276 IVNQAMEELKKMATAGEPLWVRSVetgrEILNYDEYIKEFNIEVPgngrpkRSIEASRETGLVFVDLPRLVQSFMDVN-QWKEMFPcmisKAATVDVI 372
                  56789***********************************77544999******************************.******************* PP

                  CTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE--SSX CS
        START  86 ssg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwvehvdlkgrl 176
                  ++g      ga+qlm+aelq+l+plvp R+++fvR+++ql+ ++w+ivdvS+d  +++  ++s+v+++++pSg++ie+k ngh+kv+wveh +++ ++
  PCP026730.1 373 NNGegdnrnGAVQLMFAELQMLTPLVPtREVYFVRCCKQLSPEQWAIVDVSIDKVEDNI-DASLVKCRKRPSGCIIEDKTNGHCKVIWVEHLECQRST 469
                  *********************************************************98.9************************************* PP

                  XHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
        START 177 phwllrslvksglaegaktwvatlqrqcek 206
                  ++ ++r +v+sgla+ga++wvatlq qce+
  PCP026730.1 470 IQTMYRTIVNSGLAFGARHWVATLQLQCER 499
                  ****************************97 PP

Sequence ? help Back to Top
Protein Sequence    Length: 941 aa     Download sequence    
MGVDMSNNPP TSRTKDFFAS PALSLSLAGI FRDAGAAASR EVEEGDEGSG GGASAAVGSV  60
RRREDTTEIS SENSGPGRSR SEDEFDGEGE HDEDDVDGDN KNKKKKRKKY HRHTTEQIRE  120
MEALFKESPH PDEKQRQQLS KQLGLAPRQV KFWFQNRRTQ IKAIQERHEN SLLKGEMEKL  180
RDENKAMREQ INKACCPNCG TATTSRDATL TTEEQQLRIE NARLKSEVEK LRAALVKYPP  240
GTSSPSCSAG QDQENRSSLD FYTGIFGLEE SRIMEIVNQA MEELKKMATA GEPLWVRSVE  300
TGREILNYDE YIKEFNIEVP GNGRPKRSIE ASRETGLVFV DLPRLVQSFM DVNQWKEMFP  360
CMISKAATVD VINNGEGDNR NGAVQLMFAE LQMLTPLVPT REVYFVRCCK QLSPEQWAIV  420
DVSIDKVEDN IDASLVKCRK RPSGCIIEDK TNGHCKVIWV EHLECQRSTI QTMYRTIVNS  480
GLAFGARHWV ATLQLQCERL VFFMATNVPM KDSTGVATLA GRKSILKLAQ RMTASFCRAI  540
GASSYHTWTK ISSKTGDDIR IASRKNLNDP GEPLGVILCA VSSVWLPVSP YLLFDFLRDE  600
TRRNEWDIML NGGPAQTIAN LSKGQDRGNA VTIQSMKSKE NSMWILQDTC INSYESMVVY  660
APVDITGMQS VMTGCDASNI AILPSGFSIL PDGLESRPMV LTSSQEDRSS EGGTLLTAAF  720
QVLTNSSPTA KLTMESVESV NTLISCTLRN IKTSLQLVLE DESFVTLFIR FGWNGGLGTE  780
AQLTNAQSKQ QHRLNLQRLQ GNPELIRRVA FTGLALRFLG RNRHSKKSLG LANGSAFFKL  840
DNVPNLELVI WVVGLVLLLL PDPPLVLGVR GEAGDFDGDG LVAGGADDAA LEVLQGSDGR  900
EESEALWGGE GEGGSGFGEG GEGGGGAMEE RSGELGRGVE E
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1103108KKKKRK
2103109KKKKRKK
3105109KKRKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G79840.10.0HD-ZIP family protein