PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Pav_sc0001710.1_g870.1.mk
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Amygdaloideae; Amygdaleae; Prunus
Family HD-ZIP
Protein Properties Length: 577aa    MW: 64578.8 Da    PI: 6.1187
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Pav_sc0001710.1_g870.1.mkgenomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox67.61.6e-21104159156
                                TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
                   Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                                r+k +++t+eq++e+e+lF+++++p++++r++L+k+lgL  rqVk+WFqNrR++ k
  Pav_sc0001710.1_g870.1.mk 104 RKKYHRHTTEQIREMEALFKESPHPDEKQRQQLSKQLGLAPRQVKFWFQNRRTQIK 159
                                7999************************************************9877 PP

2START234.13.8e-732734962206
                                HHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS.......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT CS
                      START   2 laeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv......dsgealrasgvvdmvlallveellddkeqWdet 75 
                                + ++a++el k+a+a+ep+Wv+s+    e++n+de++++f  +        +s+ea+r++gvv+ ++++lv++++d++ qW+e+
  Pav_sc0001710.1_g870.1.mk 273 IVNQAMEELKKMATAGEPLWVRSVetgrEILNYDEYIKEFNIEIPsngrpkRSIEASRETGVVFVDMPRLVQSFMDVN-QWKEM 355
                                56789***********************************99888999******************************.***** PP

                                -S....EEEEEEEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EES CS
                      START  76 la....kaetlevissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaell 148
                                ++    ka+t++vis+g      ga+qlm+aelq+l+plvp R+++fvR+++ql+a++w+ivdvS+d  +++  ++s+v+++++
  Pav_sc0001710.1_g870.1.mk 356 FPcmisKAATVDVISNGegdnrnGAVQLMFAELQMLTPLVPtREVYFVRCCKQLSAEQWAIVDVSIDKVEDNI-DASLVKCRKR 438
                                ***********************************************************************98.9********* PP

                                SEEEEEEEECTCEEEEEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
                      START 149 pSgiliepksnghskvtwvehvdlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                                pSg++ie+ksngh+kv+wveh ++++++++ ++r +v+sgla+ga++wvatlq qce+
  Pav_sc0001710.1_g870.1.mk 439 PSGCIIEDKSNGHCKVIWVEHLECQKSTIQTMYRTIVNSGLAFGARHWVATLQLQCER 496
                                ********************************************************97 PP

Sequence ? help Back to Top
Protein Sequence    Length: 577 aa     Download sequence    
MGVDMSNNPP TSRTKDFFAS PALSLSLAGI FRDAGEAAVA SREVEEGDEG SGGAGSVRRR  60
EDTAEISSEN SGPARSRSED EFDGEGEHDE DDGDGDNKNK KKKRKKYHRH TTEQIREMEA  120
LFKESPHPDE KQRQQLSKQL GLAPRQVKFW FQNRRTQIKA IQERHENSLL KGEMEKLRDE  180
NKAMREQINK SCCPNCGTAT TSRDASLTTE EQQLRIENAR LKSEVEKLRA ALVKNPPGTS  240
SPSCSSGHDQ ENRSSLDFYT GIFGLEKSRI MEIVNQAMEE LKKMATAGEP LWVRSVETGR  300
EILNYDEYIK EFNIEIPSNG RPKRSIEASR ETGVVFVDMP RLVQSFMDVN QWKEMFPCMI  360
SKAATVDVIS NGEGDNRNGA VQLMFAELQM LTPLVPTREV YFVRCCKQLS AEQWAIVDVS  420
IDKVEDNIDA SLVKCRKRPS GCIIEDKSNG HCKVIWVEHL ECQKSTIQTM YRTIVNSGLA  480
FGARHWVATL QLQCERLVFF MATNVPMKDS TGPAQTIANL SKGQDRGNAV TIQVAAFQVL  540
TNSSPTAKLT MESVESVNTL ISCTLRNIKT SLQCEDG
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1100105KKKKRK
2100106KKKKRKK
3102106KKRKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G79840.10.0HD-ZIP family protein