PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Pav_sc0001405.1_g720.1.mk
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Amygdaloideae; Amygdaleae; Prunus
Family HD-ZIP
Protein Properties Length: 808aa    MW: 88162.2 Da    PI: 6.5496
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Pav_sc0001405.1_g720.1.mkgenomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox63.14.2e-20117172156
                                TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
                   Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                                ++k +++t++q++eLe++F+++++p++++r e++++l+L+++qVk+WFqNrR+++k
  Pav_sc0001405.1_g720.1.mk 117 KKKYHRHTPSQIQELENFFKECPHPDEKQRLEMSRRLSLETKQVKFWFQNRRTQMK 172
                                79999************************************************999 PP

2START175.14.3e-553275482206
                                HHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS....SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S. CS
                      START   2 laeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv...dsgealrasgvvdmvlallveellddkeqWdetla. 77 
                                la +a++elvk+a+a++p+Wvk s    e++n +e++     +     + +e  r++ +v+ ++  lve+l+d + +W e++  
  Pav_sc0001405.1_g720.1.mk 327 LAMAAMDELVKMAQADSPLWVKTSdggtEILNHEEYRAFSCIGTKpsnFVTEGTRDTCMVIINSLALVETLMDAN-RWAEMFSc 409
                                57799*******************9*999999999987655544466679*************************.******** PP

                                ...EEEEEEEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEE CS
                      START  78 ...kaetlevissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSg 151
                                   +a++++ issg      galq+m ae q+lsplvp R   f+R+++q+++g+w++vdvS+d +q+ +++  +  ++++pSg
  Pav_sc0001405.1_g720.1.mk 410 lvaRASVIDMISSGmggtrnGALQVMHAEHQVLSPLVPvRPLKFLRFCKQHQEGVWAVVDVSIDINQEGSNTNAFLNCRRFPSG 493
                                ************************************************************************************ PP

                                EEEEEECTCEEEEEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
                      START 152 iliepksnghskvtwvehvdlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                                +++++++n+ skvtw+eh +++++++h l+ +l++sg+ +ga++w+atlqrqce+
  Pav_sc0001405.1_g720.1.mk 494 CIVQDMPNNCSKVTWIEHSEYDENTVHHLFWQLLRSGMGFGAQRWLATLQRQCEC 548
                                *****************************************************96 PP

Sequence ? help Back to Top
Protein Sequence    Length: 808 aa     Download sequence    
MSFGGLISGG SGNSGGGGAR VVADIAPHTP YMSSGAIAQP HFLTSPVPNS MRSSSALSLS  60
IKNMDGHSEL GLIAENFDPG IIGRMRDDEY ESRSGSDNFE GASGDDQDAG DERRPRKKKY  120
HRHTPSQIQE LENFFKECPH PDEKQRLEMS RRLSLETKQV KFWFQNRRTQ MKTQLERHEN  180
IILRQENDKL RAENGVMKDA MANPVCNSCG GPAIPGQISF EEHQLRIENA RLKDELNRIC  240
TLANKFLGRP ISSLASPISL PNSTSGLELG VGRNGIGSLS AGGSGLPMGL NLGDGVSSSS  300
PMMPLIKSST GILGNEMPYE RSMYIDLAMA AMDELVKMAQ ADSPLWVKTS DGGTEILNHE  360
EYRAFSCIGT KPSNFVTEGT RDTCMVIINS LALVETLMDA NRWAEMFSCL VARASVIDMI  420
SSGMGGTRNG ALQVMHAEHQ VLSPLVPVRP LKFLRFCKQH QEGVWAVVDV SIDINQEGSN  480
TNAFLNCRRF PSGCIVQDMP NNCSKVTWIE HSEYDENTVH HLFWQLLRSG MGFGAQRWLA  540
TLQRQCECLA FLISSTNSIE DHTGLGTNGK KSMLKLAQRM IDNFCAGVSA SSVRKWDKLC  600
VNNVSEDLRV MARKSVDDPG EPAGIVLSGS TSVWLPVSRH RLFDFLRDEQ LRDQWDVLSK  660
THKMQLMLRI AKSQGGGNCV SLLRANVIDA NENTMLMLQE SWSDASGSLV VYAPVDPASM  720
SAVMRGGDSA YVALLPSGFA ILPGGPPGYG MVKTEGNGCD DGGCFLTVGF QILGSNYPAA  780
KLDVQSINTV NTLVSHTIEK IKSALQVP
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1113119RRPRKKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00730.10.0HD-ZIP family protein