PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID RcHm_v2.0_Chr1g0352671
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Rosoideae; Rosoideae incertae sedis; Rosa
Family HD-ZIP
Protein Properties Length: 830aa    MW: 90368.2 Da    PI: 6.5083
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
RcHm_v2.0_Chr1g0352671genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox66.63.2e-21134189156
                             TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
                Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                             +++ +++t++q++eLe+lF+++++p++++r eL+++l+L++rqVk+WFqNrR+++k
  RcHm_v2.0_Chr1g0352671 134 KKRYHRHTPQQIQELEALFKECPHPDEKQRLELSRRLNLETRQVKFWFQNRRTQMK 189
                             688999***********************************************999 PP

2START200.86e-633385611206
                             HHHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S. CS
                   START   1 elaeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla. 77 
                             ela++a++elvk+a+  +++W +s     e +n +e++++f++  +     + +ea+r++g+v+ ++  lve+l+d++ +W e+++ 
  RcHm_v2.0_Chr1g0352671 338 ELALAAMDELVKLAQT-DELWLRSLeggrEVLNHEEYMRSFTPCIGlkpngFVTEASRETGMVIINSLALVETLMDSN-RWLEMFPc 422
                             57899******99985.68*********99***********9999899999***************************.******** PP

                             ...EEEEEEEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEE CS
                   START  78 ...kaetlevissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgili 154
                                + +t++vissg      galqlm aelq+lsplvp R++ f+R+++q+ +g+w++vdvSvd  ++++  + +v +++lpSg+++
  RcHm_v2.0_Chr1g0352671 423 miaRTSTTDVISSGmggtrnGALQLMHAELQVLSPLVPvREVNFLRFCKQHAEGVWAVVDVSVDTIRDNSGAPTFVNCRRLPSGCVV 509
                             **********************************************************************9**************** PP

                             EEECTCEEEEEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
                   START 155 epksnghskvtwvehvdlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                             ++++ng+skvtwveh++++++++h l+r+l++sg+ +ga++wvatlqrqc++
  RcHm_v2.0_Chr1g0352671 510 QDMPNGYSKVTWVEHAEYDESQVHHLYRPLLSSGMGFGAQRWVATLQRQCQC 561
                             **************************************************85 PP

Sequence ? help Back to Top
Protein Sequence    Length: 830 aa     Download sequence    
MSFGGFLDNS TGSSGGARIV SDIPYNNNHH HHHHNANHTS MPSSAIAQPR LVTQSLTKSM  60
FNNSPGLSLA LQTNADGGGD APRMAENFEA NNNVGGRRSR EEENEISRSG SDNMDGAGSG  120
DEGDAADNSN PRKKKRYHRH TPQQIQELEA LFKECPHPDE KQRLELSRRL NLETRQVKFW  180
FQNRRTQMKT QLERHENSLL RQENDKLRAE NMSIRDAMRN PICTNCGGPA MIGDISIEEQ  240
HLRIDNARLK DELDRVCALA GKFLGRPISS LGPSMGPPLP SSALELGVGN NGFGGMSSVA  300
TTMPLGPDFG AGLGGGMPIV AHTRPVAGGL DERTMFLELA LAAMDELVKL AQTDELWLRS  360
LEGGREVLNH EEYMRSFTPC IGLKPNGFVT EASRETGMVI INSLALVETL MDSNRWLEMF  420
PCMIARTSTT DVISSGMGGT RNGALQLMHA ELQVLSPLVP VREVNFLRFC KQHAEGVWAV  480
VDVSVDTIRD NSGAPTFVNC RRLPSGCVVQ DMPNGYSKVT WVEHAEYDES QVHHLYRPLL  540
SSGMGFGAQR WVATLQRQCQ CLAILMSSTV PARDHANTIT QSGRKSMLKL AQRMTDNFCA  600
GVCASTVHKW NKLHAGNVDE DVRYMTRESM DDPGEPPGIV LSAATSVWLP VSPQRLFNFL  660
RDERLRSEWD ILSNGGPMQE MAHIAKGQDQ GNCVSLLRAR AMNANQNSML ILQETCIDAA  720
GSLVVYAPVD IPAMHVVMNG GDSAYVALLP SGFAIVPDGP GSRGSEAKAG QGSSNGGGEA  780
RVSGSLLTMT FQILVNSLPS AKLTVESVET VNNLISCTVQ KIKAALQCES
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00730.10.0HD-ZIP family protein