PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID RcHm_v2.0_Chr2g0138951
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Rosoideae; Rosoideae incertae sedis; Rosa
Family HD-ZIP
Protein Properties Length: 763aa    MW: 84833.7 Da    PI: 5.7096
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
RcHm_v2.0_Chr2g0138951genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox67.12.3e-21108163156
                             TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
                Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                             r+k +++t+eq++e+e+lF+++++p++++r++L+k+lgL  rqVk+WFqNrR++ k
  RcHm_v2.0_Chr2g0138951 108 RKKYHRHTTEQIREMEALFKESPHPDEKQRQQLSKQLGLAPRQVKFWFQNRRTQIK 163
                             7999************************************************9877 PP

2START232.21.4e-722775002206
                             HHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS.......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S. CS
                   START   2 laeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv......dsgealrasgvvdmvlallveellddkeqWdetla. 77 
                             + ++a++el+k+a+a+ep+Wv+s+    e++n+de++++f  +        +s+ea+r++gvv+ +l++lv++++d++ qW+e+++ 
  RcHm_v2.0_Chr2g0138951 277 IVNQAMEELQKMATAGEPLWVRSVetgrEILNYDEYIKEFNIEIPasgrpkRSIEASRDTGVVFVDLPRLVQSFMDVN-QWKEMFPs 362
                             56789***********************************99888*********************************.******** PP

                             ...EEEEEEEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEE CS
                   START  78 ...kaetlevissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgili 154
                                ka+t++vis+g      ga+qlm+aelq+l+plvp R+++fvR+++ql+ ++w++vdvS+d  +++  ++s+v+++++pSg++i
  RcHm_v2.0_Chr2g0138951 363 misKAATVDVISNGegdnrnGAVQLMFAELQMLTPLVPtREVYFVRCSKQLSPEQWAVVDVSIDKVEDNI-DASLVKCRKRPSGCII 448
                             ********************************************************************98.9*************** PP

                             EEECTCEEEEEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
                   START 155 epksnghskvtwvehvdlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                             e+ksngh+kv+wveh ++++++++ ++rs+v+sgla+ga++wvatlq qce+
  RcHm_v2.0_Chr2g0138951 449 EDKSNGHCKVIWVEHLECQKSTVQTMYRSIVNSGLAFGARHWVATLQLQCER 500
                             **************************************************97 PP

Sequence ? help Back to Top
Protein Sequence    Length: 763 aa     Download sequence    
MGVDMSNNPP TSRTKDFFAS PALSLSLAGI FRDAGTPAET AANREGEEGD EGSVGGRRRS  60
REDTAEISSE NSGPARSRSA EDIDFDAELG EPDEDDGDGD NKNKKKKRKK YHRHTTEQIR  120
EMEALFKESP HPDEKQRQQL SKQLGLAPRQ VKFWFQNRRT QIKAIQERHE NSLLKGEMEK  180
LRDENKAMRE QINKACCPNC GSATTSRDAT LTTEEQQLRI ENARLKSEVE KLRAALVKYP  240
IGTSSPSSST GQDQESRSSL DFYTGIFGLE KSRIMEIVNQ AMEELQKMAT AGEPLWVRSV  300
ETGREILNYD EYIKEFNIEI PASGRPKRSI EASRDTGVVF VDLPRLVQSF MDVNQWKEMF  360
PSMISKAATV DVISNGEGDN RNGAVQLMFA ELQMLTPLVP TREVYFVRCS KQLSPEQWAV  420
VDVSIDKVED NIDASLVKCR KRPSGCIIED KSNGHCKVIW VEHLECQKST VQTMYRSIVN  480
SGLAFGARHW VATLQLQCER LVFFMATNVP MKDSTGVATL AGRKSILKMA QRMTSSFCRA  540
IGASSYHTWT KISSKTGDDI RISSRKNLND PGEPLGVILC AVSSVWLPVS PYALFDFLRD  600
ENRRNEWDIM LNGGPAQTIA NLSKGQDRGN AVTIQTMKSK ENSMWILQDS CINAYESMVV  660
YAPVDITGMQ SVMTGCDSSN MAVLSSGFSI LPDGLESRPM VITSRQEDRG SDSEGGTLLT  720
VAFQVLTNTS PTAKLTVESV ESANTLISCT LRNIKTSLQC EDG
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1104109KKKKRK
2104110KKKKRKK
3106110KKRKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G79840.10.0HD-ZIP family protein