PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID RcHm_v2.0_Chr4g0441321
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Rosoideae; Rosoideae incertae sedis; Rosa
Family HD-ZIP
Protein Properties Length: 208aa    MW: 23447.8 Da    PI: 9.6513
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
RcHm_v2.0_Chr4g0441321genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox58.61e-1858112256
                             T--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
                Homeobox   2 rkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                             rk+ ++tkeq + Lee F+++++++ +++e LA +l+L  rqV vWFqNrRa+ k
  RcHm_v2.0_Chr4g0441321  58 RKKLRLTKEQSRLLEESFRQHHTLNPRQKEALALQLKLRPRQVEVWFQNRRARSK 112
                             78889************************************************98 PP

2HD-ZIP_I/II118.34.3e-3858146189
             HD-ZIP_I/II   1 ekkrrlskeqvklLEesFeeeekLeperKvelareLglqprqvavWFqnrRARtktkqlEkdyeaLkraydalkeenerLekeveeL 87 
                             +kk+rl+keq++lLEesF+++++L+p++K++la +L+l+prqv+vWFqnrRAR k+kq+E+++e+Lkr++ +l+e+n+rL++eveeL
  RcHm_v2.0_Chr4g0441321  58 RKKLRLTKEQSRLLEESFRQHHTLNPRQKEALALQLKLRPRQVEVWFQNRRARSKLKQTEMECEYLKRWFGSLTEQNRRLQREVEEL 144
                             69************************************************************************************* PP

             HD-ZIP_I/II  88 re 89 
                             r+
  RcHm_v2.0_Chr4g0441321 145 RA 146
                             94 PP

Sequence ? help Back to Top
Protein Sequence    Length: 208 aa     Download sequence    
MASLELTMSV PGLSSYPSLP SSVRDLDINQ VPLAQRPDQE EILEDEEDSS NGSGGPPRKK  60
LRLTKEQSRL LEESFRQHHT LNPRQKEALA LQLKLRPRQV EVWFQNRRAR SKLKQTEMEC  120
EYLKRWFGSL TEQNRRLQRE VEELRAMKVG PPTVISPHTC EPLPASTLTM CPRCERITTT  180
STTTGPTKTK ISAAANTALS SKVATPAL
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1107115RRARSKLKQ
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G01430.14e-70HD-ZIP family protein