PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID RcHm_v2.0_Chr6g0275551
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Rosoideae; Rosoideae incertae sedis; Rosa
Family Trihelix
Protein Properties Length: 448aa    MW: 51417.2 Da    PI: 6.7226
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
RcHm_v2.0_Chr6g0275551genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix85.56.4e-27121217185
                trihelix   1 rWtkqevlaLiearremeerlrrgk...........lkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkr.tses 75 
                             +Wt+++v++Li+a+++++e++++++           +kk++W++vsk+m+erg+++sp+qC++k+++lnkrykk++++ +++ ++++
  RcHm_v2.0_Chr6g0275551 121 KWTDKMVRLLITAVSYIGEDIGSDCggggrrkfsalQKKGKWKSVSKVMAERGYHVSPQQCEDKFNDLNKRYKKLNDMLGRGtSCQV 207
                             7**********************9888999999999**********************************************55898 PP

                trihelix  76 sstcpyfdql 85 
                             +++  ++d +
  RcHm_v2.0_Chr6g0275551 208 VENPTLLDVI 217
                             8888777655 PP

Sequence ? help Back to Top
Protein Sequence    Length: 448 aa     Download sequence    
MEGHLSQGGR IPGGGSYVGL DLQGSVRAHH QTQHPHTLHQ QHHPISRQGS VVHPSIHEGF  60
PVKMGTMHNC DRTLSMVDYN KGEKCKNSAS DEDEPSYTEE GVDSHIEAQR GKKGSPWQRV  120
KWTDKMVRLL ITAVSYIGED IGSDCGGGGR RKFSALQKKG KWKSVSKVMA ERGYHVSPQQ  180
CEDKFNDLNK RYKKLNDMLG RGTSCQVVEN PTLLDVIDYL TEKEKDDVRK ILSSKHLFYE  240
EMCSYHNGNR LHLPHDPALQ HSLQEALRNR DDHDTDDLRR HHHDDLDEDD QDMETDERDD  300
FEENNASHGD NRGIFGGLGD SVKRLRQGQG REDFNFGGSL NAQDCNQSSY SHPQIAQGDL  360
NQVLPDSTKA AWLQKQWIES RSVQLEEQKL QIQVEMLELE KQRLKWQKFS KKRDRELEKL  420
KMENERMKLE NERMALELKR KEMGAGFS
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1815GGRIPGGG
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G21200.11e-166Trihelix family protein