PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID RcHm_v2.0_Chr7g0188161
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Rosoideae; Rosoideae incertae sedis; Rosa
Family Trihelix
Protein Properties Length: 761aa    MW: 82167.1 Da    PI: 5.9492
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
RcHm_v2.0_Chr7g0188161genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix95.16.8e-3092176187
                trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                             rW++qe+laL+++r+em+ ++r++ lk+plWe+vs+k++e g++r++k+Ckek+en++k+yk++keg+ +r++++s  +++f++lea
  RcHm_v2.0_Chr7g0188161  92 RWPRQETLALLKIRSEMDVAFRDATLKGPLWEDVSRKLAELGYKRNAKKCKEKFENVHKYYKRTKEGRAGRQDGKS--YKFFSELEA 176
                             8*********************************************************************866665..******985 PP

2trihelix106.81.4e-33491576187
                trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                             rW+k evlaLi++r+ +e+r++++  k+plWee+s+ m++ g++r+pk+Ckekwen+nk++kk+ke++k r +e+ +tcpyf++l+a
  RcHm_v2.0_Chr7g0188161 491 RWPKAEVLALIKLRSGLETRYQEAGPKGPLWEEISAGMQRMGYKRNPKRCKEKWENINKYFKKVKESNKAR-PEDAKTCPYFHELDA 576
                             8********************************************************************98.99999********85 PP

Sequence ? help Back to Top
Protein Sequence    Length: 761 aa     Download sequence    
MQGGQSHYGA SAPAPAPAAE GAAATTSMAA EAAAADQTHV VEEASPISSR PPAGGAISAV  60
NLDELMTLSG AAADVAADQG GGGGGGSGGG NRWPRQETLA LLKIRSEMDV AFRDATLKGP  120
LWEDVSRKLA ELGYKRNAKK CKEKFENVHK YYKRTKEGRA GRQDGKSYKF FSELEALHGT  180
TANVSASPPV HVANTPVSIG FGSISSPMPI SSFRMTSGSG TTSTIPGMTS QGGGAGAGAI  240
PVMPSSQPPP PPAPPMDMNF SSNSSSSSSH DEYEDDDEVA GEPSTNTSRK RKRGLGTRSS  300
SSREGGGSTR RMMEFFEILM KQVMQKQETM QQRFLEVIEK REQDRNIRED AWKRQEMARL  360
TREHELMTQE RAISASRDAA IIAFLQKITG QTIQLPPPLN VHNAPPPPVP PSVSSVHVTP  420
VSAPPPQPSV QQFHPHATPT AQVSRHQIQT AIPPHPQTEV VMAVPEQQVA PPQENVVGSG  480
GGFEATTSSS RWPKAEVLAL IKLRSGLETR YQEAGPKGPL WEEISAGMQR MGYKRNPKRC  540
KEKWENINKY FKKVKESNKA RPEDAKTCPY FHELDALYRK RVLGGGPSGG GSSSSLGNQN  600
IQQQPPATQG TVAASVPAPQ TQGTVAATDQ SGNKNGDHSP NLQKNLFGDA PEEAAKKPED  660
IVKELMGQQQ QQHHHHPQQL LNQQGVEQQL VVEDYDRVEE ADSDINLDQD EEEDEDDEED  720
EEMDEESRKM DYKIEFQKQQ NTGPSSNGGG NGAPSFLAMV Q
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1135143KRNAKKCKE
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G33240.19e-49Trihelix family protein