PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Ro05_G01799
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Rosoideae; Rosoideae incertae sedis; Rubus
Family Trihelix
Protein Properties Length: 769aa    MW: 83629.6 Da    PI: 5.6314
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Ro05_G01799genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix95.26.3e-3093177187
     trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                  rW++qe+laL+++r+em+ ++r++ lk+plWe+vs+k++e g++r++k+Ckek+en++k+yk++keg+ +r++++s  +++f++lea
  Ro05_G01799  93 RWPRQETLALLRIRSEMDVAFRDATLKGPLWEDVSRKLAELGYKRNAKKCKEKFENVHKYYKRTKEGRAGRQDGKS--YKFFSELEA 177
                  8*********************************************************************866665..******985 PP

2trihelix106.81.5e-33495580187
     trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                  rW+k evlaLi++r+ +e+r++++  k+plWee+s+ m++ g++r+pk+Ckekwen+nk++kk+ke++k r +e+ +tcpyf++l+a
  Ro05_G01799 495 RWPKAEVLALIKLRSGLETRYQEAGPKGPLWEEISAGMQRMGYKRNPKRCKEKWENINKYFKKVKESNKAR-PEDAKTCPYFHELDA 580
                  8********************************************************************98.99999********85 PP

Sequence ? help Back to Top
Protein Sequence    Length: 769 aa     Download sequence    
MQQQHGSQSH YGGGGGVVSS AEAATAAPPS MAAEADQTHV VEEASPISSR PPAGAISAVN  60
LDELMTLSGA AADVAADQGG GSGGGGGGGG GNRWPRQETL ALLRIRSEMD VAFRDATLKG  120
PLWEDVSRKL AELGYKRNAK KCKEKFENVH KYYKRTKEGR AGRQDGKSYK FFSELEALHG  180
TATPNVSASP PVHVTTAAAT NNPASIGFGS ISNPMPISSF RMTPPTTTTI PVIPSQVAGT  240
IPIMPSSQQP STTAAPMDIN FSSNSSSSSH GDEDDYEDDD DVAGEPSSTS RKRKRGTSSS  300
RDSGGSTRRM MEFFEILMKQ VMQKQETMQQ RFLEVIEKRE QDRNIREEAW KRQEMARLTR  360
EHDLMTQERA ISASRDAAII AFLQKITGQT IQLPPPLNVH NAPPPPVPPS VSVHVTPVSA  420
PPPPPPPQQP VQLHQHQHQQ QTPIAQISRH HQIQPSITPN PTEVVMVVPE QQMAPPQENV  480
AGSGGGFEAT TSSSRWPKAE VLALIKLRSG LETRYQEAGP KGPLWEEISA GMQRMGYKRN  540
PKRCKEKWEN INKYFKKVKE SNKARPEDAK TCPYFHELDA LYRKRILGGG SSGGGGSSSS  600
LGNQNSQQQP PEHPKLDSAT QGTVAASVPA PQTQGTVAAT DQSVNKSGDQ STNLQKNLFG  660
DGPEEAAKKP EDIVKELMGQ QQHHHQEVLN HQGPQQLLVE DYDRVEEADS DINLDQEEDE  720
EDDEDEEDEE MDDESRKMDY KIEFQKQQNT GPSTNGGGNG ATSFLAMVQ
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1136144KRNAKKCKE
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G33240.18e-52Trihelix family protein