PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Ro02_G04547
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Rosoideae; Rosoideae incertae sedis; Rubus
Family Trihelix
Protein Properties Length: 555aa    MW: 63532.8 Da    PI: 6.1871
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Ro02_G04547genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix994e-31363447186
     trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86 
                  rW+++ev  Li +r+++e+++++  lk+plWe+vs+ m++ g++rs+k+Ckekwen+nk+++k+k+++kkr +++s+tc+yf+ql+
  Ro02_G04547 363 RWPQSEVESLILLRSKIESKFQEPGLKGPLWEQVSASMASLGYQRSAKRCKEKWENINKYFRKTKDNAKKR-PQQSKTCSYFNQLN 447
                  8********************************************************************98.99999*******97 PP

Sequence ? help Back to Top
Protein Sequence    Length: 555 aa     Download sequence    
MQSNFEIGEI SNQQHFMDSN IGSPVLPIFF NPQKQNNFQQ QLQPQHHLSH LVHPIPITHE  60
LFQQFQEQQS GMEHWQMGPV NFKLGLNENS GTGEVALDGG DHLLHENNPI FLQPRPQNLG  120
FKSWQSQEFG ARKEPFWKPH ETEGIKQKQT IEGEICKELE SKYRLYAFQR LVTKQTGSGS  180
ALTNENSPKD VDLPMPFGDS HGLNVGPAAT VGVDNVSEAS IEEEISYRKV QKRKRKRRMK  240
EQLSSSTTRF FEGLVKRVMD HQESLHKKYL ALIEKMEKER RDREASWRCQ EAEKHQREAL  300
AKLHEQALAS NREALIVSYI EKITGQKVNL PSRETPFLLQ CDNSNEAMEE ELAPIRVDHT  360
NSRWPQSEVE SLILLRSKIE SKFQEPGLKG PLWEQVSASM ASLGYQRSAK RCKEKWENIN  420
KYFRKTKDNA KKRPQQSKTC SYFNQLNQLY SRAPITNPSS SSYCSSNPSV STGRQGYSEL  480
LEAVIGGRDT GVSQNLSSGN FELIYEMGSN RLDFDGITNG KVEHLQEDRG KVKENQEDDE  540
SMEEEQDTDG DDSDE
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1232238KRKRKRR
2232239KRKRKRRM
3233238RKRKRR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76890.21e-49Trihelix family protein