PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID OMO95153
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Grewioideae; Apeibeae; Corchorus
Family Trihelix
Protein Properties Length: 705aa    MW: 75958.7 Da    PI: 8.4254
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
OMO95153genomeEnsemblPlantsView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix96.42.7e-3090174187
  trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
               rW++qe+laL+++r++m+ ++r++  k+plWe+vs+k++e g++rs+k+Ckek+en++k+yk++keg+ +r++++s  +++f++lea
  OMO95153  90 RWPRQETLALLKIRSDMDATFRDATVKGPLWEDVSRKLAELGYKRSAKKCKEKFENVHKYYKRTKEGRAGRQDGKS--YKFFSELEA 174
               8*********************************************************************866665..******985 PP

2trihelix110.21.3e-34533618187
  trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
               rW+k evlaLi++r+ +e+r++++  k+plWee+s+ m++ g++r+pk+Ckekwen+nk++kk+ke++kkr +e+ +tcpyf+ql+a
  OMO95153 533 RWPKAEVLALINLRSGLESRYQEAGPKGPLWEEISAGMARMGYKRNPKRCKEKWENINKYFKKVKESNKKR-PEDAKTCPYFHQLDA 618
               8*********************************************************************8.99999********85 PP

Sequence ? help Back to Top
Protein Sequence    Length: 705 aa     Download sequence    
MQQGGGGGGG AHQSQYGEMG APPPVTTGVS SSHMESEQLV EAASPISSRP PATGSLDEFM  60
RLASAGGDDG GDDGDRTGGG GGGGVASGNR WPRQETLALL KIRSDMDATF RDATVKGPLW  120
EDVSRKLAEL GYKRSAKKCK EKFENVHKYY KRTKEGRAGR QDGKSYKFFS ELEALHTTSA  180
TAGANVSTPV TPVTAAAAAS LDVAPVSVGI PMPISSVRIN PPVSTIPMSS SILPLPGSTA  240
PPPMSAPAPV PPPATVLQTP ITTATAATFG IRFSSDSSSS SQGFEDDDDD DDDDEGIGGE  300
PSSVAGTSRK RKRQASRGGG GGTTRRMMEF FEGLMKQVMQ KQEAMQQRFL EAIEKREQDR  360
MIREEAWKRQ EMARLTREHE LMAQERAIAA SRDSAIISFL QKITGQTIHL PTTVTIPTAP  420
PPPTQPTVTV VPPAAPISKA AAPPLQHHPP QQHQQQQQQR NSQQQQQSQS RNQHQPPPPP  480
HPQQQQQTQP QNTEVVRYQQ QQQPISSEIV MAIPEQQVPL QEIGGSGEPS SSRWPKAEVL  540
ALINLRSGLE SRYQEAGPKG PLWEEISAGM ARMGYKRNPK RCKEKWENIN KYFKKVKESN  600
KKRPEDAKTC PYFHQLDALY KRKILGGGTS GGSSSFSDQN RAEEETSTQQ HQDTSDAPPP  660
VTAAPQLMQQ PTDHSANKTG PTAEEGLPGS LFGEGNGGAA KKAVS
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1133141KRSAKKCKE
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G33240.17e-53Trihelix family protein