PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_022158201.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Cucurbitales; Cucurbitaceae; Momordiceae; Momordica
Family Trihelix
Protein Properties Length: 643aa    MW: 73413.2 Da    PI: 6.7421
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_022158201.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix47.25.8e-15123183266
        trihelix   2 WtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkike 66 
                     W+++e laL+++r++m++ + ++    + We+vs+k+ e gf+r++ +Ckek+e+ ++++  i+ 
  XP_022158201.1 123 WSNDELLALLRIRSNMDNCFPES----TNWEHVSRKLGEVGFRRTADKCKEKFEEESRYFNHINY 183
                     *********************99....9********************************98875 PP

2trihelix911.3e-28502604186
        trihelix   1 rWtkqevlaLiearr............emeerlrrgk......lkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsesss 77 
                     rW+++evlaL+++r             + +++ ++g+      +k+plWe++s+ m + g++rs+k+Ckekwen+nk+++k+k+ +kkr s +s+
  XP_022158201.1 502 RWPRDEVLALVNVRCslynngvcggggDQDQSGGSGSgeqgasSKAPLWERISQGMLQLGYKRSAKRCKEKWENINKYFRKTKDVNKKR-SLDSR 595
                     8**************7888888888775555555432445555*********************************************8.9999* PP

        trihelix  78 tcpyfdqle 86 
                     tcpyf+ql 
  XP_022158201.1 596 TCPYFHQLS 604
                     *******96 PP

Sequence ? help Back to Top
Protein Sequence    Length: 643 aa     Download sequence    
MFEGSVSEQL HQFLTPRTTT TPTPNSSSLP LPLNFAPNLI NFHHPFFDSY NITNPLHPHL  60
LHSPNPPHQN NGSGHDEEKP DATPTTTATE LHAVAMDLEA GRDNNNNTNR SILMDDHHQQ  120
HHWSNDELLA LLRIRSNMDN CFPESTNWEH VSRKLGEVGF RRTADKCKEK FEEESRYFNH  180
INYNKTCRFL THELNYPPPH HHQDQDRDDQ DHDHHHLLFP GGDEKPDDPS VVVGAPEVEE  240
GEEENGANFR DRDEEGEDMK IESTAMRSRK KRQNRKRRRM MRQKEFEILK EYCEEIVKKM  300
MVQQEEIHSK LLHDMLKREE EKVAKEESWK KQQMERLHRE LEVMAHEQAM AGDRQATIIE  360
ILNQITNSTP FSSSQAKKEL QNLILSLNNN NTNNDNNATN YNNSPSSSSL IQTQTNSSPN  420
KNQEVIANLA SSSSLMAALP PHENSSSFTS QTDPKNPKNP YSLTKILAPQ DPNSHPPPAS  480
LQKLPQNPKT RDHKELDDLG KRWPRDEVLA LVNVRCSLYN NGVCGGGGDQ DQSGGSGSGE  540
QGASSKAPLW ERISQGMLQL GYKRSAKRCK EKWENINKYF RKTKDVNKKR SLDSRTCPYF  600
HQLSNLYNQG GGIKRLENCP AVSPENHSDH SENLPNSSQR IAC
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1270279KKRQNRKRRR
2275279RKRRR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G28300.11e-46Trihelix family protein