PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG76344.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 1081aa    MW: 117271 Da    PI: 6.4606
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG76344.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix39.31.7e-12376447266
    trihelix   2 WtkqevlaLiearremeerlr.......rgklkkplWeevskkmrergferspkqCkekwenlnkrykkike 66 
                 W+ +++ aLi+a+r+ + ++r       r k ++ +W++v++ ++  g+ r++++C +kw+nl + +kk+ +
  GBG76344.1 376 WSVEHIIALIRAKRDQDAHMRgmghayaRMKPREWKWQDVAQGLKNVGLYRNAEKCGKKWDNLMQPFKKVHH 447
                 9*************7777777444433366899************************************976 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1081 aa     Download sequence    
MADVFRRLPL LLLVVLLLLL VVLFRVCILG NVPPRRRKTR SSMKDARMEA RQAIPRCGGE  60
QVGGRRCGGS LSTIGRASQR VGCDALPPHL QPLPGSSDEE EEVERRPQTV SLGSGSTQEW  120
SATELCGTGG GVYEQSFTEL LRPGLGEDEG DGRVNMSFGL STGRSTTPSR TVLVRPHPGD  180
EGGQLTVVDR SARTRALASE TVGANRNSST AQPRAASLSK GAPGRPEWMQ LPSPLSAASE  240
VARGRGIGVD GGTDFLDVGD GRDGREVWRD LPRDHRLRRE EYITRGVEHL NVGDRENENE  300
TDDPPAEADD DDEGNDVECG EGGGGQASPS LQTDMAGKGG KSKPSGRNAH SRAKKGQGKG  360
SGGEGDGDAE EKRNFWSVEH IIALIRAKRD QDAHMRGMGH AYARMKPREW KWQDVAQGLK  420
NVGLYRNAEK CGKKWDNLMQ PFKKVHHFQS PSGGADFFQL TTKERASRGF NLTMDRAVYD  480
EIEGSTGMNH TIHPRNLADT GASGGVRPPL TSYVDLDSVA DGEGGAGRED DEEGSTRGSS  540
QTTGTPGGSG KRKNTRQQTF KALTECMEKH GELMASTMES ASKRQCSIQV RQCEALEAEV  600
EVQRKHYVPR IAGNSEGHSG RDVVEVAVVE EEMTNDDDDF EDDDNEPLLR KARVGSARGI  660
RINEGGEGTP TARRGGGVAA ANQPVFVDVA RDVGRDVARR DVAGRAKEGA ASHETSQCVL  720
APVNRPRTPA ADVAGGSQAA VEGGTSRSPA VAARGGAVAI PGEAVDVPKG GDGAAGGEDD  780
EALVHRLRGQ RAATHAMDAA AKLWEDDNRF WNDTQGSVVV RIIQEARAYL VAVARGVQPP  840
AIRRSISLPH NSIPQHKIED ESELNAAKER ALKVQTISLR AIHSWVFKSE SRERGYHMAY  900
QYALNHAATD IARAMWSPED WSSLVSPMLF CTTLNADMKL PLWFVGVNIV DRHEDDECAA  960
YQEACVQRLV RDFTSTVGTA EAMDGGRVSY ERLKGMAEVM TYLLAVTMWI MRMAGDDPRS  1020
HYDAWVFVQL TAKTTLLASM NRQFDARRHI TQSAQVMTDK LGRPPPTFAP QPVNIPDWAS  1080
K
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
13438PRRRK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.18e-11Trihelix family protein