PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GAY57194.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Sapindales; Rutaceae; Aurantioideae; Citrus
Family Trihelix
Protein Properties Length: 791aa    MW: 86292.5 Da    PI: 6.0733
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GAY57194.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix95.74.4e-3091175187
    trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                 rW+ qe+laL+++r++m+ ++r++  k+plWe+vs+k++e g++rs+k+Ckek+en++k+yk++keg+ +r++++s  +++f+qlea
  GAY57194.1  91 RWPSQETLALLKIRSDMDAAFRDATVKGPLWEDVSRKLAELGYKRSAKKCKEKFENVHKYYKRTKEGRAGRQDGKS--YKFFSQLEA 175
                 8*********************************************************************866665..*******85 PP

2trihelix103.61.4e-32519604187
    trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                 rW+k evlaLi++r+ +e+r++++  k+plWee+s  m++ g++r++k+Ckekwen+nk++kk+ke++k+r +e+ +tcpyf++l+a
  GAY57194.1 519 RWPKVEVLALIKLRSGLEHRYQEAGPKGPLWEEISVGMQRMGYNRNAKRCKEKWENINKYFKKVKESNKRR-PEDAKTCPYFHELDA 604
                 8********************************************************************97.99999********85 PP

Sequence ? help Back to Top
Protein Sequence    Length: 791 aa     Download sequence    
MQQGGGGGGD GEGGGGGGGT QSQYGEMATT STATAAAAAA HIDMGQQPVE AASPISSRPP  60
ASASNLDELM RLSGGDDDEG DRGGGVSSGN RWPSQETLAL LKIRSDMDAA FRDATVKGPL  120
WEDVSRKLAE LGYKRSAKKC KEKFENVHKY YKRTKEGRAG RQDGKSYKFF SQLEALYSSP  180
TSTSTSTATT SNVSASLPKP VTTVADTSTL DVAPVSVGIP MPISSSVRIP TSPITLTCFP  240
YHDLRSTLIP APSSAVNVPG SVTTPVPPTA TTSTTPVGIS FSSKSSSSPE TEDDDDDVMD  300
FEGQPSNTAG TSNRKRKRQT SSSHRMMAFF EGLMKQVMQK QEAMQQSFLE VIEKRERDRM  360
IREEAWKRQE MSRLAREHEL MAQERAISAS RDASIINFLQ KITGQTIQLP PAITIPAAPP  420
PPPPQPQSPS QAVPVATNTT QSHHMPPPER RDIQQHHHRH QQIQSSAAEA VTARHQQPSG  480
TVSTSIPSQV VMAVPEQQVP PSDHQEIGSG GNLEPASSRW PKVEVLALIK LRSGLEHRYQ  540
EAGPKGPLWE EISVGMQRMG YNRNAKRCKE KWENINKYFK KVKESNKRRP EDAKTCPYFH  600
ELDALYRKKI IGVGGTSTSS QNRPEERHQS QSEQQHQQEN VNPVTNPQES SINVLPAPLL  660
ITQAHSDSEN KNGNAQASNV GVTGSLFGEG NLGASKKPED IVKELMNQQG TQQKQQQPQA  720
SIVDDQFDKV EESNMGSESD NMEYEEEDER EDDEESEEDS NKMANYKVEF QRQNTSNGGG  780
NGAPSFLAMV Q
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1134142KRSAKKCKE
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76880.14e-49Trihelix family protein