PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GAY44682.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Sapindales; Rutaceae; Aurantioideae; Citrus
Family Trihelix
Protein Properties Length: 688aa    MW: 78727.8 Da    PI: 9.605
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GAY44682.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix41.43.6e-13191253269
    trihelix   2 WtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegek 69 
                 W+ +e  aL+++r+++e+++ +       We+vs++++e g++rs+++Ckek+e+ ++ + +i+ +++
  GAY44682.1 191 WSLDELIALLRIRSSLENWFPEL-----TWEHVSRRLEELGYKRSAEKCKEKFEEESRNFSNINYNKN 253
                 ********************987.....9*******************************99987766 PP

2trihelix91.77.8e-29540627186
    trihelix   1 rWtkqevlaLiearremeerlrrgk...lkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86 
                 rW+++ev aLi++r ++ ++ ++++   ++ plWe++s+ m+e g++rs+k+Ckekwen+nk+++k+k+ +kkr s +s+tcpyf+ql 
  GAY44682.1 540 RWPRDEVFALINLRCNLYNNGEDKEgaaSRVPLWERISQGMSELGYKRSAKRCKEKWENINKYFRKTKDANKKR-SIDSRTCPYFHQLS 627
                 8**************999999985333499*******************************************8.78889*******95 PP

Sequence ? help Back to Top
Protein Sequence    Length: 688 aa     Download sequence    
MPTNNFKRCD VTLLSSNGKG RPHWVPQKLM IRRLYNASLL APQKHQRGVR KKRKAKEKKR  60
GKLIKLVHKI NTMFDEVPAD QLHQFIAAAS RASVVPFPLS FSTSSSSSSL HLHHHHHHHH  120
HGSSPFPASF DPFNSTTTSQ QVQQPHQLLH SLQQQKINEE KEENRSSSSS LVGMNLEVVQ  180
REGLIQKDPA WSLDELIALL RIRSSLENWF PELTWEHVSR RLEELGYKRS AEKCKEKFEE  240
ESRNFSNINY NKNYRTLFSE FDDEEQELYH RGQNSPHGHN VADEANEKLD QQQPREEEEE  300
VEEGDKIMEQ EAEHPRNDQT VANKSSCYHH QDKEKLSKGK KKRKFREKNF ERFKGFCEDI  360
VKKLMAQQEE MHNKLIEDLV KRDEEKVARE EAWKKQQIDR FNKELEIRAS EQAITSNRQA  420
TIIKFLTRFS SSSSSSSSST SEESGVNKHK VPNYSIPNPL TTSSSLILAQ NPNQTQNPRS  480
NLAPTSVPKK QTSSTIAISP QNPSSAATQN KPLAPTSTPI QNSDSQKLIT SDGKDDIGKR  540
WPRDEVFALI NLRCNLYNNG EDKEGAASRV PLWERISQGM SELGYKRSAK RCKEKWENIN  600
KYFRKTKDAN KKRSIDSRTC PYFHQLSTLY NQGTLVAPSD GTENRPAALP ENHHSSSQGG  660
NSSTHSTMPV AQGEKNLVQV SPALDFEF
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
14654QRGVRKKRK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G28300.16e-94Trihelix family protein