PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG59021.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 980aa    MW: 106465 Da    PI: 8.5242
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG59021.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix38.62.7e-12310384269
    trihelix   2 WtkqevlaLiearr.......emeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegek 69 
                 W+ +  l+L++ +r        ++++++r + k+ +We+++k+m+  g+ r++  C +kw+nl + ykki++ ++
  GBG59021.1 310 WSTDDQLLLVRCKReqdmhlvGLGHNYGRMRTKEWKWEDIAKRMANVGCPREADDCMKKWDNLFQNYKKIQRFQN 384
                 8888888999999955555555566667889**************************************987665 PP

Sequence ? help Back to Top
Protein Sequence    Length: 980 aa     Download sequence    
MADVLRRLPL LVLVVVVFLF VVFLHVCLSC RPWNTRRNRG ASTRRPRTDR PMANMQAGAP  60
QVAGGSASKE GGEFTSLLEA GLDHDDDGEV DLRFGLSSGS AREASRTFII EDQPSPRSLQ  120
RPRGEHTEQS TLRGGASLTA RAGPSSTTRQ SGASTSSVDP LRKTFPARSG VSAAAARISG  180
VAAAPARNSV GRSNTPNPAP TPGDELRGDP ACRPVVRSQP TVENITKGVS NMRAHNDGVD  240
ENGGAGEDVD DGYREDVEAV DDDGDAPIRP LGKVGGRGRG RGRGGGRGRS ARRGSRVADV  300
DDGGKSAAYW STDDQLLLVR CKREQDMHLV GLGHNYGRMR TKEWKWEDIA KRMANVGCPR  360
EADDCMKKWD NLFQNYKKIQ RFQNASGEAD FFRLSNEERR DHNFKFRMER ALYNEIHGGM  420
LGNHTIFHPN IADTGNPDDV QLPRRGAGGG ESVGSEAVGD GWPEERSSPR EFDSNVGSGA  480
RSMKRKNTRQ QALESISTAM IFIHCWGRHT TRTKRSRQDA ISVSGAHLDV DGWGRPNDGE  540
GENETGYPAM EEVARMEHDQ VTPVTTPQGR HREDAGTGTS AATPRVEKAL GERGSGGAAT  600
PCVQQAVGDR GSAGVVGDAA MLGDQAQVCG AAAVVGEAAG TPRETGGGGR KAEEAARTLE  660
IPRAKKRKAA EDDEPLVNRV RKGGVVKELA NRAKLWVDDK LFWTSGKGRR LYNIVNDARE  720
YLVAIAKGVP LPKGPRRVAL PRSNVTVTRL TDSAQLQGAT NRASKCQNVV MRVLHGWVFK  780
SGSRERGYTL AFQYTNRWPR TSRAYIEDRP RDDDMAAYQE ATVKRLIAAF SVAVEIGNAM  840
DGGCISHDRL SRVAESFKVL LTVAMWIMRL AGDDPRSHYE AFYFCNLVAK PVMVASMHRI  900
FDHRRHTLAA ANAATERLGK AQLTLGHAPN FIPDWAKCGV KFGHDASLSG PDEARGLEWL  960
GTGPPTCEDD DDDDDDAEDS
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1279288RGRGRGGGRG
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.12e-09Trihelix family protein