PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG64179.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 1099aa    MW: 120517 Da    PI: 8.909
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG64179.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix36.71.1e-11317392270
    trihelix   2 WtkqevlaLiearr.......emeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekk 70 
                 W+   + aL++arr        m++++ r k ++ +We+v +++++ g++r ++ C +kw+nl +++kk+ + ++ 
  GBG64179.1 317 WSVGDTIALVRARRdqdlyiaGMGTSFARMKTREWKWEDVRARLQSMGVTRDAVDCGKKWDNLMQQFKKVHKFQNL 392
                 899999********9999999999*********************************************9987765 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1099 aa     Download sequence    
MDRRLGNGRP AGTTSGASNR QHSAGSVAKR PYDPRLYAGL PSHEIPLPLS DDEGGDARSS  60
TLPLGSGSTQ EWAATESCRG GRVETPWTYT SLLNEGXCDD DDNAAVDLSF QLSSSSGAAA  120
THTRIINPHP GVDCADNTHG GVCGPRDGGL PQSLREGGGK RNESTSSIGG GVGARERPEW  180
MRLSPSLRSG SGAPCARQWP EVLHQEGADV QRDGRQLCVE CRQALHQRGT ETITRGVQRL  240
HVDEGDEAAA GEARGCDDVD CDDDCNSDDL PDIRPLGRKA TKGGATARQV PAAKSRRNKK  300
MDDDTGRSDG EGGRNFWSVG DTIALVRARR DQDLYIAGMG TSFARMKTRE WKWEDVRARL  360
QSMGVTRDAV DCGKKWDNLM QQFKKVHKFQ NLSDGKDYFK LASKERRSEG FSFVMDWSMP  420
AAAGAAGDTM GSEGGGDAAD EEQGSTRDST FSAGSGGGYG KRKNMRQQTF EAVADVMEKH  480
GALMANTMDS ASKRQCSMMS RQCEILESEV EVQRKHYAAA NEANRMMCQA LMEIARNRRR  540
GPSVCSTLSV HRPVVHHKGL PLAPSPALKV CTHGAQRRHP GRTPSSFSPR RRSTKGPRSR  600
SFLRSRRRLG CLRRRRFFAV DLFDLLLAAE PSCRGSASRL QRSAVHQSPR SSSRRGAVYA  660
AYVSFTRGTS SVSSSIVALD PVALLLAVDP SSRGYVSPLR RSVVAYSRSS SSDRGEVSAW  720
FISFSRSRRR GRTVRGEGED ELLPRAAKAL VRRGREERRS HKRSLTEAGA NEEEDDDVFT  780
TEEEAAEENV TATRGSTLQR SGDQSGARRL ATPPPEAQQV RAHNTQKAKE VVVDVGGEDD  840
EPLESRRQRN VTQAAGSSGN VGVVARAREE VPVVEREATR IDNKGEREDE DPLLSRVRRG  900
GLARDVADRA RLWVDDKAFW TTGEGRRLYN IVHETREYFV TIASGMQTPP LPRSVVMPKS  960
STTVTRIADP AQLQRAIARA TTAENIALRV LHRWVFKSEN RPRGFNVAFQ YALESVATDI  1020
ARIMWNGEEW SNVVSAPVCT HTIDLNMDMP LWFAGTNIED RPEDDDMAAH QESTVRFAPR  1080
CKLVASSTVV SSRTSVCLE
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1590609RRRSTKGPRSRSFLRSRRRL
2591610RRRSTKGPRSRSFLRSRRRL
3758764RRSHKRS
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.19e-07Trihelix family protein