PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG78156.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 891aa    MW: 98900 Da    PI: 9.2368
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG78156.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix34.55.3e-11101175269
    trihelix   2 WtkqevlaLiearr.......emeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegek 69 
                 W+   + aL++arr        m++++ r k ++ +We+v +++++ g++r  + C +kw+nl +++kk+ + ++
  GBG78156.1 101 WSVGDTIALVRARRdqdlyfaGMGTSFARMKTREWKWEDVRARLQSMGVTRDVVDCGKKWDNLMQQFKKVHKFQN 175
                 899999********999988889999999****************************************987665 PP

Sequence ? help Back to Top
Protein Sequence    Length: 891 aa     Download sequence    
MWAECRQALH QRGTETITRG VQRLHEDEGD EAAVEEAQGC DDGDGDDDCN SDDLPDIRPL  60
GRRATKGGAT ARKGPAAKSR RSKKMDDDRG RSDGEGGRNF WSVGDTIALV RARRDQDLYF  120
AGMGTSFARM KTREWKWEDV RARLQSMGVT RDVVDCGKKW DNLMQQFKKV HKFQNLSGGK  180
DYFKLASKDR RSEGFSFVMD RSVYDEMEAM TKGDHTIHPK NLADTGATGG VQMPAGAGAA  240
GDTMATEAQS HYEVARFFAV VVGLVCLRRR RFFAVDLFDL LLAVEPSRRG SVSLLEWSGV  300
HQSPRSSSRR GAVHVGFVSF TCGTSSVSSS IVALALRRSP SRRGYVSPLR RSAVASQRSS  360
SRRGEVFAGF ISFSRSRRRG RGVHGKGEDE LLPKAAKALV RRRREQRRGH KRSLTEAGNM  420
SMRSGARGKK RENIPANSQG RERGRRHVPK AKRLRSEEAS ASLPHRRGRS WTAANEEEDD  480
DVFTTEEEAV EENVAAPPGS TLQRSCDQSG ARRLATPPPE AQQVRAHNTQ KAKEVVVDVG  540
AAVSSSNVGV VARAREEVPV VEREAARVDN KGEREDEDPL LSRVRRGGMA RDLADRARLW  600
VDDKAFWTTR ERRRLSVVMP KSSTTLTRIA DPPQLHEAIA HATAAENIAL HVLHGWVFKS  660
GNRPRGFNVA FQYALELVAT DIVRVMWNGE EWSNVVSAPV CAHTIDLNMD LPLWFAGTNI  720
EGRPEDDDMA AHQESTVICI AHAFRATVQM GGIVDGDFIS HERLCRIADC FRLLLAACMW  780
LMRMAGDAAR SHYEAFYFAK LVAKPTLVAS MHRAFDHRQS IIRATNAVTE RLGKANATLG  840
EYPKYIPDWA SCGIVFGQDA SITGPEDAKR RDWLGFSPLE DEDDDDGKED A
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.11e-07Trihelix family protein