PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG72774.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 1023aa    MW: 110372 Da    PI: 7.3727
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG72774.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix33.98.4e-11427499267
    trihelix   2 WtkqevlaLiearr.......emeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikeg 67 
                 W+   + aL++arr        m++++ r k ++ +We+v +++++ g++r  + C +kw+nl +++kk+ + 
  GBG72774.1 427 WSVGDTIALVRARRdqdlyiaGMGTSFARMKTREWKWEDVRARLQSMGVTRDVVDCGKKWDNLMQQFKKVHKF 499
                 899999********9999999999*********************************************9875 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1023 aa     Download sequence    
MLFPPGALRT ATVSLDKIVV FFDCRHASHV GAQNFHRSLF CRRASSLNGR PPTSHCSPLN  60
GPYMADLLHR LPLLVLVVVL LLLVVFFHVC FVGLVPIAAR RRRSSKKAHR MDRRLGNGRP  120
AGTTFGASNC QHSAGSVAKR PYDPRLYAGL PSHEIPLPPS DDEGGDARSS TLPLGSGSTQ  180
EWAATQSCGG GRVETPWTYM SLLNEGLCDD DDNAAVDLSF QLSSSSRAAA THTRIINPHP  240
GADCAHNTHG GVCGPRDGGL PQSLCEGGGK RNESTSTIDG GVGARERSEW MRLSPLSRSG  300
SGAPCARQRP EVLHQEGADI QRDSRQLWAE CRQASHQRGT ETITRGVQRL HVDEGNEAAA  360
EEARGCDDGD GDNDCNSDDL PDIRPLGRKA TKGGATARKG PAAKSRRNKK MDDDTGRSDG  420
EGGQNFWSVG DTIALVRARR DQDLYIAGMG TSFARMKTRE WKWEDVRARL QSMGVTRDVV  480
DCGKKWDNLM QQFKKVHKFL NLSGGKDYFK LASKERRSEG FNFVMDRSVY NEMEATTKGD  540
HTIHPKNLAD TGAAGGVQMP AGAGAAGDTM GSEGGGDAAN EEQGSTRDST FSAGSGDGAV  600
TFSSAASSSV VNTSSSSSSL AAEGGSMERG AGDGAQQEAR VAAEVAIAAA AAGSSGNVGV  660
VARAREEVPV VEREATRVDN KGEREDEDPL LSRVRRGGMA RDLADRARLW VDDKAFWTRG  720
EGRRLYNIVH ETWEYFVAIA GGMQTPPVPR SVVMPKSSTT VTRIADPAQL QQAIARAMAA  780
GNIALRVLHG WVFKSGNHPR GFNVAFQYAL ESVATDIARV MWNGEEWSNV VSAPVCAHTI  840
DPNMDLPLWF AGTNIEDRPE DDDMAAHQES TVICIARAFL AAVQMGGIVD GGFISHERLS  900
RITDCFRLML AACMWLMRIA GDNARNHHEA FYFAKLVAKP TLVASIHRAF DHRRSIIRAT  960
NAVTERLGKA NATLGEYPKY IPDRASCGIV FGQDASITGP EDAKRRDWLG SGPLKDDDAK  1020
EDA
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.13e-07Trihelix family protein