PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG67075.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 838aa    MW: 90378.9 Da    PI: 6.8322
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG67075.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix301.4e-09261346269
    trihelix   2 WtkqevlaLiearr..................emeerlrrgk........lkkplWeevskkmrergferspkqCkekwenlnkrykkikegek 69 
                 Wt++e+ aLi+ +                            +             We+++  mr++g +r+++qC+++w+ l   +k  ++ + 
  GBG67075.1 261 WTNDETVALIDSKGmheagrllelsptgsisgA--------RrtwettyeESRRGWEHIAGLMRSEGHKRGATQCRSRWRCLWSSFKTARRYMV 346
                 ************998888877766666655442........02223333378899********************************9887665 PP

Sequence ? help Back to Top
Protein Sequence    Length: 838 aa     Download sequence    
MSSASCEQPA NRPIPTISCN SERTPVHAGG ADSAPIDFCT QQLVGVGISL SPLTRREVAN  60
AGSQKASLFI DLRSNERDDG DIAGWMPALF QSPGTESLRV PNVVSPLASS DIQEITADEY  120
QQSLRRKAVE NGHDVRANDV EPAMSASQAA AVLECLRSTS GERKVRGVEA HIVGAKRMES  180
KKIGSSTTVG GGDGNSGETV PVHVGGVNEV EHHVGRSAAW ADASDGGGEF FKSRHQYPTS  240
NPHDANDIES IDSPTQCCGG WTNDETVALI DSKGMHEAGR LLELSPTGSI SGARRTWETT  300
YEESRRGWEH IAGLMRSEGH KRGATQCRSR WRCLWSSFKT ARRYMVRDDQ PPYWEMSVED  360
RRDCGLETHF TREWFELIQS YSQQWDSCIP HRGIRISKVG ANGEESNHNQ HAEHRGQGTP  420
KKPRTVVEEL EKTVKDLCNT ASTCTARQVE LLDLHSERIA SLQRESAKNF VATMEAAWRV  480
VSDIMRDERH LLLPLLDRGI AVVSTEDGSR TSYTMSIFRG FTGALNPSRT IYYVGTYRCI  540
KYSVMDGDSP ASFGNEFRSI SCPTVSGLLF SRRNFLQDGS YLYAWEKSNR TVLGIDLISG  600
AVTAIPQIRD QAVGDFALTQ DGCNMFAIAG KSIMRADFDK PGGKVVKVEY VTTYASQHYD  660
IRTASLDNDG SHLYVATSNG QLLQFPINKS ALGECSGALP TPAAPTGAAS AYQSPSPAPS  720
TGASPAAGAL SSAGHDGSST GVSGRLADTL SSPSPTTVRT PPRGRVNVGV QACSFVAACL  780
VSAVLGGALV LLVSRSRKSA KAESGVMTAM STALEHPVER CSALEHPVQR CSALERPS
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1420426PKKPRTV
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G33550.19e-06Trihelix family protein