PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG81186.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 1250aa    MW: 136866 Da    PI: 6.8958
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG81186.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix43.39.7e-14764835266
    trihelix   2 WtkqevlaLiearremeerlr.......rgklkkplWeevskkmrergferspkqCkekwenlnkrykkike 66 
                 W+ +++ aLi+a+r+ + +++       r k ++ +W++v+++++  g+ r++++C +kw+nl +++kk+ +
  GBG81186.1 764 WSVEHIIALIRAKRDQDAHMQgmghayaRMKPREWKWQDVAQRLKNVGVDRNAEKCGKKWDNLMQQFKKVHH 835
                 9*************7777777444333367899************************************976 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1250 aa     Download sequence    
MECIGCRESW LGGFTANLEL DFSSSSSCSQ RSACRELDGR ILFRSRKLGR IGYGGYGGFC  60
LRRWKLSSSS SLAAMGERSW ENRRRSPARH YMGGRRREWE DGDGGDVRER ERIEWPMHVD  120
RISSVTNHFV RQCVRLRQSR SYRNERGVVL VVGRIPLSEI ISAWGQDGED GELKGGGVEI  180
EVLMLLEGLP MPEDMPARQT VYVTPSVMRK LSGVESAEGV EMVAILRKPA SFTAFGPATI  240
SSFDKAERWC PSANRLLALD GVQDPGNLGT LLRTAAALGW DGVFLLPGCC DPFNEKAIRA  300
SRGAAFHIPV AVGDWSQLRA MLSRRRVACY SGEPHQPLTS SSMRSSPDAM RKKIDRASLE  360
DCVCLILGSE GQGLSQEARE VSQPLAVPME GHLESLNVAV AGGILMLLLL ILVVLLLLLV  420
LLFRVCILGN VPPRQRKRRS SMKDARMEAR QAIPRSGGEQ VGGRRCGGSL STVGRASQRV  480
GYDALPPHLQ PLPGSSDEEE VVERRPQTVS LGSGSTQEWT ATELCGTSGD MYEQSFTELL  540
RPGLGEDEGD GRVNLSFGLS TRRSTTPSRT VLVRPHPDDE GGQLTVVDRS ARTRALASET  600
AGTNRNSSMP QPRAASLSKG AQGRGIGVDG GTDFLDVGNG RDGREVWRDL RRDHRLRREE  660
YITRGVERLH VGDRENENET DDPPAEADDD YDDDDDDDND DDNDVECGEG GSGHASPSLQ  720
SDMAGKGGKS KPSGRNARPR AKKGQGKGSG GEGDGDAEEK RNFWSVEHII ALIRAKRDQD  780
AHMQGMGHAY ARMKPREWKW QDVAQRLKNV GVDRNAEKCG KKWDNLMQQF KKVHHFQSPS  840
GGADFFQLTS KERASRGFNF TMDRAVYDEI EGSTGMNHTI HPKNVADTGE SGGVRPPSTS  900
YVDPESVADG EGGAGREDDE EGSTRGSLQT TGTPGGSGKR KSTRQQTFEA LTECMEKHDE  960
LMPSTMESAS KRQCSIQEGA TSHETAQRVL APLNRPRTPA ANVAGSSHAA VEGGTLRSPA  1020
VVARGGAVAV SGEAVEVPKG GGGAAAGEDD EALVHRLRGQ RAATHAMDAA AKLWEDDNRF  1080
WNDTQGSAIV RIIQEARAYL VAVARRVQPP AIRRSISLPH NSIPQHKIED ESELNAAKER  1140
ALKVQTISLR AIHGWVFKSE SWQRGYHLAY QYALNHAATD IARAMWSAED WRSLVSPMLF  1200
RTTLDVDMKL PLWFVGVNIV DRHEDDECGI SQAQLARQKL WTAVECRTSV
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1435440QRKRRS
2737745ARPRAKKGQ
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.15e-09Trihelix family protein