PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG82441.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 1328aa    MW: 144947 Da    PI: 5.83
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG82441.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix41.82.8e-13559630266
    trihelix   2 WtkqevlaLiearremeerlr.......rgklkkplWeevskkmrergferspkqCkekwenlnkrykkike 66 
                 W+ +++ aLi+a+r+ + +++       r k ++ +W++v+++++  g+ r++++C +kw+nl +++kk+ +
  GBG82441.1 559 WSVEHIIALIRAKRDEDAHMQgmghtyaRMKPREWKWQDVAQRLKNVGVYRNAEKCGKKWDNLMQQFKKVHH 630
                 9*************6666666433333367899************************************976 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1328 aa     Download sequence    
MLEVIWVLEE GPAPELDEFV DWRQVDALSS DCYTLLDERL EGGADWAKRS GGELPITMLA  60
MVAPAAPAPT ISPTPATMFS SQVTTNNNNN YTNNNNNVHD SGMEGVILSS CASGSDNNHN  120
NHNNHNNNNS NNNNNNSNNN YSNNNNNNNN NNNTDGVHCA PLLCWRPRCL PFAAVVVDCF  180
MADVFCRLPL LLLVVLLLLL VVLFRVCILG NVAPRQRKTR TSMKYARMEA RQAIPRSGGE  240
QVGGRRCGGT LSTIGRVSQR VGCDALPPHL QPLPGSSDEE EXVERRPQTV SLGSGSTXEX  300
SATELXGXGG XVYEQSFTEL LRPGLGEDEG DGRVNLSFGL STGRSTTPSR TVLVRPHPXD  360
EGGZLTAVDR SARTRALASE TAGANRNSST AQPRAASLSK GAQGRPEWMQ LPSPLSAASE  420
VARGRGVGVD GGTDFLDVGD GRDGREVWRD LRRDHRLPRE EYITHGVERL HVGDRKNENE  480
TDDPPAEADD DDDDDKDNDI ECGEGGGGHA SPSLQTDMAG KGGKSKPSGR NARSRVKKGQ  540
GKRSGGEGDG DAEEKRNFWS VEHIIALIRA KRDEDAHMQG MGHTYARMKP REWKWQDVAQ  600
RLKNVGVYRN AEKCGKKWDN LMQQFKKVHH FQSPSGGANF FQFTSKERAI RGFNFTMDRA  660
VYDEIEGSTG MNHTIHPRNV ADTGVSGGVR PPSTSYVDPD SVADGEGGAG REDDEEGSTR  720
GSSRTTGTPG GSGKRKNTRQ QTFEALTECM EKHGELMAST MESASKRQCS IQVRQCEALE  780
AEVEVQRKHY AASDEVSKLM FCELPTAIEG AFVGSAMSSY GGGRAKAALK QIVEGATPAK  840
KGRHQAKRQR KGVQAVAAGS ARDVVEEAVV EEEMTNDDDD FEDDDDEPLP RKARVGSAGG  900
IRINEGGEGT PTARRGGGVA AANQLVFVDV ARDDGARRRK EGRRWLWDVA TPAVAAHGGT  960
VAVPGEAVEV PKGGDGAAGG EDDEALVHRL RGQRVATHAM DAAAKLWEDD NRLWNDTQGS  1020
AVVRIIQEAR AYLVAVARGV QSPTIRRSIS LLHNSIPQHK IEDESELNVV KERALKVQTI  1080
SLKAIHGWVF KSESRQRGYH MAYQYALNHA TTDIARAMWS AEDWRSLVSP MLFRTTLNAD  1140
MKLPLWFMGV NIVDRHEDDE CAAYQEACVQ RLVRDFTSAV GMIEAMDGGR ESYERLKGMA  1200
EAMRYLLAAT MRIMRMARDD PRSHYDASVF VQLTAKTTLL ASMNRQFDAR QHITQSAQVM  1260
TDKLGRPPPT FAPPPVYIPD WASKCGVTFS HDATLASPME AKRLDWLGTG PPEDDDDDAE  1320
GDDKGEGG
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1532540ARSRVKKGQ
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.14e-10Trihelix family protein