PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG86101.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 1149aa    MW: 125830 Da    PI: 9.3402
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG86101.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix30.78.1e-109651049180
    trihelix    1 rWtkqevlaLiearr......emeerlr.rgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcp 80  
                  rW ++e+ + ++a++       ++++++ +gk   + Wee+ + +r+   +rs+++Ck +w+ +++ y ++ +++ +  s+++s + 
  GBG86101.1  965 RWGEDETVVFLQAKSeqlaarALDSEVKgNGKDPTKVWEEIKADVRRANWNRSAIECKRRWNTVKRWYSRVVDNDTR--SGRQSYWT 1049
                  8**************4444443333333345788999***********************************99998..45565665 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1149 aa     Download sequence    
MESRGPTFVT GEIDVLNVVR ALDHCIPLLI GHLLSISEQA NERMLQHCKA NRKEFALART  60
ANTKGKAPAQ DDNPNPASTS KPIRLGLIQK DYHFLRSKAI SWKSAECDIN VWGIPYNAII  120
DSGASVLAIL QRVVERAGRR KNLIMLTERD HLVSTDEEKI KTVGRMTNVA FRLGKVHALG  180
DVVVLDVSTY DVLLGLPVLT ALRANLDFEK RSIVLRNTGG KPYAIPMRLT LRTVAEATQK  240
TPPVPGGALR MIAWKKLAER SDHQTEEPNL SSDTNNSDED GLVIWELVQQ RVRYPIRQAT  300
TKTSKLTEGN IQRTRALISG EPLMQISRMA DSLEPPRTLY EGTTPLLARF RDKRAFCDIT  360
ELPCSLLTSR KEIRLLRLGA EGKALEPPAR RDTGPHDLGI KILKNNVRWQ DVCDGITPEK  420
HVAIREEDAQ MMATVKFDIR HGFHHILVKE EDRPKTAFVL FEGTCQWVRC PMGICNAPAT  480
FQRAMNVTFQ NFVNKTRLTQ GMISFCVIVY MDDILVYSES FHGHAQHIKW TLGALRDAGF  540
KIALEKSEFF LYEISFLGYV VTRGGLRPNW RKVAVIRDAP TPTSLTQRAF LGLASYYRRF  600
IKGFAAIARP LTNLLRKDQP LSWDAECEQA FSTLKGALAT APILIPPDLA KQFILITDWQ  660
PEAISAILVQ KGNDGREHVI EYTSRTVSDE RRNDSAPQGE WYTVVWGLQH FHFHAQQPSQ  720
GVSLPMWTPC ISPPASATAF PYAQDASVEL VRHGGHLTGL LEWEKLGDRG ICAAGDPVMD  780
AVAEHANSDT QQGRNQAWTA EFPLRSEAND GVQSCMQGAI SEPLAGALAA GTATVAAIAV  840
PSVRVVPAVH VLPPTGAVAV GGVVGAARAL EGGALTRGCG VPGVEDLAAG CALTGGGSRP  900
AAEAVAVGGV APPCGNVAAA EAMPGVAAPP AAASAGTAGG AVPSPARVDK QNTEDVPAKR  960
ATSSRWGEDE TVVFLQAKSE QLAARALDSE VKGNGKDPTK VWEEIKADVR RANWNRSAIE  1020
CKRRWNTVKR WYSRVVDNDT RSGRQSYWTM TPKERAAANL YFNLRKNWFD ILESYNVRNR  1080
VPGSRRRSGG GGSSGACRGS GTMVEEEKRR MRRGQHRITR VVAGGVPLVN VPTSHQLLVL  1140
LLPTDAGPM
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
111081113KRRMRR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76880.12e-08Trihelix family protein