PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG74302.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 1937aa    MW: 216479 Da    PI: 6.4889
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG74302.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix37.84.9e-1210331109271
    trihelix    2 WtkqevlaLiearremeerlr.......rgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkr 71  
                  W+ +   aLi+a+r+ +++l+       r k ++ +W +v +++++ g+ r++++C +kw+nl +++kk+ + ++  
  GBG74302.1 1033 WSVDNMIALIRAKRDQDSHLQgmgtafaRMKPREWKWLDVEQRLKKVGVDREAERCGKKWDNLMQKFKKVHHLQGLS 1109
                  9999**********7777776444333266899999**********************************9888763 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1937 aa     Download sequence    
MEGGGNMLAT TKEAAELIPL DQYVTLGYPG LGMIGTIRPA PEQRDVAEWQ PSLGEMKLGG  60
PTFVTGEIDV LNIIRALDHQ IPLPIGHLLS ISEQANERML QHCKANRKRF GLARTANTKT  120
KGPMPYENAA GASDPIHTGL IQKDDNFLRI KPIPWKSAEC DIEVWGIPYN AIIDSRASVL  180
AILLRVVEKA GRKNDLIMLT ERDQLFSADK ENIKTVGRMT NVAFRLGKVH ALGDVVVLDV  240
NTYDVLFGLP ALVALRANLD FERRSIILRN TGGKPYVVPM RLTLRTTANI APRVSPAMTG  300
TLCMITWDNR QRSKDADSSD DDDDPVILEM AQERIQYPVQ RTNVKTMSRT NQDLQRTQAM  360
IMGEPLVQIS RMVDSLEPPR TMYEGISPLL ARYNDKRHYC DITDLPKSLL TSAKEIRLLR  420
LGAEINSLEP PERLENGLGI KIATKPVPWQ DIRDGVTPEE HVAIREQDAQ MMATVSSWRS  480
DNSFISTPPP GNTKRWVRCP MGICNAPATF QRAMNVTFQN FVNKKQLTQG MINFCVIVYM  540
DDILVYSETY HGHAQHIEWT LGAPRDAGFK ITLEKSEFFL SEILFLGYVV TRGGLRPDSR  600
KVAAVKEALV PTLLTQVRAF LGLASYYRPF IKGFAAIARP LTNLLWKDQP LSWDAECAHA  660
FVALKETLAT TPILIRPDPS KQFILITDWQ PEAISAILAF LTERALNDHL LAPRGYISAI  720
RQVHHPRLNL DLLRLAQFFC TQGVYAAAEF RFAGSRQAVV LAEAVTVQQW SALGISHVST  780
VVAFNGDIST VSALRHTAIR TKLREWGQQL VAEWNHWLTR HRDNLALTDD IDLGGIRALQ  840
HEPAVVLPEI VPFQPPVREA SGNNRDPPRQ QYRTPSLSRG TSARPLWIQS PSPLSTTSTA  900
ARRRGEFGET DCGIDDVGDA HDGREVWAEQ RQTLHPRREE SITHGVQRLR VGEHEDDGNA  960
PAVGPNDQEW NNEHGEGGEE DAGHVPPFKQ SSMKGRGGKT RACAHNGRWG EIAVGKGSDA  1020
EADANAEGGR QFWSVDNMIA LIRAKRDQDS HLQGMGTAFA RMKPREWKWL DVEQRLKKVG  1080
VDREAERCGK KWDNLMQKFK KVHHLQGLSG KQDFFQFFGK ERLSKGFNFN MDRAVYDEIL  1140
GSTAKSHTIN PKNVADTGVP GGVRLPSASS ADHESVGDGY AAAGHDDDDD GSTRGSSQTT  1200
GNPAGFAKRK STRQQTFEAM TECMEKHGAL VASSMDNSSK RQCSIQIRQC EALEAEVEVQ  1260
KKHYAASDEV SKLMCHALLE IAKAIRERQS SCRLCCAPRR FHRPHDAKVI AVPSSMYPSP  1320
ARRGRGGFRL AVLAREVGGG PIGWMSPPVT SPYPSASTAG AEEKDEVESK AEHGQIHIPR  1380
RLLPAATVSH HPLEREVTLS PSPDNIDEPN CAPSPNQGEF IHPRVVKEAH LPMTRSPSPI  1440
SVTLGDDKTQ RFFDQTVTDL LFFLTLEPTD RSPASHRHRS STHLDVMERG YDFILGTPWS  1500
RRFRSTEADW ATNTLVLKTK XGQTYRVPFI GTXXIPRPDP PPPEPSVPTP SPSITVTSPR  1560
QFAHFIQQDD VNFFMVNVTD LLHYDPPCPD VEIISLEPDP PSISMALIST SVPPPSVEST  1620
PSSRADADAE ELARYTTDLE PAVRDLIREY HDVFPSSFSY AGIPPMRNVE HSIQLVPDYR  1680
VHHLAPYRLS IPEATELKHL LEELLRLGFI KPSNSPWGAP VLFARKADGT LRLCIDYRGL  1740
NRYTVKNTEQ LKTAFRSRFG HYEFTVMPFG LTNAPATFQR AMNDIRDILE RYVLVYLDDI  1800
LVYSRTLEEH LRHLRDVLDR LRRHGFYAKL SKCRFAQHKV DFLGHYVSDQ GLHMDDVKIT  1860
AIAECPTPTS AKQLRSFLGL TSYYSQGGGG GGGGGEDMAD EEEEEEEKDM AEEEEEKEQK  1920
EEEEEEEEED EEEQEVG
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.17e-08Trihelix family protein