PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG63227.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 985aa    MW: 107402 Da    PI: 6.0767
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG63227.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix38.72.6e-12329400266
    trihelix   2 WtkqevlaLiearr.......emeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkike 66 
                 W+ +++ aLi+a+r        me+++ r k ++ +W++v+++++  g+ r++++C +kw+nl +++ k+ +
  GBG63227.1 329 WSVEHIIALIRAKRdqdahmqGMEHAYARMKPREWKWQDVAQRLKNVGVYRNAEKCGKKWDNLMQQFMKVHH 400
                 9*************555555455555557789***********************************99865 PP

Sequence ? help Back to Top
Protein Sequence    Length: 985 aa     Download sequence    
MADVFRRLPL LLLVVLLLLL VVLFRLCILG NMPPRRRKIR SSMKDARMEA RQEIPRSDGE  60
QVGGRRCGGS LSTVGRASQR VGCDALPPHL QPLPGSSDEE EEVERRPQTV SLGSGSMQPR  120
SCAGRAAACT SSRSLNSFDM VLAKTKEMAT LTFVDRSART RALASETAGA NRNSSTAQPG  180
AASLSKSAQG RPEWMQLPSP LSAASEVARG RGVGVDGGTD FLDVGDGRDG REVWRDLRRD  240
HRLRREEYIT QGVECLHVGD RENKNETDDP PADAEDDDDD DEDNNTDMAG KGGKSKPSGR  300
NARSRAKKRQ GKGSGGEGDR DAKEKRNFWS VEHIIALIRA KRDQDAHMQG MEHAYARMKP  360
REWKWQDVAQ RLKNVGVYRN AEKCGKKWDN LMQQFMKVHH FQSPSGGVGF FMLTSKERAS  420
RSFNFTMDRA VYDEIEGSTG MNHTIHPRNV ADTGASGGVH PPSTSYVDPD SVVDEEGGAG  480
REDDKEGSTR GSSQTTGTPG VSGKRKNTRQ QTFEALTECM EKQWELMAST MESASKRQCS  540
IQVRQCEALE AEVEVQRKHY AASDERKGVK VVAAGSARDV VEEVVVEEEM TNDDDGFEDD  600
DDEPQPRKAR VGSAVGIRIN EGGEGTPAAR RGGGVAAANQ PVFVDVARDV ARRDVAGRAK  660
EGAASHETAQ RVLVPVNRPR TPAADVAGGS QAAVEGGTSR SPAVAARGGA VAVPGEAVDV  720
PKGGDGVAGG EDDEALVHRL RGQRAATHAM DAAAKLWEDD NRVWNDTQGS AVVRIIQEAP  780
RISLVSPMLF YTTLNADMKL PFWFMGVNIV DRHEDDECTA YQEACIQRLV RDFTSVVGTA  840
EAMDGGRVSY ERLKGMAKAM RYLLAATMWI MRMAGEDPRS HYDAWVFVQL MAKTMLLASV  900
NCQFVARRHI TQSAQVMTDK LGRPPPTFAP PPVYIPDWAS KCGVTFWHDA TLASPTEAKR  960
LDWLGTGPPE DDDDDAEGDD KGEGG
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
13438PRRRK
2302310ARSRAKKRQ
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.18e-10Trihelix family protein