PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG84136.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 939aa    MW: 102852 Da    PI: 7.9871
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG84136.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix31.83.7e-1079153269
    trihelix   2 WtkqevlaLiearr.......emeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegek 69 
                 W+ +  ++L++ +r         +++++r + k+ +W++++k+m+  g  r +  C +kw+nl + ykki++ ++
  GBG84136.1  79 WSPEDQMLLVRCKRkqdmhlvGFGHNYGRMRTKEWKWDDIAKRMANAGRPRDADDCMKKWDNLFQNYKKIQRFQN 153
                 7777777888888833333333345555778**************************************998665 PP

Sequence ? help Back to Top
Protein Sequence    Length: 939 aa     Download sequence    
MQAHSDGGDD DGGGGDNADE RLMEDVEAGD DNDDIHIRPL GKTGGRGRGR SRGAVRGRSI  60
GRGGRGGGDD DGGKSATYWS PEDQMLLVRC KRKQDMHLVG FGHNYGRMRT KEWKWDDIAK  120
RMANAGRPRD ADDCMKKWDN LFQNYKKIQR FQNASGQPDF FRLSNEERKE HNFKFRMERV  180
LYNEIHAGMV GNHTIFPPNV ADTGSPDGVQ VPRQGAGGGE SVGSEAGGEG FPEERSSARD  240
SDNNAGSGAG GGKRKNARQQ ALEWIADVMD LHGELMSSTV ESSSKRQCSI FTRQYDILEK  300
EVAVQKAHYA ASDETQRMMC HMPMEIAAAI REKCCMLATD KCRLLPADIF NVVEDRRRHH  360
SSHLVTWTIF AGVVVLLACT WTSVVNRWWS RSTVTVGLLF FSPPIFSPRS TLAGVVVLLS  420
TACTSSSSSR GGQYFCVVRV FVRVMPRQHG RHFSRQISGT SNGGRRNTSR SSDRTAKLAR  480
RSMPTRGSAR GKKRDGVPHD SQGQGRGRRH VPKVKRVRSD DASARVPPRG AQGWAAAVGG  540
DYDKDFTTEE EQGEASASAV RESGRQRSSD HSAPKRMLTP PPEAQQLCAR KRRREKAAIV  600
DLGCDDDDPL EKRRLRTRTT TTPPPAVVAC SAVDERPVTG RLPATPSQPR QPNPGDDGGS  660
VQHGGGGEVV ADARKVGGES AGAAGAGALG TVAPVATARE EAALVFKSGN CPRGYNVAFQ  720
YSLESVATDI ARAMWYGEEW YNVVSPTVCA HTIDLSMDLP LWFSGTNIEE RPDGDDMVAH  780
QDSTVIYVAH AFRAAVQMGA LIDGDFISYD PLCRVADCFR LLLEACMWIM RMAGDDPRSH  840
YEAFYFASLM AKPTLIAAMH RSFDHRRSVV HAANIVTEWL GKANATLGEY PDYIPQWAPC  900
GIGFRHNAST TGPEDAKKLD WLGSAPLDDD NNDDGKYDA
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1590594RKRRR
2590596RKRRREK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.16e-07Trihelix family protein