PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG84806.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 1279aa    MW: 139504 Da    PI: 4.1076
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG84806.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix36.11.7e-1111501225270
    trihelix    2 WtkqevlaLiearremeerlr.......rgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekk 70  
                  W+ +++laLi+a+ + + +++       r k ++ +W +v++++++ gf r   +C + w+nl +++kk+ + ++ 
  GBG84806.1 1150 WSVDHILALIRAKHDQDAHMQgmghayaRIKPQEWKWLDVAQRLKKVGFDRDTDKCGKTWDNLMQQFKKVHHFNGL 1225
                  **************888888843332224478999999********************************998876 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1279 aa     Download sequence    
MSSGRNNDEE MEPEEEQEEV EPGPLDRLRA GEEWIIKEEF GNRICFSLAW RSPRGGCVDM  60
DMSSVAFRKD GTLLDAIFHM HLKSGHQFAR HMGDDREGTF GEELNDCERI HISVDKIPEE  120
ANSILFLVSQ FGEKVPEKKP EKKKRKLSTV PEEEQLGQPL TEAEALADDA LLQPPPAATE  180
PAPPGSEADQ AATAPADKDA ASALETGSGE PGEAKEGEVE GDLEDAEGKK GVSFSDTNQE  240
YSPTATGEKS SDESKREEGT GEESADEASP TGSAEEGTEP TKEEGEDQEV QDQVPPPATA  300
DEVAEPEAQQ QEEELSSEGE EKGETAPEGE GEETGVATTS QDPPESGEEG PDTTEAPAED  360
DAGGPRGSSG EEPGNEGGQD SAPSGGEEPA SEDEAKGASD KEDGDGDGPQ TDEEKEPTTQ  420
NAKSMDDQAD QDQDQDHAKS SGEEGDDDTG GEGSAEGESG DEEGSKPPDG EQSPPPEPEP  480
EDPYPPPTEE QLAAAAAADA SGASLETEGL GETMIVIPVT TFGTRAESLT FRVLSEEMEE  540
LIALSIGEDF ADPTVGSVAI CVIRRMVNGW CIRGLHLPAG SARTWGEALV ALPRAQAEIL  600
IPSYKRRLLP QWDILKPIPK FEHYNFEEMS KIVEAWKPVK KEEEGGAEEK EEGEGEEEEE  660
DEVPVKPPKG RRRSVESVDT AADGDSEEPE LPPLDWKVFE IRVDLGWVMQ DDANDDMEWG  720
CFLFDSHGEL VDEVSRKVKE IDGVSVEDEE EDLAVFNHYC KPPEPPEEEE EEDEEGAAAA  780
AEGAETEKKE GAEGEKPQAE GEEGGEGEGE EGEEGENGEE EEQEQEEEPE QKDEDEENED  840
GEQEAEPEPE PAPEQTAIIL FTLVREAPNK LFVLWKNAGK PDPTTFIEEI RKQKRDLTKE  900
EVDLKVAVPF ETNFWRIRLD AIPCTGESME ESMERLVDVA SYTELLQQGL SGDEGDGGVN  960
LSFGLCSGRS SATSRMVVVC PHPDDDGGQV MAVVRSSKSL APIREASGNN RDPPRQQFQS  1020
PFVSTGASAR PQWMQSLSPL SAGSSAARRR GNAGRRILAS TILVTNGTEG RKRRHQFWTP  1080
MITRTMTTTE KVGRRTKATF RRPSKTAWEA GAERRRRLRP ARNGRRGKKA AGKGSDADGD  1140
DDAEGGRHFW SVDHILALIR AKHDQDAHMQ GMGHAYARIK PQEWKWLDVA QRLKKVGFDR  1200
DTDKCGKTWD NLMQQFKKVH HFNGLSGEQD FFQLSAKETT SKGFNFNMDR AVFGGRGHGR  1260
AQSHDGGHCW HSPLVIGAT
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1142147KKKRKL
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.15e-07Trihelix family protein