PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG64121.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Trihelix
Protein Properties Length: 781aa    MW: 85313.7 Da    PI: 6.8739
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG64121.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix44.25.1e-14432504267
    trihelix   2 WtkqevlaLiearremeerlr.......rgklkkplWeevskkmrergferspkqCkekwenlnkrykkikeg 67 
                 W+ +++ aLi+a+r+ + +++       r k ++ +W++v+++++  g+ r++++C +kw+nl +++kk+ + 
  GBG64121.1 432 WSVEHIIALIRAKRDQDAHMQgmghayaRMKPREWKWQDVAQRLKNVGVDRNAEKCGKKWDNLMQQFKKVHHF 504
                 9*************7777777444333367899************************************9875 PP

Sequence ? help Back to Top
Protein Sequence    Length: 781 aa     Download sequence    
MSQRSADTNI SGRTPAPNYA LSVGDRRRPQ DDRLQHCAPL LCWRPRCLPF AAVVIDCFMA  60
DVFRRLPLLI LVVLLLLLVV LFRVCILGNV PPRRRKRSSM KDARMEARQA IPRSGGEQVG  120
GRRCGGSLST VGRASQRVGY DALPPHLQPL PGSSDEEEVV ERRPQTVSLG SGSTQEWTAT  180
ELCGTGAGVY EQSFTELLRP GLGEDEGDGR VNLSFGLSTG RSTTPSRTVL VRPHPGDEGG  240
QLTVVDRSAR TRALASETAG TNRNSSTPQP RVGSLSKGAK GRPEWMQLPS PLSAASEVAR  300
GRGVGVDGGT DFLDVGNGRD GREVCRDLRR DHLLRREEYI TRGVERLHVG DRENENETDD  360
PPADADDDDD NDVECGEGGV GHASPSLQSD MAGKGGKSKP SGRNARLRAK KGQGKGSGGE  420
GDGDAEEKRN FWSVEHIIAL IRAKRDQDAH MQGMGHAYAR MKPREWKWQD VAQRLKNVGV  480
DRNAEKCGKK WDNLMQQFKK VHHFQSPSRG ADFFKLTLKE RASKGFNFTM DRAVYDETEG  540
STGMNHTIHP KNVEDIGASG GVRPPSTSYV DPESVADGEG GAAREHDEEG STRGFSQTTG  600
TPDGSGKRKR TRQQTFEALT ECMEKHGELM TSTMESASKR QCSIQVRQCE ALEAKVEVQR  660
KHYAASDEVS KLMCHTLLEI ANPGNVIGKE VELVDESHPS SNGVVDVVLG LVEDQRIVVR  720
YPAKLALTKE EVPPPLVDEG EELLLACIVN GMHDGHFLAK VLDRVPVHTI MLLEDCTNAI  780
A
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
19296PRRRK
2405413ARLRAKKGQ
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.16e-10Trihelix family protein