PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GBG64168.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Charophyceae; Charales; Characeae; Chara
Family Dof
Protein Properties Length: 985aa    MW: 98463.8 Da    PI: 5.3825
Description Dof family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GBG64168.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-Dof122.81.1e-38326386262
      zf-Dof   2 kekalkcprCdstntkfCyynnyslsqPryfCkaCrryWtkGGalrnvPvGggrrknkkss 62 
                 ++ka++cprC+s +tkfCyynny+++qPr+fCk+C+ryWt+GG+lrnvPvG+grrknk+s+
  GBG64168.1 326 PDKAVACPRCESFDTKFCYYNNYNVNQPRHFCKGCQRYWTAGGTLRNVPVGAGRRKNKHSQ 386
                 6899******************************************************986 PP

Sequence ? help Back to Top
Protein Sequence    Length: 985 aa     Download sequence    
MEAEVKSPGS GDQQGGQQQN SDPGIKLFGR MIAVNFYNRS DIDYYSQEGG RGGGGVSGGQ  60
STAAVADCRA VSEEETPEAA QATEAAVGAG VGTATIEQGM EDGRVSGGDA GMTTAPCSLT  120
MSFARARAQG AGYNMVPSGG GGGGGCGGMG WGCGGGGGSA EKVAVGKGQP EGCGELHDEE  180
RRPNDRSFDR DRRMGSGVLP SEEAGSSSGS NSGSDDGNFE AGNGTQSMAP GGVTVMSGGH  240
AIEGRGENIP TTSMGQGRGE EMTVNGLGLD SSGGKGCGRS EGLTGSTGMR MVGMDGEGEE  300
GGSEKVDGGI SSGGGLCTSQ EKTKRPDKAV ACPRCESFDT KFCYYNNYNV NQPRHFCKGC  360
QRYWTAGGTL RNVPVGAGRR KNKHSQQKSA NGQANVSSGG NPARNSCGTA SAGIGSVSGT  420
VPGPGSSSGF PNSSVPGCVR LEVEDEVSPS GMTSELGRAD GAPTRFPAGT NCGVARPIPT  480
SALPAGLGIM GLGRGLVAEE TTSLQYGVFQ QLQGLTAPPV LPGGAVEGFG VPGLGIPGSG  540
VEGLLGPDVR RKKQKTRRLQ QGVTPTGLAT PVSETLGSNE EEESAAVARG FCAVRETTDA  600
STCTSSMANT SVCMSSVSSL NADGERMNDL TGPSMMPSQV GVQGSSVWGP PPPPSVGLVR  660
GPGGMQVDSL GVSMGMFKTP GMEAGELASA AASAGAGVWP SVTPTAAAMH ASPGTWGPAS  720
TGSPYGYFSG AWPFGYHLGW NGHHGGAMAG AGPAASGAAG WASTGLWPGG WAGLSPSSPW  780
GAAAGWGSLW GMPWANAAAA AAVANASAGV SPASVLGKHM RDGGAAVMMG KQTVDDANCL  840
VGPSVGNPPA GMLSPTGQFK PKGLHEATEG RNRGVGEDGG SEEGVMEGGG DMGGMVWAPK  900
LLRTGDGVHG DGRIGGWSAG KGLSGPTSGL GSMFRAFNGG MPKAPRFENS SSPVSFEKRL  960
FSPDQNARLA NPASIPRSMA FQESN
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
14955GGRGGGG
2547554PDVRRKKQ
3551557RKKQKTR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G39660.17e-35Dof family protein