PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID GAY37239.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Sapindales; Rutaceae; Aurantioideae; Citrus
Family HB-PHD
Protein Properties Length: 704aa    MW: 79132.7 Da    PI: 9.6972
Description HB-PHD family protein
Gene Model
Gene Model ID Type Source Coding Sequence
GAY37239.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox17.95.5e-064965272253
                 SSS--HHHHHHHHHHCTS-HHHHHHHHHHHHH CS
    Homeobox  22 nrypsaeereeLAkklgLterqVkvWFqNrRa 53 
                 n ++ +  +e L+k+l+L+ ++V  WF+N R+
  GAY37239.1 496 NAKIRRIVKENLSKELSLEPEKVNKWFKNARY 527
                 55666667899*******************97 PP

2PHD34.64.4e-12240295151
                 SBTTTSS..TCTTSSEEEBS.SSSSEEETTTSTSSSSHHSHHSS..TBSSHHHHTT CS
         PHD   1 rCkvCgk..sdeegelvlCd.gCkewfHlkClglkleseekpeg..ewlCeeCkek 51 
                 +C+ C+   +  ++++vlCd +C+ +fH+kCl+++l++e++p g   w+C+ C++k
  GAY37239.1 240 ICAKCKLreAFPDNDIVLCDgTCNCAFHQKCLDPPLDTESIPPGdqGWFCKFCECK 295
                 588888854455********56********************9999*******985 PP

Sequence ? help Back to Top
Protein Sequence    Length: 704 aa     Download sequence    
MPGDGKKVVK HSESGKSCFP KKDSGSELIA SLKFKKGSKI SKSKKLRSKS KCQSKTVNST  60
LLKRTVAESH SKGAGDDFAK SKSISQKNLH IKIDRKGSKN WASSKHKGKN SALVISKGNG  120
EVADGDGETK KLRKGRSKKR RKEKVELDEA SRLQRRTRYL LIKMKLEQNL IDAYSGEGWK  180
GHSREKIRPE KELQRAKKQI LKCKIGIRDA IRQLDSLSSV GCIEGSVIAT DGSVHHEHII  240
CAKCKLREAF PDNDIVLCDG TCNCAFHQKC LDPPLDTESI PPGDQGWFCK FCECKMEIIE  300
SMNAHIGTSF SVNSNWQDIF KEEAAFPDGC SALLNQEEEW PSDDSEDDDY NPERRENSCS  360
ISRAGTDDDP SSSTSLSWFS DSETFSESMR WEMESNGYKN YSVDSSIGSD ETSDGEIICG  420
RRQRRTVDYK KLYDEMFGKD ASAFEQLSED EDWGPAKRRR KEKESDAVNS LMTLYGSEEK  480
CSKVKTAEVK KKLPSNAKIR RIVKENLSKE LSLEPEKVNK WFKNARYLAL KARKVESARQ  540
VPGSPRISKE SSLETEKQNA DVLTFKNSLE ETLVCSPKSL KKIHPKKDSK SVSSGSGFEK  600
NQQKRASLES PVNSNQASIE LSDDVSLKKL LKAKSKKTKK VKFVATGESQ AAEAEMERLC  660
RAKGRLECLK QTLSRFQIEN SKKSNKSHLY DMVYVPIAEL REKL
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1133143RKGRSKKRRKE
2138142KKRRK
3138165KKRRKEKVELDEASRLQRRTRYLLIKMK
4139166KKRRKEKVELDEASRLQRRTRYLLIKMK
5457484KKRRKEKVELDEASRLQRRTRYLLIKMK
6458485KKRRKEKVELDEASRLQRRTRYLLIKMK
7458463RRRKEK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G29940.11e-152HB-PHD family protein