PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_022974200.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Cucurbitales; Cucurbitaceae; Cucurbiteae; Cucurbita
Family HB-PHD
Protein Properties Length: 716aa    MW: 81209 Da    PI: 9.2578
Description HB-PHD family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_022974200.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox32.41.5e-105085501254
                     HHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHH CS
        Homeobox  12 leeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRak 54 
                     +e L ++F++n  ps++ +e L+ +lgL+ ++V+ WF+N R+ 
  XP_022974200.1 508 VEKLRQVFAENELPSRDVKENLSIELGLDAEKVSKWFKNARYS 550
                     789**************************************85 PP

2PHD35.81.9e-12239294151
                     SBTTTSS..TCTTSSEEEBS.SSSSEEETTTSTSSSSHHSHHSS..TBSSHHHHTT CS
             PHD   1 rCkvCgk..sdeegelvlCd.gCkewfHlkClglkleseekpeg..ewlCeeCkek 51 
                     +C+ C+   +  +++++lCd +C+ +fH+kCl+++l++e++p g   w+C+ C++k
  XP_022974200.1 239 FCAKCKLreAFPDNDIILCDgTCNCAFHQKCLDPPLDTENIPPGdqGWFCKFCECK 294
                     699999854455********56********************9999*******985 PP

Sequence ? help Back to Top
Protein Sequence    Length: 716 aa     Download sequence    
MRGTGRRLTQ KESGKCSHSK METGSELILP LKLKRCSKIS HSKQKKSRTK SHAQEIGSTL  60
KRRPFPKSLS KGNKNVTIRQ LAGKKFLLKK LNSKYTKDLL LSKLQGGKSL PSPSTEGNAE  120
KVEPVTKINQ QRKRRKNKGK REKVELDEAS RLQRRTRYLI IKIKLEQNLI DAYSGEGWKG  180
QSREKIRPEK ELQRAMEQIL QCKLGIRDAI RQLDLLGSVG CIEDSVIGPD GSVYHEHIFC  240
AKCKLREAFP DNDIILCDGT CNCAFHQKCL DPPLDTENIP PGDQGWFCKF CECKMEILEG  300
MNAHLGTRFS MNVSWEDIFK EEAAFPDGRN ASLNHEEDWP SDDSADDDYD PDKKEIGYDN  360
PSGEENDKDV FEESSSSTSL SWSLDGEDLT DRDSIGCEDH FGASSSIVSD GSNEEGITGG  420
RRQRQAVDYK KLYVEMFGKD STAHEQVSED EDWGPAKRRR REKECDAAST LMSLCESEKK  480
SKKIDVEAEK RPLNSQSRSF FRIPLYAVEK LRQVFAENEL PSRDVKENLS IELGLDAEKV  540
SKWFKNARYS ALRTRKAEGA TQSHSPNKTL NEPRLADSKE MSACPPSSED APIKELQLKS  600
RNSHYKKKQH RKSSLVSSNN NKDALDSGDD ISLKNLLKNR KAKVKKRVKF VARGGDRGQE  660
AEVEMERLCK IKGRLEIMKQ KLLRLSNKKE DGVLDRSHMF EQSIVYVPVA VLKEKV
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
14551KKSRTKS
2135142RKNKGKRE
3457483KRRRREKECDAASTLMSLCESEKKSKK
4458463RRRREK
5639646NRKAKVKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G29940.11e-158HB-PHD family protein