PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID OMO94393
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Grewioideae; Apeibeae; Corchorus
Family HB-PHD
Protein Properties Length: 723aa    MW: 81460.5 Da    PI: 10.0223
Description HB-PHD family protein
Gene Model
Gene Model ID Type Source Coding Sequence
OMO94393genomeEnsemblPlantsView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox34.24.4e-11503548853
               -HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHH CS
  Homeobox   8 tkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRa 53 
                ++ +e+L ++F++n  ps+  r  L+k+lgL+ ++V  WF+N R+
  OMO94393 503 PPTAVERLRQVFAENELPSKVIRDNLSKELGLEPEKVNKWFKNARY 548
               47889***************************************97 PP

2PHD40.85.1e-14239294151
               SBTTTSS..TCTTSSEEEBS.SSSSEEETTTSTSSSSHHSHHSS..TBSSHHHHTT CS
       PHD   1 rCkvCgk..sdeegelvlCd.gCkewfHlkClglkleseekpeg..ewlCeeCkek 51 
               +C+ C+   +  ++++vlCd +C+++fH+kCl+++l++e++p g   w+C+ C++k
  OMO94393 239 FCAKCKLreAFPDNDIVLCDgTCNRAFHQKCLDPPLDTENIPPGeqGWFCKFCECK 294
               699999854455********56********************9999*******985 PP

Sequence ? help Back to Top
Protein Sequence    Length: 723 aa     Download sequence    
MRGNGKKVTD HGSAKSSSSK KEAGSKLIAS LQFKKRSKIS HGRVRKPKCH VKKVGSALLK  60
RKVSASVSKG NLTGNNVSSA KKAGNKLNLQ KSNKKGCSKK LNSSKQHGND AAVGSSEENG  120
KKANGDLRTK KLTKKKKKKQ KDKVEVDEAS RLQRRTRYLL IKMKLEQNLI DAYSGEGWKG  180
QSREKIKPEK ELQRAEKQIL DCKLGIRDAI RQLDSLSSVG SIEGSVIAPD GSVYHEHIFC  240
AKCKLREAFP DNDIVLCDGT CNRAFHQKCL DPPLDTENIP PGEQGWFCKF CECKMEIIEA  300
MNAHIGTHFS VDSHWQDIFK DEAAFPDGAI ALLNPEEEWP SDDSEDDDYD PERTEKSCSI  360
SGAATDGDQS DDTDSSTSLS WSLDGEDFSE SGRRENHSVD SAADSCETSD GEIISGRRRR  420
REVDYRQLYD EMFGKDAPPY EQVSEDEDWG PSKRKRREKE SDAASTLMTL YESETKFPNV  480
ETTEMRRQLP SNLKSKRTFF RIPPTAVERL RQVFAENELP SKVIRDNLSK ELGLEPEKVN  540
KWFKNARYLA LKSRKVEKAD QLQSSPRVSK ESGVESPKSK DPDIVALEDM SKATLSPTSK  600
ILKKKVRKSP KSNSLHSSLK RSLRDQVSPA KSSKVSKDLS DDVILKKLLK VKKKRDKKRI  660
IVAGGLQEFE LEMERLCRAK VRLERMQQTL LRLESGKARK MNKKRLHEES VIYIPIAELK  720
EKA
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1130139KKLTKKKKKK
2134164KKKKKKQKDKVEVDEASRLQRRTRYLLIKMK
3135165KKKKKKQKDKVEVDEASRLQRRTRYLLIKMK
4136166KKKKKKQKDKVEVDEASRLQRRTRYLLIKMK
5454459RKRREK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G29940.21e-168HB-PHD family protein