PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cla016237
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Cucurbitales; Cucurbitaceae; Benincaseae; Citrullus
Family HD-ZIP
Protein Properties Length: 877aa    MW: 97973 Da    PI: 6.1238
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cla016237genomeICuGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox63.43.2e-2093148156
                TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
   Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                +++ +++t+ q++e+e+lF+++++p+ ++r +L+++lgL+ rqVk+WFqNrR+++k
  Cla016237  93 KKRYHRHTARQIQEMEALFKECPHPDDKQRLKLSQELGLKPRQVKFWFQNRRTQMK 148
                688899***********************************************998 PP

2START100.62.7e-322934762163
                HHHHHHHHHHHHHHHC-TT-EEEEEXCCTTEEEEEEESSS...............SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEEE CS
      START   2 laeeaaqelvkkalaeepgWvkssesengdevlqkfeeskv..............dsgealrasgvvdmvlallveellddkeqWdetla....kaetle 83 
                la +  +elvk+  ++ep+Wv++s  e g+e l  +e ++               +++ea r+++vv+m++ +lv  +ld + +W e ++    ka+t++
  Cla016237 293 LAVSSIAELVKMCRSTEPLWVRDS--ESGKEILNVEEHGRMfpwplnlkqhlineFRTEATRDTAVVIMNSITLVDAFLDAN-KWMELFPsivaKAKTVQ 389
                6788899*****************..99999999998888899***************************************.******99999****** PP

                EECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE..TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEE CS
      START  84 vissg......galqlmvaelqalsplvp.RdfvfvRyirq.lgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghsk 163
                viss       + lqlm+aelq lsplvp R+  f+R+++q   +g+w++vd  +ds  +   ++s+ R ++ pSg++i++++ng+s+
  Cla016237 390 VISSSvsghatSSLQLMYAELQTLSPLVPtREAHFLRCCQQnADEGSWTVVDFPIDSFHDSL-QHSFPRYRRKPSGCIIQDMPNGYSR 476
                *****************************************9999*************9987.9**********************97 PP

3START106.15.8e-3447759094206
                EEEXXTTXX-SSX.EEEEEEEEEEE..TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE--SSXXHHHHHHHHHHHHHH CS
      START  94 mvaelqalsplvp.RdfvfvRyirq.lgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwvehvdlkgrlphwllrslvksglae 191
                m+aelq lsplvp R+  f+R+++q   +g+w++vd  +ds  +   ++s+ R ++ pSg++i++++ng+s+vtwveh++ +++ +h++++ +v+sg+a+
  Cla016237 477 MYAELQTLSPLVPtREAHFLRCCQQnADEGSWTVVDFPIDSFHDSL-QHSFPRYRRKPSGCIIQDMPNGYSRVTWVEHAEIEEKPIHQIFNNFVHSGMAF 575
                89***********************9999*************9987.9**************************************************** PP

                HHHHHHHHTXXXXXX CS
      START 192 gaktwvatlqrqcek 206
                ga +w+a lqrqce+
  Cla016237 576 GAHRWLAILQRQCER 590
                *************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466895.85E-1981151IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.601.0E-2089155IPR009057Homeodomain-like
PROSITE profilePS5007117.39490150IPR001356Homeobox domain
SMARTSM003896.0E-1991154IPR001356Homeobox domain
CDDcd000863.79E-1893151No hitNo description
PfamPF000469.6E-1893148IPR001356Homeobox domain
PROSITE patternPS000270125148IPR017970Homeobox, conserved site
PROSITE profilePS5084843.642283593IPR002913START domain
SuperFamilySSF559613.43E-23285481No hitNo description
CDDcd088757.69E-104287589No hitNo description
SMARTSM002342.2E-25292590IPR002913START domain
PfamPF018521.2E-25293476IPR002913START domain
Gene3DG3DSA:3.30.530.204.9E-4375474IPR023393START-like domain
PfamPF018523.7E-28477590IPR002913START domain
SuperFamilySSF559611.65E-19477592No hitNo description
Gene3DG3DSA:3.30.530.209.9E-5480574IPR023393START-like domain
SuperFamilySSF559612.47E-15627845No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 877 aa     Download sequence    Send to blast
MYGDCQVMTS NMGGNMVSSE SLFSSPIQNP NFNFMSNFQH FPSVVPKEEN GLMMRGKEDM  60
ESGSGSEQLV EENQGIEMES NNNNDIIQQN QKKKRYHRHT ARQIQEMEAL FKECPHPDDK  120
QRLKLSQELG LKPRQVKFWF QNRRTQMKAQ QDRSDNVILR AENESLKNEN YRLQTALRNI  180
ICPSCGGQGI LGEPSLDEQQ LRLENARLRD QLEQVCSLTT RYTGRPIQGM PSTAPLMQPS  240
LDLDMNIYSR QYTEAIVSSS EMMSLPSMLP PETAHFPEGG LLIEEEKTLA MELAVSSIAE  300
LVKMCRSTEP LWVRDSESGK EILNVEEHGR MFPWPLNLKQ HLINEFRTEA TRDTAVVIMN  360
SITLVDAFLD ANKWMELFPS IVAKAKTVQV ISSSVSGHAT SSLQLMYAEL QTLSPLVPTR  420
EAHFLRCCQQ NADEGSWTVV DFPIDSFHDS LQHSFPRYRR KPSGCIIQDM PNGYSRMYAE  480
LQTLSPLVPT REAHFLRCCQ QNADEGSWTV VDFPIDSFHD SLQHSFPRYR RKPSGCIIQD  540
MPNGYSRVTW VEHAEIEEKP IHQIFNNFVH SGMAFGAHRW LAILQRQCER IASLMARNIS  600
DLGVIPSPEA RQNLMKLAQR MIRTFSVNIS TSGGQSWTAL SDSPDDTVRI TTRKVVEPGQ  660
PNGVILSAVS TTWLPYPHYR VFDLLRDERR RSQLEVLSNG NSLHEVAHIA NGSHPGNCIS  720
LLRINVASNS SQHVELMLQE SCTDQSGSLV IYATIDVDSI QLAMSGEDPS CIPLLPIGFS  780
IVPAIGSTVG GHPAPPPEDG TTNANSSCLL TVGLQVLAST IPSAKLNLSS VTAINNHLCN  840
TVHQINIALG SPGRLENGNN VAEPNNAPTP PPPPPKQ
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor. {ECO:0000250}.
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankLN6818241e-168LN681824.1 Cucumis melo genomic scaffold, anchoredscaffold01596.
GenBankLN7132581e-168LN713258.1 Cucumis melo genomic chromosome, chr_4.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_018681950.10.0PREDICTED: homeobox-leucine zipper protein ROC3-like isoform X1
SwissprotQ9FJS21e-177HDG5_ARATH; Homeobox-leucine zipper protein HDG5
TrEMBLM0SQ640.0M0SQ64_MUSAM; Uncharacterized protein
STRINGGSMUA_Achr4P19010_0010.0(Musa acuminata)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF109463036
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G46880.11e-179homeobox-7
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]
  2. Lung SC, et al.
    Arabidopsis ACYL-COA-BINDING PROTEIN1 interacts with STEROL C4-METHYL OXIDASE1-2 to modulate gene expression of homeodomain-leucine zipper IV transcription factors.
    New Phytol., 2018. 218(1): p. 183-200
    [PMID:29288621]