PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cla006925
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Cucurbitales; Cucurbitaceae; Benincaseae; Citrullus
Family HD-ZIP
Protein Properties Length: 752aa    MW: 80599.3 Da    PI: 6.6653
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cla006925genomeICuGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox65.29.2e-21134189156
                TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
   Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                +++ +++t++q++eLe++F+++++p++++r eL+++l L++rqVk+WFqNrR+++k
  Cla006925 134 KKRYHRHTPQQIQELEAVFKECPHPDEKQRLELSRRLCLETRQVKFWFQNRRTQMK 189
                688999***********************************************999 PP

2START160.11.7e-5033248467206
                CCGGCT-TT-S....EEEEEEEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--...-TTSEE-EESSEEEE CS
      START  67 ddkeqWdetla....kaetlevissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppe..sssvvRaellpSgil 153
                d++ +W e+++    + +t++vissg      galqlm aelq+lsplvp R++ f+R+++q+ +g+w++vdvSvd  ++ p+   +s+  +++lpSg++
  Cla006925 332 DRSNRWAEMFPcmiaRTTTTDVISSGmggtrnGALQLMHAELQVLSPLVPvREVNFLRFCKQHAEGVWAVVDVSVDTMRETPTsgGPSFGNCRRLPSGCV 431
                56669************************************************************************999999999************** PP

                EEEECTCEEEEEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
      START 154 iepksnghskvtwvehvdlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                +++++ng+skvtwveh++++++++h+l+r+l++sg+ +ga++wvatlqrqce+
  Cla006925 432 VQDMPNGYSKVTWVEHAEYDDSQVHQLYRPLLSSGMGFGAQRWVATLQRQCEC 484
                ***************************************************96 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466897.7E-21116191IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.603.9E-22121191IPR009057Homeodomain-like
PROSITE profilePS5007117.2131191IPR001356Homeobox domain
SMARTSM003896.3E-18132195IPR001356Homeobox domain
CDDcd000867.32E-19133191No hitNo description
PfamPF000462.5E-18134189IPR001356Homeobox domain
PROSITE patternPS000270166189IPR017970Homeobox, conserved site
SMARTSM002341.9E-26270484IPR002913START domain
CDDcd088753.41E-85331483No hitNo description
PROSITE profilePS5084830.607333487IPR002913START domain
PfamPF018524.3E-44334484IPR002913START domain
SuperFamilySSF559612.51E-23335484No hitNo description
SuperFamilySSF559611.41E-23512745No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 752 aa     Download sequence    Send to blast
MSFGGFLDGG GGGGGGGARI LADLPYTNNS TTNANNNPTG GIGVGGNMSS GAIAPPRLIT  60
QSLTKSMFNS PGLSLALTNM DGGQGDLAAR LPEGFEHNVG RRGREEEHES RSGSDNMDGG  120
SGDDQDAADN PPRKKRYHRH TPQQIQELEA VFKECPHPDE KQRLELSRRL CLETRQVKFW  180
FQNRRTQMKT QLERHENTLL RQENDKLRAE NMSIRDAMRN PICSNCGGPA IIGEISLEEQ  240
QLRIENARLK DELDRVCALA GKFLGRPISS LANSIAPPLP SSSLELGVGS NGFGSLTMAT  300
SMPIGPDFGG GLSGNLAVVQ PPARPTPGMG LDRSNRWAEM FPCMIARTTT TDVISSGMGG  360
TRNGALQLMH AELQVLSPLV PVREVNFLRF CKQHAEGVWA VVDVSVDTMR ETPTSGGPSF  420
GNCRRLPSGC VVQDMPNGYS KVTWVEHAEY DDSQVHQLYR PLLSSGMGFG AQRWVATLQR  480
QCECLAILMS SAVPIRDHTA ITAGGRRSML KLAQRMTANF CAGVCASTVH KWNKLNAGSV  540
DEDVRVMTRK SVDDPGEPPG IVLSAATSVW LPVSPQRLFD FLRDERLRSE WDILSNGGPM  600
QEMAHIAKGQ DHGNCVSLLR ASAMNANQSS MLILQETCID AAGSLVVYAP VDIPAMHVVM  660
NGGDSAYVAL LPSGFAIVPD GAVTVTNGSS PSGGEGPQSQ RATGGGSLLT VAFQILVNSL  720
PTAKLTVESV ETVNNLISCT VQKIKAALQC ET
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor. {ECO:0000250}.
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankLN6819251e-179LN681925.1 Cucumis melo genomic scaffold, anchoredscaffold00052.
GenBankLN7132651e-179LN713265.1 Cucumis melo genomic chromosome, chr_11.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_022924365.10.0homeobox-leucine zipper protein ANTHOCYANINLESS 2-like isoform X2
SwissprotA2YR020.0ROC7_ORYSI; Homeobox-leucine zipper protein ROC7
TrEMBLA0A0B0ML560.0A0A0B0ML56_GOSAR; Homeobox-leucine zipper ROC6
STRINGGSMUA_Achr5P20050_0010.0(Musa acuminata)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF119234106
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G52170.10.0homeodomain GLABROUS 7