PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cucsa.086030.1
Common NameCsa_1G064670, LOC101212604
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Cucurbitales; Cucurbitaceae; Benincaseae; Cucumis
Family HD-ZIP
Protein Properties Length: 742aa    MW: 82041.9 Da    PI: 5.9532
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cucsa.086030.1genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox60.62.5e-1964119156
                     TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
        Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                     r++ +++t+ q++e+e++F+++++p+ ++r++L+++lgL+  qVk+WFqN+R++ k
  Cucsa.086030.1  64 RKRYHRHTQLQIQEMEAFFKECPHPDDKQRKQLSRELGLEPLQVKFWFQNKRTQIK 119
                     789999**********************************************9877 PP

2START188.14.4e-592574761206
                     HHHHHHHHHHHHHHHHC-TT-EEEE...EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEEE CS
           START   1 elaeeaaqelvkkalaeepgWvkss...esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla....kaetle 83 
                     ela +a++e+ ++a+++ep+Wv      e++n+de+l++ ++  +      ++ea+r + ++  ++ +lv +l+d++ qW++ +     +a tle
  Cucsa.086030.1 257 ELAVSAMEEVCRMAQEGEPLWVVGEnsmEMLNEDEYLRTYSTRIGprivgLTSEASRQTSILAFNHLKLVHILMDVN-QWSTIFCgivsRALTLE 350
                     57899*****************988889************99988********************************.******99999****** PP

                     EECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE CS
           START  84 vissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwvehvd 171
                     v+ssg      galq+m+ae+q++splvp R+ +fvRy++q+g+g+w++vdvS+d  ++ p+     R +++pSg+li++++ng+skvtwvehv+
  Cucsa.086030.1 351 VLSSGvggdynGALQVMTAEFQVPSPLVPtRENYFVRYCKQQGEGSWAVVDVSLDYLRPTPT----SRTRRRPSGCLIQELPNGYSKVTWVEHVE 441
                     ***********************************************************995....78889************************ PP

                     --SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
           START 172 lkgrlphwllrslvksglaegaktwvatlqrqcek 206
                     +++r +h+l++ +v  gla+gak+w+atl+rqc++
  Cucsa.086030.1 442 VDDRAVHSLYKGVVTCGLAFGAKRWMATLGRQCQR 476
                     *********************************96 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.608.3E-2245119IPR009057Homeodomain-like
SuperFamilySSF466895.01E-1951122IPR009057Homeodomain-like
PROSITE profilePS5007116.7361121IPR001356Homeobox domain
SMARTSM003891.1E-1762125IPR001356Homeobox domain
CDDcd000863.05E-1863122No hitNo description
PfamPF000466.3E-1764119IPR001356Homeobox domain
PROSITE patternPS00027096119IPR017970Homeobox, conserved site
SuperFamilySSF559613.72E-33248477No hitNo description
PROSITE profilePS5084842.172248479IPR002913START domain
CDDcd088752.19E-111252475No hitNo description
SMARTSM002343.1E-59257476IPR002913START domain
PfamPF018522.1E-49258476IPR002913START domain
Gene3DG3DSA:3.30.530.201.1E-4360443IPR023393START-like domain
SuperFamilySSF559615.5E-23503731No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 742 aa     Download sequence    Send to blast
MFGAHGFEDH HHQDDLLLEM TQKNFETELE KFGEDEFESR SVTDAMDAPL GEEQCDLLNQ  60
RNKRKRYHRH TQLQIQEMEA FFKECPHPDD KQRKQLSREL GLEPLQVKFW FQNKRTQIKA  120
QQERHENAIL KAQNEKLRAE NMRYKEALSN TSCPNCGGPA ALGEMSFDAQ HLRIDNAHLR  180
DEIERLNGNN KYGGKGWGSH SSHIVSCGGQ VGRSSLKPQQ LQGDDHLLGD MYGETTTGMM  240
LKSSSVTTEI DKPVIVELAV SAMEEVCRMA QEGEPLWVVG ENSMEMLNED EYLRTYSTRI  300
GPRIVGLTSE ASRQTSILAF NHLKLVHILM DVNQWSTIFC GIVSRALTLE VLSSGVGGDY  360
NGALQVMTAE FQVPSPLVPT RENYFVRYCK QQGEGSWAVV DVSLDYLRPT PTSRTRRRPS  420
GCLIQELPNG YSKVTWVEHV EVDDRAVHSL YKGVVTCGLA FGAKRWMATL GRQCQRLTNS  480
SSTNIPALDI CVVTGQEGRK SVMKLAERMV RSFCSGVGAA TAHNWTTLST IDSDDVRVMA  540
RKSLDDPGRP PGIVLNAATS FWIPIPPNRV FNFLRDQNTR NQWDILSNGG LVQEMARIGN  600
DRNSGNCVSL LRVNSANSSQ SNMLILQESC SDDISGSYII YAPVDTAAMN MVLSGGDPDY  660
VALLPSGFAI LPDGPPIGPE GPPGILEFGA GGSLLTVAFQ ILVDSVPTAK LSLGSVATVN  720
SLIKCTVERI RAALMCDQPI N*
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor that binds to the L1 box DNA sequence 5'-TAAATG[CT]A-3'. Plays a role in maintaining the identity of L1 cells, possibly by interacting with their L1 box or other target-gene promoters. Functionally redundant to ATML1. {ECO:0000269|PubMed:12505995}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankLN6819320.0LN681932.1 Cucumis melo genomic scaffold, anchoredscaffold00001.
GenBankLN7132660.0LN713266.1 Cucumis melo genomic chromosome, chr_12.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_004154226.20.0PREDICTED: homeobox-leucine zipper protein MERISTEM L1
SwissprotQ93V990.0PDF2_ARATH; Homeobox-leucine zipper protein PROTODERMAL FACTOR 2
TrEMBLA0A0A0LRM50.0A0A0A0LRM5_CUCSA; Uncharacterized protein
STRINGXP_008441375.10.0(Cucumis melo)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF2480333
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G21750.20.0HD-ZIP family protein
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]
  2. Ren Y, et al.
    An integrated genetic and cytogenetic map of the cucumber genome.
    PLoS ONE, 2009. 4(6): p. e5795
    [PMID:19495411]
  3. Guo S, et al.
    Transcriptome sequencing and comparative analysis of cucumber flowers with different sex types.
    BMC Genomics, 2010. 11: p. 384
    [PMID:20565788]
  4. Li Z, et al.
    RNA-Seq improves annotation of protein-coding genes in the cucumber genome.
    BMC Genomics, 2011. 12: p. 540
    [PMID:22047402]
  5. Ding Y, et al.
    Four distinct types of dehydration stress memory genes in Arabidopsis thaliana.
    BMC Plant Biol., 2013. 13: p. 229
    [PMID:24377444]