PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cc07_g12840
Common NameGSCOC_T00036825001
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Gentianales; Rubiaceae; Ixoroideae; Coffeeae; Coffea
Family HD-ZIP
Protein Properties Length: 692aa    MW: 77978.9 Da    PI: 6.0786
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cc07_g12840genomeCGSCView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox61.71.1e-192681156
                 TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
     Homeobox  1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56
                 r++ +++++eq+++Lee+F+ +++p++++r++L+++lgL+ rq+k+WFqN+R++ k
  Cc07_g12840 26 RKQYHRHSAEQIQRLEEFFKDCPHPDENQRRQLSRELGLEPRQIKFWFQNKRTQTK 81
                 788999**********************************************9988 PP

2START122.36.4e-392094283206
                  HHHHHHHHHHHHHHC-TT-EEEE.......EXCCTTEEEEEEESSS.....SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEEEEE CS
        START   3 aeeaaqelvkkalaeepgWvkss.......esengdevlqkfeeskv....dsgealrasgvvdmvlallveellddkeqWdetla....kaetlevi 85 
                  a  a++el+ +   +ep+Wvks+         e++d+ + k++        + +e +r+sg+v+ ++   + ++++d+e+W   ++    k+ t+ev+
  Cc07_g12840 209 AVNAMDELLELFRGNEPLWVKSPtneryliHRETYDKLYPKISH--InsssSWIESSRDSGLVPITAR-HLIDIFQDPEKWMDFFPtivtKVRTIEVL 303
                  6789*******************777666666666666666644..1568899*************99.8889999999***99999999******** PP

                  CTT...EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE--SSXX.H CS
        START  86 ssg...galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwvehvdlkgrlp.h 178
                  ++g   g l +m  +l +lsplv+ R+  f+Ry rql   +wviv vS ds ++ + ++s+    ++pSg+li++++ng+ +v wvehv +++r+  h
  Cc07_g12840 304 DTGkrgGSLFVMHEKLHVLSPLVApRELAFLRYVRQLDSTTWVIVNVSYDSLKELE-DASSSQTWMFPSGCLIQDMPNGKANVAWVEHVQVDDRSLtH 400
                  ***********************99******************************9.9************************************999* PP

                  HHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
        START 179 wllrslvksglaegaktwvatlqrqcek 206
                   l++ +v  ++a gak+w  tlqr ce+
  Cc07_g12840 401 PLYKDMVCDSQAYGAKRWIVTLQRMCER 428
                  **************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.609.6E-211177IPR009057Homeodomain-like
SuperFamilySSF466892.92E-181383IPR009057Homeodomain-like
PROSITE profilePS5007117.4592383IPR001356Homeobox domain
SMARTSM003896.8E-172587IPR001356Homeobox domain
PfamPF000463.8E-172681IPR001356Homeobox domain
CDDcd000861.79E-172684No hitNo description
PROSITE patternPS0002705881IPR017970Homeobox, conserved site
PROSITE profilePS5084843.226198431IPR002913START domain
SuperFamilySSF559611.1E-29199429No hitNo description
CDDcd088753.39E-87202427No hitNo description
Gene3DG3DSA:3.30.530.201.0E-8207394IPR023393START-like domain
SMARTSM002348.8E-20207428IPR002913START domain
PfamPF018521.2E-31209428IPR002913START domain
SuperFamilySSF559617.03E-5450622No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 692 aa     Download sequence    Send to blast
MDFAPEGSGS ADEHDDSSNS RRERERKQYH RHSAEQIQRL EEFFKDCPHP DENQRRQLSR  60
ELGLEPRQIK FWFQNKRTQT KSHNERADND ALRVENDRFY YENLVMREAL RNLVCPKCED  120
ASSGEEARQR NLERLRAENA WLKQELERAS RVVSTFSRRS GVLESYLPPS FSPLSYLGEN  180
IPGTSTTTLL PQGLSEIQEM EKSVMVETAV NAMDELLELF RGNEPLWVKS PTNERYLIHR  240
ETYDKLYPKI SHINSSSSWI ESSRDSGLVP ITARHLIDIF QDPEKWMDFF PTIVTKVRTI  300
EVLDTGKRGG SLFVMHEKLH VLSPLVAPRE LAFLRYVRQL DSTTWVIVNV SYDSLKELED  360
ASSSQTWMFP SGCLIQDMPN GKANVAWVEH VQVDDRSLTH PLYKDMVCDS QAYGAKRWIV  420
TLQRMCERFA FSLGPIPTPG HELEGVIDAP EGRKSLAKLS HKMVKNFCHI LSMPERIDLP  480
QLSELNRNGF RVSVHRSDTS GQPNNMIVCV AASLRLPTSF ENLFDFFKDE HARDQWDVLS  540
EGNPVHEIAH ISTGTHPGNS ISLMQPVNPK ENMLILQESS IDLLGANLIY APVPVSTITS  600
AISGQDTTDT NVLPSGFIIS SDGVDGGTRA GASSSSSMIG SNSSLLTVAF QIMVRPDTFS  660
DRLITDSVAT IHALISSTVQ KIRLAVGSRF G*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
12027RRERERKQ
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_027075663.10.0homeobox-leucine zipper protein HDG11-like
TrEMBLA0A068TZK70.0A0A068TZK7_COFCA; Uncharacterized protein
STRINGMigut.H02245.1.p0.0(Erythranthe guttata)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA10211732
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G73360.10.0homeodomain GLABROUS 11