PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID CCG001343.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Malpighiales; Salicaceae; Saliceae; Populus
Family HD-ZIP
Protein Properties Length: 762aa    MW: 84155.5 Da    PI: 5.365
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
CCG001343.1genomeLZUView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox62.46.7e-2054109156
                  TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
     Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                  r+k  ++t++q++eLe +F+++++p++++r eL+++lgL+ +q+k+WFqNrR+++k
  CCG001343.1  54 RKKYNRHTANQIQELEFFFKECPHPDEKQRSELSRRLGLESKQIKFWFQNRRTQMK 109
                  79999************************************************999 PP

2START1723.9e-542594912205
                  HHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS.....SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEEEEECT CS
        START   2 laeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv....dsgealrasgvvdmvlallveellddkeqWdetla....kaetleviss 87 
                  la++a++el+k+a+ e+p W ks     e +n +e++++f++ +     + +ea r+sgvv  +   lve+l+d++  W e+++    +a+t++ iss
  CCG001343.1 259 LALAAMDELIKMAQIESPIWIKSLdggkEVLNHEEYMRTFPRIGMkpsnFVTEATRESGVVLVNISALVETLMDVN-GWVEMFPsliaRAATTDIISS 355
                  6899**********************9999**********8866699999**************************.********************* PP

                  T.................EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEE CS
        START  88 g.................galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwv 167
                  g                 galq + ae+q++sp vp R ++f+R ++ql++g+w++vdvS+d +q++ + +    +++lpSg++i+++ ng skvtwv
  CCG001343.1 356 GmggttdiissgmggtksGALQMIHAEFQLISPFVPvRQVTFIRLCKQLTEGVWAVVDVSIDANQENLNAQAPETCKRLPSGCIIQDMNNGCSKVTWV 453
                  *******************************************************************9966666679********************* PP

                  E-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXX CS
        START 168 ehvdlkgrlphwllrslvksglaegaktwvatlqrqce 205
                  eh +++++ +h+l+r++++sg+ +ga++w+atlqr+ e
  CCG001343.1 454 EHSEYDESAVHQLYRPILGSGRGFGAQRWLATLQRYYE 491
                  **********************************9865 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466893.47E-1940111IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.604.4E-2141111IPR009057Homeodomain-like
PROSITE profilePS5007117.84851111IPR001356Homeobox domain
SMARTSM003891.7E-1653115IPR001356Homeobox domain
PfamPF000461.9E-1754109IPR001356Homeobox domain
CDDcd000867.32E-1854111No hitNo description
PROSITE patternPS00027086109IPR017970Homeobox, conserved site
PROSITE profilePS5084835.63249495IPR002913START domain
SuperFamilySSF559612.75E-28250491No hitNo description
CDDcd088752.89E-107253488No hitNo description
SMARTSM002342.0E-32258492IPR002913START domain
PfamPF018523.7E-46259490IPR002913START domain
SuperFamilySSF559616.59E-14523757No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 762 aa     Download sequence    Send to blast
MDGRGDMGLF GEHFDPCLVG RIKEDGYYES RSGSDNIEGA SGEDQDVGDD QRPRKKYNRH  60
TANQIQELEF FFKECPHPDE KQRSELSRRL GLESKQIKFW FQNRRTQMKT QLERHENVIL  120
RQENDKLRLE NELLKQNMSD PICNNCGGPV VPGPVSYEQQ QLRIENARLT DELGRVCALA  180
NKFLGRPLTS SANPIPPLSS KSKLDLAVGI NGYGNLGHTD NMLPMVLDNN RAIMMSLMKP  240
IGNAVGKEVP HDRSIFVDLA LAAMDELIKM AQIESPIWIK SLDGGKEVLN HEEYMRTFPR  300
IGMKPSNFVT EATRESGVVL VNISALVETL MDVNGWVEMF PSLIARAATT DIISSGMGGT  360
TDIISSGMGG TKSGALQMIH AEFQLISPFV PVRQVTFIRL CKQLTEGVWA VVDVSIDANQ  420
ENLNAQAPET CKRLPSGCII QDMNNGCSKV TWVEHSEYDE SAVHQLYRPI LGSGRGFGAQ  480
RWLATLQRYY EGMAMIMSPS ILGEDQTVIN LGGKKSMLKL ARRMVDNFCS GVCASSLHKW  540
GNPVAGNVSE DVRILTRKSI NEPGEPDGIV LSAATSVWLP VSRQRLFDFL RDEKSRSHWD  600
ILSNGGILQE IIQIPKGQGQ GQWNRVSLLR STAVDADAVE NNMLILQETW NDVSGSLVVY  660
APVDLQSMSV VTSGGDSTYV ALLPSGFVIL PDNSFSNGEP SNSDGNPVKR DSDSNNGGGS  720
FFTVGFQILA SNLPSAELTV ESVETIHNLI SCTMHRIRTV FN
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in the regulation of the tissue-specific accumulation of anthocyanins and in cellular organization of the primary root. {ECO:0000269|PubMed:10402424}.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_011012369.10.0PREDICTED: homeobox-leucine zipper protein ANTHOCYANINLESS 2-like isoform X1
SwissprotQ0WV120.0ANL2_ARATH; Homeobox-leucine zipper protein ANTHOCYANINLESS 2
TrEMBLB9I4A90.0B9I4A9_POPTR; Uncharacterized protein
STRINGPOPTR_0012s13390.10.0(Populus trichocarpa)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF119234106
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00730.10.0HD-ZIP family protein
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]