PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Csa02g045180.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Camelina
Family HD-ZIP
Protein Properties Length: 684aa    MW: 75895.4 Da    PI: 6.5117
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Csa02g045180.1genomeCSGPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox57.52.3e-1861115256
                     T--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
        Homeobox   2 rkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                      + +++t++q++eLe++F+ +++p++++r eL kkl L+ +q+k+WFqNrR+++k
  Csa02g045180.1  61 TRYHRHTSYQIQELESFFKVCPHPNEKQRLELGKKLTLESKQIKFWFQNRRTQMK 115
                     334567999*******************************************999 PP

2START158.65e-502284284206
                     HHHHHHHHHHHHHC-TT-EEEEEXCCTTEEEEEEESSS.SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEEEEECTT...... CS
           START   4 eeaaqelvkkalaeepgWvkssesengdevlqkfeeskvdsgealrasgvvdmvlallveellddkeqWdetla....kaetlevissg...... 88 
                      ea++el+k+a+++ p+W++ s  e++ +v+              r++g+v  +++ lv +l++++ +W e ++     a+t+evis+g      
  Csa02g045180.1 228 MEAMDELLKLAELDNPLWSSKS--EKEAIVRPAC----------TREIGLVLINSVALVDSLMETN-KWAEIFEcivaVASTVEVISNGsdgsrn 309
                     689******************9..7777777777..........69********************.**************************** PP

                     EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE--SSXXHHHHH CS
           START  89 galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwvehvdlkgrlphwllr 182
                     g lqlm+ae+q++splvp R   f+Ry++q+g+g w++vdvS d +++++  +s+  ++++pSg++i++ +ng skvtw+eh +++++++h+l++
  Csa02g045180.1 310 GSLQLMQAEFQVMSPLVPiRQKKFLRYCKQHGDGLWAVVDVSYDINRENEHLKSYGGSKRFPSGCIIQDIGNGCSKVTWIEHLEYEESHIHSLYQ 404
                     **************************************************9******************************************** PP

                     HHHHHHHHHHHHHHHHHTXXXXXX CS
           START 183 slvksglaegaktwvatlqrqcek 206
                     +l +s++  ga +w+atlqrqce+
  Csa02g045180.1 405 PLFGSSVGLGATKWLATLQRQCES 428
                     **********************95 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466891.09E-1841117IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.602.1E-1944117IPR009057Homeodomain-like
SMARTSM003893.4E-1556121IPR001356Homeobox domain
CDDcd000861.26E-1657117No hitNo description
PROSITE profilePS5007116.81157117IPR001356Homeobox domain
PfamPF000464.1E-1661115IPR001356Homeobox domain
PROSITE profilePS5084839.134216431IPR002913START domain
CDDcd088753.65E-99222427No hitNo description
SuperFamilySSF559615.05E-28223429No hitNo description
SMARTSM002342.6E-37225428IPR002913START domain
PfamPF018522.5E-44228428IPR002913START domain
SuperFamilySSF559615.6E-18453675No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 684 aa     Download sequence    Send to blast
MNGDLDVDMS RGDFNPSFYL GRLKDDEFES RSLCDDDSFD AMSGDENKQE QRPNKKKKKR  60
TRYHRHTSYQ IQELESFFKV CPHPNEKQRL ELGKKLTLES KQIKFWFQNR RTQMKTQLER  120
HENVILRQEN EKLRLENSFL KESMRGSLCI DCGGAVIPGE VSFEQHQLRI ENAKLKDELD  180
RICALANRFI GGSISLEQPS NGGNGSQHLP IGNGFSGGTS LMLMDLTMEA MDELLKLAEL  240
DNPLWSSKSE KEAIVRPACT REIGLVLINS VALVDSLMET NKWAEIFECI VAVASTVEVI  300
SNGSDGSRNG SLQLMQAEFQ VMSPLVPIRQ KKFLRYCKQH GDGLWAVVDV SYDINRENEH  360
LKSYGGSKRF PSGCIIQDIG NGCSKVTWIE HLEYEESHIH SLYQPLFGSS VGLGATKWLA  420
TLQRQCESFT NLLSSQDHTG LSLAGTKSIL KLAQRMKVNF YSGITASSVH KWEKLNAENV  480
GQDTRILTRK SLEPSGIVLS AATSLWLPVT QQRLFEFLCD GKCRNQWDIL SNGASMETTL  540
LVPKGQREGS CVSLLRAAGK DQNESSMLIL QETWNDASGA LVVYAPVDIP SMNVVMSGGD  600
SAYVALLPSG FSILPDGSSL SDQINTNGGL VNQESKGCLL TVGFQILVNS LPSAKLNVES  660
VETVNNLIAC TIHKIRAALR IPA*
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor that binds to the DNA sequence 5'-GCATTAAATGC-3'. {ECO:0000269|PubMed:16778018}.
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00559DAPTransfer from AT5G52170Download
Motif logo
Cis-element ? help Back to Top
SourceLink
PlantRegMapCsa02g045180.1
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAB0256031e-166AB025603.1 Arabidopsis thaliana genomic DNA, chromosome 5, BAC clone:F17P19.
GenBankCP0026881e-166CP002688.1 Arabidopsis thaliana chromosome 5 sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_019091179.10.0PREDICTED: homeobox-leucine zipper protein HDG7-like isoform X2
SwissprotQ9LTK30.0HDG7_ARATH; Homeobox-leucine zipper protein HDG7
TrEMBLR0F0C90.0R0F0C9_9BRAS; Uncharacterized protein
STRINGXP_010444530.10.0(Camelina sativa)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM112827105
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G52170.10.0homeodomain GLABROUS 7
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]