PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Csa16g050670.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Camelina
Family HD-ZIP
Protein Properties Length: 1128aa    MW: 126357 Da    PI: 6.2805
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Csa16g050670.1genomeCSGPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox65.38.6e-2198153156
                     TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
        Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                     r+k +++t++q++ +e+lF+++++p++++r++L+k+lgL  rqVk+WFqNrR++ k
  Csa16g050670.1  98 RKKYHRHTTDQIRHMEALFKETPHPDEKQRQQLSKQLGLAPRQVKFWFQNRRTQIK 153
                     7999************************************************9877 PP

2Homeobox65.38.6e-21481536156
                     TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
        Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                     r+k +++t++q++ +e+lF+++++p++++r++L+k+lgL  rqVk+WFqNrR++ k
  Csa16g050670.1 481 RKKYHRHTTDQIRHMEALFKETPHPDEKQRQQLSKQLGLAPRQVKFWFQNRRTQIK 536
                     7999************************************************9877 PP

3START110.42.9e-352563981122
                     HHHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS.......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEE CS
           START   1 elaeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv......dsgealrasgvvdmvlallveellddkeqWdetla....kaet 81 
                     e+a++a+ el+k+a+++ep+W +s+    e++n+de+l++f+++++      +++ea+r++g+v+m++++l ++++d++ qW+e++a    ka+t
  Csa16g050670.1 256 EIANRATLELQKMATSGEPLWLRSVetgrEILNYDEYLKEFPQAQAsslpgrKTIEASRDVGIVFMDAHKLAQSFMDVG-QWKEMFAclvsKAAT 349
                     578999**************************************99*********************************.*************** PP

                     EEEECTT.......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS- CS
           START  82 levissg.......galqlmvaelqalsplvp.RdfvfvRyirqlgagd 122
                     ++vi++g       ga+qlm+ e+q+l+p+vp R+++fvR++rql  +d
  Csa16g050670.1 350 VDVIRQGegpsridGAIQLMFGEMQLLTPVVPtREVYFVRSCRQLXTKD 398
                     ********************************************77665 PP

4START232.89.5e-736398661206
                     HHHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS.......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEE CS
           START   1 elaeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv......dsgealrasgvvdmvlallveellddkeqWdetla....kaet 81 
                     e+a++a+ el+k+a+++ep+W +s+    e++n+de+l++f+++++      +++ea+r++g+v+m++++l ++++d++ qW+e++a    ka+t
  Csa16g050670.1 639 EIANRATLELQKMATSGEPLWLRSVetgrEILNYDEYLKEFPQAQAsslpgrKTIEASRDVGIVFMDAHKLAQSFMDVG-QWKEMFAclvsKAAT 732
                     578999**************************************99*********************************.*************** PP

                     EEEECTT.......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--..-TTSEE-EESSEEEEEEEECTCEEEEEEE CS
           START  82 levissg.......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppe.sssvvRaellpSgiliepksnghskvtwv 167
                     ++vi++g       ga+qlm+ e+q+l+p+vp R+++fvR++rql+ ++w+ivdvSv++e++++e ++s+ ++++lpSg++ie++snghskvtwv
  Csa16g050670.1 733 VDVIRQGegpsridGAIQLMFGEMQLLTPVVPtREVYFVRSCRQLTPEKWAIVDVSVSVEDSNTEkEASLLKCRKLPSGCIIEDTSNGHSKVTWV 827
                     *********************************************************************************************** PP

                     E-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
           START 168 ehvdlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                     eh d+++++++ l+rslv++gla+ga++wvatlq +ce+
  Csa16g050670.1 828 EHLDVSASTVQPLFRSLVNTGLAFGARHWVATLQLHCER 866
                     *************************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.607.2E-2384149IPR009057Homeodomain-like
SuperFamilySSF466895.85E-2087156IPR009057Homeodomain-like
PROSITE profilePS5007117.81595155IPR001356Homeobox domain
SMARTSM003899.6E-1897159IPR001356Homeobox domain
PfamPF000464.4E-1898153IPR001356Homeobox domain
CDDcd000868.70E-16102153No hitNo description
PROSITE patternPS000270130153IPR017970Homeobox, conserved site
PROSITE profilePS5084819.85247400IPR002913START domain
SuperFamilySSF559611.92E-11250395No hitNo description
PfamPF018521.1E-29256398IPR002913START domain
SMARTSM002345.8E-28256483IPR002913START domain
Gene3DG3DSA:1.10.10.607.2E-23467532IPR009057Homeodomain-like
SuperFamilySSF466895.85E-20470539IPR009057Homeodomain-like
PROSITE profilePS5007117.815478538IPR001356Homeobox domain
SMARTSM003899.6E-18480542IPR001356Homeobox domain
PfamPF000464.4E-18481536IPR001356Homeobox domain
CDDcd000868.70E-16485536No hitNo description
PROSITE patternPS000270513536IPR017970Homeobox, conserved site
PROSITE profilePS5084843.716630869IPR002913START domain
SuperFamilySSF559614.63E-31633866No hitNo description
CDDcd088751.52E-107634865No hitNo description
PfamPF018522.2E-66639866IPR002913START domain
SMARTSM002341.5E-84639866IPR002913START domain
Gene3DG3DSA:3.30.530.201.1E-7680865IPR023393START-like domain
SuperFamilySSF559612.35E-158951113No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1128 aa     Download sequence    Send to blast
MSMAVEMSSK QPTKDFFSSP ALSLSLAGIF RNASSGNTDP ADEDFLSRRV VDDEDRTVEM  60
SSENSGPTRS RSEEDLEGED HDEDLEDDDG NKGNKRKRKK YHRHTTDQIR HMEALFKETP  120
HPDEKQRQQL SKQLGLAPRQ VKFWFQNRRT QIKAIQERHE NSLLKAELEK LREENKAMRE  180
SFSKAANSSC PNCGGGPDDL HLENSKLKAE LDKLRAALGR TPYPLQASCS EDQEQRLGSL  240
DFYTGVFALE KSRIAEIANR ATLELQKMAT SGEPLWLRSV ETGREILNYD EYLKEFPQAQ  300
ASSLPGRKTI EASRDVGIVF MDAHKLAQSF MDVGQWKEMF ACLVSKAATV DVIRQGEGPS  360
RIDGAIQLMF GEMQLLTPVV PTREVYFVRS CRQLXTKDFF SSPALSLSLA GIFRNASSGN  420
TDPADEDFLS RRVVDDEDRT VEMSSENSGP TRSRSEEDLE GEDHDEDLED DDGNKGNKRK  480
RKKYHRHTTD QIRHMEALFK ETPHPDEKQR QQLSKQLGLA PRQVKFWFQN RRTQIKAIQE  540
RHENSLLKAE LEKLREENKA MRESFSKAAN SSCPNCGGGP DDLHLENSKL KAELDKLRAA  600
LGRTPYPLQA SCSEDQEQRL GSLDFYTGVF ALEKSRIAEI ANRATLELQK MATSGEPLWL  660
RSVETGREIL NYDEYLKEFP QAQASSLPGR KTIEASRDVG IVFMDAHKLA QSFMDVGQWK  720
EMFACLVSKA ATVDVIRQGE GPSRIDGAIQ LMFGEMQLLT PVVPTREVYF VRSCRQLTPE  780
KWAIVDVSVS VEDSNTEKEA SLLKCRKLPS GCIIEDTSNG HSKVTWVEHL DVSASTVQPL  840
FRSLVNTGLA FGARHWVATL QLHCERLVFF MATNVPTKDS LGVTTLAGRK SVLKMAQRMT  900
QSFYRAIAAS SYHQWTKITT KTGQDMRVSS RKNLHDPGEP TGVIVCASSS LWLPVSPTLL  960
FDFFRDEARR HEWDALSNGA HVQSIASLSK GQDRGNSVAI QTVKTREKSI WVLQDSCTNS  1020
YESVVVYAPV DINTTQLVLA GHDSSSIQIL PCGFSIIPDG VESRPLVITT TQDDRNSQGG  1080
SLLTLALQTL INPSPAAKLN MESVDSVTNL VSVTLHNIKR SLQIEDC*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1478482RKRKK
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor required for correct morphological development and maturation of trichomes as well as for normal development of seed coat mucilage. Regulates the frequency of trichome initiation and determines trichome spacing. {ECO:0000269|PubMed:11844112}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapCsa16g050670.1
Regulation -- Description ? help Back to Top
Source Description
UniProtINDUCTION: Down-regulated by GEM. {ECO:0000269|PubMed:17450124}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAF3602940.0AF360294.1 Arabidopsis thaliana putative homeobox protein GLABRA2 (At1g79840) mRNA, complete cds.
GenBankBT0019560.0BT001956.1 Arabidopsis thaliana clone U09291 putative homeobox protein GLABRA2 (At1g79840) mRNA, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010430051.10.0PREDICTED: homeobox-leucine zipper protein GLABRA 2-like
SwissprotP466070.0HGL2_ARATH; Homeobox-leucine zipper protein GLABRA 2
TrEMBLA0A178WBY60.0A0A178WBY6_ARATH; GL2
STRINGXP_010473020.10.0(Camelina sativa)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM123702731
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G79840.10.0HD-ZIP family protein
Publications ? help Back to Top
  1. Wu R,Citovsky V
    Adaptor proteins GIR1 and GIR2. I. Interaction with the repressor GLABRA2 and regulation of root hair development.
    Biochem. Biophys. Res. Commun., 2017. 488(3): p. 547-553
    [PMID:28526410]