PlantRegMap/PlantTFDB v5.0
Plant Transcription
Factor Database
|
Home TFext BLAST Prediction Download Help About Links PlantRegMap |
Transcription Factor Information
Basic Information? help Back to Top | |||||||||
---|---|---|---|---|---|---|---|---|---|
TF ID | Csa07g060030.1 | ||||||||
Organism | |||||||||
Taxonomic ID | |||||||||
Taxonomic Lineage |
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Camelina
|
||||||||
Family | HD-ZIP | ||||||||
Protein Properties | Length: 1305aa MW: 145710 Da PI: 7.0586 | ||||||||
Description | HD-ZIP family protein | ||||||||
Gene Model |
|
Signature Domain? help Back to Top | |||||||
---|---|---|---|---|---|---|---|
No. | Domain | Score | E-value | Start | End | HMM Start | HMM End |
1 | Homeobox | 65 | 1e-20 | 98 | 153 | 1 | 56 |
TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS Homeobox 1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 r+k +++t++q++ +e+lF+++++p++++r++L+k+lgL rqVk+WFqNrR++ k Csa07g060030.1 98 RKKYHRHTTDQIRHMEALFKETPHPDEKQRQQLSKQLGLAPRQVKFWFQNRRTQIK 153 7999************************************************9877 PP | |||||||
2 | Homeobox | 49.8 | 5.8e-16 | 675 | 713 | 18 | 56 |
HHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS Homeobox 18 lFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 lF+++++p++++r++L+k+lgL rqVk+WFqNrR++ k Csa07g060030.1 675 LFKETPHPDEKQRQQLSKQLGLAPRQVKFWFQNRRTQIK 713 7**********************************9877 PP | |||||||
3 | START | 232.5 | 1.2e-72 | 256 | 483 | 1 | 206 |
HHHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS.......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEE CS START 1 elaeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv......dsgealrasgvvdmvlallveellddkeqWdetla....kaet 81 e+a++a+ el+k+a+++ep+W +s+ e++n+de+l++f+++++ +++ea+r++g+v+m++++l ++++d++ qW+e++a ka+t Csa07g060030.1 256 EIANRATLELQKMATSGEPLWLRSVetgrEILNYDEYLKEFPQAQAssfpgrKTIEASRDVGIVFMDAHKLAQSFMDVG-QWKEMFAclvsKAAT 349 578999*************************************999*********************************.*************** PP EEEECTT.......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--..-TTSEE-EESSEEEEEEEECTCEEEEEEE CS START 82 levissg.......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppe.sssvvRaellpSgiliepksnghskvtwv 167 ++vi++g ga+qlm+ e+q+l+p+vp R+++fvR++rql+ ++w+ivdvSv++e++++e ++s+ ++++lpSg++ie++snghskvtwv Csa07g060030.1 350 VDVIRQGegpsridGAIQLMFGEMQLLTPVVPtREVYFVRSCRQLTPEKWAIVDVSVSVEDSNTEkEASLLKCRKLPSGCIIEDTSNGHSKVTWV 444 *********************************************************************************************** PP E-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS START 168 ehvdlkgrlphwllrslvksglaegaktwvatlqrqcek 206 eh d+++++++ l+rslv++gla+ga++wvatlq +ce+ Csa07g060030.1 445 EHLDVSASTVQPLFRSLVNTGLAFGARHWVATLQLHCER 483 *************************************97 PP | |||||||
4 | START | 232.5 | 1.2e-72 | 816 | 1043 | 1 | 206 |
HHHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS.......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EE CS START 1 elaeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv......dsgealrasgvvdmvlallveellddkeqWdetla....ka 79 e+a++a+ el+k+a+++ep+W +s+ e++n+de+l++f+++++ +++ea+r++g+v+m++++l ++++d++ qW+e++a ka Csa07g060030.1 816 EIANRATLELQKMATSGEPLWLRSVetgrEILNYDEYLKEFPQAQAssfpgrKTIEASRDVGIVFMDAHKLAQSFMDVG-QWKEMFAclvsKA 907 578999*************************************999*********************************.************* PP EEEEEECTT.......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--..-TTSEE-EESSEEEEEEEECTCEEE CS START 80 etlevissg.......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppe.sssvvRaellpSgiliepksnghsk 163 +t++vi++g ga+qlm+ e+q+l+p+vp R+++fvR++rql+ ++w+ivdvSv++e++++e ++s+ ++++lpSg++ie++snghsk Csa07g060030.1 908 ATVDVIRQGegpsridGAIQLMFGEMQLLTPVVPtREVYFVRSCRQLTPEKWAIVDVSVSVEDSNTEkEASLLKCRKLPSGCIIEDTSNGHSK 1000 ********************************************************************************************* PP EEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS START 164 vtwvehvdlkgrlphwllrslvksglaegaktwvatlqrqcek 206 vtwveh d+++++++ l+rslv++gla+ga++wvatlq +ce+ Csa07g060030.1 1001 VTWVEHLDVSASTVQPLFRSLVNTGLAFGARHWVATLQLHCER 1043 *****************************************97 PP |
Protein Features ? help Back to Top | ||||||
---|---|---|---|---|---|---|
Database | Entry ID | E-value | Start | End | InterPro ID | Description |
Gene3D | G3DSA:1.10.10.60 | 8.6E-23 | 84 | 149 | IPR009057 | Homeodomain-like |
SuperFamily | SSF46689 | 7.1E-20 | 88 | 156 | IPR009057 | Homeodomain-like |
PROSITE profile | PS50071 | 17.815 | 95 | 155 | IPR001356 | Homeobox domain |
SMART | SM00389 | 9.6E-18 | 97 | 159 | IPR001356 | Homeobox domain |
Pfam | PF00046 | 5.2E-18 | 98 | 153 | IPR001356 | Homeobox domain |
CDD | cd00086 | 7.08E-16 | 102 | 153 | No hit | No description |
PROSITE pattern | PS00027 | 0 | 130 | 153 | IPR017970 | Homeobox, conserved site |
PROSITE profile | PS50848 | 43.618 | 247 | 486 | IPR002913 | START domain |
SuperFamily | SSF55961 | 6.59E-31 | 250 | 483 | No hit | No description |
CDD | cd08875 | 6.87E-106 | 251 | 482 | No hit | No description |
SMART | SM00234 | 1.4E-84 | 256 | 483 | IPR002913 | START domain |
Pfam | PF01852 | 2.7E-66 | 256 | 483 | IPR002913 | START domain |
Gene3D | G3DSA:3.30.530.20 | 1.4E-7 | 297 | 482 | IPR023393 | START-like domain |
SuperFamily | SSF55961 | 2.98E-13 | 497 | 668 | No hit | No description |
Gene3D | G3DSA:3.30.530.20 | 2.5E-8 | 518 | 600 | IPR023393 | START-like domain |
SMART | SM00389 | 1.9E-8 | 664 | 719 | IPR001356 | Homeobox domain |
Pfam | PF00046 | 2.6E-13 | 675 | 713 | IPR001356 | Homeobox domain |
SuperFamily | SSF46689 | 1.55E-12 | 675 | 720 | IPR009057 | Homeodomain-like |
Gene3D | G3DSA:1.10.10.60 | 3.9E-14 | 676 | 723 | IPR009057 | Homeodomain-like |
PROSITE profile | PS50071 | 14.333 | 676 | 715 | IPR001356 | Homeobox domain |
CDD | cd00086 | 2.98E-12 | 676 | 713 | No hit | No description |
PROSITE pattern | PS00027 | 0 | 690 | 713 | IPR017970 | Homeobox, conserved site |
PROSITE profile | PS50848 | 43.618 | 807 | 1046 | IPR002913 | START domain |
SuperFamily | SSF55961 | 6.59E-31 | 810 | 1043 | No hit | No description |
CDD | cd08875 | 6.87E-106 | 811 | 1042 | No hit | No description |
SMART | SM00234 | 1.4E-84 | 816 | 1043 | IPR002913 | START domain |
Pfam | PF01852 | 2.7E-66 | 816 | 1043 | IPR002913 | START domain |
Gene3D | G3DSA:3.30.530.20 | 2.5E-8 | 926 | 1041 | IPR023393 | START-like domain |
SuperFamily | SSF55961 | 2.9E-15 | 1072 | 1290 | No hit | No description |
Gene Ontology ? help Back to Top | ||||||
---|---|---|---|---|---|---|
GO Term | GO Category | GO Description | ||||
GO:0006355 | Biological Process | regulation of transcription, DNA-templated | ||||
GO:0008289 | Molecular Function | lipid binding | ||||
GO:0043565 | Molecular Function | sequence-specific DNA binding |
Sequence ? help Back to Top |
---|
Protein Sequence Length: 1305 aa Download sequence Send to blast |
MSMAVEMSSK QPTKDFFSSP ALSLSLAGIF RNASSGNTDP AEEDFLSRRV VEDEDRTVEM 60 SSENSGPTRS RSEEDLEGED HDEDLEDDDG NKGNKRKRKK YHRHTTDQIR HMEALFKETP 120 HPDEKQRQQL SKQLGLAPRQ VKFWFQNRRT QIKAIQERHE NSLLKAELEK LREENKAMRE 180 SFSKAANSSC PNCGGGPDDL HLENSKLKAE LDKLRAALGR TPYPLQASCS EDQEQRLGSL 240 DFYTGVFALE KSRIAEIANR ATLELQKMAT SGEPLWLRSV ETGREILNYD EYLKEFPQAQ 300 ASSFPGRKTI EASRDVGIVF MDAHKLAQSF MDVGQWKEMF ACLVSKAATV DVIRQGEGPS 360 RIDGAIQLMF GEMQLLTPVV PTREVYFVRS CRQLTPEKWA IVDVSVSVED SNTEKEASLL 420 KCRKLPSGCI IEDTSNGHSK VTWVEHLDVS ASTVQPLFRS LVNTGLAFGA RHWVATLQLH 480 CERLVFFMAT RVTTLAGRKS VLKMAQRMTQ SFYRAIAASS YHQWTKITTK TGQDMRVSSR 540 KNLHDPGEPT GVIVCASSSL WLPVSPTLLF DFFRDEARRH EWDALSNGAH VQSIASLSKG 600 QDRGNSVAIQ TVKTREKSIW VLQDSCTNSY ESVVVYAPVD INTTQLVLAG HDSSSIQILP 660 CGFSIIPDGV EXFMLFKETP HPDEKQRQQL SKQLGLAPRQ VKFWFQNRRT QIKAIQERHE 720 NSLLKAELEK LREENKAMRE SFSKAANSSC PNCGGGPDDL HLENSKLKAE LDKLRAALGR 780 TPYPLQASCS EDQEQRLGSL DFYTGVFALE KSRIAEIANR ATLELQKMAT SGEPLWLRSV 840 ETGREILNYD EYLKEFPQAQ ASSFPGRKTI EASRDVGIVF MDAHKLAQSF MDVGQWKEMF 900 ACLVSKAATV DVIRQGEGPS RIDGAIQLMF GEMQLLTPVV PTREVYFVRS CRQLTPEKWA 960 IVDVSVSVED SNTEKEASLL KCRKLPSGCI IEDTSNGHSK VTWVEHLDVS ASTVQPLFRS 1020 LVNTGLAFGA RHWVATLQLH CERLVFFMAT NVPTKDSLGV TTLAGRKSVL KMAQRMTQSF 1080 YRAIAASSYH QWTKITTKTG QDMRVSSRKN LHDPGEPTGV IVCASSSLWL PVSPTLLFDF 1140 FRDEARRHEW DALSNGAHVQ SIASLSKGQD RGNSVAIQTV KTREKSIWVL QDSCTNSYES 1200 VVVYAPVDIN TTQLVLAGHD SSSIQILPCG FSIIPDGVES RPLVITTTQD DRNSQGGSLL 1260 TLALQTLINP SPAAKLNMES VDSVTNLVSV TLHNIKRSLQ IEDC* |
Nucleic Localization Signal ? help Back to Top | |||
---|---|---|---|
No. | Start | End | Sequence |
1 | 95 | 99 | RKRKK |
Functional Description ? help Back to Top | ||||||
---|---|---|---|---|---|---|
Source | Description | |||||
UniProt | Probable transcription factor required for correct morphological development and maturation of trichomes as well as for normal development of seed coat mucilage. Regulates the frequency of trichome initiation and determines trichome spacing. {ECO:0000269|PubMed:11844112}. |
Cis-element ? help Back to Top | |
---|---|
Source | Link |
PlantRegMap | Csa07g060030.1 |
Regulation -- Description ? help Back to Top | ||||||
---|---|---|---|---|---|---|
Source | Description | |||||
UniProt | INDUCTION: Down-regulated by GEM. {ECO:0000269|PubMed:17450124}. |
Regulation -- PlantRegMap ? help Back to Top | ||||||
---|---|---|---|---|---|---|
Source | Upstream Regulator | Target Gene | ||||
PlantRegMap | Retrieve | - |
Annotation -- Nucleotide ? help Back to Top | ||||||
---|---|---|---|---|---|---|
Source | Hit ID | E-value | Description | |||
GenBank | AF360294 | 0.0 | AF360294.1 Arabidopsis thaliana putative homeobox protein GLABRA2 (At1g79840) mRNA, complete cds. | |||
GenBank | BT001956 | 0.0 | BT001956.1 Arabidopsis thaliana clone U09291 putative homeobox protein GLABRA2 (At1g79840) mRNA, complete cds. |
Annotation -- Protein ? help Back to Top | |||||||
---|---|---|---|---|---|---|---|
Source | Hit ID | E-value | Description | ||||
Refseq | XP_010430051.1 | 0.0 | PREDICTED: homeobox-leucine zipper protein GLABRA 2-like | ||||
Swissprot | P46607 | 0.0 | HGL2_ARATH; Homeobox-leucine zipper protein GLABRA 2 | ||||
TrEMBL | A0A178WBY6 | 0.0 | A0A178WBY6_ARATH; GL2 | ||||
STRING | XP_010473020.1 | 0.0 | (Camelina sativa) |
Orthologous Group ? help Back to Top | |||
---|---|---|---|
Lineage | Orthologous Group ID | Taxa Number | Gene Number |
Malvids | OGEM12370 | 27 | 31 |
Best hit in Arabidopsis thaliana ? help Back to Top | ||||||
---|---|---|---|---|---|---|
Hit ID | E-value | Description | ||||
AT1G79840.1 | 0.0 | HD-ZIP family protein |
Publications ? help Back to Top | |||
---|---|---|---|
|