PlantRegMap/PlantTFDB v5.0
Plant Transcription
Factor Database
|
Home TFext BLAST Prediction Download Help About Links PlantRegMap |
Transcription Factor Information
Basic Information? help Back to Top | |||||||||
---|---|---|---|---|---|---|---|---|---|
TF ID | Vocar.0017s0086.2.p | ||||||||
Organism | |||||||||
Taxonomic ID | |||||||||
Taxonomic Lineage |
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; Chlamydomonadales; Volvocaceae; Volvox
|
||||||||
Family | CPP | ||||||||
Protein Properties | Length: 2239aa MW: 221808 Da PI: 7.0325 | ||||||||
Description | CPP family protein | ||||||||
Gene Model |
|
Signature Domain? help Back to Top | |||||||
---|---|---|---|---|---|---|---|
No. | Domain | Score | E-value | Start | End | HMM Start | HMM End |
1 | TCR | 44.3 | 3.5e-14 | 1070 | 1109 | 2 | 42 |
TCR 2 ekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNkeek 42 ++k+C+Ckks+Clk+YC+Cfaag++C++ C+C +C+N+ e+ Vocar.0017s0086.2.p 1070 SSKSCRCKKSQCLKLYCDCFAAGQYCGS-CSCISCHNRPEH 1109 689*************************.********9875 PP | |||||||
2 | TCR | 46.3 | 8.3e-15 | 1141 | 1179 | 1 | 39 |
TCR 1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 k+k+gCnC+ks+ClkkYCeC++ g+kC+ +C+C +C+N Vocar.0017s0086.2.p 1141 KHKRGCNCRKSHCLKKYCECYQGGVKCGIQCTCMECENM 1179 589***********************************7 PP |
Protein Features ? help Back to Top | ||||||
---|---|---|---|---|---|---|
Database | Entry ID | E-value | Start | End | InterPro ID | Description |
SMART | SM01114 | 9.5E-14 | 1069 | 1109 | IPR033467 | Tesmin/TSO1-like CXC domain |
PROSITE profile | PS51634 | 30.722 | 1070 | 1181 | IPR005172 | CRC domain |
Pfam | PF03638 | 1.3E-11 | 1072 | 1106 | IPR005172 | CRC domain |
SMART | SM01114 | 1.4E-13 | 1141 | 1182 | IPR033467 | Tesmin/TSO1-like CXC domain |
Pfam | PF03638 | 4.7E-11 | 1143 | 1179 | IPR005172 | CRC domain |
Gene Ontology ? help Back to Top | ||||||
---|---|---|---|---|---|---|
GO Term | GO Category | GO Description | ||||
GO:0044212 | Molecular Function | transcription regulatory region DNA binding |
Sequence ? help Back to Top |
---|
Protein Sequence Length: 2239 aa Download sequence Send to blast |
MRRSSPPREP GVGPEGAKAS ARPGLRTVPT MPNKRQAIQG YDVDNSLGSP LFPRDYDQLR 60 NLLGSPLPPL HSPPLFQPSP RRSILTSPAR PAANQRPSNQ IPSNNSHRHN PDSMDALASF 120 FAPSPALPVP LFSPSAAAPN SLFDTSVNRA RADVFTPTPC KSGALDHVTA LLHDVGRGGQ 180 SGAGSDPIGV HLQHQRCNND SAVPTNAATD PHHHEGGSGG SGSGSGSGGT AIPVNVASAS 240 ASAGISCTAQ LAAAALRSQA NKASSSHGFH MIHDGGFGFG FGRSQATPGL MLSLSGQGGF 300 GPIHHPVPGY PSDLLGISGP GSHMFGGSGG AAAGIDGGSG NGGGLYGSFC GIANGGGGGC 360 AGLQLNVKDM LLAHHDDDDR SGGLLTSMPS FCGGGGSGGG GGGFGSGLGL GRPRSRYLDF 420 CTTPRHSADA AASGGAAAAP DATSGDGARA GGSGSSGGVA SNVGAGGGSG SSVGIGEGVG 480 VVGSVEGGRC GGGGIYPGGL SSRHHGGGLL LAPQLPAPPL LQTETNTATT GSNTGQHSRG 540 ADGAAESPPE SLDEPQKPVL SYPSPNEETR IAIAGGCVKV VESGASRTTV ETPGVELGCG 600 PSGGGGGASG GITSTEVAAG PAAAATGTER MATVSASAAG GGGGGGSNGV LVPSSGGSFG 660 LRTESLLVLP GTSVAVPGGG GGGLMVPVSG SGRGGGGNDG GGCDLSSSME ADTMQYSAGG 720 RGSGAGGVCL DRDLGMSSPS LLPPPPLSLG GLMPSSFSVP PHLQPNHHHL QHHHHHHHHQ 780 QQQQQQHYQS SAMTSSMMDL SMARGGSGAG TGASTLMMVS EGGGAMPGLQ PLSQMQMPLC 840 EDDKSFVKRQ IQQQQMQQQQ MQQQQMQQQQ PLVRGGSGTF AVRMASGGSG AGGGGGGGVN 900 GSSGGLLQGR DAAAAVAASG GCERPGVGLP PRSGGGGGGG INVMPGGGMP GAAVAAASVG 960 GAGGGGAGNG GASTPSTLQR PQRTRTASSY GGGGAGGAMN EGTVMSMRGA PSMDFDVVVP 1020 ELELSPDFPG RGGINANANP NAHRSSGAGG GLTQIQGGGP NRGRRTSENS SKSCRCKKSQ 1080 CLKLYCDCFA AGQYCGSCSC ISCHNRPEHA DRVLQRREDI AARDPQAFTR KIQLAPNGNG 1140 KHKRGCNCRK SHCLKKYCEC YQGGVKCGIQ CTCMECENMD VGSSQEGAGA RGALKRGGAA 1200 AKGAGGRAGG GGGGGGSRAG SRRSSATGMY DDYAPSPPLP STSGCSDGPS PTPSQGTVPG 1260 SVMLQPPPPL ASMPSLTVAA AAAAAAAATT AATTNHFAMS LGSGGDGAAG CTASMPYGGG 1320 HALSAGVVQF SEDGTVRRNS TNSLSHSQAP PVAQPPQLLP PSQQQQQQLQ SQMQSMPAPL 1380 PPNFLRQQQQ QEVQLQPMHS HPHHQQQQQE QQESPSLICG EMLQQQQQQQ QQQQQQQQQQ 1440 QQQQQQQQQQ QQQQQQQQQY YQQQQMVKRS LPPELYGSGS GSDAVARDTC CRGDGGDGDG 1500 EILPGNLRDF QGVVRDEMDE DAEEEEEEEE GDGPSQEQLG PLKRRRKQEL GRRTAATPLP 1560 LPSDHPTAPT SESSALATGG TWAEEARNSA GCNNRRGAAA TVAAAHASDN NADNAYPRTE 1620 GGGMGDMTLA AVGTAGGDLG PSGAGAAAAM AAAAAMPPPP SGQDLRFSLG PEPPGFTPRG 1680 LGISSLDVVS PPPLSMLTHL ESDTDSDGGG LEGAGGGLGC RPRRRSAQHQ YRQQYMNTGG 1740 AAVAAAAGGG GGGGGGAADV PHPSALRRNG GSRHQSHGMV GLDVGVMDFD DSSSALADAM 1800 ITAIADEASR GPMAGGGECV ATTAGAAGTA AAAAAAAAAA GRHQHRQSGE AAAAAGSDAA 1860 TAREGGGVLL CGELLGSGCL LDDGSNDMFL AGFEPNSVEG ARGCGSFGFG SGGSGGGFLS 1920 PRFGGMGGGN SSFGLATSPT AFPRSGGVNG SLGLCAVSPQ WRVRPPGLGP MGEVAAAVGP 1980 PLGLMSSNSW LHLPHRRPSR FAPTRVNGGS GGGGSSAMSY DPSQLPPWPP VASVTTCTVG 2040 GPLVPELCGL DSGLALKGGH LDMGRAGAAA AAAASVHTLV SPVRTSAMSA AALARRRETG 2100 GPEGDASRAP SYQGAPSGCG DEQRESGPWV VPAAHHHLEA ASSPSKQQRC FMATAAGGGG 2160 NLAAPLQLPQ PSAMTPGEQP QFDILTGGSG GGHPRTQPPG GGRGGSRNGG GCAGAAGGGA 2220 SANRPPRAGS FVADNGAA* |
3D Structure ? help Back to Top | ||||||
---|---|---|---|---|---|---|
PDB ID | Evalue | Query Start | Query End | Hit Start | Hit End | Description |
5fd3_A | 3e-23 | 1067 | 1180 | 4 | 122 | Protein lin-54 homolog |
5fd3_B | 3e-23 | 1067 | 1180 | 4 | 122 | Protein lin-54 homolog |
Search in ModeBase |
Nucleic Localization Signal ? help Back to Top | |||
---|---|---|---|
No. | Start | End | Sequence |
1 | 1204 | 1212 | GGRAGGGGG |
Binding Motif ? help Back to Top | |||
---|---|---|---|
Motif ID | Method | Source | Motif file |
MP00624 | PBM | Transfer from PK22848.1 | Download |
Regulation -- PlantRegMap ? help Back to Top | ||||||
---|---|---|---|---|---|---|
Source | Upstream Regulator | Target Gene | ||||
PlantRegMap | - | Retrieve |
Annotation -- Protein ? help Back to Top | |||||||
---|---|---|---|---|---|---|---|
Source | Hit ID | E-value | Description | ||||
Refseq | XP_002953265.1 | 0.0 | hypothetical protein VOLCADRAFT_94037 | ||||
TrEMBL | D8U3R6 | 0.0 | D8U3R6_VOLCA; Uncharacterized protein | ||||
STRING | XP_002953265.1 | 0.0 | (Volvox carteri) |
Best hit in Arabidopsis thaliana ? help Back to Top | ||||||
---|---|---|---|---|---|---|
Hit ID | E-value | Description | ||||
AT3G22760.1 | 5e-29 | Tesmin/TSO1-like CXC domain-containing protein |
Link Out ? help Back to Top | |
---|---|
Phytozome | Vocar.0017s0086.2.p |