PlantRegMap/PlantTFDB v5.0
Plant Transcription
Factor Database
|
Home TFext BLAST Prediction Download Help About Links PlantRegMap |
Transcription Factor Information
Basic Information? help Back to Top | |||||||||
---|---|---|---|---|---|---|---|---|---|
TF ID | IGS.gm_5_00384 | ||||||||
Common Name | CHLNCDRAFT_143055 | ||||||||
Organism | |||||||||
Taxonomic ID | |||||||||
Taxonomic Lineage |
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Trebouxiophyceae; Chlorellales; Chlorellaceae; Chlorella
|
||||||||
Family | GATA | ||||||||
Protein Properties | Length: 2459aa MW: 258021 Da PI: 5.0252 | ||||||||
Description | GATA family protein | ||||||||
Gene Model |
|
Signature Domain? help Back to Top | |||||||
---|---|---|---|---|---|---|---|
No. | Domain | Score | E-value | Start | End | HMM Start | HMM End |
1 | GATA | 22.1 | 2.1e-07 | 1124 | 1153 | 1 | 29 |
GATA 1 CsnCgttkTplWRrgp..dgnktLCnaCGly 29 C Cgtt++++WR+ p g++ LC CG + IGS.gm_5_00384 1124 CPVCGTTSSTQWRKPPgeAGHI-LCDDCGKR 1153 888**********99999****.******65 PP |
Protein Features ? help Back to Top | ||||||
---|---|---|---|---|---|---|
Database | Entry ID | E-value | Start | End | InterPro ID | Description |
Gene3D | G3DSA:3.30.50.10 | 1.8E-4 | 773 | 823 | IPR013088 | Zinc finger, NHR/GATA-type |
PROSITE profile | PS50114 | 9.684 | 1118 | 1151 | IPR000679 | Zinc finger, GATA-type |
SMART | SM00401 | 0.0029 | 1118 | 1167 | IPR000679 | Zinc finger, GATA-type |
SuperFamily | SSF57716 | 3.52E-6 | 1120 | 1155 | No hit | No description |
Gene3D | G3DSA:3.30.50.10 | 1.2E-6 | 1121 | 1156 | IPR013088 | Zinc finger, NHR/GATA-type |
SMART | SM00401 | 0.069 | 1189 | 1246 | IPR000679 | Zinc finger, GATA-type |
Gene3D | G3DSA:3.30.50.10 | 6.6E-4 | 1193 | 1246 | IPR013088 | Zinc finger, NHR/GATA-type |
SMART | SM00401 | 0.24 | 1781 | 1838 | IPR000679 | Zinc finger, GATA-type |
Gene Ontology ? help Back to Top | ||||||
---|---|---|---|---|---|---|
GO Term | GO Category | GO Description | ||||
GO:0006355 | Biological Process | regulation of transcription, DNA-templated | ||||
GO:0003700 | Molecular Function | transcription factor activity, sequence-specific DNA binding | ||||
GO:0008270 | Molecular Function | zinc ion binding | ||||
GO:0043565 | Molecular Function | sequence-specific DNA binding |
Sequence ? help Back to Top |
---|
Protein Sequence Length: 2459 aa Download sequence Send to blast |
MSAEVAHQGG FEADATAGGA APAPTSSGEA AQRLQQQAGA WVGGFLEDVQ ELGEGMAPLP 60 PELPHEAGSG GQDAGVLARP LRGKAQASQH DAESYAEAAV AAARRRTGAA DVASAAGAAD 120 GGQSGAPPSG QHAAVAEAEQ EQEQPAACTA AVLLPTPDMP LAEALAPHSA AAADQPLLEA 180 GRAASEQEAA AVEAVPAQPA VPGAEHEVED VTAAAQQQQY DTLLGMSGAA QQQLVPGVYD 240 PAVAQQQAFD ALVIGPLRTA QAAGGTGGAV APGATGAAGT TYTFEASGSG PAGRPPSQQA 300 AASGGRGRPG RPASQLAGAS NDAVASGMAS ALEQAQQEVE DQHPEQGVTE GHALPVVGVP 360 ATAAGEPVEQ SVQAPKHGGT KFRVRVPSNA RLGAAAAPPA APHGSRGVSE APPGSGKYKV 420 QARDVRIAGY QSEALAAAAH DHARIWWALR QQQPEGPGTA AQAASQDQVQ AAVLGLQGEL 480 NFPPVKFSSD AALLALLSGA TRQQLEEQLQ PFRLQQWEAA ARSAGQQQLQ AGVGSAAALN 540 QDNTTRFVGV LRITGSVKLQ AAITVDGTRE HLGAFDTPVE AAVVRDIGII WKQLHGLAGD 600 AMLNLPHSNL DKDAELVQEL RDAANPNALR AVLVRWVPKR LQDVVASLPG AEAGSQRAAG 660 TTLARVGSGA VHGSELPASP PPSAAGAAQG PPAAEDVGQP GAAHGAGALA CNGTSAGGGS 720 SGNSQQTSQQ QAGDGPEGGG GSSAGEGSED RNEEGDGDEE EQPGDSSDVS GSQLCSHCGD 780 DQCTGYQRHP SSGKLLCANC FSYLQRTGLD RPADVIQAYF ANRAAGMHAR TGRGNGHGSG 840 RGSGRDNSSS RRAAAGLDVV RGPGGSNRQA AAGRARQAAA RHPEDTVLSS DEGAAAADDE 900 SSEEESDLKE RRLRRQAQRH LDVQLQLRRQ PIAASAQGRV LPRRRNAAAG VAAAVAAAGA 960 SSSEEDEGRP ATSSDYEQQE ASESEEEFAE SEAEPVQQQQ HAAASAGGDG ASAFEWEDAG 1020 HSRMLKTINV AEAEARAFIF PESTWDDLAG GSNREVLVRH AYSGEEWRGY LSGTTLRLPG 1080 LFGKLELSAG SRLQLRVAAP KRELPLVECT LLMPEAVAAI GRKCPVCGTT SSTQWRKPPG 1140 EAGHILCDDC GKRVWQQRAE SLQQQQQQPL QPQQQPLHEQ PRDQPAVGTA AVRQCTHCGF 1200 LQPGPAGRQY YWRRHPTTRE QLCAPCGRYA DKHGGELPEL LEEEEQEDSE EQQDEDSEED 1260 EEEHPQRQPP QRQCLQCGST SKGSAKWAGW HRHPASGEEW LCHPCYRKVH NAIKRQQRQA 1320 EKQAQQEVES EVEPDEEEQS EEEEEEAAIA AAVKQCTHCG SLRPGPPGTK YSWRRHPTSK 1380 EQLCGRCWGH AHRHGGELPA LSDAEGEQEQ SEEEEEEQPP QQPQERQCLQ CASPTKGSGR 1440 WATWHRHPAT GEEWLCQPCF NKARAAIKRQ QRQPEKHPQE VESEEEEAQS EEEEEEEQLQ 1500 QQPPQRQCLQ CGSTSKGSSK SAGWHRHPAT GEEWLCQPCF NKARTAAMRQ QRQPEKHPQE 1560 VESEEEEAQS EEEEEEEQLQ QQPQQRQCLQ CGSTSKGSTK SAGWHRHPAT GEEWLCHPCY 1620 RKVHNAIKRQ QRQAEKQAQQ EEKSEVEPEG EEQSEEEEAA VAAAVKQCTH CGSLRPGPPG 1680 TSYSWRWHPT SKEQLCGPCC GYSDRHGGEL PVVAWKEAAA AATNQPKKKQ QGKRKQHGRQ 1740 PPAAPAVDEE PPVSPRQHKQ QRVADAATAV GAEGQSDAGA AGEQRLCSHC GADHAGPTPT 1800 YPWRKHATTG ARLCKDCWDF SRKHYGFLPE LPQPAEPPQP SQPEGQEEQP AAGKGRQRSG 1860 TRWASGQEQG DAAAGDAEAL QAPVLLSLPG RRVTRHQQHQ QEEQQHGAVE VEAQAATAAG 1920 QASRRRRRLS EVEGEAAAEE QQQPPSASEH KRGRRQLVQA EQEQTALLGS MHKRGRRQQE 1980 EQAEDTAEAE AGEKQKEQAA EPAAKRRRGK QQEVPEPPAA AVAAAPAEPV GASQASDSAG 2040 ATGRGGGECR WVNADSTALE VEVNGTCILS ETLVLPAAMA AKLTGLQAGG HLRLRDARTA 2100 NNRTAWNASL SDGRIEGKKL FSKLDVQEGD RLLLHRQAPA AAARAPGGRP PLLVAAKVAR 2160 EGPGEQALQQ ALAMAAAAAG VSAPAAVPGS GGEGAEPADG AGAGAPAAQQ EQPPPQDVAP 2220 AAAAAEAAMD ADEARQEQEP GAAAAEAQQQ AGEAEPAAAA VAAVQPAPAE GAGPRPELEG 2280 SMAGSGGGQQ EQAQHPGVAL RMFKQAREEP ARSSAPRGAG GNAAAGQRGP AAGGPPPAEP 2340 AGEPPAAAAA PAEPPPAAAG APPAVGVAAP GGSPASGASK EANDVLERLI LVLGSDAAAR 2400 QAGLTIDLIT DHQAQFERMD LKQQHDRAKP LTMLLEMRMF AEAVRFMEKL VANNNLSA* |
Annotation -- Protein ? help Back to Top | |||||||
---|---|---|---|---|---|---|---|
Source | Hit ID | E-value | Description | ||||
Refseq | XP_005849601.1 | 0.0 | hypothetical protein CHLNCDRAFT_143055 | ||||
TrEMBL | E1Z9D6 | 0.0 | E1Z9D6_CHLVA; Uncharacterized protein | ||||
STRING | XP_005849601.1 | 0.0 | (Chlorella variabilis) |
Link Out ? help Back to Top | |
---|---|
Entrez Gene | 17357168 |
Publications ? help Back to Top | |||
---|---|---|---|
|