PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Vocar.0002s0221.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; Chlamydomonadales; Volvocaceae; Volvox
Family GATA
Protein Properties Length: 1698aa    MW: 177932 Da    PI: 7.8287
Description GATA family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Vocar.0002s0221.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GATA46.26.4e-152860135
                 GATA  1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkgl 35
                         C nC+ttkTplWR++  g+  LCnaCG+y +++g+
  Vocar.0002s0221.1.p 28 CNNCNTTKTPLWRKEA-GEL-LCNACGIYLKSRGV 60
                         *************776.***.***********997 PP

2GATA40.15e-13736769134
                 GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkg 34 
                          C+nCgtt+TplWR++ +  +t CnaCG+y + +g
  Vocar.0002s0221.1.p 736 CANCGTTQTPLWRKDRETGCTMCNACGIYKQTHG 769
                          *****************************98886 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:3.30.50.104.3E-172277IPR013088Zinc finger, NHR/GATA-type
SMARTSM004014.4E-152271IPR000679Zinc finger, GATA-type
PROSITE profilePS5011419.1512276IPR000679Zinc finger, GATA-type
SuperFamilySSF577164.75E-122373No hitNo description
CDDcd002027.76E-152766No hitNo description
PfamPF003201.0E-112860IPR000679Zinc finger, GATA-type
PROSITE profilePS5011418.9730787IPR000679Zinc finger, GATA-type
SMARTSM004012.3E-9730781IPR000679Zinc finger, GATA-type
SuperFamilySSF577162.52E-11730782No hitNo description
Gene3DG3DSA:3.30.50.102.1E-15733776IPR013088Zinc finger, NHR/GATA-type
CDDcd002027.43E-17736787No hitNo description
PfamPF003203.9E-11736769IPR000679Zinc finger, GATA-type
PROSITE patternPS003440736761IPR000679Zinc finger, GATA-type
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0008270Molecular Functionzinc ion binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1698 aa     Download sequence    Send to blast
MDTDAAIVPH TAPQSSSEEP SEKPDKACNN CNTTKTPLWR KEAGELLCNA CGIYLKSRGV  60
HRPVELIRAQ LNQSSRRTCA NSGGPGGDSS RTRALPGQVT RKSTRKIHER VIYGNEPADA  120
RQQRKPYSTA AVGVKIARTP SCSDAAAAYH KEMLASFLRY NGSGADRQAG GAAGYTYGYG  180
FGQCYYLSPD EDQRHEREYG AYDDDCAQPS GSRGPLPGLS PGAASERGGI RGGGTERDDG  240
SCGGDIRQDE SEDANIGPLS PAFSLGPELN EDDDAGTVAA FLIKMRRGGR RSRSRQQQRR  300
SPQRPQRVDD GPDGEELRSR QGMSRGVVPG DCCDDCDTGD ELHDKAVDAA AKPGTREAGA  360
GALLVAPWSL GVNGARQFSA VLASLAAAGQ VRGTGQTEAG AKSGGPWIKV EEAAKAGTAT  420
TARSVSPFAA LAHGTGSSGA TAAIKPDADD TWRRKHDGEE ASDPQSPESD RKLVPSPPSP  480
LNAGSGDASE TRQAKERDHL AKTAPPAGAG VSGGSGGPPG SQAALLRLLT QPDAARLLLA  540
MDPLQRLRFF AAAAVQGQAR PVAAGGGNDS GGEHEGGGGA CTDTDKYGSR HLPLGPDKDM  600
LGSARQRHHP HQDAGEAAEA LPGSSDPDRQ QGLRLRHEQP AYGARTPGAA LPEKHHRAPL  660
SRGYAAGSAD GTERVHTREQ LQGWGPAGGG DSSDPAAAME LVAAAAGGGG SSGRHASGCP  720
PPPPRPPRNH KGPLLCANCG TTQTPLWRKD RETGCTMCNA CGIYKQTHGF DRPVGGRNQL  780
PQPVSKRCAA LRTMPMRGSA AGEPHVTGRA SSSHVEPAIG AVAASLATGG ADSSLCAPAP  840
QPPQPAVVTS AMPWGVSPSA STTATNTTAA ATATAVARKC TSTDGEDRCI TTSVTGRSEA  900
VSAVGGAAGW QHQQQEQPCS VKRESGSATV GLESQQLRAD SDQGDKTEGG QAMTAACHGG  960
SAVGALEPSV QSGPWRPVSE LKPPGESDSH HQTSVPSPQT SGRGASPSPS PQQVSVGAVD  1020
DEAAGGDAAE PHQRCAWRDG ARCSASLQRP SYERESLERR GLADEAGDHA PGSDGASDGA  1080
PLAQDLGPRT AGGRAISPPP STGRGGSGDR PAGSNSSNNS SSGGAAPSSD MAIGEHLDAV  1140
DAVAGVAKVL ESNHSSVTLE AMVADEAVFR SRIPKTPQRL QLKKRQRVQP PHADWEEAEG  1200
CKGGDERQSQ HPRLCASSPA LARYFGGDAE GAGGVGSRDT LFMEKLAQQA AWLAQIAAVG  1260
GHDGEDGPLG GGGGPFQQRQ HHEDELDRQR QLLLLLRRDW NLKRGRDQVD CLSQQQQDRS  1320
QQQVPVVSQS TLLQRRISSA EVRAAGLAVG AEAAPSPSAA VAGPLQNQEQ QQKQQRPSNQ  1380
QREFSPPSSS PAPSGPSRRL TAQEQQDSHR GPSEQQLERA AGAMRQAAPT QSSLHVHVPL  1440
TQQQQPLQSR VTDNTPHLRV HDYKPRHHTH PFPAHNSQPP RQHHQQEEHP PARLRPTTVI  1500
VSGSGTQQPQ CVRIHAKPVP VHQLQQLQHL RAAGVSLGHL VVGQQQQQPA PQEQDPSRSG  1560
PPPPQSKQPH LYTHRQFPYV HHNQQQKQRN GISSAQPSTQ SQEAEPRQQH QPRWQVQVRH  1620
GPEQHEQLGH PTAVAAASFF GSPGPYPEAL SGVAAAQRPQ RASSISHPAV VPLRVVSPSA  1680
VVATEQLAKS IPTLAWI*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
111781187RLQLKKRQRV
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_002946273.10.0hypothetical protein VOLCADRAFT_120238
TrEMBLD8TIM10.0D8TIM1_VOLCA; Uncharacterized protein
STRINGXP_002946273.10.0(Volvox carteri)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
ChlorophytaeOGCP4830810