PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Vocar.0002s0223.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; Chlamydomonadales; Volvocaceae; Volvox
Family GATA
Protein Properties Length: 2027aa    MW: 209248 Da    PI: 7.1332
Description GATA family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Vocar.0002s0223.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GATA44.62e-14700733134
                 GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkg 34 
                          C+nCgtt+TplWR++     t+CnaCG+y + +g
  Vocar.0002s0223.1.p 700 CANCGTTQTPLWRKDRVTGDTVCNACGIYKQTHG 733
                          ****************76666********98886 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM004012.2E-9694751IPR000679Zinc finger, GATA-type
PROSITE profilePS5011418.861694752IPR000679Zinc finger, GATA-type
SuperFamilySSF577161.38E-12694747No hitNo description
Gene3DG3DSA:3.30.50.101.6E-16699738IPR013088Zinc finger, NHR/GATA-type
PfamPF003209.3E-12700733IPR000679Zinc finger, GATA-type
PROSITE patternPS003440700725IPR000679Zinc finger, GATA-type
CDDcd002029.32E-17700752No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0008270Molecular Functionzinc ion binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 2027 aa     Download sequence    Send to blast
MAYLNRQRGP RDEAAQVDPN IAQAILLRML APELRLDRTS EDGISPSSVV SSIGQGPSSL  60
RCDHGHPQVL PGWASYTAAF SALTNQQYQV QSQRTVHRTG PDQQALTLTS DNHMRSPGAC  120
SSPAAYAETS QLHPNYEPFQ ERPLKRLCRQ ANDQPHGGPP QPAAGVPLQP YMQLAQLLQL  180
SRHPAPNLHV PALAQSADAA LLPSATLSAA SQQPASPAPP LVADPPDAAP RPQLRLIRPQ  240
APHDITECAG QQLPSPSRSA AFHANPAPGD LSADNEVSAA AALLIQMRHQ SINLPYVDAG  300
EEVLCLSTGP AGSGGGDSGD GAAEAASEAR GGRGGGGGGS EEAPLEDMEV VELPVDVSSM  360
VDVGGGVYVT PSTAGPGISF RSLSGRQQDG FGVLAFDNVP RRPRSQVGRQ PPEPTAAHLK  420
RLSALATQPR ASAVAAAAVP ALLPGTFGGG QLPFGAAAAG QQLLGLADPL SAAVAAALAV  480
TDPAAAAAWM AGGRLAPALA SVLATHLAPR IPGAHIASGA AAAAVGLSGP NSSRGAAPGG  540
GTGARQRQPL PAAEHDDTSI NSESSAEDSR HFRSRGATRT PTPQLSAGTL PLGSQPHQQQ  600
LKASQLLSSL QQQQHHHQQQ QSRPATGRSA GASRRKTRPA SDRQRPGSAG SDPSFEVRGR  660
DGDNGGGGGG AADAPPPGAS AASWQPPPRP PRNHKGPLLC ANCGTTQTPL WRKDRVTGDT  720
VCNACGIYKQ THGFDRPVGG RQPAPQQASG KRHAALRTVP TLAPTAGGAA AGSGVGGASS  780
RGAPRSPSPQ PSRSPSRVSA PCPPPQPPPT APSTGATLGD GSLPELVKTD EEDVASASQQ  840
PLRVAAAAAA VAAAAVAAAS AAAATPAWSA QGGPGGAAAG PCRGKQIRLS LQRDIAASDG  900
LEPEPESEAL LASLCEREAP AKARDDGLGV GVGGGPSNNC MRRSIPLPCR TASGGNGDPD  960
GGIAVSPLRL DVAAAAGAAA SSGMMRVYEA RTPETAAPSD EGALAPQQPQ LLRGSSLRDW  1020
LPYGVYQPAA AAVPPPPPPL PSQSLKRQRE DWEVQPALSP QVPTESGESR GALQEGSRVP  1080
SNGARLQQQQ QRPAGSSYSA EQLSDTGGEG SDVAGAAAAA AAAAAALRTR GAVDAWEDQT  1140
RCLEGEGEGG TSMAGVLPAA LPPPRPPVPP PPASPLLLLL PPKVKTEGPE GRERDLAAAD  1200
GGGDAAAAGP SMPYSEPEEV PEQGVCEAAR PPSQQQQQPK QQPQPKQRAV TAPRSGFTAM  1260
SVASDDLTTF GDLVRLRPRA TPELRSDEVT EWEQSRQREV RQLRERQTGH WEDHHHYHHH  1320
HWQGEYSARA PGVHVPGPMA QWPQKDEADA RPAAAAAAAV QYSATVRRPA LPPGNLVCVR  1380
LPNGRLAYLQ AMAPPPQPPP RQLLAYGSRA VVEMSAGGFP LRQEVSGVSR GWDRWQEQPH  1440
GGEESDQEHR VSGGGENSDS GARWGYGTQQ QQSQQWQQQQ RRSRRDSESA WTAAAAATGT  1500
RMPYRPARHE EDENDYGDAA AVGRSTAAAA GTTSAAATSL AAATSAARVP RKLVLSADAD  1560
GTAGTGSVYI TGQDADQQSQ HSDHSDNGGW GRRRGGGDSS GCPEGDLEQA VGSSRALCTT  1620
VRQEGVPEGG SGPGSGPSRC PPPAASERGS LGSLGVPPHP LRPRLPPPTA VTTTSSQAQL  1680
QQQQQEHSEP GRQIVTAAAA AAVENLALRQ YVELYEQATA DPQEELMAWQ RRRQQRAAAV  1740
VTAAATAAQG ILSYRPPADG ELSKYDLYGD TTSADGGGSG SFSRPSPSPA ATQPDFDGAA  1800
RRSYTGTIVD RDSYGHRPRN HSHHQLPIEP TNRQRGLERP EVLDAELGPM YGRGELAVDE  1860
HPGLLYGGML PPVQASYVGM RGGTDPRPAS PLLPPSQSEG HPNSHAHLVS HPHPHLRHHT  1920
HPHPAGVFLP GPPFVVARRG PYPQQHAAAL PPRPLGHSAA QLADGPDGPR QQQQQQPPSS  1980
SGGLQAAAKH YLSAPAGPRR SSGTVVEGGR SGAAAVTSEA AATQPP*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1330338GGRGGGGGG
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_002945915.10.0hypothetical protein VOLCADRAFT_86378
TrEMBLD8TIL90.0D8TIL9_VOLCA; Uncharacterized protein
STRINGXP_002945915.10.0(Volvox carteri)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
ChlorophytaeOGCP4830810