PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PNH03891.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; Chlamydomonadales; Tetrabaenaceae; Tetrabaena
Family GATA
Protein Properties Length: 1395aa    MW: 146516 Da    PI: 9.6005
Description GATA family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PNH03891.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GATA36.85.5e-12771804134
        GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkg 34 
                 CsnCgt +TplWR++     t+CnaCG+y + +g
  PNH03891.1 771 CSNCGTAHTPLWRKDRATGLTVCNACGIYKQTHG 804
                 ****************77777********99886 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1395 aa     Download sequence    
MASRKRPDPM EDAQVLPLAC AKTPRLHFSG GVRLHLAQHP RTSQMRPSPE ARDCAGPADR  60
PTWTVHALPA HPSAVAQPGT RGGTAAGQQP AQADRHGLPG RAATGLQQGI YSDAGGQVET  120
QQEALEPPPW GSGEYGGQRC TITGARSGLQ ERGSTAARDG MLGSPSGAGD EAPHKDKLQQ  180
LRLQQLRMEL LNQLLQQQRQ HHQQQRHEQQ QQAADLAAMA PPPFQGPQQH QRPSQLQQHL  240
ARLALQVQLE NGHGHKAAGA SGQGQWAEGR WVQVASPRLQ PASDAADASP PERAWHGAGV  300
LQLPSRGPHA PVGGHHELRH PLPPAHPQQQ AQQQAQQQQA PPLAASTQWL VPLSPSTGPP  360
LRVLQQQPPH HQQQQTALPL QGLTPCGNGC KAEQQCEEPL AADRDLSVDE DARTVAAVLV  420
RILHSRAPQD GVSGGPGSKS LEGMAVCEEL ANVAGQCSAY NSGQEEEGRG TRVASDWWAM  480
EEAPLPDVEV LELHAAEDGM VSAGNGLFLT PTSTGPGISF RQLVGEQADH PFPDLPRRPR  540
TRPAWPLKPL SVNHSARLRA IANSGPAARR LPRRHPRAAM GRAACPSGAQ PPPLLATMAD  600
PAATTAAAGG PAASPPEHQP PEPPAVPERL ACAVAATVDV AASATEPLGA AAGGGAERAG  660
APQPLLRKQS LPSEPREPSP AAPAAAATAR SGAPAPAPLH APAAGHTYRG VDTGGASAAA  720
PPCSRPVLTA AARQPPNPHG SATAPPGGPP PSSRPPPAPR PPRNHLGPLL CSNCGTAHTP  780
LWRKDRATGL TVCNACGIYK QTHGCERPVD RWNEGPQPTR RSGPAPRAAA PAAASSPPPP  840
PQYVPVARQG GRGHAPDGAP HPQLARLQAG ADAGGLPAGL RWAGEAREGG PAQQQEQQRA  900
RRAADGGPPF QQRAWDQQLG AQQGVGWGRD PADGGAAGSP RSREHPPPPD GDRRRGRPTS  960
PKPRSGRGGG AVAGPRGGAG GAEARAGGGG GEGAALQQSR NGGDGGPAAA GLQLAGGRRA  1020
RDEQYDTAEL PPPRRVAREG RGAGGEQPAP QRMPRDEWAQ RAHELVGLMG LDDGGAALRP  1080
QLQEAGGARF VQLSQGQVEY LPVARSVRSS RQARDDSGLY DMALGAGSRR EPAGEERPWP  1140
RQQQASAGPA SPRELIAEVL GGADGDSNRL LLQHLINQVR QRRQQMMPEL PYEQQQAQHR  1200
PLDSPDRGPV DGWRGDPWPQ ADARPHRLRE QAASAAAAPG AGSSLLLQQL MQRLHQQQQQ  1260
QQGDEEFYRD RHQQPLTQPS QRNEQPAQHD RRPRGPARGE AALLPYPGKH APLSAGSPGG  1320
NGSSGPRQQG RYGSDSRAPS VVVNLELLAL QLEQQQQQGL LGQQAMSLRD FQAAGMFAQQ  1380
GGNAGGGRGQ RQYNV