PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID MDP0000309902
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Maloideae; Maleae; Malus
Family GATA
Protein Properties Length: 1161aa    MW: 129804 Da    PI: 8.8904
Description GATA family protein
Gene Model
Gene Model ID Type Source Coding Sequence
MDP0000309902genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GATA57.81.4e-18656688133
           GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkk 33 
                    C+ C++t+TplWR+gp g+k+LCnaCG++yrkk
  MDP0000309902 656 CTDCKATETPLWRSGPAGPKSLCNACGIRYRKK 688
                    *******************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5121415.509158IPR002652Importin-alpha, importin-beta-binding domain
SuperFamilySSF483718.45E-13112499IPR016024Armadillo-type fold
PfamPF017493.6E-211294IPR002652Importin-alpha, importin-beta-binding domain
Gene3DG3DSA:1.25.10.102.1E-13812496IPR011989Armadillo-like helical
SMARTSM001853.8E-9104145IPR000225Armadillo
PfamPF005142.7E-10105145IPR000225Armadillo
CDDcd000203.44E-35109230No hitNo description
PROSITE profilePS5017610.762115158IPR000225Armadillo
SMARTSM001857.1E-11147187IPR000225Armadillo
PfamPF005145.7E-13148186IPR000225Armadillo
PROSITE profilePS5017611.742158200IPR000225Armadillo
SMARTSM001850.83188230IPR000225Armadillo
PfamPF005143.8E-7189230IPR000225Armadillo
SMARTSM001850.0049232271IPR000225Armadillo
PROSITE profilePS5017610.482242284IPR000225Armadillo
CDDcd000209.33E-34243356No hitNo description
PfamPF005141.1E-7243270IPR000225Armadillo
SMARTSM001851.5E-6273313IPR000225Armadillo
PfamPF005141.3E-8274312IPR000225Armadillo
SMARTSM001858.6E-9315356IPR000225Armadillo
PfamPF005141.9E-9316356IPR000225Armadillo
SMARTSM001856.3E-8358398IPR000225Armadillo
PfamPF005143.6E-11358397IPR000225Armadillo
CDDcd000202.55E-23364494No hitNo description
SMARTSM001852.7E-5401441IPR000225Armadillo
PfamPF005143.0E-6402439IPR000225Armadillo
PfamPF161866.4E-19456501IPR032413Atypical Arm repeat
SuperFamilySSF577169.03E-11650693No hitNo description
PROSITE profilePS5011412.492650686IPR000679Zinc finger, GATA-type
SMARTSM004011.3E-12650706IPR000679Zinc finger, GATA-type
Gene3DG3DSA:3.30.50.102.0E-14654688IPR013088Zinc finger, NHR/GATA-type
CDDcd002022.49E-12656688No hitNo description
PROSITE patternPS003440656681IPR000679Zinc finger, GATA-type
PfamPF003201.7E-16656688IPR000679Zinc finger, GATA-type
SuperFamilySSF530981.24E-379561132IPR012337Ribonuclease H-like domain
Gene3DG3DSA:3.30.420.102.3E-389581132IPR012337Ribonuclease H-like domain
SMARTSM004796.4E-369581137IPR013520Exonuclease, RNase T/DNA polymerase III
CDDcd061275.45E-379601129No hitNo description
PfamPF009298.0E-219601126IPR013520Exonuclease, RNase T/DNA polymerase III
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0006606Biological Processprotein import into nucleus
GO:0005618Cellular Componentcell wall
GO:0005635Cellular Componentnuclear envelope
GO:0005730Cellular Componentnucleolus
GO:0005829Cellular Componentcytosol
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0005515Molecular Functionprotein binding
GO:0008270Molecular Functionzinc ion binding
GO:0008565Molecular Functionprotein transporter activity
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1161 aa     Download sequence    Send to blast
MSLRPNARAE VRRNRYKVAV DAEEGRRRRE DNMVEIRKNR REESLQKKRR EGLQTQQLPS  60
SLHSAGLDKK LEHLPSMVAG VWGDEGTMQL EATTQFRKLL SIERSPPIEE VIQAGVVPRF  120
VEFLMREDFP QLQFEAAWAL TNIASGTSDN TRVVIDHGAV PIFVKLLSSP SDDVREQAVW  180
ALGNVAGDSP RCRDLVLGNG ALLPLLSQLN ENAKLSMLRN ATWTLSNFCR GKPQPPFEQV  240
KPALPALARL IHSNDEEVLT DACWALSYLS DGTNDKIQAV IESGVCPRLV ELLMHPSPSV  300
LIPALRTVGN IVTGDDMQTQ VIIQHQALPC LLNLLTNNYK KSIKKEACWT ISNITAGNKE  360
QIRAVVEANI IGPLVNLLQN AEFDIKKEAA WAISNATSGG SHEQIKFLVS QGCIKPLCDL  420
LVCPDPRIVT VCLEGLENIL KVGEAEKNLG TTGGVNLYAQ AIEDADGLEK IESLQSHDNT  480
EIYEKAVKML ETYWLEDDDE TMPPGDAAPT TFNFGGNDVP TVPSETFLSK AIIQSGQVRS  540
GQVKRHVMFE SCCRVECCRG WLAQVFEPAG LNRFRLPWAL IIGTSPTQIK GKNRSXFFWS  600
RYFRWSDGVA DGVFSHWHEA IMHMHFDADS TFSNCHQLFM VIQDMNVKNV KNKKCCTDCK  660
ATETPLWRSG PAGPKSLCNA CGIRYRKKIP TVSXCKGPKR WKKDKTYAAA AXPVXSPPQP  720
PQTLLLLLLP PPPRAPLPEK PKLVAVVVRK IGKRVXEIED KLDMKETYHF RXNIHLKFCD  780
INEIKIVGKR VXAVRNLNFQ GRQYFSKKPD RARDPQVLPL PSVLAIPTHM RTLSMCFSSL  840
SQVPRSCSVY XLAHFWSEGF PNLSRTCGYN CNSKLHDSRI YAVDGGNSRK WTRRSLTTKT  900
EGRNKSPIKL SREILDVTVS TSAALNINKM ETSQYQQTQP LDFRQAIAQN KDLADLVTVI  960
VFDIETTGFS REKDRIIEIA LQDLEGGENS TFQTLVNPER SVLNSHVHGI TTDMVNRPDV  1020
PRFKDLLPIL VKYVKSRQKP GGCVMLVAHN ARTFDVPFXR SEFLRCSVDI PSTWLFRDTM  1080
PLGREAMKSE GSSSRSISLQ ALREHFQIPL DGTAHRAMAD VKVLSAVLQR LTCILKLPLS  1140
SLVGEAFLAS ETGTAKKKGS R
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4b8j_A0.015153519IMPORTIN SUBUNIT ALPHA-1A
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Mdo.70610.0fruit| leaf
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G26930.11e-14GATA transcription factor 23