PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Pd.00g519740.m01
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Amygdaloideae; Amygdaleae; Prunus
Family GATA
Protein Properties Length: 678aa    MW: 73497.2 Da    PI: 6.3851
Description GATA family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Pd.00g519740.m01genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GATA542.3e-17267301135
              GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkgl 35 
                       C +C t kTp+WR gp g+ktLCnaCG++y++ +l
  Pd.00g519740.m01 267 CLHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRL 301
                       99*****************************9885 PP

2GATA542.3e-17606640135
              GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkgl 35 
                       C +C t kTp+WR gp g+ktLCnaCG++y++ +l
  Pd.00g519740.m01 606 CLHCATDKTPQWRTGPMGPKTLCNACGVRYKSGRL 640
                       99*****************************9885 PP

Sequence ? help Back to Top
Protein Sequence    Length: 678 aa     Download sequence    
MEAPEYFQNS FCPQFTPEKR HSFDNNNNKA TNGGGGGGDH FMVEDLLDFS NDDAVITDGG  60
ATFDNVTGNS TDSSTLTVID SCNSSSLSGS EPNVIPDIGS RNIAEGPFSS DLCVPYDDLA  120
ELEWLSNFVE ESFSSQDLQK LQLISGMKAR PDEAASETRQ FQPERNRNDN AHNTTTTTNN  180
NPIFNPDVSV PAKARSKRSR AAPCNWTSRL LLLSQPTSSS EQSDVVSSGP ASPSPPPSTG  240
KKTVKSAPKK KESPEGPGGG PGDGRKCLHC ATDKTPQWRT GPMGPKTLCN ACGVRYKSGR  300
LVPEYRPASS PTFVLTKHSN SHRKVLELRR QKEMRILPEL VAMEAPEYFQ NSFCPQFTPE  360
KRHSFDNNNN KATNGGGGGG DHFMVEDLLD FSNDDAVITD GGATFDNVTG NSTDSSTLTV  420
IDSCNSSSLS GSEPNVIPDI GSRNIAEGPF SSDLCVPYDD LAELEWLSNF VEESFSSEDL  480
QKLQLISGMK ARPDEAASET RQFQPERNRN DNAHNTNNNP IFNPDVSVPA KARSKRSRAA  540
PCNWTSRLLL LSQPTSSSEQ SDVVSSGPAS PLPPPSTTGK KTVKSAPKKK ESPEGPGGGP  600
GEGRKCLHCA TDKTPQWRTG PMGPKTLCNA CGVRYKSGRL VPEYRPASSP TFVLTKHSNS  660
HRKVLELRRQ KEMDPRQC
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G32890.13e-68GATA family protein