PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Pd.00g703280.m01
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Amygdaloideae; Amygdaleae; Prunus
Family GATA
Protein Properties Length: 773aa    MW: 87651.9 Da    PI: 8.9002
Description GATA family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Pd.00g703280.m01genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GATA412.6e-13723755131
              GATA   1 Cs..nCgttkTplWRrgpdgnktLCnaCGlyyr 31 
                       C+  nC++t++p+WR gp g+k+LCn CG++ r
  Pd.00g703280.m01 723 CTnyNCRVTESPMWRTGPLGPKSLCNRCGIRSR 755
                       55559*************************977 PP

Sequence ? help Back to Top
Protein Sequence    Length: 773 aa     Download sequence    
MAFLETSIPN HYNNLVCVKR GLGVVIRDSM GDLMMAAYKV TNSKSRKSQD GNKSQKSSKP  60
PGQAKEHKDQ LERLSEKDPE FYDFLKEHDQ ELLQFNDEDI EEDSDTNLKE DETPVDDEIQ  120
VDEETGRHDV VQKKKKPSKK VITSEMVDSW CNSIREDGKL SAIHSLMKAF RTACHYGDDK  180
EDESMLDFGV MSSSVFNKVM LFVLKEMDGI IRKLLELPAF GGKKETILDA MNTKRWKNYN  240
HLVKSYLGNA LHVLRQMTDT EMISFTLRRL QYSSIFLAAF PVLLRKYIKT AVDLWGLGGG  300
SLPLVSLLFL RDLCVRLGSD CLDECFKGMY KAYVLNCQFI TAAKLQHVQF RANCVIELYG  360
VDLPTAYQHA FVFIRQLAMI LREALNAKTK EAFRKVYEWN FMNCLELWTG AISSYGSEAD  420
FRPVVYPLAQ IIYGVARLVP TARYFPLRLR CVRMLNRIAA STGTFTPVSM LLLDMLEMKE  480
LNRPATGGVG KALDLRTILK VSKPTLKTRA FQEACVLSVV DELAEHLAQW SYSIAFPELS  540
FIPAVRLRSF CKSTKVERFR KAMRELIRQI EANCQFTNER RMSISFLPND PAAASFLERK  600
HMDPKGVQNG FEMTNFDQHE ANLGGTDCLV DLTLRLGIPS SDKNNDQQSH TANGSSTSQA  660
VDRSSLNNLH VNGYKQYEFP APPELKNYCI INISNRRGKT GGSRKRKTTG RRPAKVGDID  720
KTCTNYNCRV TESPMWRTGP LGPKSLCNRC GIRSRVDAYS QALNCLSFNK LGL
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1696713RRGKTGGSRKRKTTGRRP
2703708SRKRKT
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G20750.19e-12GATA family protein