PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PCP024933.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Amygdaloideae; Maleae; Pyrus
Family GATA
Protein Properties Length: 715aa    MW: 78592 Da    PI: 7.7106
Description GATA family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PCP024933.1genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GATA55.67.1e-18611645135
         GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkgl 35 
                  C+ Cgt+kTplWR gp g+k+LCnaCG++ rkk++
  PCP024933.1 611 CADCGTSKTPLWRGGPAGPKSLCNACGIRSRKKRR 645
                  ********************************986 PP

Sequence ? help Back to Top
Protein Sequence    Length: 715 aa     Download sequence    
MSLRPSARTE VRRNRYKVAV DADEGRRRRE DNMVEIRKSK REESLLKKRR EGLQAQQFAP  60
SLQPSNLEKK LESLPAMVAG VWSDDNSLQL ESTTQFRKLL SIERSPPIEE VIQSGVVPRF  120
VAFLMREDFP QLQFEAAWAL TNIASGTSEN TKVVIEQGAV PIFVKLLASP SDDVREQAVW  180
ALGNVAGDSP RCRDLVLSSG ALMPLLAQLN EHAKLSMLRN ATWTLSNFCR GKPQPPFDQV  240
KPALPALERL VHSNDEEVLT DACWALSYLS DGTNDKIQAV IEAGVCVRLV QLLLHPSPSV  300
LIPALRTVGN IVTGDDLQTQ CIINNGSLPC LLSLLTHNHK KSIKKEACWT ISNITAGNRD  360
QIQSVIEAGL IGPLVNLLQH AEFDIKKEAA WAISNATSGG TPDQIKYLVG EGCIKPLCDL  420
LVCPDPRIVT VCLEGLENIL KVGEAEKAVG NSEVNLYAQL VDDAEGLEKI ENLQSHDNTE  480
IYEKSVKILE TYWLEEDEET LPAGDGTQPG FRFGGSEVQK FLMRVKFGVK LRSSWSGSGA  540
SRVSGFSLVR CPLSSEMTRV IIGIVVGTEA KDLQNISYRM LDPSDKGSES EGMTIKTPDA  600
GSSEEGFKKT CADCGTSKTP LWRGGPAGPK SLCNACGIRS RKKRRAILGL NKENPNDKKG  660
KKNKQLGDGL KQRLLALGRE VLMQRSTVER QRRKLGEEEQ AAVLLMALSY GSVYA
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1642662KKRRAILGLNKENPNDKKGKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G06740.16e-50GATA family protein