PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Pav_sc0000689.1_g110.1.mk
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Amygdaloideae; Amygdaleae; Prunus
Family C2H2
Protein Properties Length: 592aa    MW: 64027.7 Da    PI: 8.8554
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Pav_sc0000689.1_g110.1.mkgenomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H215.93.7e-0596118123
                                EEETTTTEEESSHHHHHHHHHHT CS
                    zf-C2H2   1 ykCpdCgksFsrksnLkrHirtH 23 
                                + C++C+k F r  nL+ H r H
  Pav_sc0000689.1_g110.1.mk  96 FICEICNKGFQRDQNLQLHRRGH 118
                                89*******************88 PP

2zf-C2H212.50.00045172194123
                                EEETTTTEEESSHHHHHHHHHHT CS
                    zf-C2H2   1 ykCpdCgksFsrksnLkrHirtH 23 
                                +kC +C+k++  +s+ k H +t+
  Pav_sc0000689.1_g110.1.mk 172 WKCDKCSKRYAVQSDWKAHSKTC 194
                                58*****************9998 PP

Sequence ? help Back to Top
Protein Sequence    Length: 592 aa     Download sequence    
MMKGLMFQQQ QQQQQQHSQL VDENMSNLTS ASGEAASVSS GNRNEIGTSF SQQFFAPPPS  60
QTQPALKKKR NLPGNPDPEA EVITLSPKTL MATNRFICEI CNKGFQRDQN LQLHRRGHNL  120
PWKLKQRTSK EVRKKVYVCP EASCVHHDPS RALGDLTGIK KHFCRKHGEK KWKCDKCSKR  180
YAVQSDWKAH SKTCGTREYR CDCGTLFSRR DSFITHRAFC DALAEESARA ITTGNNNPLL  240
ISPQQQQLQQ PGSSSASHHH HMNLNQVQLA HQFQDLHGFS LKKEQQSFTS LRPDLPPWLA  300
CPGPPNNTSI DLSSSSSIFS TRLDQNFTQT HQDLSLHDHN STAPNPNPNP NPSLGPTLPP  360
FQPAPSPHMS ATALLQKAAQ MGATMSSKNS TASAAAATSA GSSPQPMMRS HQNNQGHVPD  420
FGGHVSSFGN NTAAAATTGA GGASASSNGT GPPPSSSGIH QHHHHQNQNQ NQNQNQHQAS  480
LLHDMMNSLS SGTGFEGASF ELDAFGSMPT VLNNSKKDTN NSNNPSSAHF NRSSASDEGG  540
GHGEGLTRDF LGLRALSHSD FLNIAGLGSC VTSAATAAAS SSLDQTHKPW QG
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1123135KLKQRTSKEVRKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G03840.11e-113C2H2 family protein