PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Pav_sc0000069.1_g660.1.mk
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Amygdaloideae; Amygdaleae; Prunus
Family C2H2
Protein Properties Length: 512aa    MW: 56190.3 Da    PI: 7.9701
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Pav_sc0000069.1_g660.1.mkgenomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H2157e-0590110323
                                ETTTTEEESSHHHHHHHHHHT CS
                    zf-C2H2   3 CpdCgksFsrksnLkrHirtH 23 
                                C++C+k F r  nL+ H r H
  Pav_sc0000069.1_g660.1.mk  90 CEICNKGFQREQNLQLHRRGH 110
                                *******************88 PP

2zf-C2H212.50.00043164186123
                                EEETTTTEEESSHHHHHHHHHHT CS
                    zf-C2H2   1 ykCpdCgksFsrksnLkrHirtH 23 
                                +kC++C+k++  +s+ k H + +
  Pav_sc0000069.1_g660.1.mk 164 WKCEKCSKRYAVQSDWKAHSKIC 186
                                58*****************9876 PP

Sequence ? help Back to Top
Protein Sequence    Length: 512 aa     Download sequence    
MMKGLMGDEN MSNLTCASGD LSASNSSIRN ESSSAGTLYP QQQSSSANDI QQQQPPPPKK  60
KRNLPGNPDP DAEVIALSPK SLMATNRFLC EICNKGFQRE QNLQLHRRGH NLPWKLKQRT  120
NKEVRKKVYL CPEPTCVHHD PSRALGDLTG IKKHFCRKHG EKKWKCEKCS KRYAVQSDWK  180
AHSKICGTRE RDSFITHRAF CDALAQENST STAARSAMPA SINPLLSSLP QLHTHGLQGL  240
SVKREQDQQQ LLPPWLSCLP EGEATAPSGL PSMNMSSPLF STSSFQQYYS FDHQNPNPSS  300
SSTTTLPDFQ QSHTASPHMS ATALLQKASE MGATISKPSH SPMFLKPPSH QAHVSNEAFD  360
DGFGNNDNKS SLLHDMMMMS NSFGDQQVSS AFGDHHQAFN GIMVDGNNNF VEINNNNNNN  420
INPPKNYKSM ELAQLGRSGT DNNNNDEGLT RDFLGLRAPF SSSAAAAAAA SHGHGDFFNI  480
NMAGLDHHVN SSSSSAYNGQ PSDHQNQTSW QG
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1115127KLKQRTNKEVRKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G50700.11e-96C2H2 family protein