PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PCP028205.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Amygdaloideae; Maleae; Pyrus
Family C2H2
Protein Properties Length: 1529aa    MW: 170577 Da    PI: 6.2341
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PCP028205.1genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H213.60.0002905926223
                  EETTTTEEESSHHHHHHHHHHT CS
      zf-C2H2   2 kCpdCgksFsrksnLkrHirtH 23 
                  kC +C++ F ++ n++rHir+H
  PCP028205.1 905 KCDKCPREFFSSINYRRHIRVH 926
                  7********************6 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1529 aa     Download sequence    
MGSVGEVGEG RVLSDVEETI FVAVGKDLKQ SETTLAWAVK NFAGKRICLL HVHKPSLLLS  60
LSEGIGSASK LKQGAVKALQ GAERTKLRKL LDQYRMVLTG AGVEANELWI EMGNVEEGIV  120
EIIARHHIRW LVMGAAADEH YSEGDREDES EIEIAPPLLL MNSAFGTEQP EHLRSESTDS  180
LRNLSSADAE EDTSELEEIS GRSTLLMIDA EGEANGESYH SVEQAIVDTK NLKQKEFEQT  240
TRRWKEEDDE MEAKCKAKAF ESLCTKEMSQ RKEMEEKLAR LKQEIDGVKD ESEEFIRQLS  300
MVEDKKLVLE KQLEASQCEV RELEEKIISA AGLLMSFREK RDTMKIEHRN AIRKVRRLET  360
MINGEAASFD QAEFPIFSFM EINEATHNFD PSWKIKEGRR GSVYKGILRH MHVAIKMLPS  420
YGSKTQLDFE HEVEVLSRVR HPNLVTLIGT CPESRSLVYE YLRNGCLEDL LACKDSTPPL  480
PWQIRTSIAT EICSALIFLH SNIPCLVHGN LKPSNILFDA NFVSKLVNLG ISSLIPQNEN  540
PTNFTKICSD PNGAHVYMDP EYLETGKLTP ESDVYSFGII LLQLLTARPL SGLVKDVKYA  600
LENDNFKALL DVSAGDWPLE QAKEMAYIAL RCCENKRLDR LDLLVAHEAM RASCAASASC  660
LVSKKLIRTP SHFLCPILQE VMQDPHIAAD GFTYEEEAIR GWLKSGHNTS PMTNLELEHC  720
KLVPNYALQP LYFLLANSLT LSGHLRLVNF LHFLGDFPSR QTLRISLPPA PIPADIPTSS  780
LFLNRSLLAP PLLSEVYSVS DQAWFCMFMI SSSGMPILDI WRSGGFVFLM SIAKPTTSDS  840
LYAMKSGEEN DSIDTIIRQV AKEPSISFSR AGDSPVPWIQ LLHALDQQEL PGWPLHSPIK  900
VQMQKCDKCP REFFSSINYR RHIRVHHRLK KLDKDSSKNR ELLREFWDKL SPEEAKEAVS  960
FNDVTLEEVP GSSIIKALTI LIRKPGFSSL PHICLKAGSA LLDIVQARPS RFPISSQELF  1020
SILDDASEKT FLCGTAISMQ RYVFDGEAGK IGLESKNLVA CTSFLVEQIL LKAWHADKDA  1080
EALRLQKLLV EEEEAAQRRQ EELLERKRLK KLRQKEQKEK DQRHGEKVDV KENIDETLEA  1140
VPLVETSSPS ATFDSDTASS DALDHACLSL EPFHFSNTGE NADLESQTGV SFGHVDSASG  1200
PNVERQLVQG NGSRRAVARW HMPPKSRRGV PNGFHDGHSS QTAKHSSIQH HGNLRDVRAA  1260
SSGNRVWSRK PKPEYDGGSL KAGVQKEASE PDQIKNHEVL IGSIPVNLGN FCQESNNLAG  1320
VHDDHPSEKG QMLKDNSQDK TNKPDLVQSG TNRSTVKLWR PVSRNGTRGP TPIHNSSKES  1380
DINVVAEKGN SQSPYSENCK RSCVVDGHND GNGNGSTHPD ETQSLGFSSR AAEDFLAQRW  1440
KEAIAADHVE LVLFQDSEPP RCPDNQNDGE VGANHLLKSK RSILGNAENR LVDGEAFDLP  1500
TAGAAKVRRR TKQGKGVKIK YIPKHSTVA
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
111051112ERKRLKKL
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G25610.11e-111C2H2 family protein