PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_022954750.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Cucurbitales; Cucurbitaceae; Cucurbiteae; Cucurbita
Family C2H2
Protein Properties Length: 1557aa    MW: 174506 Da    PI: 8.2634
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_022954750.1genomeNCBIView Nucleic Acid
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H212.90.0003314661488323
                      ET..TTTEEESSHHHHHHHHHHT CS
         zf-C2H2    3 Cp..dCgksFsrksnLkrHirtH 23  
                      Cp   Cgk F ++ +L++H r+H
  XP_022954750.1 1466 CPvkGCGKKFFSHKYLVQHRRVH 1488
                      9999*****************99 PP

2zf-C2H211.60.0008915241550123
                      EEET..TTTEEESSHHHHHHHHHH..T CS
         zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt..H 23  
                      y+C    Cg++F+  s++ rH r+  H
  XP_022954750.1 1524 YVCAeqGCGQTFRFVSDFSRHKRKtgH 1550
                      89********************99666 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1557 aa     Download sequence    
MAASAMAAEP TPEVLSWLKT LPLAPEYHPT LAEFQDPISY IFKIEKEASK FGICKIVPPV  60
LPSPKKTVIL NFNKSLAARA PCSDSINTKS PPTFTTRQQQ IGFCPRKTRP VQKPVWQSGE  120
YYTFQQFEAK AKAFEKSYLK KCTKKGGLSP LELETLYWRA TLDKPFSVEY ANDMPGSAFV  180
PVSTKMFREA GDGTTLGETA WNMRAVSRAK GSLLKFMKEE IPGVTSPMVY VAMLFSWFAW  240
HVEDHDLHSL NYLHMGAGKT WYGVPRDAAV AFEEVVRVQG YGGEINPLVT FAILGEKTTV  300
MSPEVLVSSG VPCCRLVQNA GEFVVTFPRA YHTGFSHGFN CAEAANIATP EWLRVAKDAA  360
IRRASINYPP MVSHFQLLYD LALSSRSPLC TGSEPRSSRL KDKRRSEGET VIKELFVQNI  420
LENNSLLDVL GSGVSVVLLP QGSSDSIYSR LRVGSHMRGK LRFPAGFCNS KEEAESPQSF  480
DYDNLTLENS QGMNRVKGLY SVNGLYSTLS ERSTGNLCAS SSRILNATNE RGGSVHCDGL  540
SDQRLFSCVT CGILSFACVA IVQPREQAAR YLMSADCSFF NDWVVGSGIT GEGISIRDGH  600
GVASNSGKRE RCVADGLYDV PVQAVNRQLP VADQSYKANF NAEKRNETSA LGMLALAYGH  660
SSDSEEDNAE ADAALHANDA KPTICSSVDQ YQFENSGLTS SEYCKNSATS NHDPLSANSA  720
DQMQFQVNDY EEFGRARFDS KDSFNCSSEF EIDGVGSTKK NDLSTRYQDS HVNGKPSLDT  780
DTEKPMFEQS AEPVEIENMP FAPDIDEDSS RLHVFCLEHA KEVEQQLRPI GGVHILLLCH  840
PDYPKMEAEA KLMAQELSID HLWTDTTFRD ATQDEEKRIQ LALDSEEAIP GNGDWAVKLG  900
INLFYSANLS HSPLYSKQMP YNSVIYNAFG RSTSGNSSGK PKVYQRRSGK LKRVVVGKWC  960
GKVWMSNQIH PLLAKRDPQE EDVDGFPSWT MSDEKIEWKS DNIQKSETVN RKSAGKRKMT  1020
YGSGAATKKA EPIESEDIVS DNSGDDCIHQ HHRILQNKRS KIVASKDVMS DDSVEDVSYK  1080
KHGRVPVNEE AYCETDDPGS DEGPDESLGD RHTKLHRGFY GFKLPKWGEI EPAVSDDSFE  1140
RDSSQFRGKT SKSKIDKYVE RQDALSDECL ESPLKQYRRI PKSKQAKVVK KNAISHDIRD  1200
DSFLWHRQGT SRSKMATIDS EEAVSEDSFE NSSHQHMSTP RSKSAKRTAR ENVFSDDPDE  1260
DDTSLLHHRK NVRNVQSKYF ERENTPDDQL DDSANQCRTR VLRSKPVKKE TISQTKQEIL  1320
RPAKRGASRT LKEEFSQPLK RGGRHTLKLE TPQPTKQLAP NRRGKQAKRN SKLTDLESEE  1380
EQQPGGPSTR LRQRTPKPTK FSETKPNDKR PIGKKKVKNA SSLKTPAGHR DSKARDEESE  1440
YLCDIEGCNM SFGSKQELVL HKRNICPVKG CGKKFFSHKY LVQHRRVHMD DRPLKCPWRG  1500
CKMTFKWAWA RTEHIRVHTG DRPYVCAEQG CGQTFRFVSD FSRHKRKTGH STKKGRG
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
114101418RPIGKKKVK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G48430.10.0C2H2 family protein