PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_023554009.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Cucurbitales; Cucurbitaceae; Cucurbiteae; Cucurbita; Cucurbita pepo
Family C2H2
Protein Properties Length: 1508aa    MW: 167757 Da    PI: 6.6985
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_023554009.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H210.90.001513931418123
                      EEET..TTTEEESSHHHHHHHHHH.T CS
         zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt.H 23  
                      ykC    C +sF +k +L  H r+ +
  XP_023554009.1 1393 YKCDleGCRMSFETKVELALHKRNqC 1418
                      99********************9876 PP

2zf-C2H211.20.001214761502123
                      EEET..TTTEEESSHHHHHHHHHH..T CS
         zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt..H 23  
                      ykC+   Cg sF+  s++ rH r+  H
  XP_023554009.1 1476 YKCKveGCGLSFRFVSDYSRHRRKtgH 1502
                      99*********************9777 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1508 aa     Download sequence    
MGGVEIPKWL KGLPLAPEFR PTDTEFADPI AYISKIEKEA SAFGICKIIP PFPKPSKKYV  60
ISNLNKSLSR SSELSRDLNA SSVRSSSKLG STDGANEREV RAVFTTRHQE LGQSVRKTKG  120
VVQNPQFGVH KQVWQSGEVY TLEQFESKSK VFARSVLGGI KEPSPLVVES LFWKAATEKP  180
IYVEYANDVP GSAFGEPEGK FRYFHRRRRK RNYYNRKERS SELKSGEMET LTETLARDSR  240
GTSTRDNLNT SAEMLKPSTS TVSSEDASHN SRGKSSDSCI NMEGTAGWRL SNSPWNLQVI  300
ARSPGSLTRY MPDDIPGVTS PMVYIGMLFS WFAWHVEDHE LHSMNFLHVG SPKTWYSIPG  360
DHAFAFEEVV RTQAYGGSVD HLAALTLLGE KTSLLSPETV IASGIRCCRL IQNPGEFVVT  420
FPRAYHVGFS HGFNCGEAAN FGTPQWLSVA KDAAVRRAAM NYLPMLSHQQ LLYLLTMSFV  480
SRVPRSLLPG VRSSRLRDRQ KEEREFMVKK GFVEDILREN NMLSVLLEKE SSCRAVLWNP  540
DMLPYLSHSQ VANTNSAVAT SPRENTSCNH IESLDRNDKN VQNFIDEMAL DLESMNDIYL  600
DSDDLSCDFQ VDSGTLACVA CGILGFPFMS VVQPSEKASR ELSGDHLSTH KRGGVLGSKD  660
VHCSPHFDGT HPEDSTSVPD VNCLSKDPSV GSVPKFDKGW NTFSKFLRPR SFCLLHAVDT  720
VELLQKKGGA NILVICHSDY HKIKANAVAI AEEIGHNFVY NEVRLDIASE EDLGLIDLAV  780
DEERDECRED WTSRLGINLR HCVKVRKSSP TKQVQHALAL GGLFLNRDHG FDLSNLNWPA  840
KRSRSKKINH LQHSKRFQSM HMHLKEEVSG EKSDSRIAKQ QEKFFQYYRR NKKSGNSTGV  900
SSVTQPASSG DSSDLCNDRS FRSNASELAI PDPTGTTDQQ DAVLQDCGNT NSISTVGRMT  960
EPQMENCLPE EAYIDGELPV DDSGMQQYIT AALDTSEPNK NAVLPSCTVG PLVNAINESF  1020
ELPQDQELLE SRNKTDQECD IASEEQSHAP AGVCSDEVNL AESTGLHCSI VLESSKVVLD  1080
SEDVKNSSSE ACDGMTRDET AIADGIKGMD EDSCSLIPIK LQLCPDTEGH SQFGHLDDRT  1140
NTGTPDAATS NLRDRTSEVS RMACEGPDLC NAATSDGLLN NLQTFDADVE TRSVSGVEVQ  1200
LKAQLSSCLA DEKSIKNLGS QEDVDNLSDA LMSSTGVQNE TPTEPRIPMD KPGFKSCILG  1260
ESPMDVETGG EASDRKNLTG GKAPGIDSPL TQSKTRDATE ICSLKHKPSS DVEKRRKRKR  1320
HDELRIENEL SSFDFIRSPC EGLRPRAIKN LTHQRDIDVN ISVQEKPESK RVRKPSDNVS  1380
PKPKKGIRRK GSYKCDLEGC RMSFETKVEL ALHKRNQCPH EGCGKRFSSH KYAMLHQRVH  1440
DDDRPLKCPW KGCSMSFKWA WARTEHIRVH TGERPYKCKV EGCGLSFRFV SDYSRHRRKT  1500
GHYVDQPT
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
113141320KRRKRKR
213151320RRKRKR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G04240.10.0C2H2 family protein