PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_022990199.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Cucurbitales; Cucurbitaceae; Cucurbiteae; Cucurbita
Family C2H2
Protein Properties Length: 1506aa    MW: 167527 Da    PI: 6.7583
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_022990199.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H210.90.001513911416123
                      EEET..TTTEEESSHHHHHHHHHH.T CS
         zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt.H 23  
                      ykC    C +sF +k +L  H r+ +
  XP_022990199.1 1391 YKCDleGCRMSFETKVELALHKRNqC 1416
                      99********************9876 PP

2zf-C2H211.20.001214741500123
                      EEET..TTTEEESSHHHHHHHHHH..T CS
         zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt..H 23  
                      ykC+   Cg sF+  s++ rH r+  H
  XP_022990199.1 1474 YKCKveGCGLSFRFVSDYSRHRRKtgH 1500
                      99*********************9777 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1506 aa     Download sequence    
MGGVEIPKWL KGLPLAPEFR PTDTEFADPI AYISKIEKEA SAFGICKIIP PFPKPSKKYV  60
ITNLNKSLSR SSELSRDLNA SNVRSSSKLG STDGANEREV RAVFTTRHQE LGQSVRKTKG  120
VVQNPQFGVH KQVWQSGEVY TLEQFESKSK VFARSVLGGI KEPSPLVVES LFWKAATEKP  180
IYVEYANDVP GSAFGEPEGK FRYFHRRRRK RNYYNRKERS SELKSGEMET LAETLARDSR  240
GTSTRDNLNT SAEMLKPSTS TVSSEDASHN SRGKSSDSCI NMEGTAGWRL SNSPWNLQVI  300
ARSPGSLTRY MPDDIPGVTS PMVYIGMLFS WFAWHVEDHE LHSMNFLHVG SPKTWYSIPG  360
DHAFAFEEVV RTQAYGGSVD HLAALTLLGE KTSLLSPKTV IASGIPCCRL IQNPGEFVVT  420
FPRAYHVGFS HGFNCGEAAN FGTPQWLSVA KDAAVRRAAM NYLPMLSHQQ LLYLLTMSFV  480
SRVPRSLLPG VRSSRLRDRQ KEEREFMVKK GFVEDILREN NMLSVLLEKE SSCRAVLWNP  540
DMLPYLSNSQ VANTNSAVAT SPRENTSCNH IENLDRNDKS VQNFIDEMAL DLESMNDIYL  600
DSDDLSCDFQ VDSGTLACVA CGILGFPFMS VVQPSEKASR ELSGDHLSTH KRGGVLGSKD  660
VHCSPHFDGT HPEDSTSVPD VNCLSKDPSV GSVPKFDKGW NTFSKFLRPR SFCLLHAVDT  720
VELLQKKGGA NILVICHSDY HKIKANAVAI AEEIGHNFVY NEVRLDIASE EDLGLIDLAI  780
DEERDECRED WTSRLGINLR HCVKVRKSSP TKQVQHALAL GGLFLNRDHG FDLSNLNWPA  840
KRSRSKKINH LQHSKRFQSM HLKEEVSGEK SDSRIAKQQE KFFQYYRRNK KSGNSTGVSS  900
VTQPASSGDS SDLCNDRSFR SNASELAIPD PTGTPDQQDA VLQDCGNTNS ISTVGRMTEP  960
QMENCLPEEA YIDGELPVDD SGMQQNITTA LDTSERNKKA VLPTCTVGPL VNSINESLEI  1020
PQDQELLESR NKTDQECDIA SEEQSHAPAG VCSDEVNLAE STGLHRSIVL ESSKVVLDSE  1080
DVKNSSSEAC DGMTRDETAI ADGIKGMAED SCSLIPIKLH LCPDTEGHSQ FGHLDDRINT  1140
GTPDAATSNL RDRTSEVSKM ACEGPDLCNA VTSDGLLNNL QTFGADVETR SVSGVEVQLK  1200
AQLSSCLADE KSIKNLGSQE DVDNLSDALM SSTGVQNETP TEPRTPMDEP GFKSCILGES  1260
PMDVETGGDA SDRKNLTGGK SPGIDSPLTQ SKTRDATEIC SSKHQPSSDV EKQRKRKRHD  1320
ELRIENELSS YDFIRSPCEG LRPRAIKNLT HQRDTDVNIS VQEKPERKRV RKPSDNVPPK  1380
PKKEIRRKGS YKCDLEGCRM SFETKVELAL HKRNQCPHEG CGKRFSSHKY AMLHQRVHDD  1440
DRPLKCPWKG CSMSFKWAWA RTEHIRVHTG ERPYKCKVEG CGLSFRFVSD YSRHRRKTGH  1500
YVDQPS
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
113661373ERKRVRKP
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G04240.10.0C2H2 family protein