PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_022958142.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Cucurbitales; Cucurbitaceae; Cucurbiteae; Cucurbita
Family C2H2
Protein Properties Length: 1506aa    MW: 167425 Da    PI: 6.7068
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_022958142.1genomeNCBIView Nucleic Acid
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H210.90.001513911416123
                      EEET..TTTEEESSHHHHHHHHHH.T CS
         zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt.H 23  
                      ykC    C +sF +k +L  H r+ +
  XP_022958142.1 1391 YKCDleGCRMSFETKVELALHKRNqC 1416
                      99********************9876 PP

2zf-C2H211.20.001214741500123
                      EEET..TTTEEESSHHHHHHHHHH..T CS
         zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt..H 23  
                      ykC+   Cg sF+  s++ rH r+  H
  XP_022958142.1 1474 YKCKveGCGLSFRFVSDYSRHRRKtgH 1500
                      99*********************9777 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1506 aa     Download sequence    
MGGVEIPKWL KGLPLAPEFR PTDTEFADPI AYISKIEKEA SAFGICKIIP PFPKPSKKYV  60
ISNLNKSLSR SSELSRDLNA SNVRSSSKLG STDGANEREV RAVFTTRHQE LGQSVRKTKG  120
VVQNPQFGVH KQVWQSGEAY TLEQFESKSK VFARSVLGGI KEPSPLVVES LFWKAATEKP  180
IYVEYANDVP GSAFGEPEGK FRYFHRRRRK RNYYNRKERS SELKSGEMET LTETLARDSR  240
GTSTRDNLNT SAEMLKPSTS TVSSEDASHN SRGKSSDSCI NMEGTAGWRL SNSPWNLQVI  300
ARSPGSLTRY MPDDIPGVTS PMVYIGMLFS WFAWHVEDHE LHSMNFLHVG SPKTWYSIPG  360
DHAFAFEEVV RTQAYGGSVD HLAALTLLGE KTSLLSPETV IASGIPCCRL IQNPGEFVVT  420
FPRAYHVGFS HGFNCGEAAN FGTPQWLSVA KDAAVRRAAM NYLPMLSHQQ LLYLLTMSFV  480
SRVPRSLLPG VRSSRLRDRQ KEEREFMVKK GFVEDILREN NMLSVLLEKE SSCRAVLWNP  540
DMLPYLSNSQ VANTNSAVAT SPRENTSCNH IENLDRNDKS VQNFIDEMAL DLQSMNDIYL  600
DSDDLSCDFQ VDSGTLACVA CGILGFPFMS VVQPSEKASR ELSGDHLSTH KRGGVLGPKD  660
VHCSPHFDGT HPGDSTSVPD VNCLSKDPSV GSVPKFDKGW NTFSKFLRPR SFCLLHAVDT  720
VELLQKKGGA NILVICHSDY HKIKANAVAI AEEIGHNFVY NEVRLDIASE EDLGLIDLAV  780
DEERDECRED WTSRLGINLR HCVKVRKSSP TKQVQHALAL GGLFLNRDHG FDLSNLNWPA  840
KRSRSKKINH LQHSKRFQSM HLKEEVSGEK SDSIIAKREE KFFQYYRRNK KSGNSTGVSS  900
VTQPASSGDS SDLCNDRSFR SNASELAIPD PTGTTDQQDA VLQDCGNTNS ISTVGRMTEP  960
QMENCLPEEA YIDGELPVDD SGMQQNITTA VDTSERNKKA VLPSCTVGSL VNSINESLEI  1020
PQDQELLESR NKTDQECDIA SEEQSHAPAG VCSDEVNLAE STGLHCSIVL ESSKVVLDSE  1080
DVKNSSSEAC DGMTRDETAI ADGIKGMDED SCSLIPIKLQ LCPDTEGHSQ FGHLDDRTNT  1140
GTPDAATSNL RDRTSEVSRM ACEGPDLCNA ATSDGLLNNL QTFDADVETQ SISGVEVQLK  1200
AQLSSCLADE KSIKNLGSQE DVDNLSDALM SSTGVQNETP TEPRIPMDEP GFKSCILGES  1260
PMDVETGGEA SDRKNLTGGK APGIDSPLTQ SKTRDATEIC SSKHKPSSDV EKRRKRKRHD  1320
KLRIENELSS FDFIRSPCEG LRPRAIKNLT HQRDIDVNIS VQEKPERKRV RKPSDSVPPK  1380
PKKEIRRKGS YKCDLEGCRM SFETKVELAL HKRNQCPHEG CGKRFSSHKY AMLHQRVHDD  1440
DRPLKCPWKG CSMSFKWAWA RTEHIRVHTG ERPYKCKVEG CGLSFRFVSD YSRHRRKTGH  1500
YVDQPT
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
113121318KRRKRKR
213131318RRKRKR
313661373ERKRVRKP
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G04240.10.0C2H2 family protein