PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_022134320.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Cucurbitales; Cucurbitaceae; Momordiceae; Momordica
Family C2H2
Protein Properties Length: 1543aa    MW: 171437 Da    PI: 6.6806
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_022134320.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H210.60.001714521475223
                      EET..TTTEEESSHHHHHHHHHHT CS
         zf-C2H2    2 kCp..dCgksFsrksnLkrHirtH 23  
                      +Cp   Cgk+Fs++ + + H+r+H
  XP_022134320.1 1452 QCPheGCGKRFSSHKYAMLHQRVH 1475
                      69999*****************99 PP

2zf-C2H211.10.001215111537123
                      EEET..TTTEEESSHHHHHHHHHH..T CS
         zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt..H 23  
                      ykC+   Cg sF+  s++ rH r+  H
  XP_022134320.1 1511 YKCKveGCGLSFRFVSDYSRHRRKtgH 1537
                      99*********************9777 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1543 aa     Download sequence    
MGGVEIPKWL KGLPFAPEFR PTDTEFADPI AYISKIEKEA SAFGICKIIP PFPKPSKKYV  60
ISNLNKSLSR STELSPPNVC PSSKLGSADG ANEGEVRAVF TTRHQELGQS VKKTKGVVQN  120
PQFGVHKQVW QSGEKYTLEK FESKSKVFAR SVLSGIKEPS PLVVESLFWK AASGKPIYIE  180
YANDVPGSAF GEPRGKFRYF HRRRRKRNYY HRSKERSSEL RTGEMGTLTD SLSLDSAGTS  240
PRNDLNTSSE ILKASTSTVP SEDTSHNSRG KSSDSCINME GTAGWRLSNS PWNLQVIARS  300
PGSLTRYMPD DIPGVTSPMV YIGMLFSWFA WHVEDHELHS MNFLHVGSPK TWYSIPGDHA  360
FAFEEVVRTQ AYGGSVDHLA ALTLLGEKTT LLSPETVIAS GIPCCRLIQN PGEFVVTFPR  420
AYHVGFSHGF NCGEAANFGT PQWLSVAKDA AVRRAAMNYL PMLSHQQLLY LLTMSFVSRV  480
PRSLLPGVRS SRLRDRQKEE REFMVKKGFV EDILRENNML SVLLEKESSC RAVLWNPDML  540
PYSSNSQVAT NSAVATSRKE NISCNHTESI DGNDKNMQNF MDEMTLDLDT VNDIYLESDD  600
LSCDFQVDSG TLACVACGIL GFPFMSVVQP SEKAARELSA DNLSIHKRGG VFGPKDAHDS  660
PDFGGTHPDS TSVPDVNCLS KNLSVASIPK FDKGWSTFGK FLRPRSFCLQ HAVDIIELLK  720
NKGGANILVI CHSDYHKIKA NAVAIAEEIG NHFVYNEVRL DIASEEDLRL IDLAVDVERN  780
ECREDWTSRL GINLRHCVKV RKSSPTKQVQ HALELGGLFL NRNHGFDLSP INWPSKKSRS  840
KKISRPRYYK PFQSMPLKDE VLGKRSDCKI AKREEKVFQY YRRNKKSGNS KGVGSATQPV  900
SSGDSIDLCN MRTFRSNTSE LAIPGPIGTT NQQNAVLQDR GNTNSDPASS MVADSICAVV  960
GRMTEPRIEN CTPEVVDVNG ESCHLPVDTS GMQQKIMTTS DTSEPNEKAV LPSFTCPHVN  1020
AINESEMHKE QEIVGSCNNT NQVCDIASEG QSHALADVGL DETSSIHFES SKVMMDNADV  1080
RNLNCEACDG TTKDDDAEQE IEIANRLKDV EEDSCSLIPI KQQHCVATEC DSQLGHLEDR  1140
IEQEMEPTCR SNESEPILVN TGTASAATSH SRDENSEVPG VGCEAPNLCN AVTSVDLVNN  1200
CQIDADVETQ SVSGVVVQSK TQQSSCLADE RSFENLGSQE DKEHLSDIEM RTEPRSLVNE  1260
PGSNSCILGE GRPMDVEASG KEACDRENLT GGMTPDDAME CANMSGNQHV DDPSPITLET  1320
HDVAEICSSK HNEQGKNTRN LKSNPSSDVE KRRKRKREEE LIIENGFSSC DFIRSPCEGL  1380
RPRVGKNLTS RTGADVVSVQ EKPERERVRK LPDALSPKRK KEIRKGSFKC DLEGCRMSFE  1440
TRAELALHKR NQCPHEGCGK RFSSHKYAML HQRVHDDDRP LKCPWKGCSM SFKWAWARTE  1500
HIRVHTGERP YKCKVEGCGL SFRFVSDYSR HRRKTGHYID QPV
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
113511357KRRKRKR
213521357RRKRKR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G04240.10.0C2H2 family protein