PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_021638623.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Malpighiales; Euphorbiaceae; Crotonoideae; Micrandreae; Hevea
Family C2H2
Protein Properties Length: 1782aa    MW: 202133 Da    PI: 8.6427
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_021638623.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H214.10.0001416911713323
                      ET..TTTEEESSHHHHHHHHHHT CS
         zf-C2H2    3 Cp..dCgksFsrksnLkrHirtH 23  
                      Cp   Cgk+F ++ +L++H r+H
  XP_021638623.1 1691 CPvkGCGKTFFSHKYLVQHRRVH 1713
                      9999*****************99 PP

2zf-C2H211.50.0009317491775123
                      EEET..TTTEEESSHHHHHHHHHH..T CS
         zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt..H 23  
                      y+C    Cg++F+  s++ rH r+  H
  XP_021638623.1 1749 YVCAeeGCGQTFRFVSDFSRHKRKtgH 1775
                      89********************99666 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1782 aa     Download sequence    
MAASGGLVSE PTSQQEVFQW LKNLPLAPEY HPTPAEFQDP IAYIFKIEKE AAKYGICKIV  60
PPVLAAPKKA AIANLNRSLA ARADSSTSKS PPTFTTRQQQ IGFCPRKPRP VQKPVWQSGE  120
NYTFQEFETK AKSFERNYLK KCSKKGALSP LEIETLYWKA TVDKPFSVEY ANDMPGSAFS  180
PKKTGVKEVG EGVTVGETEW NMRVVSRAKG SLLRFMKEEI PGVTSPMVYV AMLFSWFAWH  240
VEDHDLHSLN YLHMGAGKTW YGVPREAAVA FEEVVRVHGY GGEINPLVTF AILGEKTTVM  300
SPEVFISAGV PCCRLVQNAG EFVVTFPRAY HSGFSHGFNC GEAANIATPE WLRIAKDAAI  360
RRASINYPPM VSHFQLLYDL ALELCTRMPV SISAKPRSSR LKDKQKGEGE RLVKELFVNN  420
VIHNNGLLHV LGKGSSIVLL PRSSSDMSVC SNLRVGSQLR VSSSLGLCSS KGIMKSSKDS  480
VPDEVMPEMN NRINQVKGLF SMKEKFASLC ERNRFSSLNG NDSMHNMDTG TENGGTVHGD  540
KLSDQRLFSC VTCGILSFDC IAIIQPREAA ARYLMSADCS FFNDWIVGSG VTNDGFTIAH  600
GDTHTSEQNS STKWMKKNTV DGLYDVPVQS ANCQIQMIDQ NKVASNTETQ KGTSSLSLLA  660
LNYGNSSDSE EDQVEPDVLH HADEINMLNC SSESTYQHRI SALPSLKQEC HQDEADSRTL  720
SSARPDCGDE VTLQSIDCHA EHGHGNRPAN FKDESDRALN CSVEFETDNL ASMESNGLEH  780
TFRGPMSTSR IASSCSPVVH DTGKAKFNRV VVPRENMDTS FAQRSDEDSS CMHVFCLEHA  840
VEMEQQFRPI GGVHILLLCH PEYPRIEAVA RSVSEELGIE CLWNDINFRD ATKEDEENIQ  900
SALDSEEAIP GNGDWAVKLG INLFYSANLS RSSLYSKQMP YNSVIYNAFG RSSPASSPTK  960
FNVYGRKPSK QKKVVAGRWC GKVWMSNQVH PFLTKRDSED QDQEQERSFR GWTRPDEKLE  1020
IKPEIIYRTE TTLATRKSGR KRKMSVASGP GKEVKCFNTE DAASEDSQED VSCKQHTRVY  1080
SRMQIKRIER EVSYDSLEDY SHQQYGRTHR SKQAKSVERE DANSDDSLRC NTQQHRRTLR  1140
SKQAKSIESE NDVSYALVDN SSRKQHSRIP RTRSNQAKYV ESKSEFSDDS LEGDIHDWHG  1200
RVPRSTQAKF RREDAVSDDS LEESSHRPLR RVHRSKQATY FEKEDAISDD SLENSSLQRN  1260
RRISGASQAK FIERHDEVSD DMLEGSTYQQ HRGSYRNRES KFIDKEGAVS DDLLEDSTCQ  1320
QRMRIFRTKQ AKFVERQNEA SDDSLEDAIQ QQRRVIPRSK RTKFVKREDA VSDDLLEDDT  1380
HLKHRRIPGS RKAKLIERED VSDDLQEDDG QWQSRKTPRG KPAKFIESED GSDDLQDVTH  1440
WQSRKTSRRR QAASIEREDV SDDLEEGNTH WQPKKTHRCK QAKFIERKDV SDDLHEDDTH  1500
WQPRKIPRGK QAKLIEREDA VSDDLLEHNS IKKQRRILRS KQRKPATLRK MKRGAVQHVK  1560
QGTSRLKRRE NLQSIKQDKQ MKQETPRLWN AKGERNTRQF ESRVEEELEG GPSTRLRKRP  1620
SKPSKESETK LKDKLQSSRK KVKSSSAGKP PNGQKSVKNK DEDAEYQCDI EGCAMSFGSK  1680
HELALHKRNI CPVKGCGKTF FSHKYLVQHR RVHLDDRPLK CPWKGCKMTF KWAWARTEHI  1740
RVHTGARPYV CAEEGCGQTF RFVSDFSRHK RKTGHSVKKS RG
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
116301641KLKDKLQSSRKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G48430.10.0C2H2 family protein