PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_022770021.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Helicteroideae; Durio
Family C2H2
Protein Properties Length: 1585aa    MW: 179709 Da    PI: 9.121
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_022770021.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H212.80.0003514941516323
                      ET..TTTEEESSHHHHHHHHHHT CS
         zf-C2H2    3 Cp..dCgksFsrksnLkrHirtH 23  
                      Cp   Cgk F ++ +L++H r+H
  XP_022770021.1 1494 CPvkGCGKKFFSHKYLVQHRRVH 1516
                      9999*****************99 PP

2zf-C2H211.70.0008215521578123
                      EEET..TTTEEESSHHHHHHHHHH..T CS
         zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt..H 23  
                      y+C    Cg++F+  s++ rH r+  H
  XP_022770021.1 1552 YVCAeeGCGQTFRFVSDFSRHKRKtgH 1578
                      89********************99666 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1585 aa     Download sequence    
MAASNLSPEP SQEVFSWLKS LPLAPEYRPS LAEFQDPIAY IFKIEKEASQ YGICKIIPPV  60
PPAPKRTAIG NLNRSLVARA AANASSDSKP APTFTTRQQQ IGFCPRKQRP VQKPVWQSGE  120
YYTFQEFEAK AKSFERNYLK RYSKKGSLSA LEVETLFWKA TVDKPFSVEY ANDMPGSAFV  180
PLDSKKSSVG GREAGEGVTV GETPWNMRAV SRAKGSLLRF MKEEIPGVTS PMVYIAMLFS  240
WFAWHVEDHD LHSLNYLHMG AAKTWYGVPR DAAVAFEEVV RVDGYAGEFN PLVTFSTLGE  300
KTTVMSPEVF ICAGVPCCRV PMSIGAKPKS SRLKDKNKSE GETLVKKLFV QNLIQNNDLL  360
HILGKGSSVV LLPKCSSDSS LCSELRVASQ LRTNPRISLG LCNYKEVRES SETLASDEIM  420
LGRSEEIKAV KGFYSVKGKF VTIYEGNRDS SFRGTDYTCR LPTQTLNTNI ERESAVQADA  480
LSDQRLFSCV TCGILCFDCI AVLQPTEQAA RYLMSADCSF FNDWTVGSGL THDGFTAAHG  540
DAITSEQNPC TRRINRSAPN ALYDVPVQSV EYKFQMVDQT NQVVEDTKEG GDTSALGLLA  600
SAYGNSSDSE DHVEPNATVF GDETSSANTS PGRKFQYNGS GFSPGNANRS HTSSLSRLDS  660
EEEAPIHVTD SCSEPGSRRV DIKYRSPQTV DRAVEFETDN LASGRSNSLE DKFRDPITVS  720
HANPSCSPAT HGTEKMRCSK AIAPMENADI PYAPRSDEDS SRMHVFCLEH AVEAEQQLRQ  780
IGGVHVFLLC HPEYPKIEAE AKLVAEELEI DYQWNDILFG DATKDDEERI HSALDSEDAL  840
HGNGDWAVKL GINLFYSANL SRSTLYSKQM PYNCVIYSAF GRKSPASSPT KLNRYGRRSS  900
KQKKVVAGKW CGKVWMSNQV HPFLAQRDPD EQEQERCFHA WATSDENLER KRENVQKAET  960
TKVAKVFNRK RKMRAEIAPS KKVTCIEPQG AVSDDSLDGS SLRQQQRFFR GKQPRLIEKE  1020
EAISYDSLED DSLLLCRNLS RRKQAKFLER EAAESEDEEE DFNAQQHWRN LRGKQGKYFE  1080
EDDVVSGDSL DESSLKQYRR IRRSRQAKFM EVEDAVSDDE QEEISHQLQR RIPRGRQIKS  1140
FERNDAISDY SRADNALKQY RRISKGKHPK FFERDYAMSD DASDDDSQRH LRRIPRGKHM  1200
KRMERDYAFS DDSLEDNPQQ QHRRRSKVAK YTDREVVVSF GPLKGNSHQQ HRKVPRSKRT  1260
KYIEREDAVL SDSPGDSSLQ QLRRIARSKH TKILKREDAV SDDSLDDTSP KQLRKTPRSR  1320
QGKFIERENA VSYDSLEENY HQPNRNLRSR KKKAQTPRQI QQETPQNVKQ GKRRTTKQVV  1380
SQQMKLETPR NRNTKIEQSA RQCNSYGEDE IEGGPSTRLR KRVRKPLKES EAKPKEKKQA  1440
SKKKVKNASN VKTLAGHNTA KVKDEEAEYQ CDMEGCTMSF GLKQELVLHK RNICPVKGCG  1500
KKFFSHKYLV QHRRVHIDDR PLKCPWKGCK MTFKWAWART EHIRVHTGAR PYVCAEEGCG  1560
QTFRFVSDFS RHKRKTGHST KKARG
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1950972RKRENVQKAETTKVAKVFNRKRK
214181424RLRKRVR
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G48430.10.0C2H2 family protein