PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_023904779.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fagales; Fagaceae; Quercus
Family C3H
Protein Properties Length: 893aa    MW: 105951 Da    PI: 7.0845
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_023904779.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH19.22.2e-06232251625
                     -SGGGGTS--TTTTT-SS-S CS
         zf-CCCH   6 CrffartGtCkyGdrCkFaH 25 
                     C+f+++tG C++G rC++ H
  XP_023904779.1 232 CPFHLKTGACRFGQRCSRVH 251
                     ******************99 PP

2zf-CCCH24.54.6e-08411438126
                     --S---SGGGGTS..--TTTTT-SS-SS CS
         zf-CCCH   1 yktelCrffartG..tCkyGdrCkFaHg 26 
                     +k ++C  ++++   tC++G  C+F+H+
  XP_023904779.1 411 WKVAICGEYMKSRfkTCSHGTACNFIHC 438
                     899************************8 PP

Sequence ? help Back to Top
Protein Sequence    Length: 893 aa     Download sequence    
MAEPLAGKED EEEERDMMEE VEEETEGMSE RLRRKEKRKA KKKMKRKQMR KEMAEKEREE  60
EEARLNDPEE QRKLRMMEEE EKERMERDRK EFEEREKAWI EAMERKKKEE EEEEEEERRK  120
KALEEDSRRQ QQAENENELI EDDDWEYVEG PAEIIWKGNE IIVKKKRVKV PKKDAHQQRI  180
DNADRPTSNP LPPQSEAFAD YKNAPKLSAE QIIENVTQQV PNFGTEQDKA HCPFHLKTGA  240
CRFGQRCSRV HFYPDKSCTL LIRNMYNGPG LAWEQDEGLE VCKRCLHVIL YWSSLLVMAI  300
TLRGLKMQAT ILVCGYIWHF LVLKMKRRKS HTDEEVEGCY EEFYEDVHTE FLKFGEIVNF  360
KVCRNGAFHL RGNVYVQYKS LDSAVLVYNS INGRYFAGKQ ISCEFVNVTR WKVAICGEYM  420
KSRFKTCSHG TACNFIHCFR NPGGDYEWAD FDKPPPRYWV RKMASLFGYS DEVGYQKEVE  480
QENLGQLRHS RKRLTDVDRY YSRSKSKEID YLNGGGSSRR NDNKNDMQKS TQRRGRTSND  540
RKQMEVLDED MRGEKTNFKG GSYRKSRSHD TDSEGEWSDR YKDRYYGSAR KSSRLRNRDV  600
NHRTHEVESE GGYSDRSEDR ETHDGHARKS SRHSRNAKFV DDCEDQEKFG GNWSDRDGDQ  660
ETHDAYMTKS SRHRRKVGHP DDRMDSKNRT HDTDGEWSDG ERDRDRHHRK RSESSRHLRK  720
VGHSDDHKDS KNRSHDPEGE WSDRDTNRDR HHHKKRKSSR RDHDHGGSKN RADDTNLISD  780
WLDSDGERHK SQTRKHTRHR NEALDYSDDE GEPAKKLKDK FHSRESSIEK SGLERGSMHS  840
HSRWDSINEN LESDGSRESN SSDQYVQRHK LHDLESSYNS DKYIDKQDRW EPD
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
13339RRKEKRK
23347RRKEKRKAKKKMKRK
33442RKEKRKAKK
43846RKEKRKAKK
54148KKKMKRKQ
6105121RKKKEEEEEEEEERRKK
7106122RKKKEEEEEEEEERRKK
8326331KRRKSH
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G10320.11e-141C3H family protein