PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_021298174.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Herrania
Family C3H
Protein Properties Length: 849aa    MW: 98930.2 Da    PI: 9.1708
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_021298174.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH19.22.1e-06217236625
                     -SGGGGTS--TTTTT-SS-S CS
         zf-CCCH   6 CrffartGtCkyGdrCkFaH 25 
                     C+f+++tG C++G rC++ H
  XP_021298174.1 217 CPFHLKTGACRFGQRCSRVH 236
                     ******************99 PP

2zf-CCCH26.31.3e-08346373126
                     --S---SGGGGTS..--TTTTT-SS-SS CS
         zf-CCCH   1 yktelCrffartG..tCkyGdrCkFaHg 26 
                     +k ++C  f+++   tC++G  C+F+H+
  XP_021298174.1 346 WKVAICGEFMKSRlkTCSHGTACNFIHC 373
                     899************************8 PP

Sequence ? help Back to Top
Protein Sequence    Length: 849 aa     Download sequence    
MGEAETVLKG EEGGGERENH QMEKSRKEKR KQMKKMKRKQ VRKEAAEKER EVEEARLNDP  60
EEQMRIQREE EEERKRREIA LKEFEERERA WVEAMEMKRK AQEEEEKEEE EKRKDLKEDA  120
NGEQEEMSDD WDYIEGSPEI IWEGNEITVR KKKVRVPKKD ANQKSKEEDA DRPTSNPLPP  180
QSEAFADYLN ASSAQQVLES VAKEVPNFGT EQDKAHCPFH LKTGACRFGQ RCSRVHFYPD  240
KSCTLLMRNM YNGPGLAWEQ DEGLEYTDEE VERCYEEFYE DVHTEFLKFG EIVNFKVCKN  300
GAFHLRGNVY VHYKMLESAV LAYHSINGRY FAGKQVKCEF VNLTRWKVAI CGEFMKSRLK  360
TCSHGTACNF IHCFRNPGGD YEWADWDKPP PRYWVKKMGA LFGYTDEAGF EKQIAQEHSR  420
QSRNPSRMIK SDADRHCSRR SKSREMNRLT GGADGRPCNK DDVEESSRSQ RGKNNDKKQT  480
KSLDGRSYRE NQSLRWDQNS EKSHDTSSDG GYSDSKRGKK YDRKRTKTLD GRSDRQRSLK  540
WDQNSEEIHD TSSDGSYSDS KRGKKNDRKQ EKVLDGGRSD RQRSLKWDEN SEKIHDTSSD  600
GGYSDGKRGK KNDRKQAKTL DGRSDRQRSL KWDENSEKIH DTGSDGGYSD SRRGKKNDRK  660
RAKGLDEGRS HRQRRLTWDQ NSEEIHDNSS DGGYSDSKRG KKSDRKRAKT LDGRSGRRRS  720
LKWDENSEKI HDTGSDGGYS DSKRGKKNDR XKILDGRSDR DRNLKWDQNR ERTLDTSSDE  780
GYSGRDIDAA RDTDEVRHRC HAKKHSKHQS GSLEYLADNR NFKNRDHEDT ENSPAQTKKR  840
TRHRSSKGG
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
17378ERKRRE
274100RKRREIALKEFEERERAWVEAMEMKRK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G10320.11e-157C3H family protein