PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID AUR62017914-RA
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; Caryophyllales; Chenopodiaceae; Chenopodioideae; Atripliceae; Chenopodium
Family C3H
Protein Properties Length: 1044aa    MW: 115952 Da    PI: 8.0962
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
AUR62017914-RAgenomePhytozomeView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH18.14.8e-06157176525
                     --SGGGGTS--TTTTT-SS-S CS
         zf-CCCH   5 lCrffartGtCkyGdrCkFaH 25 
                     +Cr+f+  G C++G++C+F H
  AUR62017914-RA 157 ICRDFSS-GRCRRGSSCHFLH 176
                     7******.************* PP

2zf-CCCH20.11.1e-06224244526
                     --SGGGGTS--TTTTT-SS-SS CS
         zf-CCCH   5 lCrffartGtCkyGdrCkFaHg 26 
                     +C+ f++ G C++Gd C++ H+
  AUR62017914-RA 224 ICNEFLK-GRCRRGDLCRYVHD 244
                     6******.*************7 PP

3zf-CCCH23.11.3e-07309330325
                     S---SGGGGTS--TTTTT-SS-S CS
         zf-CCCH   3 telCrffartGtCkyGdrCkFaH 25 
                     + +C+ffa+ G C+ G rCkF+H
  AUR62017914-RA 309 ETPCKFFAA-GNCRNGTRCKFSH 330
                     679**9999.************* PP

Sequence ? help Back to Top
Protein Sequence    Length: 1044 aa     Download sequence    
MKRNGAREGT QKCVLVWPVG RYADRFVKYA YSSSHLSNQD PAILSGEMSG SSKRRHSKWD  60
HKDAEALPEN FYDRAQTRRP REPLSPDRSS RRHDSRSKES AMTWDREGNC TRISPGLDEW  120
RPRHSSRSPR NGWGRSHRRE PAVNDRSRSR SGVPAQICRD FSSGRCRRGS SCHFLHQDEM  180
YDNRRLECSS PEDWESRRRK TGGSRYSPDD TKDSATRTEK SAHICNEFLK GRCRRGDLCR  240
YVHDDSAADV FDKGVVEDVY RNRDHDLRSR DIYPDHDHRS RDTYCDLDHR RRDTYPDRNH  300
EFEPPKRGET PCKFFAAGNC RNGTRCKFSH QVQEIYSPDR RSRGDRRGPD HKSEGGQVWE  360
GSRWSDATAV STVPAVQGWG EDKNERKEVS TARYKGDGRG LDHKYEGDRG WDGPSWCDVT  420
TVPNNPAAHG WGDDKNEITE VAKDMTGSRP GDDCWSHDML DIKQTRETSA NIGKSLIQDK  480
KQALQWKIEN NVPGLTENFS GDMEISPRDI KQQGMNSSSS QTASNISFPA VQQDVAGEAS  540
YEKHYQAGLQ EQKVFHDPFA DQNPDYKDKK PMLNSGGRSD VRDHDGSCIV QNQPLNLLQG  600
QGNFQSDHSV NVHPTTNINS PLGQTQLSLP SHSLPGQSQP SIPSHPPGKN QLSIPLQSPL  660
GQSQFSLFSH SPPGQNQLSL SSHPPPGQNQ LPLPGPPPPG QNQLPLPGPP PPGQNQLPLP  720
GPPPPGQIQL PFPGLPSSGK NQLSLPLQNA GQGQLSSPSY LSQGEAMQRQ VSHNPSAFLD  780
KKVDGSHAVS LASSEVAPES ATAHSSVSNK QLAQLSHLTA SLTQLLESRQ QHSQFSTAVT  840
PCSLSNSPDI AEPVPPVSAA TIQLNQAPEA LKPYDPVCNS VEPKMANLGN HVLPSVEQNI  900
SKNEIHHKDN VTQQEPVAAS KDEKYGKTTD EGIIKALENG PSQDMGADDG ADDDKKLKDT  960
KGLRQFKFAL AEFVKELLKP SWKDGHVSKD AYKTIVKKVV DKVTETLGPQ IPQTKEKIDM  1020
YLSCSKQKID KLVQAYVGKH QKS*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1196201SRRRKT
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G33835.15e-09C3H family protein