PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID AUR62043443-RA
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; Caryophyllales; Chenopodiaceae; Chenopodioideae; Atripliceae; Chenopodium
Family C3H
Protein Properties Length: 983aa    MW: 113855 Da    PI: 8.3448
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
AUR62043443-RAgenomePhytozomeView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH23.11.3e-07201220625
                     -SGGGGTS--TTTTT-SS-S CS
         zf-CCCH   6 CrffartGtCkyGdrCkFaH 25 
                     C+f+++tG C++G+rC++ H
  AUR62043443-RA 201 CPFHLKTGVCRFGARCSRVH 220
                     ******************99 PP

Sequence ? help Back to Top
Protein Sequence    Length: 983 aa     Download sequence    
MMAVKEDNRQ EVEEREIDKN TIKDNGNGND NSITMATMSR KEKRKALKKM KRKLVRKEIA  60
KKEREEEEAR WNDPEEQMRL LRLEQEEAES MARERMDCFE FKANNENASE ANEDDDWEYI  120
EEGPAEIIWI GNEIIVKKKK IRVPKKASEI LKENQESERP ISNPLAPQSE AFEDYRSAKE  180
VLESVAQQFP NFGTEQDKAH CPFHLKTGVC RFGARCSRVH FYPDKACTIL IKNMYNGPGL  240
AWKQDEGLEH TDEEVKQSFE DFYEDVHTEF FKFGELVNFK VCKNGSSHLR GNLYVHYKSL  300
ESAMLAYQST NGRYFAGKLL TCEFINVTRW RVVICGEYMK SRFKVGHKLG IEFSQSCSRG  360
SACNFIHCFQ NPGGDYEWAD WDKPAPRYWL KKMTALFGYT GDDFTDEVGL RYHSRRSRSR  420
GLDSPHRKSL SDEDDGYKSR RGRHSPHRGD GRETSNLDKE MQKERRTYLD YRCFKHSEID  480
NVDVSEGITD GHRSVGIKRR SKKRKRERVS ADQDKNRKVE AIKMYHSRRS RSRGLDSPHR  540
KSCSNEDDGY KRRRGRYSPR RGDDRETSNL DREMQEERRT CQDSRFDVSE GSTDGQGYVA  600
GTKRSSKKRE RERLFTDQDN KSRKVDVNSS KGEIAEGGVS DRYHEHKSRS SSRQNGVAES  660
ARECRHNKVD KSDVGKSPYG EKNYDEFHRY KDKRVTAKKI EAASLNYNSH GDSQDNEEDE  720
HHSRSKEKSR MMKDEAESSS SASQCSDDSR LVKVEKRQQK KRRKKFENHQ MEISLADTKR  780
NRERIDEEDR HCGHRSAQHH VKEPGMSEVD SNAYRGLVEV DAENFVSIDR SISYCGTCNE  840
TARSDFMNPD VQGGILCRSS DGDKLHDSSN RDSAPERERM LQVYRSDEEA TAPGMVATSL  900
VHTSCKGTDN EAKGELEKAC RLAALKKLEE IRMKKDTQTF HCREPIESLK ICTEENIHGR  960
VLEKANSSKV EVCETSIKRR RI*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
14054RKEKRKALKKMKRKL
24754LKKMKRKL
3499505RRSKKRK
4499507RRSKKRKRE
5760764KKRRK
6760765KKRRKK
7978982KRRRI
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G10320.11e-138C3H family protein