PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID AUR62021019-RA
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; Caryophyllales; Chenopodiaceae; Chenopodioideae; Atripliceae; Chenopodium
Family C3H
Protein Properties Length: 1847aa    MW: 210108 Da    PI: 7.1667
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
AUR62021019-RAgenomePhytozomeView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH382.8e-122146126
                    --S---SGGGGTS--TTTTT-SS-SS CS
         zf-CCCH  1 yktelCrffartGtCkyGdrCkFaHg 26
                    ++++ C++++rtG+C+yG+rC+F+H+
  AUR62021019-RA 21 PGEPDCNHYLRTGYCSYGGRCRFSHP 46
                    5789*********************9 PP

2zf-CCCH29.51.3e-095580126
                    --S---SGGGGTS--TTTTT-SS-SS CS
         zf-CCCH  1 yktelCrffartGtCkyGdrCkFaHg 26
                    ++++ C +++r+G C +GdrC+F+H+
  AUR62021019-RA 55 PGEPDCLYYLRNGSCGFGDRCRFNHP 80
                    5789*********************8 PP

3zf-CCCH35.22.1e-11103127327
                     S---SGGGGTS--TTTTT-SS-SSS CS
         zf-CCCH   3 telCrffartGtCkyGdrCkFaHgp 27 
                     +++C ++++tG+Ck+G +Ck++H++
  AUR62021019-RA 103 QPICEYYMKTGICKHGTSCKYHHPK 127
                     799********************96 PP

4zf-CCCH27.74.6e-09141166227
                     -S---SGGGGTS--TTTTT-SS-SSS CS
         zf-CCCH   2 ktelCrffartGtCkyGdrCkFaHgp 27 
                     ++ +C+++ +tG Ck+G +CkF+H+p
  AUR62021019-RA 141 GEQECKYYIATGKCKFGMSCKFHHPP 166
                     6789********************96 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1847 aa     Download sequence    
MVEPKLKFVL EGEGQLYPER PGEPDCNHYL RTGYCSYGGR CRFSHPVGSV YPERPGEPDC  60
LYYLRNGSCG FGDRCRFNHP HLPSTSLGNK HVNGEDLPIR SDQPICEYYM KTGICKHGTS  120
CKYHHPKPFA PYKQGYPLRP GEQECKYYIA TGKCKFGMSC KFHHPPLSYE PVTMVKPVTP  180
HRHAPTRFRL SVVPSYVQRP YGSTSSDTSY LIDHLSTEDS SIFGPQSSSS GVQKEFFLPK  240
ENANFVAECS KGTLDTVFQP KMYRKLDILA LVLIAKIKIR AKVREARRIL HEPMLTCCLF  300
PHNHEIERET LIQLWMAGGF LEDARLKDVV FQHMGRHKGQ TTQMEDVGNC CIDLFEKRKL  360
ISASRTHNLT GKMTYVINVG TIKNMLNKVS TSTKDLYTMI KDSNSFDGSM QSWHVSLAYD  420
EFDQTTFKAL RKFHDLRTLL FIPNYGSSLK QIPSTFFLVI KLLRALDLSR SYITELPISI  480
GNLTKLRYLD CSLTPIEGLP ESIGHLRELQ TLKLKECNHL FELPKGMKDL VSLRHLDFDV  540
LGQLTSMPQG IGALTELRTL SAFIIDNNRG HNIRDLMYLN NLSGSICISG LENLCHEDDV  600
KVAALRRKKR ISKLQLRWNI FQDKKMEIQQ DSLTCIDEFI PNVCLEELQL LCYPLSRLPH  660
WIADEDLKNL VSITLIKCEN IELNTSLGEL PNLKYLEITQ MNGVRKIDCC FRGQISYPAF  720
WSLETLVIDG MSTLEAWEDV KTYDFPRLLK LIIKHCPNLA ELPFLPLIDS LKHLEISRCK  780
SLQYLTKQKL PGLIKDLIIN ECPLLKLKCS KDGEDWDKIE HVPNIWIDLK DIHSHQDDCS  840
EDDGRTTDSL SDDFDYSEDS DIADIVFREE PSKKGFMHHH PSAMEAAFQS GFASALLSIA  900
FEGMAHIILV ELKKIASADE AMIKLEATVT EAQEILSNLD ASMLDIHVNT KIVLQNLVDT  960
LARLCYKAGD LTEDLALKIM HKNIGSNKKP VLNVIRTSPT SWGKPYRFSN LQKEIDDILE  1020
QLKRLIKLLP EGRNKESNGN CSLNEVVPNL IGRECDKKEI IEELSSCSSN YKGLCIVGMD  1080
GLGKTALALT ILEQTEISEQ FDLIRLSMDT CGDFKFNRVV IQLENNLVAG SSHGSLLILF  1140
DNLLDVKSDD WDKFWSYITS KEIGGRQYTY IKILVTTVNA GVPEATKTKA HHLSFLSDEE  1200
CKKVLIERAQ FHIKPLPELE PSLIHAADGL AKQCEGLPYV ARILGVKLAH CDFDELVTMI  1260
NQPLWVSTVF KAEILPALRS GYLSLGPHLK QCFAYFSLFP QNYCYDVDEL VQLWVAEGFI  1320
EQDRSISIAL QNSNMYLNEL RNKSILQITH EATDNGKPVY KLHEFNHKFA HIAASWTCQR  1380
IDEGLSTLRS LDSDTRHLSV SCDNIDSMTI WAKLKKLKHL RTFLSLRNRK LIGAIPPELF  1440
KSLTRLRVLV LSKTDIVELP DSIKKLKRLR YLDVSYTSLK KLPETLVELP TLQYLKLKDT  1500
KSLSYLPSGF HKLTNLLSLD WERRDLMRVG VPSNIGSLSA LQTLPLFSVH DKEGYTVKEL  1560
NNMNNLRGSI CIQNLENIKD MEESKTAMLC NKSFLKSIEL KWMQSRSDSA AKDVLTGLKP  1620
HDYLTELKIT KYGSSKFPQW MCHSSLMVEK IHLKKCNVQV MPSFGELLNL KTLIIQDFAN  1680
LKCLDYHFCG IGVGHFQSLV SLELEDLAQL QQWTRLDAND MPRLRVLKIK LCPKLEDLPS  1740
LKYLTSLTDM QIEYCPILKS LPELPDTLKS LVVIECNLLK ERCQIGGSDW HKVDCIPEVE  1800
IDKEEIRTSV LPGIQLIMLT KYILELKQDN ANEGNEDGMV KLDGMK*
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G18550.13e-46C3H family protein