PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Ro05_G22157
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Rosoideae; Rosoideae incertae sedis; Rubus
Family C3H
Protein Properties Length: 2040aa    MW: 227354 Da    PI: 7.3115
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Ro05_G22157genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH36.96.1e-1216611683426
                   ---SGGGGTS--TTTTT-SS-SS CS
      zf-CCCH    4 elCrffartGtCkyGdrCkFaHg 26  
                   e C+f+++tG Ck+G++CkF+H+
  Ro05_G22157 1661 EDCSFYLKTGNCKFGSNCKFNHP 1683
                   89********************9 PP

2zf-CCCH25.72e-0817491774227
                   -S---SGGGGTS--TTTTT-SS-SSS CS
      zf-CCCH    2 ktelCrffartGtCkyGdrCkFaHgp 27  
                   ++ +C++++r G CkyG  C+++Hg+
  Ro05_G22157 1749 GQTECKYYLRPGGCKYGKACRYNHGK 1774
                   6789********************96 PP

3zf-CCCH37.63.6e-1217951819226
                   -S---SGGGGTS--TTTTT-SS-SS CS
      zf-CCCH    2 ktelCrffartGtCkyGdrCkFaHg 26  
                   ++++C++++r+G CkyG++C+F+H+
  Ro05_G22157 1795 GERECPYYMRNGSCKYGSNCRFNHP 1819
                   789*********************9 PP

4zf-CCCH24.93.4e-0819281954127
                   --S---SGGGGTS--TTTTT-SS-SSS CS
      zf-CCCH    1 yktelCrffartGtCkyGdrCkFaHgp 27  
                   ++++ C+ff+rtG Ck+ ++Ck++H++
  Ro05_G22157 1928 PGQPVCSFFLRTGDCKFKSNCKYHHPK 1954
                   5789*********************96 PP

5zf-CCCH32.81.1e-1019761999326
                   S---SGGGGTS--TTTTT-SS-SS CS
      zf-CCCH    3 telCrffartGtCkyGdrCkFaHg 26  
                   + +C +++r+G+Ck+G+ CkF H+
  Ro05_G22157 1976 QNICTHYSRYGICKFGPACKFDHP 1999
                   679********************9 PP

Sequence ? help Back to Top
Protein Sequence    Length: 2040 aa     Download sequence    
MAASEQPQEV LSWLRTLPVA PEYHPTWAEF QDPIAYIFKI EKEASKYGIC KIVPPVPPAP  60
KKTAIANLNK SLVARGGPSV GEGPKSLPTF TTRQQQIGFC PRKARPVQRP VWQSGEYYTF  120
QQFEAKAKSF EKSYLKRRRK KGGGGLSPLD IETLYWKATV DKPFSVEYAN DMPGSAFVPL  180
SSKKTSASTS REAGDGVTLG ETAWNMRGVS RSRGSLLRFM KEEIPGVTCP MVYVAMMFSW  240
FAWHVEDHDL HSLNYLHMGA GKTWYGVPRE AAVAFEEVVR VQGYGGEINP LVTFATLGEK  300
TTVMSPEVFI SSGIPCCRLV QNAGEFVVTF PRAYHTGFSH GFNCGEAANI ATPEWLRVAN  360
DAAIRRASIN YPPMVSHFQL LYDLALALCS RMPVHNSAEP RSSRLKDKKK GEGETVVKGL  420
FVKNVIQNNE LLHVLGKGSS IVLLPQSSSD ISVCSKLRVG SNPDDLMIDG KQGIKQVKGF  480
YSMKGKLASL CESSRHPSLN GNSNVSAPSK MLNVSTKRES NVEGEGLSDQ RLFSCVTCGI  540
LSFSCVAIIQ PREAAARYLM SADCSFFNDS AVDCEAFQVA NGDPNSSKKG PCTGWFNLEL  600
ASLDETGLME KSTPDGLYDV PVQSADYRIQ MADPINEVES NTEMQGDTSA LGLLALTYGN  660
SSDSEEDQAE LDVPVCGDET NLSDCSLEGR YEYQSASPPL RDSYGGTTGV RSPTLPGSDC  720
GNELPTVDGY VENKHGHQYF DRSVNTETNN LALTKTNGLV GTSLDPVKVS YSGSPDAYDV  780
QTTGFGQATL QKENTSISFA PGCDHDSSRM HVFCLEHAVE VEQQLRSIGG AHILLLCHPD  840
YPRIEDEAKE IAEELGVNYP WNDMVFRHAT REDEERIQSA LDSEEAIAGN GDWAVKMGIN  900
LFYSASLSRS HLYSKQMPYN SVIYNAFGRS SPASSSAGPD VCGRRPAKQK KIVVGKWCGK  960
VWMSHQVHPF LIKREHEEKK VEQEEQRRFH GSAMPDEKLD GKSEGTRKTE KTVVTKQYSR  1020
KRKMNVEGGT TKKAKCFEKE DAVSAYSIDD NSHQQQKRFL KNKQAKYIES GPTKKAKCME  1080
TEDAVSGDSM EDDSHQQNRR MLQSGQAKYV EDDGSDDSMG IDSHQQQSRI AKSKQAKHTA  1140
RDFSMVSDDS VGVDSHHQQR QVAESNAREF SAVSDDSLED NAHQLHRRSL RRNKDKCIGR  1200
ENLTSEGLHG ASSRQQQRRT YKSKQAKIVE REDGALDDTP EDSAVLQNKR ILRGKQIKSE  1260
TLQQKKQENP GRVKQASRRL QETQKQTPKV QNIESEQNTF DINAEKEPEG GPSTRLRKRP  1320
PKEQQETGRK KAKEQPETSR KKAKEQPETG RKKAKEQQQT GRIKVNTTLA VKTKNAPARK  1380
AKNSLAVREE EAEFLCDIEG CTMSFGSKQE LNLHKRNVCP VKGCAKKFFS HKYLVQHRRV  1440
HMDDRPLRCP WKGCKMTFKW AWARTEHIRV HTGARPYVCA EPGCGQTFRA LKCGGRMELS  1500
ESIVSVPPDS HSNSDNNNEI DQVQQQLNYS EIGSQHSIPE TNDIGHQQQN HSVPSDLDHA  1560
VIDQIQKLDL KGDDFEIVVE ENKEKGEDDT EEEGGEFQEV EAEKSYNGRE TENGNESERH  1620
SEEGEQESNE GEDKSENGAE VEKKGEESNR RYQYPVRPEA EDCSFYLKTG NCKFGSNCKF  1680
NHPVRRKNNQ LKDWDFLCLE SFIIDFSFTA VAKAMSMFLI ARVLFLIRIR ECVFKDKVKE  1740
RDELAEKPGQ TECKYYLRPG GCKYGKACRY NHGKGKPSVA SVVELNFLGL PIRQGERECP  1800
YYMRNGSCKY GSNCRFNHPD PTAAGGSDPP SGFDNGGPAS LQGGSQSSSW SAPRSLSESP  1860
LYVPMMMPPS QGVPSQNTEW NSYQAPMYLG ERSMPAPPPY VINNSGTETN VYKQYPMPNQ  1920
VDEFPERPGQ PVCSFFLRTG DCKFKSNCKY HHPKSQTAVS PSFALSDKGL PLRPDQNICT  1980
HYSRYGICKF GPACKFDHPL HLTSSTTSGL DHQLPFSDLA NIKEAGIARS RSGTDDTIQL
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
110191024SRKRKM
213291342RKKAKEQPETSRKK
313401353RKKAKEQPETSRKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G48430.10.0C2H2 family protein