PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Rmu_sc0002661.1_g000009.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Rosoideae; Rosoideae incertae sedis; Rosa
Family C2H2
Protein Properties Length: 1493aa    MW: 166500 Da    PI: 9.1507
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Rmu_sc0002661.1_g000009.1genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H213.40.0002313981421223
                                 EET..TTTEEESSHHHHHHHHHHT CS
                    zf-C2H2    2 kCp..dCgksFsrksnLkrHirtH 23  
                                 +Cp   Cgk F ++ +L++H r+H
  Rmu_sc0002661.1_g000009.1 1398 VCPvkGCGKKFFSHKYLVQHRRVH 1421
                                 69999*****************99 PP

2zf-C2H211.80.0007314571483123
                                 EEET..TTTEEESSHHHHHHHHHH..T CS
                    zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt..H 23  
                                 y+C    Cg++F+  s++ rH r+  H
  Rmu_sc0002661.1_g000009.1 1457 YVCAepGCGQTFRFVSDFSRHKRKtgH 1483
                                 899999****************99666 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1493 aa     Download sequence    
MAASEQPQEV LSWLRALPVA PEYHPTWAEF QDPIAYIFKI EKEASKYGIC KIVPPVPPAP  60
KKTAIANLNK SLVARGGPSV GKGPKALPTF TTRQQQIGFC PRKARPVQRP VWQSGEHYTF  120
NQFEAKAKSF EKSYLKKRRK KGGLNPLDVE TLYWRATVDK PFSVEYANDM PGSAFVPLSS  180
KKSGASTSRE AGDGVTLGET AWNMRGVSRS RGSLLRFMKE EIPGVTCPMV YVAMMFSWFA  240
WHVEDHDLHS LNYLHLGAGK TWYGVPREAA VAFEEVVRVQ GYGGEINPLV TFATLGEKTT  300
VMSPEVFISS GIPCCRLVQN AGEFVVTFPR AYHTGFSHGF NCGEAANIAT PEWLRVANDA  360
AVRRASINYP PMVSHFQLLY DLALALCSRT PVRNSAEPRS SRLKDKKKGE GETVVKGLFV  420
KNVIQNNELL HVLGKGSSIV LLPQSSSDIS VCSKLRVGSQ LRVNPDDLMI DGKQGIKQVK  480
GLYSVKGKLA SLCESSQHPS LNANGNASTP SKMLNMSAKR ESNVEGEGLS DQRLFSCVTC  540
GILSFSCVAI IQPREAAARY LMSADCSFFN DWAVDCEPIQ VANGDPNSSK KGPCTETGLM  600
QKSTHDGLYD VPVQSADYRN QITDPINEVD SNTEMQRDTN ALGLLALTYG DSSDSEEDQA  660
EPDAPVCGDE TNLSDCSLEG RYEYKSASPP LRDSYGGTAG VRSPTSPRFD CGNELPTVDS  720
NVENRREATN FKDNGHQYID YSVDLDTLTK TNGLGGTSID PVKVSYSGSP DALDIQPTRF  780
GQATLQKEST GTSFFPGCDQ DSSRMHVFCL EHAVEVEQQL RSFGGAHILL LCHPDYPRIV  840
DEAKEIAEEL GVNYPWNDMV FRDATREDEE RIQSALDSEE AIAGNGDWAV KMGINLFYSA  900
SLSRSHLYSK QMPYNSVIYN AFGRSSPASS PAGPEVCGRR PAKQKKIVVG KWCGKVWMSN  960
QVHPFLIKRD HEEKKVELEQ RRFHDSEMPD EKLDGKSEST RKTEKTMVTK QYSRKRKMTV  1020
EGGTTKKAKC PDAVSAHSVD DNSHQQQKRF LKNKQAKYIE SGPTKKAKFT ETEDAVSGDS  1080
MEDDFRQQNR RTLRSEQAKY IEGDDDVSDD SMGVDSHQQQ RRIAKSKQAK YIARDFSMLS  1140
DDSVGVNSDH QQRRVAESNA REFSAVSDDS LEDNIHQLHR RSLRRNKGKC IGRGNLTSQN  1200
LHGVSSPQQQ RRTSKSKQAK TVEREDAALD DTPDDNAALR LKSFRGRQIK PETVQQKKQE  1260
TPRRVKQGSR RLQETQQKTP RIQNIQSEQN TVDVNAEEPE GGPSTRLRKR PPKEQPETGR  1320
KKAKEQPETG RKKAREQQQT GRIKVNTALG VKTKNASARK GKNASAVREE EAEFLCDVEG  1380
CTMSFGTKQE LNLHKKNVCP VKGCGKKFFS HKYLVQHRRV HEDDRPLRCP WKGCKMTFKW  1440
AWARTEHIRV HTGARPYVCA EPGCGQTFRF VSDFSRHKRK TGHSVKKGKG RSR
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1136140KKRRK
2136141KKRRKK
310131018SRKRKM
413201333RKKAKEQPETGRKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G48430.10.0C2H2 family protein