PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Rmu_sc0012005.1_g000009.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Rosoideae; Rosoideae incertae sedis; Rosa
Family C3H
Protein Properties Length: 1284aa    MW: 140873 Da    PI: 4.2223
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Rmu_sc0012005.1_g000009.1genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH17.86e-0612591282226
                                 -S---SGGGGTS--TTTTT-SS-SS CS
                    zf-CCCH    2 ktelCrffartGtCkyGdrCkFaHg 26  
                                 k + Cr ++++G+Ck G++C + H+
  Rmu_sc0012005.1_g000009.1 1259 KGRVCR-YFESGHCKKGSSCDYLHP 1282
                                 7789**.8888*************9 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1284 aa     Download sequence    
MAAEAEREEV EAVEMVEAER EEVGGVAEEV QVDREEEEEV KLEGEGVELE KAEVELEREE  60
IREEEEKAEE QGHVTDMVDE TIETEGRAEA EAVDGGHVSD LNLKDTVDQT LETSDGADVE  120
DTFETTDAVE EGKGNEAEVH EVNATDMGDE TNGTEAVEEG KEGVVTDMVD ETVESEAMEE  180
EGNGVAAEGH EANVTVTESD EIIETTEAVE EGKVSEAEGD EANVKDMVHE SIETEAAGEE  240
EQKAEVEGHE ADAAIEAEEV GQEAEAEGEE QNLNVKDMVD GAVETQVSRE EEQEAEVEEN  300
EANTTIEAEE AGQEVEAEGG EWNLNVKDMV DDTIQTQVSG EVGQEAEAEG GEQNLNVKDM  360
VDDTIQTQVS GEEEQEAEVE GHELTATVTV DEADEAVVTT EAVEADVSDD VMEETTEAEE  420
TEAAEGAEEM EADEMEAAEE EEAEEMEMVE DTNTSGVGGK RKRRKNSKAA ASEKVLSKKK  480
EEDVCFICFD GGELVLCDRR GCPKAYHPSC VNRDEAFFRA KGRWNCGWHL CSNCEKNAHY  540
MCYTCTFSLC KACTKDAVIL CIKGNKGFCE TCMKTVMLIE KNEHGNKEKD AVDFDDKSSW  600
EYLFKDYWTD LKERLSLTLD DLSQAKNPWK GSAGHANKHG SHDEPYDANI DGGSDSDNSE  660
NLDSTNSKRR KGKKRLKTRA KGKNSSSPAT GSGGRYADDN TEWASKELLE FVMHMRNGDS  720
SALSQFDVQA LLLEYIKRNK LRDPRRKSQI ICDLRLQSLF GKPRVGHFEM LKLLESHFFM  780
KEDSQVDDHQ GSVVDTEGNQ LEADGHSDTP AKASKDKKRK RKKNEPQSNV EDFAAIDIHN  840
INLIYLRRNL VEDLLEDTDN FQEKVAGSFV RIRISGSGQK QDLYRLVQVI GTCKAAEPYK  900
VGKRMTDILL EILNLNKTEI VTIDIISNQE FTEDECKRLR QSIKCGLINR LTVGDVQEKA  960
VVLQPVRVKD WLETETVRLQ HLRDRASEKG RRKEYPFLFQ QFTACWLIGS SVTNCHLIWC  1020
RGAPSQKIEP NNATNVPAQH PSHGYWPPTN NGTSSINTGN SAGNFPAQGF SGVPNSDAWR  1080
PPVPSNQSYI QPPAQPQVPW GPSVSDNQSA VPRMGQESQN SGWVPVAGNP NVAWGGPVPG  1140
NSNMNWAAPS QSPGWSASGQ VPVRGNAVPS WAPPPGQGPP SVSANPGWAP PGQGPALGNA  1200
NSGWSAPTAN QTQNGDRFSN QRDRGSHGGD SGFGGGKPWN RQPSFGGGGG GGSSRPPFKG  1260
RVCRYFESGH CKKGSSCDYL HPDH
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1460465KRKRRK
2817822KKRKRK
3817823KKRKRKK
4817824KKRKRKKN
5819823RKRKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G16485.10.0C3H family protein