PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Rmu_sc0008967.1_g000004.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Rosoideae; Rosoideae incertae sedis; Rosa
Family C2H2
Protein Properties Length: 525aa    MW: 57281.3 Da    PI: 8.7629
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Rmu_sc0008967.1_g000004.1genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H217.41.2e-057698123
                               EEETTTTEEESSHHHHHHHHHHT CS
                    zf-C2H2  1 ykCpdCgksFsrksnLkrHirtH 23
                               ++C+ C+k F r  nL+ H r H
  Rmu_sc0008967.1_g000004.1 76 FVCEVCNKGFQREQNLQLHRRGH 98
                               89*******************88 PP

2zf-C2H212.40.00047152174123
                                EEETTTTEEESSHHHHHHHHHHT CS
                    zf-C2H2   1 ykCpdCgksFsrksnLkrHirtH 23 
                                +kC++C+k++  +s+ k H + +
  Rmu_sc0008967.1_g000004.1 152 WKCEKCSKRYAVQSDWKAHSKIC 174
                                58*****************9876 PP

Sequence ? help Back to Top
Protein Sequence    Length: 525 aa     Download sequence    
MQPTSYQQVD ENMSNLTCAS GDLSASANSS IRNDSSSSTQ QQQPPQKKKR NLPGNPDPDA  60
EVIALSPKSL MATNRFVCEV CNKGFQREQN LQLHRRGHNL PWKLKQRTNK EVRKKVYLCP  120
EPTCVHHDPS RALGDLTGIK KHFSRKHGEK KWKCEKCSKR YAVQSDWKAH SKICGTREYR  180
CDCGTLFSRR DSFITHRAFC DALAQENSNA ARAINPLLSS SLPQLHGGFQ PHHLSHHHVK  240
REVQDLQLQL PPWMGGEATA SAGPSHHLSS NINLSSSSPL FSTSSFDHQQ SFLHQNPNPS  300
SSDFQPSASA HMSATALLQK ASEMGATVSK PSSMFLKPHD QQHQASHMMP ETTAFGSFME  360
QAGANSGGAA AASNNSLLHD MMMMTSHLQQ SSSAARGGFD QLSSSSSFGD HHHQGFNGSM  420
VMGSFAQNVN PSPQKKLITG ESQLRRTSST TDGGGGGGSS GGGGMNEGMT RDFLGLRAFS  480
SQDDQDFFNM AAAHGLDQHV VNSTSSANYN GQQHNHQNQT SWQGN
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1103115KLKQRTNKEVRKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G50700.11e-108C2H2 family protein