PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID SMil_00005537-RA_Salv
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Lamiales; Lamiaceae; Nepetoideae; Mentheae; Salvia
Family C3H
Protein Properties Length: 540aa    MW: 61860.2 Da    PI: 7.554
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
SMil_00005537-RA_SalvgenomeNDCTCMView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH24.93.5e-08369388625
                            -SGGGGTS--TTTTT-SS-S CS
                zf-CCCH   6 CrffartGtCkyGdrCkFaH 25 
                            C+f+++tG C++G+rC+++H
  SMil_00005537-RA_Salv 369 CPFHLKTGACRFGSRCSRIH 388
                            *******************9 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5010312.765363391IPR000571Zinc finger, CCCH-type
PfamPF006424.1E-6368388IPR000571Zinc finger, CCCH-type
PRINTSPR018485.8E-12369388IPR009145U2 auxiliary factor small subunit
PRINTSPR018485.8E-12388408IPR009145U2 auxiliary factor small subunit
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0000398Biological ProcessmRNA splicing, via spliceosome
GO:0089701Cellular ComponentU2AF
GO:0003723Molecular FunctionRNA binding
GO:0046872Molecular Functionmetal ion binding
Sequence ? help Back to Top
Protein Sequence    Length: 540 aa     Download sequence    Send to blast
MAPPPAPPHT TACTTTTPLH HHSTACTPQH HRRGLGEIRG ERRRRSEGGK GRRRELLSLS  60
QGTDPAGEGR LPPLTDPAGA KDRSGEGRRT DPAGAASSDD EQGKGKWRRR RLGRKRDIIG  120
GTTQKGKWDL NSGTEGALIS SDRRDTSPEP KIMIEEDVDN TNHTGTEAAE CAAGKREKRR  180
KAVKKEKRKN KRKEMAEMAR REEEIRLNDP EEQRRLQAEE EQERLRAEEE RRHFVEMERK  240
ILEEWEKKKA LQEKEEEERR RRVEQEELLS KQNQVGHENE VDDDGWEYVE EGPSEIIWQG  300
NEIIVKKKKI RVKKKDDQQI MKEDTNRPIS NPLPPQSEAF TDHKTAQQLL DSVAQETPNF  360
GTEQDKAHCP FHLKTGACRF GSRCSRIHFY PDKSCTLLIK SMYNGPGLAW EQDEGLEVLP  420
TAFYVGKHLC LNVSCRGVPD AKRCGATFCT HVVLNVTVTH PNECHVSNIC RKGQCPHREN  480
KRSDFLNFVP PQHTDEEVER AYEEFYEDVH TEFLKFGEII NFKVNILEFL YFRCASGSAP
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1106111WRRRRL
2177191KRRKAVKKEKRKNKR
3183191KKEKRKNKR
4246261KKKALQEKEEEERRRR
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA818966
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G10320.18e-51C3H family protein