PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG027104t3
Common NameTCM_027104
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family C3H
Protein Properties Length: 851aa    MW: 99016.5 Da    PI: 8.6433
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG027104t3genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH19.22.1e-06217236625
                       -SGGGGTS--TTTTT-SS-S CS
           zf-CCCH   6 CrffartGtCkyGdrCkFaH 25 
                       C+f+++tG C++G rC++ H
  Thecc1EG027104t3 217 CPFHLKTGACRFGQRCSRVH 236
                       ******************99 PP

2zf-CCCH26.31.3e-08346373126
                       --S---SGGGGTS..--TTTTT-SS-SS CS
           zf-CCCH   1 yktelCrffartG..tCkyGdrCkFaHg 26 
                       +k ++C  f+++   tC++G  C+F+H+
  Thecc1EG027104t3 346 WKVAICGEFMKSRlkTCSHGTACNFIHC 373
                       899************************8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5010312.23211239IPR000571Zinc finger, CCCH-type
SMARTSM003563.4212238IPR000571Zinc finger, CCCH-type
PfamPF006422.2E-4216236IPR000571Zinc finger, CCCH-type
PRINTSPR018481.4E-36217236IPR009145U2 auxiliary factor small subunit
PRINTSPR018481.4E-36236256IPR009145U2 auxiliary factor small subunit
Gene3DG3DSA:3.30.70.3303.9E-27242341IPR012677Nucleotide-binding alpha-beta plait domain
PROSITE profilePS501029.627243343IPR000504RNA recognition motif domain
CDDcd125409.21E-50244342No hitNo description
SMARTSM003611.3E-5267339IPR003954RNA recognition motif domain, eukaryote
PRINTSPR018481.4E-36272287IPR009145U2 auxiliary factor small subunit
SuperFamilySSF549285.08E-16275343IPR012677Nucleotide-binding alpha-beta plait domain
PRINTSPR018481.4E-36300322IPR009145U2 auxiliary factor small subunit
PRINTSPR018481.4E-36327351IPR009145U2 auxiliary factor small subunit
PROSITE profilePS5010310.559345375IPR000571Zinc finger, CCCH-type
SMARTSM003560.11345374IPR000571Zinc finger, CCCH-type
PfamPF006425.4E-6346373IPR000571Zinc finger, CCCH-type
PRINTSPR018481.4E-36365377IPR009145U2 auxiliary factor small subunit
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0000398Biological ProcessmRNA splicing, via spliceosome
GO:0089701Cellular ComponentU2AF
GO:0000166Molecular Functionnucleotide binding
GO:0003723Molecular FunctionRNA binding
GO:0046872Molecular Functionmetal ion binding
Sequence ? help Back to Top
Protein Sequence    Length: 851 aa     Download sequence    Send to blast
MGEAETALKE EEGGGERENH QMEKSRKEKR KQMKKMKRKQ VRKEAAEKER EAEEARLNDP  60
EEQMRIQREE EEERKRREIA LKEFEERERV WIEAMEMKRK AQEEEEKEEE EKRKDLKEDA  120
NGEQEEMSDD WEYIEGSPQI IWEGNEITVR KKQVRVPKKD ANQKSKEEDA DRPTSNPLPP  180
QSEAFADYLN ASSAQQVLES VAKEVPNFGT EQDKAHCPFH LKTGACRFGQ RCSRVHFYPD  240
KSCTLLMRNM YNGPGLAWEQ DEGLEYTDEE VERCYEEFYE DVHTEFLKFG EIVNFKVCKN  300
GAFHLRGNVY VHYKMLESAV LAYHSINGRY FAGKQVKCEF VNLTRWKVAI CGEFMKSRLK  360
TCSHGTACNF IHCFRNPGGD YEWADWDKPP PRYWVKKMGA LFGYSDEAGF EKQIEQEHSG  420
QSRNRSRVIK SDADRHRSRR SKSREMNRLI GGADRSPCIE DDVEESSHSQ RGKNNDRKQT  480
KGLDGRSYRE SKSLKWDQNR EKNHDTSSDG GYSDSKRGKK IDRKRAKTLD GRSDRQRSLT  540
WDQNSEEIHD TSSDGGYSDS KRGKKNDRKQ AKTLDGRSDR QRGLKWDENS EKIHTTSSDG  600
GYSDSKRGKE NDRKQAKVLD GGRSDRQRSL TWDQNSEEIH DTSSYGGYSD SKRGKKNDRK  660
QAKTLDGRSD RQRSLKCDEN SEKIHDTSSD GSYSDSKRGK KNDRKRAKTL DGRSGRQRSL  720
KWDENSEQIH YTSSDGGYSH SKRGKKNERK KAKLLDGRSD RHRSLKWDQN RERTLDTSSD  780
EGYSERDIDA ARDADEVTHH CHAKEHSKHQ SESLEYLADN RSFKNRDYED TENSPAQTKK  840
RTRHRSSKGG *
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4yh8_A1e-302083729167Splicing factor U2AF 23 kDa subunit
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
17277ERKRRE
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankJX5860542e-43JX586054.1 Gossypium hirsutum clone NBRI_GE19787 microsatellite sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_021298174.10.0LOW QUALITY PROTEIN: zinc finger CCCH domain-containing protein 5
TrEMBLA0A061G8440.0A0A061G844_THECC; Zinc finger C-x8-C-x5-C-x3-H type family protein isoform 3
STRINGEOY257250.0(Theobroma cacao)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G10320.11e-164C3H family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]