PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG035217t1
Common NameTCM_035217
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family C3H
Protein Properties Length: 2111aa    MW: 232535 Da    PI: 9.2214
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG035217t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH17.47.7e-0619872008627
                        -SGGGGTS--TTTTT-SS-SSS CS
           zf-CCCH    6 CrffartGtCkyGdrCkFaHgp 27  
                        C++f +tG C+ G++Ck +H++
  Thecc1EG035217t1 1987 CPNFEATGSCPQGSKCKLHHPK 2008
                        *****************99985 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5010313.15918781906IPR000571Zinc finger, CCCH-type
SMARTSM003560.01518791905IPR000571Zinc finger, CCCH-type
SMARTSM003563.819061930IPR000571Zinc finger, CCCH-type
PROSITE profilePS501036.86919101931IPR000571Zinc finger, CCCH-type
SMARTSM003569.6E-419321957IPR000571Zinc finger, CCCH-type
PROSITE profilePS5010312.49319321958IPR000571Zinc finger, CCCH-type
PROSITE profilePS501036.6919591980IPR000571Zinc finger, CCCH-type
SMARTSM003561.119591985IPR000571Zinc finger, CCCH-type
PROSITE profilePS5010312.36119812009IPR000571Zinc finger, CCCH-type
SMARTSM003560.219862008IPR000571Zinc finger, CCCH-type
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0043484Biological Processregulation of RNA splicing
GO:0060149Biological Processnegative regulation of posttranscriptional gene silencing
GO:0016607Cellular Componentnuclear speck
GO:0046872Molecular Functionmetal ion binding
Sequence ? help Back to Top
Protein Sequence    Length: 2111 aa     Download sequence    Send to blast
MDPSSYVNHH HHHHHQQNNH TRYAPLSPPL HLLSPTPPLP PHAPPYQQSP NNLYPSRYHV  60
PPPQPPPQLA PPPPPPQQPQ QQPYHPLPAP PPSVQRQQHP NHHPYNPQHP QYTFNPNFNS  120
NPKPNNVSHQ FHDFPQRRVP EFDTRPEYWP DNRASRPHSV SSLDREARYH QFDRRPASLA  180
VDRYRHDVEG SSRYRALELN LRERPELGRV HSDNWIPDRA SRDFGIVSMG FESNSNNSGF  240
CHKVAENVRW GSRLRDQLID NGNNEINERD EMRVFSRKID YYQESELERF SDRGSSREDS  300
HEFNRAPRKQ IQKKSALLRI QKAQQNHRNR EDERSHYMGY NNEGKTGSFR GKDLVLHSDH  360
GLEERERKVS PVELDVSFKS NSLVAKAIVT PSSSSPVSDL NVKPRTSKIR KVMIFDKANE  420
SRAKLDVSTS VLNSGSGSED SKQSAGKVKS CGIGNVHDGV TKPSSKRTNV SLRNGKFERS  480
CKVTVKLDDL TCVTEETPIT DKKPESLEGK STIPCIGNMR DGGLQTCSDR ANVSVRENKV  540
EGTLKSTLSD KSGASVGKPS SLKATKKKKI VRKVEKKVTN SPLNLANSKS PKSCDQPVKA  600
DTSTYCLSAI SVADKSVTPP KMKIASAAGC SAGAVGLECC PKESALLLEY EKVTGASKDV  660
VSKEVGTDVD PGSSVAPKIK RKRNSSTLPL RSSGHEESKV DQRFVNSDNS VFGLRIVSNI  720
KEDRTETLNE SITSRAFSVE DIDKQFYHSE SNNDGLLRSE DINVHEDIVD IGSSSVAMHG  780
TPGFECGSSN TQEINIACDI GNVNSGSKQA CATAGNPVVE DGTTGRLPEA NCSAGSNKMP  840
HLPCSEETQI NSGSIYADCS NHNRSTIHTP DIGYVNSGES NCEIGDDFVK HLVSSTLSLG  900
NSGAERIPNA AESPECTAGS ADVLTSANCL DTTIITSGVS APPEVMVSDF GWLDTFREVS  960
ASADGKSPEN KKRKISTSSS DVTASVINEG VAVSNISKSA VQLPSNFTDD QLQLEQAVKV  1020
SSIDGLHKEG IDLLLVNSSV VGPSQSVGFF RDAYKINHPR IDPCSAFIES VAPSSPCLHL  1080
LKLGGDQLST ATQVSAQNNH QIVAMDIEGD DRGKVHVGTA EEQKFISSEV SQCRITPEHM  1140
SSSLDQRLPS TDVEDDNHIP LKDDLPSALI SLVFGVDANE VSATNSNDEV MPAPDIVSDV  1200
GSPYNHDNFV ISASTCKAPL CQQSEKQAFG DEKFSDDKPM AEGAGNVSAL VSYSQHSRTI  1260
LKSNDAIQTN QSVAGKEVLL PSHDSKNTNS PNSISGATRR RKNPLSHVVP KSYPTRSSFV  1320
FSASKNTTPS TNITKPRTWH RTNNSSASPL SGNKPSSSAN PLQRQMPKKA AFFQSPSYIR  1380
KGNSLVRKPV AVPALPQGSH SLSSSVYRMN PGVVDEVKKG TGPNSRVGAV DLRTGGANAS  1440
FERPTTPPLS SVSKVPNCTS NSPGECTSSP LAEPSISDCC ETAINHASSM EINDVLNSPE  1500
DGLKTFETLN QNGSVNNLEE CTEQSESNLV PSNAKRLTYV KPKSNQLVAT SECGRTSILN  1560
ADKNQNFSAP SDGYYKKSKN QLIRTALESH IKQAVTMSDN KTNSVGQVAA KVMPSRTVGK  1620
RQSNKVVGKT HKPSKFSLVW TLHSARLSKN DGNSLRRPKV LPQLFPWKRM TYWRSFKLNS  1680
VSSCNSSLST ISRKMLLSRK RNTVYTRSIN GFSIRKSKVF SVGGSSLKWS KSIERNSRKA  1740
NEEATLAVAE AERKKREQKG TVSRTGKRSY SCHKVVHGTE LRPGERIFRI GSLRYKMDSS  1800
RHSLQRISDD ESSCSSDHLS ENSTKKTYVP RRLVIGNDEY VRIGNGNQLV RDPKKRTRVL  1860
ASEKVRWSLH TARLRLVKKR KYCQFFTRFG KCNKDDGKCP YIHDPSKIAV CTKFLKGLCS  1920
NPNCKLTHKV IPERMPDCSY FLQGLCTNEN CPYRHVHVNP NASTCEGFLR GYCADGNECR  1980
KKHSYVCPNF EATGSCPQGS KCKLHHPKKQ SKGKKSKRSI KHNNARGRYF GIDMLVPKRM  2040
VPESHRALDD DDVFFDGKFS DYIRLDVRDD DAGEIHQVMN DQMTFGDNDS SDLRLDDLDE  2100
LIKPIRIMNR *
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
6fbs_C5e-201883200841168Cleavage and polyadenylation specificity factor subunit 4
6fuw_C5e-201883200841168Cleavage and polyadenylation specificity factor subunit 4
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007019226.20.0PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein At1g21580
TrEMBLA0A061FH950.0A0A061FH95_THECC; Zinc finger C-x8-C-x5-C-x3-H type family protein, putative isoform 1
STRINGEOY164510.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM57801012
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]