PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG014549t4
Common NameTCM_014549
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family C2H2
Protein Properties Length: 1197aa    MW: 136159 Da    PI: 8.1287
Description C2H2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG014549t4genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-C2H213.30.0002610891111323
                        ET..TTTEEESSHHHHHHHHHHT CS
           zf-C2H2    3 Cp..dCgksFsrksnLkrHirtH 23  
                        Cp   Cgk F ++ +L++H r+H
  Thecc1EG014549t4 1089 CPvkGCGKKFFSHKYLVQHRRVH 1111
                        9999*****************99 PP

2zf-C2H212.10.000611471173123
                        EEET..TTTEEESSHHHHHHHHHH..T CS
           zf-C2H2    1 ykCp..dCgksFsrksnLkrHirt..H 23  
                        y+C    Cg++F+  s++ rH r+  H
  Thecc1EG014549t4 1147 YVCAeeGCGQTFRFVSDFSRHKRKtgH 1173
                        89********************99666 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM003552610641086IPR015880Zinc finger, C2H2-like
PROSITE profilePS5015712.36210871116IPR007087Zinc finger, C2H2
SMARTSM003550.004510871111IPR015880Zinc finger, C2H2-like
Gene3DG3DSA:3.30.160.601.6E-510881115IPR013087Zinc finger C2H2-type/integrase DNA-binding domain
PROSITE patternPS00028010891111IPR007087Zinc finger, C2H2
SuperFamilySSF576674.6E-911031143No hitNo description
Gene3DG3DSA:3.30.160.602.7E-811161141IPR013087Zinc finger C2H2-type/integrase DNA-binding domain
PROSITE profilePS5015710.74111171146IPR007087Zinc finger, C2H2
SMARTSM003550.001411171141IPR015880Zinc finger, C2H2-like
PROSITE patternPS00028011191141IPR007087Zinc finger, C2H2
SuperFamilySSF576674.32E-811351169No hitNo description
Gene3DG3DSA:3.30.160.602.3E-911421170IPR013087Zinc finger C2H2-type/integrase DNA-binding domain
PROSITE profilePS5015711.2411471178IPR007087Zinc finger, C2H2
SMARTSM003550.8511471173IPR015880Zinc finger, C2H2-like
PROSITE patternPS00028011491173IPR007087Zinc finger, C2H2
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0009741Biological Processresponse to brassinosteroid
GO:0009826Biological Processunidimensional cell growth
GO:0010228Biological Processvegetative to reproductive phase transition of meristem
GO:0033169Biological Processhistone H3-K9 demethylation
GO:0035067Biological Processnegative regulation of histone acetylation
GO:0048366Biological Processleaf development
GO:0005634Cellular Componentnucleus
GO:0003676Molecular Functionnucleic acid binding
GO:0046872Molecular Functionmetal ion binding
Sequence ? help Back to Top
Protein Sequence    Length: 1197 aa     Download sequence    Send to blast
MSRGLCNYKD VVKLSKDLAS DEIMVGGNEE IKGVKGFYSV KGKFASMYEG NRDSAFNGTD  60
HLCRLPLQTL NMSAEGENAV QGDALSDQGL FSCVTCGILC FSCIAVLQPT EQAARYLMSA  120
DCSFFNDWTV GSGVTRDGFT TTHGDVITSE QNSCTRWMNK RAPNALYDVP VQSVEDKFHM  180
ADQSNQVVED TEKGGDTSAL GLLASTYGNS SDSEEDHVEP NVTVSGDETN SANRSLERKF  240
QYNGSGFSPG DANGSNNPSL LRLESEEEAP VHVDIKSTSP QAFDHTVEFE TDNLASRRSI  300
GLEDKFRDPI TTSHANPSYS PATHGAEKMR FSKTMVPMEN ADIPFAPRSD EDSSRMHVFC  360
LEHAVEVDQQ LRQIGGVHVF LLCHPEYPKI EAEAKLVTEE LGIDYPWNDI LFGDATKEDE  420
ERIQSALDSE DAIPGNGDWA VKLGVNLFYS ANLSRSTLYS KQMPYNYVIY SAFGRNSPGS  480
SPTKLNVYGR RSGKQKKVVA GKWCGKVWMS NQVHPFLAQR DPEEQEQERG FHAWATSDEN  540
LERKPENVHK AETTKVAKKF NRKRKMRPEI ASSKKVKCIE TEGAVSDDSL DGGSLRQQQI  600
FFRGKQPRLI QKEEAISYDL LEDDSLLQQR NLSRKKLAKF IEREGAESED AEEEFTHQQH  660
WRNLRGKQGK YIEEDDAVSG DSLDESSLKQ YRRIPRSWQA KFREREDIVS EDELEEISHR  720
LHRRIPRCRQ IKSCEKNDAI SDDSRADNSL KQYRRMPKGR QANFVERDDT MSDDASEDDS  780
QHQLRRIPKG KQMKCMERDD AFSDDSLEDN LQQQHRIPRS KVAKFTDRED VVSFDSLKGS  840
SHQQRRRVSR SQLTKFIERE DAVSSDSPDD SSLQQLRRIP RSKQTKILER EDAVSDDSLD  900
DTSQQQLRKT PRSRQGKFIE REDAVSYDSL DENYHQPNRR TLRSRKKKAQ TPRQIKQETP  960
RNVKQGKRRT TKQVVSQQIK QETPRNRNTK IEQSARQCNS YGEDELEGGP STRLRKRVRK  1020
PLKESETKPK EKKQASKKKV KNASNVKTLA GHNTSKVRDE EAEYQCDMEG CTMSFGLKQE  1080
LLLHKRNICP VKGCGKKFFS HKYLVQHRRV HLDDRPLKCP WKGCKMTFKW AWARTEHIRV  1140
HTGARPYVCA EEGCGQTFRF VSDFSRHKRK TGHSAKKGLG STKGLEFPHC CHSGNC*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
6a57_A6e-691062117721136Lysine-specific demethylase REF6
6a58_A6e-691062117721136Lysine-specific demethylase REF6
6a59_A6e-691062117721136Lysine-specific demethylase REF6
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
110121018RLRKRVR
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00608ChIP-seqTransfer from AT3G48430Download
Motif logo
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007037855.20.0PREDICTED: lysine-specific demethylase REF6
TrEMBLA0A061G5N90.0A0A061G5N9_THECC; Relative of early flowering 6, putative isoform 4
STRINGEOY223560.0(Theobroma cacao)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G48430.11e-87relative of early flowering 6
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]