PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG025114t1
Common NameTCM_025114
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family AP2
Protein Properties Length: 730aa    MW: 79297.4 Da    PI: 6.1473
Description AP2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG025114t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1AP2353.5e-11302358155
               AP2   1 sgykGVrwdkkrgrWvAeIrd.pseng..kr.krfslgkfgtaeeAakaaiaarkkleg 55 
                       s y+GV++++++gr++A+++d  +     ++ k  + g ++ +e+Aa+a++ a++k++g
  Thecc1EG025114t1 302 SIYRGVTRHRWTGRYEAHLWDnSCR-RegQTrKGRQ-GGYDKEEKAARAYDLAALKYWG 358
                       57*******************4444.2455535555.779999**************98 PP

2AP246.68.4e-15401452155
               AP2   1 sgykGVrwdkkrgrWvAeIrdpsengkrkrfslgkfgtaeeAakaaiaarkkleg 55 
                       s y+GV+++++ grW A+I  +     +k  +lg+f t eeAa+a++ a+ k++g
  Thecc1EG025114t1 401 SIYRGVTRHHQHGRWQARIGRVAG---NKDLYLGTFSTQEEAAEAYDIAAIKFRG 452
                       57****************988532...5************************998 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF541711.05E-14302368IPR016177DNA-binding domain
CDDcd000181.58E-19302368No hitNo description
PfamPF008471.2E-8302358IPR001471AP2/ERF domain
SMARTSM003801.1E-22303372IPR001471AP2/ERF domain
Gene3DG3DSA:3.30.730.108.8E-12303367IPR001471AP2/ERF domain
PROSITE profilePS5103217.319303366IPR001471AP2/ERF domain
PRINTSPR003674.0E-6304315IPR001471AP2/ERF domain
CDDcd000187.31E-25401462No hitNo description
SuperFamilySSF541711.18E-17401462IPR016177DNA-binding domain
PfamPF008471.0E-9401452IPR001471AP2/ERF domain
PROSITE profilePS5103219.045402460IPR001471AP2/ERF domain
SMARTSM003801.5E-33402466IPR001471AP2/ERF domain
Gene3DG3DSA:3.30.730.105.4E-18402461IPR001471AP2/ERF domain
PRINTSPR003674.0E-6442462IPR001471AP2/ERF domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 730 aa     Download sequence    Send to blast
MASMNNWLAF SLSPQELPSQ TVDQDHHSQT AVSRLGFNSD DISGADVSGE CFDLTSDSSA  60
PSLNLPPPFG ILEAFNRNNQ SQDWNMKGLG MNSDGNYKTS SELSMLMGSS CNGQSLDQSN  120
QEPKLENFLG NHSFSNHQQN KLHGCNTMYN TTTGEYMFPN CSLQLPSEDT TNARTSNGGD  180
DNDNNNNKNN NNNTNINTGN GSSSIGLSMI KTWLRNQPAP PQPEAKNNGG ASQSLSLSMS  240
TGSQTGSPLP LLTSSTGGGS GGESSSSDNN KQQKTPTGMD SESGAIEAMP RKSIDTFGQR  300
TSIYRGVTRH RWTGRYEAHL WDNSCRREGQ TRKGRQGGYD KEEKAARAYD LAALKYWGTT  360
TTTNFPISNY EKELEEMKHM TRQEYVASLR RKSSGFSRGA SIYRGVTRHH QHGRWQARIG  420
RVAGNKDLYL GTFSTQEEAA EAYDIAAIKF RGLNAVTNFD MSRYDVKSIL ESSTLPIGGA  480
AKRLKDVEQA EMALDVQRVD DDNMSSQLTD GINNYGAAHH GWPTIAFQQA QPFSMHYPYG  540
QRVWCKQEQD SDANHTFQDL HQLQLGSTHN FFQPSVLHNL MAMDSSSMEH SSGSNSVIYC  600
NGGGGDAAGS NGASGSYQAV GYGGNGGYVI PMGTVVASDS NQNQGNGFGD NEVKTLGYET  660
MYGSADPYHP RNLYYLSQQS STGGVKASSY DQASACNNWV PTAVPTIAQR SSNMAVCHGA  720
PTFTVWNDT*
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_021281898.10.0AP2-like ethylene-responsive transcription factor BBM
TrEMBLA0A061EZB20.0A0A061EZB2_THECC; AP2 domain-containing transcription factor
STRINGEOY097220.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM86552737
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G17430.11e-129AP2 family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]