PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG036587t4
Common NameTCM_036587
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family NAC
Protein Properties Length: 626aa    MW: 69777.8 Da    PI: 4.3756
Description NAC family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG036587t4genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1NAM1683e-52191452128
               NAM   2 ppGfrFhPtdeelvveyLkkkvegkkleleevikevdiykvePwdLp.k.kvkaeekewyfFskrdkkyatgkrknratksgyWkatgkdkev 92 
                       ppGfrFhPtdeelv +yLk+k+ ++kl+l ++i+e+d+yk++P++Lp +  +k+++++w+fFs+rd+ky++g r+nrat++gyWkatgkd+++
  Thecc1EG036587t4  19 PPGFRFHPTDEELVLYYLKRKICRRKLKL-DIIRETDVYKWDPEELPaQsILKSGDRQWFFFSPRDRKYPNGARSNRATRQGYWKATGKDRTI 110
                       8****************************.99**************96346778999************************************ PP

               NAM  93 lskkgelvglkktLvfykgrapkgektdWvmheyrl 128
                       ++ ++++vg+kktLvfy grap+g+++dWvmhey+l
  Thecc1EG036587t4 111 TC-NSRVVGVKKTLVFYGGRAPNGVRSDWVMHEYTL 145
                       **.999****************************98 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF1019411.01E-5915168IPR003441NAC domain
PROSITE profilePS5100555.37218168IPR003441NAC domain
PfamPF023651.6E-2719145IPR003441NAC domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0070301Biological Processcellular response to hydrogen peroxide
GO:0005634Cellular Componentnucleus
GO:0005789Cellular Componentendoplasmic reticulum membrane
GO:0016021Cellular Componentintegral component of membrane
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 626 aa     Download sequence    Send to blast
MTVTAAAGDS CLGDDQVWPP GFRFHPTDEE LVLYYLKRKI CRRKLKLDII RETDVYKWDP  60
EELPAQSILK SGDRQWFFFS PRDRKYPNGA RSNRATRQGY WKATGKDRTI TCNSRVVGVK  120
KTLVFYGGRA PNGVRSDWVM HEYTLDEEEL KRCQNMKDYY ALYKVYKKSG PGPKNGEQYG  180
APFKEEDWVD EEYVSNPITV TPVKLPNEAI PDDNVNANVQ VQSALNEIEE FMRQLADEPA  240
LPQPQAQPGH ALPQVVSEEE TQSTLLDPSP RGVIFHEPIG VVLEQASFEF SQSPTSQLHE  300
APEVTSVADH FEQVPQICEE GFLEIDDLIG PETLTSNVGK PAENVQFNEL DGLSEFDLFH  360
DAAMFLQDMG PIDQGAVPFS YTDNMINQVS YQLEPQSNIS LMEQQLQTQS NLNLMDEQLQ  420
LQSNINLMDP QLQPQLNAFG ANQQLQPQLN AFGDNMLNQV DYQLQFQSVG DELDQQIQLD  480
QIHEPLWTHD QSSDVFAPSG SNLGNAAPTS GLIYNGNNQD QGDKNGGGAS MFSSALWSFV  540
ESIPTTPASA SETPLVNRAL ERMSSFSRLR LNARNTAVSA VDGAATARRI GGNRGIFFIS  600
ILGALCAILW FFTGTVRILG RSISS*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
3ulx_A6e-501914516140Stress-induced transcription factor NAC1
Search in ModeBase
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00179DAPTransfer from AT1G34190Download
Motif logo
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007020196.20.0PREDICTED: NAC domain-containing protein 17 isoform X2
TrEMBLA0A061FK080.0A0A061FK08_THECC; NAC domain protein, IPR003441, putative isoform 4
STRINGEOY174210.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM28122667
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G34190.11e-133NAC domain containing protein 17
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]