PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG005010t1
Common NameTCM_005010
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family NAC
Protein Properties Length: 498aa    MW: 56569.2 Da    PI: 5.2405
Description NAC family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG005010t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1NAM1202.2e-37111313127
               NAM   3 pGfrFhPtdeelvveyLkkkvegkkleleevikev.diykvePwdLp..kkvkaeekewyfFskrdkkyatgkrknratksgyWkatgkdkev 92 
                       +G+rFhPtd+elv++yL +k+ +++  + ++ikev ++++ +Pw+Lp  +k+k+ ++ wyfFs+r+ +    kr +r+t++g+Wk tgk+++v
  Thecc1EG005010t1  11 VGYRFHPTDKELVDHYLWNKILDRDSLV-HAIKEVdGLCRKDPWELPrlSKIKSADQVWYFFSRRKDN----KRVKRTTDNGFWKVTGKTRDV 98 
                       7***********************9777.88****66**********5449999999******99775....59******************* PP

               NAM  93 lskkgelvglkktLvfykgrapkgektdWvmheyr 127
                       +   ++   +kktLvf++gr p+++ t Wvmhey 
  Thecc1EG005010t1  99 KG--KRGSAIKKTLVFFQGRGPNAKWTPWVMHEYI 131
                       98..555799************************6 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5100540.0689151IPR003441NAC domain
SuperFamilySSF1019413.4E-3910151IPR003441NAC domain
PfamPF023651.9E-2211130IPR003441NAC domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 498 aa     Download sequence    Send to blast
MGSISSNSDI VGYRFHPTDK ELVDHYLWNK ILDRDSLVHA IKEVDGLCRK DPWELPRLSK  60
IKSADQVWYF FSRRKDNKRV KRTTDNGFWK VTGKTRDVKG KRGSAIKKTL VFFQGRGPNA  120
KWTPWVMHEY IFTSTVLDNK EGIFLCKLKN KEDEKADTSR SEVCEPSQVA DDGIPENSAM  180
FDPDVMLATL EEPDGREEAD NNLSPSPQPM MREEQVPSCV DSAYLYEFSG GHCGFQHLFN  240
SNEQSDDSWT KYLVDSDEVY RDENEGCSLP SVYMRMTCPG ECSRKRSRFE NGGPCGAIEN  300
EECQATYEQV VSASSMLDEH SGSKKFQAMA MVNAPNETLT SVEHDPHGRE RMLAIHNESR  360
EMDAPSGDSA VDLHCIFHVA LAESVDYSYA RFDNPQYQER SQNLIEQEDD PIRITKQGKF  420
SVKTVSLDKA RDVKQHVPAD NLPQMENAVM ESNMKFKKAK ETNKKESLQM HWSESAESDE  480
RGKLRIKDLV LPPNLAG*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
3ulx_A2e-221215418171Stress-induced transcription factor NAC1
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007051378.20.0PREDICTED: NAC domain-containing protein 4 isoform X2
TrEMBLA0A061DS330.0A0A061DS33_THECC; Uncharacterized protein isoform 1
STRINGEOX955350.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM2081746
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G01550.17e-41NAC domain containing protein 69
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]