PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG007806t1
Common NameTCM_007806
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family GRAS
Protein Properties Length: 446aa    MW: 49737.6 Da    PI: 5.1908
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG007806t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS293.55.4e-90444392374
              GRAS   2 velLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPilkfs 94 
                       ++lLl+cA a++s+d +laq++++ l++ as+ gdp+qRl++ f++AL ++ +r ++++++    s+ +++ ++     l  ++++ P+++f+
  Thecc1EG007806t1  44 EKLLLHCASALESNDVTLAQQVMWVLNNVASSVGDPNQRLTSWFLKALISKASRVCPTTMNFHGGSTFQRRLMTVI--ELAGYVDLLPWHRFG 134
                       689***************************************************9999999887777764333332..33459********** PP

              GRAS  95 hltaNqaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesg...skeeleetgerLakfAeelgvpfefnvl... 181
                       + + N+aI +av g  +vHi+Df+i++++QWp+L++aLa+Rpegp slRiT+ +   ++    + ++ee+g+rLa+fA+  +vpfef+v+   
  Thecc1EG007806t1 135 FCASNSAIFKAVRGYPKVHILDFSITHCMQWPTLIDALAKRPEGPSSLRITVPSYRPPVppmLNVSTEEVGHRLANFAKFRDVPFEFHVIddp 227
                       ****************************************************99977778888999*************************** PP

              GRAS 182 .................vakrledleleeLrvkpgEalaVnlvlqlhrlldesvsles....erdevLklvkslsPkvvvvveqeadhnsesF 253
                                         ++ l+ l++++L+++++Eal++n++  l +l de+   +      rd +L+++k+l+P+++vvv++++d++++s+
  Thecc1EG007806t1 228 sfpssgeilskessafqFESLLSHLTPSALDLREDEALVINCQNWLRYLSDERIGNTAhdssLRDVFLDIIKGLNPRIIVVVDEDSDLSAPSL 320
                       *******98888777766667778899999*******************98865443333449****************************** PP

              GRAS 254 lerflealeyysalfdsleaklpreseerikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvks 346
                         r++ +++y +  fd+le+ lp++s++r  +E   +g++i+n+++ eg++r+er e+ +k +er+++a F +vp+ e++++++k ll +++ 
  Thecc1EG007806t1 321 TSRITTCFNYLWIPFDALETFLPKDSSQRLEYESD-IGQKIENIISFEGFQRIERLESGAKLSERMKNASFFSVPFCEETVTEVKFLLDEHA- 411
                       ***********************************.********************************************************. PP

              GRAS 347 dgyrveeesgslvlgWkdrpLvsvSaWr 374
                        g+ ++ee++ l+l+Wk+++ v+++aW+
  Thecc1EG007806t1 412 SGWGMKEEEDMLILTWKGHKSVFATAWT 439
                       899************************6 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098544.36917420IPR005202Transcription factor GRAS
PfamPF035141.9E-8744439IPR005202Transcription factor GRAS
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
Sequence ? help Back to Top
Protein Sequence    Length: 446 aa     Download sequence    Send to blast
MKTEVRGNGA SISLQNPSLF NTPQSSITGA LRGCLGSLDG ACIEKLLLHC ASALESNDVT  60
LAQQVMWVLN NVASSVGDPN QRLTSWFLKA LISKASRVCP TTMNFHGGST FQRRLMTVIE  120
LAGYVDLLPW HRFGFCASNS AIFKAVRGYP KVHILDFSIT HCMQWPTLID ALAKRPEGPS  180
SLRITVPSYR PPVPPMLNVS TEEVGHRLAN FAKFRDVPFE FHVIDDPSFP SSGEILSKES  240
SAFQFESLLS HLTPSALDLR EDEALVINCQ NWLRYLSDER IGNTAHDSSL RDVFLDIIKG  300
LNPRIIVVVD EDSDLSAPSL TSRITTCFNY LWIPFDALET FLPKDSSQRL EYESDIGQKI  360
ENIISFEGFQ RIERLESGAK LSERMKNASF FSVPFCEETV TEVKFLLDEH ASGWGMKEEE  420
DMLILTWKGH KSVFATAWTS TGLED*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3h_B2e-551544110421Protein SHORT-ROOT
5b3h_E2e-551544110421Protein SHORT-ROOT
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007043377.10.0PREDICTED: scarecrow-like protein 32
TrEMBLA0A061EAD40.0A0A061EAD4_THECC; GRAS family transcription factor
STRINGEOX992080.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM17642910
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G49950.11e-125GRAS family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]