PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG036707t1
Common NameTCM_036707
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family GRAS
Protein Properties Length: 458aa    MW: 52208.6 Da    PI: 6.7327
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG036707t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS312.31.1e-95774551374
              GRAS   1 lvelLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPilkf 93 
                       l++lLl  A+ v++++ + a + L++l + +s  gd++qR++ayf+ +Laarl+ + s  y+ +  ++t+    +e++ a++ ++ vsP+++f
  Thecc1EG036707t1  77 LIHLLLITATSVDENNVNSALENLTELYQSVSLIGDSVQRVVAYFADGLAARLLTQKSPFYDMVMKEPTN----EEQFLAFTCLYRVSPYYQF 165
                       689****************************************************999999988877776....678889999********** PP

              GRAS  94 shltaNqaIleavege.....ervHiiDfdisqGlQWpaLlqaLasRp..egppslRiTgvgspesgskeeleetgerLakfAeel.gvpfef 178
                       +h+taNqaI ea+e+e     +++H+iDfd+ +G+QWp+L+q+L++++  ++  slR+Tg g+    s+eel+et+ rL +fA  + ++ fef
  Thecc1EG036707t1 166 AHFTANQAIIEAFEKEdeinnRALHVIDFDVCYGFQWPSLIQSLSEKAssGNRISLRLTGYGR----SSEELQETETRLVSFAGGCrNLVFEF 254
                       ****************55554555**********************99745556*********....9***************87648***** PP

              GRAS 179 nvlvakrledleleeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyysalfdsl 271
                       + l    l   +l +Lr k++E+++Vnlv++l +l  + +++++    +Lk v sl+P +v++veqe +++  sFl rf+e+l+y++a+fdsl
  Thecc1EG036707t1 255 QGL----LRGSKLVNLRKKKNETVVVNLVFHLNTLN-NFLKISD----TLKSVHSLRPSIVILVEQEGSRSPRSFLSRFMESLHYFAAMFDSL 338
                       **8....66667789*****************9996.5566655....********************************************* PP

              GRAS 272 eaklpreseerikvErellgreivnvvacegae..rrerhetlekWrerleeaGFkpvplsekaakqaklllr........kvk..sdgyrve 352
                       +  lp es er ++E+  lg+ei++++ c ++e  ++ r  ++e+W+ r+e+ GF+ +++s+k   qaklll+        +++  s g+rv 
  Thecc1EG036707t1 339 DDCLPLESAERLSIEKNHLGKEIKSIINCDKDEdnKFPRYAKMETWKGRMESHGFEGMKMSSKSLIQAKLLLKirthycplQCEgeSGGFRVF 431
                       *****************************998866889*********************************9744333333333334568887 PP

              GRAS 353 ee..sgslvlgWkdrpLvsvSaWr 374
                       e+   +sl lgW+dr L+++SaW+
  Thecc1EG036707t1 432 ERddGKSLSLGWQDRCLLTASAWQ 455
                       653356777**************6 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098548.48451423IPR005202Transcription factor GRAS
PfamPF035143.7E-9377455IPR005202Transcription factor GRAS
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
Sequence ? help Back to Top
Protein Sequence    Length: 458 aa     Download sequence    Send to blast
MEDTDEDELL NLSLSIVTDP GGERNRKRKT KDISKPFNPS YEGCEGKIFR LLQVREEMLK  60
VDHKRKKMVE DGKGLHLIHL LLITATSVDE NNVNSALENL TELYQSVSLI GDSVQRVVAY  120
FADGLAARLL TQKSPFYDMV MKEPTNEEQF LAFTCLYRVS PYYQFAHFTA NQAIIEAFEK  180
EDEINNRALH VIDFDVCYGF QWPSLIQSLS EKASSGNRIS LRLTGYGRSS EELQETETRL  240
VSFAGGCRNL VFEFQGLLRG SKLVNLRKKK NETVVVNLVF HLNTLNNFLK ISDTLKSVHS  300
LRPSIVILVE QEGSRSPRSF LSRFMESLHY FAAMFDSLDD CLPLESAERL SIEKNHLGKE  360
IKSIINCDKD EDNKFPRYAK METWKGRMES HGFEGMKMSS KSLIQAKLLL KIRTHYCPLQ  420
CEGESGGFRV FERDDGKSLS LGWQDRCLLT ASAWQCV*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_A3e-58664548378Protein SCARECROW
5b3h_A3e-58664547377Protein SCARECROW
5b3h_D3e-58664547377Protein SCARECROW
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017980934.10.0PREDICTED: protein SCARECROW
TrEMBLA0A061FLD00.0A0A061FLD0_THECC; GRAS family transcription factor
STRINGEOY174970.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM15761916
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G54220.18e-59GRAS family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]