PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG035571t1
Common NameTCM_035571
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family Trihelix
Protein Properties Length: 570aa    MW: 63889.3 Da    PI: 6.8676
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG035571t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix91.21.1e-2846130187
          trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                       rW++qe laL+++r++m+  +r+++lk+plWeevs+k++e g++rs+k+Ckek+en+ k++k++k+g+ ++   + +t+++fdqlea
  Thecc1EG035571t1  46 RWPRQESLALLKIRSDMDAVFRDSSLKGPLWEEVSRKLAELGYHRSAKKCKEKFENVFKYHKRTKDGRTGK--ADGKTYRFFDQLEA 130
                       8********************************************************************96..66678*******85 PP

2trihelix103.91.1e-32387472187
          trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                       rW+k ev+aLi++r++++ +++++  k+plWee+s+ mr+ g++rs+k+Ckekwen+nk++kk+ke++kkr se+s+tcpyf+ql+a
  Thecc1EG035571t1 387 RWPKAEVQALIRLRTNLNVKYQENGPKAPLWEEISAGMRKLGYSRSAKRCKEKWENINKYFKKVKESSKKR-SEDSKTCPYFHQLDA 472
                       8*********************************************************************8.89999********85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM007170.0343105IPR001005SANT/Myb domain
CDDcd122036.12E-2345110No hitNo description
PfamPF138378.2E-1945131No hitNo description
PROSITE profilePS500907.01545103IPR017877Myb-like domain
SMARTSM007170.0018384446IPR001005SANT/Myb domain
PROSITE profilePS500907.468386444IPR017877Myb-like domain
CDDcd122038.77E-26386451No hitNo description
PfamPF138373.1E-22387473No hitNo description
Gene3DG3DSA:1.10.10.607.1E-4387443IPR009057Homeodomain-like
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 570 aa     Download sequence    Send to blast
MAESSADASA APGIHEGSEI GMVGSNSGEE DKGRVDEGDR SFGGNRWPRQ ESLALLKIRS  60
DMDAVFRDSS LKGPLWEEVS RKLAELGYHR SAKKCKEKFE NVFKYHKRTK DGRTGKADGK  120
TYRFFDQLEA LENLHSLQSQ SPPKPQTPTP TSAAMPWTNP PTASNIHVPS TTINPTNVPQ  180
TNATPSINPT ISTQAVPIHS IGPYSNSIPS SFHNISSNLF STSTSSSTAS DDDSDQGSSK  240
KKRKWKEFFW RLTKEVIEKQ EELQNKFLRT IEKCEQERTA REEAWRIQEM ARINREHEIL  300
VQERSTAAAK DAAVIAFLQK ILGQQPNTVQ VQPQENPQPT PPPPTAPLSL PPPLHQPQPQ  360
PPTPALNFDT SKMTNGAYNV VLSSPSRWPK AEVQALIRLR TNLNVKYQEN GPKAPLWEEI  420
SAGMRKLGYS RSAKRCKEKW ENINKYFKKV KESSKKRSED SKTCPYFHQL DAIYKEKISK  480
NENSVGSSGY GVKPESKMVP LMVQPEQQWP PQQQEISQQA EAMMEEAERE NVDQIQEDEE  540
DIGESEGEEY ERNAFELVAN KTAPIGTAE*
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor that binds specific DNA sequence. {ECO:0000250}.
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankJQ0130922e-54JQ013092.1 Gossypium hirsutum trihelix transcription factor (GT7) mRNA, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017985286.10.0PREDICTED: trihelix transcription factor GT-2
SwissprotQ391171e-146TGT2_ARATH; Trihelix transcription factor GT-2
TrEMBLA0A061FJ930.0A0A061FJ93_THECC; Duplicated homeodomain-like superfamily protein, putative
STRINGEOY167120.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM48492553
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76890.21e-117Trihelix family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]