PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG032181t4
Common NameTCM_032181
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family STAT
Protein Properties Length: 582aa    MW: 65047.4 Da    PI: 5.9791
Description STAT family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG032181t4genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1STAT180.52.4e-56792001122
              STAT   1 ldvvllnalgqpvekdvevvasLlyadsglvveksddaeapLLisydGvefssedrplkllrGrasfklkisqLsskcdnrLfrikfeipklk 93 
                       ldvvllna+gqpv+k++evvasLlya + ++vek++d+eapLL+sydG+ef+s+drp+kll+Grasfklkis+Lssk++nr f+ikf i k++
  Thecc1EG032181t4  79 LDVVLLNAFGQPVNKELEVVASLLYAHNRSPVEKTNDEEAPLLASYDGIEFASSDRPSKLLNGRASFKLKISKLSSKSENRQFCIKFGISKFE 171
                       8******************************************************************************************** PP

              STAT  94 kypfleavskpirCisrsrntrsssltkk 122
                        y+fle +s +irC+sr+r+ r+s+ ++k
  Thecc1EG032181t4 172 GYRFLEDFSPSIRCVSRNRTPRTSTIIWK 200
                       ***********************999886 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
CDDcd103386.09E-57454560No hitNo description
Gene3DG3DSA:3.30.505.104.8E-6458526IPR000980SH2 domain
SuperFamilySSF555506.01E-8460526IPR000980SH2 domain
PROSITE profilePS500019.589465549IPR000980SH2 domain
Sequence ? help Back to Top
Protein Sequence    Length: 582 aa     Download sequence    Send to blast
MPNDLRKTTV VGITGGNDLQ GYIHDAKVLP STLSIKNQYV QNPPLQLSID ESSASDIEED  60
NGFWNIVGGK ASCRRIFSLD VVLLNAFGQP VNKELEVVAS LLYAHNRSPV EKTNDEEAPL  120
LASYDGIEFA SSDRPSKLLN GRASFKLKIS KLSSKSENRQ FCIKFGISKF EGYRFLEDFS  180
PSIRCVSRNR TPRTSTIIWK KTTAVHPLNG SQSFGLDDAS LEPRHNTVDE AKLSPTSKRV  240
RSGEAKISTI DQLGEECNSL AWTANQVENG YGSSMEARPE NFEEVDNSLS DSESTGARDS  300
ALKSVSNTAH SVSDLTIFRY CLGGLTDRSL LLKEIATNAS DEEISGFANQ VSLYSGCSHH  360
RHQIKITKRL IEEGTKAWNL LSQNNIQVQW ESAVFEIEEQ FMKIAHCSTR SLTQQDFELL  420
RKIAGCRDYM AQENFEKMWC WLYPVAFTLS SDWINAMWNC TSPKWIEGFI TKEEAELSLQ  480
GPRGLQEPGT FILRFPTSRS WPHPDAGSLI VTYVGSDYTL HHRLLSLDNV CSPGVREMNA  540
KVKPLQDMLL AEPELSRLGR CQIGEEKLFL TSIFLVRLIT C*
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007022054.20.0PREDICTED: uncharacterized protein LOC18594437 isoform X2
SwissprotQ56XZ10.0SHB_ARATH; SH2 domain-containing protein B
TrEMBLA0A061FA400.0A0A061FA40_THECC; SH2 domain protein A, putative isoform 4
STRINGEOY135810.0(Theobroma cacao)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G78540.10.0SH2 domain protein B
Publications ? help Back to Top
  1. Yamada Y,Wang HY,Fukuzawa M,Barton GJ,Williams JG
    A new family of transcription factors.
    Development, 2008. 135(18): p. 3093-101
    [PMID:18701541]
  2. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]