PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG032181t3
Common NameTCM_032181
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family STAT
Protein Properties Length: 720aa    MW: 80990.6 Da    PI: 5.3694
Description STAT family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG032181t3genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1STAT1803.3e-562623831122
              STAT   1 ldvvllnalgqpvekdvevvasLlyadsglvveksddaeapLLisydGvefssedrplkllrGrasfklkisqLsskcdnrLfrikfeipklk 93 
                       ldvvllna+gqpv+k++evvasLlya + ++vek++d+eapLL+sydG+ef+s+drp+kll+Grasfklkis+Lssk++nr f+ikf i k++
  Thecc1EG032181t3 262 LDVVLLNAFGQPVNKELEVVASLLYAHNRSPVEKTNDEEAPLLASYDGIEFASSDRPSKLLNGRASFKLKISKLSSKSENRQFCIKFGISKFE 354
                       8******************************************************************************************** PP

              STAT  94 kypfleavskpirCisrsrntrsssltkk 122
                        y+fle +s +irC+sr+r+ r+s+ ++k
  Thecc1EG032181t3 355 GYRFLEDFSPSIRCVSRNRTPRTSTIIWK 383
                       ***********************999886 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:3.30.505.107.5E-6641708IPR000980SH2 domain
SuperFamilySSF555509.27E-8642712IPR000980SH2 domain
PROSITE profilePS500019.678648720IPR000980SH2 domain
Sequence ? help Back to Top
Protein Sequence    Length: 720 aa     Download sequence    Send to blast
PTSIIEQPVR QNNCLPLLLL LYKETEKFSG FFFVQRKGKT MGENNQIIEE KDYILLKDFK  60
VEIEVEEGKG FILCFWVYMF NPNAFPATIL KQVYSETNSS APLLVLNEKT LMLLPLTCLH  120
NEVPDPGNTA LSTEVLKVST QIEYPQYKWI HVAYEVSTDF VRLHINAEIA GELQLSSLLN  180
KVSMPNDLRK TTVVGITGGN DLQGYIHDAK VLPSTLSIKN QYVQNPPLQL SIDESSASDI  240
EEDNGFWNIV GGKASCRRIF SLDVVLLNAF GQPVNKELEV VASLLYAHNR SPVEKTNDEE  300
APLLASYDGI EFASSDRPSK LLNGRASFKL KISKLSSKSE NRQFCIKFGI SKFEGYRFLE  360
DFSPSIRCVS RNRTPRTSTI IWKKTTAVHP LNGSQSFGLD DASLEPRHNT VDEAKLSPTS  420
KRVRSGEAKI STIDQLGEEC NSLAWTANQV ENGYGSSMEA RPENFEEVDN SLSDSESTGA  480
RDSALKSVSN TAHSVSDLTI FRYCLGGLTD RSLLLKEIAT NASDEEISGF ANQVSLYSGC  540
SHHRHQIKIT KRLIEEGTKA WNLLSQNNIQ VQWESAVFEI EEQFMKIAHC STRSLTQQDF  600
ELLRKIAGCR DYMAQENFEK MWCWLYPVAF TLSSDWINAM WNCTSPKWIE GFITKEEAEL  660
SLQGPRGLQE PGTFILRFPT SRSWPHPDAG SLIVTYVGSD YTLHHRLLSL DNPWSAGNEC
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017980338.10.0PREDICTED: uncharacterized protein LOC18594437 isoform X3
SwissprotQ56XZ10.0SHB_ARATH; SH2 domain-containing protein B
TrEMBLA0A061F9730.0A0A061F973_THECC; SH2 domain protein A, putative isoform 3 (Fragment)
STRINGEOY135810.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM52052645
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G78540.10.0SH2 domain protein B
Publications ? help Back to Top
  1. Yamada Y,Wang HY,Fukuzawa M,Barton GJ,Williams JG
    A new family of transcription factors.
    Development, 2008. 135(18): p. 3093-101
    [PMID:18701541]
  2. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]