PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG032181t1
Common NameTCM_032181
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family STAT
Protein Properties Length: 709aa    MW: 79506 Da    PI: 5.4548
Description STAT family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG032181t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1STAT1803.2e-562223431122
              STAT   1 ldvvllnalgqpvekdvevvasLlyadsglvveksddaeapLLisydGvefssedrplkllrGrasfklkisqLsskcdnrLfrikfeipklk 93 
                       ldvvllna+gqpv+k++evvasLlya + ++vek++d+eapLL+sydG+ef+s+drp+kll+Grasfklkis+Lssk++nr f+ikf i k++
  Thecc1EG032181t1 222 LDVVLLNAFGQPVNKELEVVASLLYAHNRSPVEKTNDEEAPLLASYDGIEFASSDRPSKLLNGRASFKLKISKLSSKSENRQFCIKFGISKFE 314
                       8******************************************************************************************** PP

              STAT  94 kypfleavskpirCisrsrntrsssltkk 122
                        y+fle +s +irC+sr+r+ r+s+ ++k
  Thecc1EG032181t1 315 GYRFLEDFSPSIRCVSRNRTPRTSTIIWK 343
                       ***********************999886 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
CDDcd103382.76E-58597708No hitNo description
Gene3DG3DSA:3.30.505.106.7E-6601669IPR000980SH2 domain
SuperFamilySSF555507.73E-8603669IPR000980SH2 domain
PROSITE profilePS5000110.464608708IPR000980SH2 domain
Sequence ? help Back to Top
Protein Sequence    Length: 709 aa     Download sequence    Send to blast
MGENNQIIEE KDYILLKDFK VEIEVEEGKG FILCFWVYMF NPNAFPATIL KQVYSETNSS  60
APLLVLNEKT LMLLPLTCLH NEVPDPGNTA LSTEVLKVST QIEYPQYKWI HVAYEVSTDF  120
VRLHINAEIA GELQLSSLLN KVSMPNDLRK TTVVGITGGN DLQGYIHDAK VLPSTLSIKN  180
QYVQNPPLQL SIDESSASDI EEDNGFWNIV GGKASCRRIF SLDVVLLNAF GQPVNKELEV  240
VASLLYAHNR SPVEKTNDEE APLLASYDGI EFASSDRPSK LLNGRASFKL KISKLSSKSE  300
NRQFCIKFGI SKFEGYRFLE DFSPSIRCVS RNRTPRTSTI IWKKTTAVHP LNGSQSFGLD  360
DASLEPRHNT VDEAKLSPTS KRVRSGEAKI STIDQLGEEC NSLAWTANQV ENGYGSSMEA  420
RPENFEEVDN SLSDSESTGA RDSALKSVSN TAHSVSDLTI FRYCLGGLTD RSLLLKEIAT  480
NASDEEISGF ANQVSLYSGC SHHRHQIKIT KRLIEEGTKA WNLLSQNNIQ VQWESAVFEI  540
EEQFMKIAHC STRSLTQQDF ELLRKIAGCR DYMAQENFEK MWCWLYPVAF TLSSDWINAM  600
WNCTSPKWIE GFITKEEAEL SLQGPRGLQE PGTFILRFPT SRSWPHPDAG SLIVTYVGSD  660
YTLHHRLLSL DNVCSPGVRE MNAKVKPLQD MLLAEPELSR LGRIIRSH*
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007022054.20.0PREDICTED: uncharacterized protein LOC18594437 isoform X2
SwissprotQ56XZ10.0SHB_ARATH; SH2 domain-containing protein B
TrEMBLA0A061FGJ10.0A0A061FGJ1_THECC; SH2 domain protein A, putative isoform 1
STRINGEOY135810.0(Theobroma cacao)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G78540.10.0SH2 domain protein B
Publications ? help Back to Top
  1. Yamada Y,Wang HY,Fukuzawa M,Barton GJ,Williams JG
    A new family of transcription factors.
    Development, 2008. 135(18): p. 3093-101
    [PMID:18701541]
  2. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]