PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG031704t2
Common NameTCM_031704
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family NZZ/SPL
Protein Properties Length: 337aa    MW: 36109.4 Da    PI: 9.0154
Description NZZ/SPL family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG031704t2genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1NOZZLE20.38.7e-0722674487
            NOZZLE 44 grkpgsktaqqkqkkptlrgmgvaklerfiieeekkkl..vvatvg 87
                      g   g  + +qk kk   rg+gva+le+ ++ee++kk   v+a+ g
  Thecc1EG031704t2 22 GNSVGRSSKKQKPKKVPQRGLGVAQLEKIRLEEQQKKDaaVAAAAG 67
                      55667778899999999***************99887643444444 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF087447.8E-52368IPR014855Plant transcription factor NOZZLE
Sequence ? help Back to Top
Protein Sequence    Length: 337 aa     Download sequence    Send to blast
MAQEDQSQRC SNSNTSSGGG FGNSVGRSSK KQKPKKVPQR GLGVAQLEKI RLEEQQKKDA  60
AVAAAAGILP SPSSSVISQP THHKSSSYLS LPIPSNFHPS NQSSYSSSSS SIPFPADLSP  120
PNLIFRPPLS VQNADVVSAN TVPLTTGSSP GWHPGAGVVL PGNGSVNSGH KLWSSREYSI  180
EKECSGLDPG LAFRTNLSLP YESEPIWPPP SLMQRAQPFQ QPSSSMVNLS SRTSSTSVLN  240
YQMEPPSNQS YYGNCTPLLP EEEKMVGMKR SYPFSLDNAP GPPLHGKYPP IVHPINGHVE  300
AASSSNGSTF NFEPGTPNFR FDLNFFHLLY VMKIGV*
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007021650.20.0PREDICTED: uncharacterized protein LOC18594125 isoform X1
TrEMBLA0A061F8410.0A0A061F841_THECC; Actin cytoskeleton-regulatory complex protein pan1, putative isoform 2
STRINGEOY131750.0(Theobroma cacao)
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]