PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG001889t1
Common NameTCM_001889
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family Nin-like
Protein Properties Length: 270aa    MW: 29443 Da    PI: 8.2379
Description Nin-like family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG001889t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1RWP-RK69.74.1e-221460248
            RWP-RK  2 ekeisledlskyFslpikdAAkeLgvclTvLKriCRqyGIkRWPhRk 48
                      ++++s++d++ +Fslp+ dAA++Lgvc+++LK+iCR++G++RWPhRk
  Thecc1EG001889t1 14 TQSLSFDDIADFFSLPLYDAASTLGVCASALKKICRENGLDRWPHRK 60
                      789*******************************************9 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5151915.733184IPR003035RWP-RK domain
PfamPF020424.8E-191761IPR003035RWP-RK domain
Sequence ? help Back to Top
Protein Sequence    Length: 270 aa     Download sequence    Send to blast
MMSLQQQNSS SAATQSLSFD DIADFFSLPL YDAASTLGVC ASALKKICRE NGLDRWPHRK  60
FLAGKSIEEI KRHAARERRK ELTELSKVHR QGSQPQNNEL SKLQGAAALP NLQQQGTKNI  120
QTGQALNFGH RSLMTGMTTS DEFKYGFPSD GLSIATNKWW GSSKSDGHED VQVDGAETEG  180
EDKHQSVEKP GDMANEKPEE NGKLDDGIGP QGSGLLTAVR KRAVEEGGEA LKLGVYKGYG  240
IKKLGTREAS LLLRIFKSSL QKDWIHGPS*
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017973224.10.0PREDICTED: uncharacterized protein LOC18612177 isoform X2
TrEMBLA0A061DK620.0A0A061DK62_THECC; Uncharacterized protein isoform 1
STRINGEOX930360.0(Theobroma cacao)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76350.16e-10Nin-like family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]