PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG037359t1
Common NameTCM_037359
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family Nin-like
Protein Properties Length: 534aa    MW: 59105.5 Da    PI: 5.5248
Description Nin-like family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG037359t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1RWP-RK23.21.4e-07441491350
            RWP-RK   3 keisledlskyFslpikdAAkeLgvclTvLKriCRqy.G..IkRWPhRkik 50 
                       k ++++dl+ +F++  kdAA+ L+++ TvL  i  +  G   +RWP+R i+
  Thecc1EG037359t1 441 KMLTFDDLKPFFEMLRKDAARRLNLSETVLHNIFDEAtGkkGRRWPYREIA 491
                       789****************************997664131137*****996 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5151910.876422513IPR003035RWP-RK domain
PfamPF020421.7E-7443491IPR003035RWP-RK domain
Sequence ? help Back to Top
Protein Sequence    Length: 534 aa     Download sequence    Send to blast
MADPNANDPD DDPHYVPLVD EALNFMNNME PILPDDAMNI FMGNVNHPFM DDSRNNFLNA  60
INPTLFGNSS SDNSDVLNLR VNSQSDPSMN NLRTSTQMQQ NFENEDLNNL HVPTRVNSTV  120
QEPSQHPFIP QSEQEATNPL QIATNGEGMH SEPNLENAQA SMRNELNLEN SPTGMRKEPN  180
LENSQVETGF GIQRLDRNKG KLSVGLSRPP PSGCSGCQML REITHRKGSL VKKLQIHGEL  240
SRGRFFHALN NVLDDDTTVV LDAENIDFYD KGYRDVEKFL SQYFIKQEQE GWSMHDDPRA  300
VFFKVLCFGP GGVQTGEAAN TSNREKNPVP PTAGVLATAP AEAANTSNRE KNPVPPTARV  360
LATAPAEAAN TSNREKNPVR QTARVLATAP AEASNHSNAE AANTSISQAG QVAATGSHET  420
TNPNNSGRRS INLSEQRKRI KMLTFDDLKP FFEMLRKDAA RRLNLSETVL HNIFDEATGK  480
KGRRWPYREI AANRRKIAKL TAIADSTDNP AASDRARDEI RTLEEQIDAL YRP*
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_021276287.10.0uncharacterized protein LOC110410746
TrEMBLA0A061GLH30.0A0A061GLH3_THECC; Uncharacterized protein isoform 1
STRINGEOY300020.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM40762342
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]