PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG037359t2
Common NameTCM_037359
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family Nin-like
Protein Properties Length: 336aa    MW: 37581 Da    PI: 9.1072
Description Nin-like family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG037359t2genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1RWP-RK247.6e-08243293350
            RWP-RK   3 keisledlskyFslpikdAAkeLgvclTvLKriCRqy.G..IkRWPhRkik 50 
                       k ++++dl+ +F++  kdAA+ L+++ TvL  i  +  G   +RWP+R i+
  Thecc1EG037359t2 243 KMLTFDDLKPFFEMLRKDAARRLNLSETVLHNIFDEAtGkkGRRWPYREIA 293
                       789****************************997664131137*****996 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5151910.876224315IPR003035RWP-RK domain
PfamPF020429.5E-8245293IPR003035RWP-RK domain
Sequence ? help Back to Top
Protein Sequence    Length: 336 aa     Download sequence    Send to blast
MHSEPNLENA QASMRNELNL ENSPTGMRKE PNLENSQVET GFGIQRLDRN KGKLSVGLSR  60
PPPSGCSGCQ MLREITHRKG SLVKKLQIHG ELSRGRFFHA LNNVLDDDTT VVLDAENIDF  120
YDKGYRDVEK FLSQYFIKQE QEGWSMHDDP RAVFFKVLCF GPGGVQTGEA ANTSNREKNP  180
VRQTARVLAT APAEASNHSN AEAANTSISQ AGQVAATGSH ETTNPNNSGR RSINLSEQRK  240
RIKMLTFDDL KPFFEMLRKD AARRLNLSET VLHNIFDEAT GKKGRRWPYR EIAANRRKIA  300
KLTAIADSTD NPAASDRARD EIRTLEEQID ALYRP*
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_021276287.10.0uncharacterized protein LOC110410746
TrEMBLA0A061GKR10.0A0A061GKR1_THECC; Uncharacterized protein isoform 2
STRINGEOY300020.0(Theobroma cacao)
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]