PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG002653t1
Common NameTCM_002653
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family FAR1
Protein Properties Length: 278aa    MW: 31787.2 Da    PI: 7.1683
Description FAR1 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG002653t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1FAR185.39.7e-27112201190
              FAR1   1 kfYneYAkevGFsvrkskskkskrngeitkrtfvCskegkreeekkktekerrtraetrtgCkaklkvkkekdgkwevtkleleHnHela 90 
                       kfY eYA++vGF vr+ + ++s  +g++  r++ C+k+g++ ++k +    +++r++ r gCka++ vk+ek+gkw+vt++++eHnH+l 
  Thecc1EG002653t1 112 KFYVEYARQVGFVVRIMQRRRSGIDGRTLARRLGCNKQGFSPNHKGTFGPDKKPRPSAREGCKATILVKMEKTGKWVVTRFVKEHNHPLI 201
                       6**********************************************9****************************************95 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF031011.4E-24112201IPR004330FAR1 DNA binding domain
Sequence ? help Back to Top
Protein Sequence    Length: 278 aa     Download sequence    Send to blast
MFIWVTRMMK PEPNSSPETT LATNCLGDPR FYPTVEIHHF ILRLVFLSAS DQRIIASQII  60
FGCLSETLDS MDLDKEVGTV DSSEEKTAEP EGREILEPYV GMEFESEDDA RKFYVEYARQ  120
VGFVVRIMQR RRSGIDGRTL ARRLGCNKQG FSPNHKGTFG PDKKPRPSAR EGCKATILVK  180
MEKTGKWVVT RFVKEHNHPL IATANGFSTT GDKDKKIEEL SMELAHQEQL CSAYREKLFT  240
FMNNVEEQTE ELSSKIQVIV DNVRKLESET QRFSHRR*
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_021301085.11e-169uncharacterized protein LOC110429388
TrEMBLA0A061DMU80.0A0A061DMU8_THECC; Far-red impaired responsive family protein isoform 1
STRINGEOX937220.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM39182855
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G12850.13e-74FAR1 family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]