PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG007052t1
Common NameTCM_007052
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family FAR1
Protein Properties Length: 287aa    MW: 32657.2 Da    PI: 8.4127
Description FAR1 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG007052t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1FAR154.63.6e-1735111176
              FAR1   1 kfYneYAkevGFsvrkskskkskrngeitkrtfvCskegkreeekkk.tekerrtraetrtgCkaklkvkkek.dgkw 76 
                       +fY++YA  +GF vrk   +++ ++g  ++++f Cskeg+++e+++k ++++++++ ++r++Cka+++v ++k  gkw
  Thecc1EG007052t1  35 EFYKAYACAMGFGVRKGGCRRN-KDGIEVMKHFACSKEGHKAEKREKlENRAQEPKRSSRIDCKANIRVILNKdIGKW 111
                       6********************9.666677899*********99998867888889**************999877888 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF031012.7E-1435111IPR004330FAR1 DNA binding domain
SuperFamilySSF577565.44E-5212241IPR001878Zinc finger, CCHC-type
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003676Molecular Functionnucleic acid binding
GO:0008270Molecular Functionzinc ion binding
Sequence ? help Back to Top
Protein Sequence    Length: 287 aa     Download sequence    Send to blast
MENKENNLLD AKYLHKLSKD DILGLEFDDL EDVYEFYKAY ACAMGFGVRK GGCRRNKDGI  60
EVMKHFACSK EGHKAEKREK LENRAQEPKR SSRIDCKANI RVILNKDIGK WIPLNLIMKR  120
WTKNAKDDAP AVVDDNVDPK YQTILRYASL SSHCNRLCHV ASQFVETFNK ARSEIASLTR  180
RYEEMCKVNT DGISNLTEHV RDPTRVKVKG KVGAKSEGKK KPRKCGNCRM EGHTRNKCPQ  240
LELTLCSLDS SSCLLDDNDV DVYERNKEIW PSQLGTLFGG DSSEEE*
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017974464.11e-85PREDICTED: protein FAR1-RELATED SEQUENCE 5
TrEMBLA0A061DZX40.0A0A061DZX4_THECC; Uncharacterized protein
STRINGEOX982450.0(Theobroma cacao)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G18960.13e-11FAR1-related sequence 12
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]