PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG026600t1
Common NameTCM_026600
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family RAV
Protein Properties Length: 436aa    MW: 49561.4 Da    PI: 7.7489
Description RAV family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG026600t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1AP231.64e-10139186154
               AP2   1 sgykGVrwdkkrgrWvAeIrdpsengkr.krfslgkfgtaeeAakaaiaarkkle 54 
                       s++kGV + k +g+W A+++       + +r++lg+f ++ +Aa a++ a+ k++
  Thecc1EG026600t1 139 SRFKGVIRQK-NGQWGAQLYA------NhTRIWLGTFKSETDAAMAYDSAAIKFR 186
                       789***6566.9******887......34**********99**********9987 PP

2B383.51.9e-26262357189
                       EEEE-..-HHHHTT-EE--HHH.HTT......---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SS CS
                B3   1 ffkvltpsdvlksgrlvlpkkfaeeh......ggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgr 87 
                       f+k+ltpsdv+k++rlv+pkk+a ++      +g+k ++ +l+++d+  r W+++++y+++s+++v+t+GW++F k+++Lk++D++ F++ ++
  Thecc1EG026600t1 262 FQKELTPSDVGKLNRLVIPKKYAVKFfppiegSGSKGSDAELIFYDKFMRLWKFRYCYWNSSQSFVFTRGWNRFLKEKELKANDVISFYVCES 354
                       99******************************8888999**************************************************7643 PP

                       .SE CS
                B3  88 .se 89 
                        +e
  Thecc1EG026600t1 355 rKE 357
                       233 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF541712.09E-14139196IPR016177DNA-binding domain
CDDcd000181.97E-20139195No hitNo description
Gene3DG3DSA:3.30.730.102.3E-16140196IPR001471AP2/ERF domain
PROSITE profilePS5103217.135140195IPR001471AP2/ERF domain
SMARTSM003803.0E-20140201IPR001471AP2/ERF domain
Gene3DG3DSA:2.40.330.105.3E-31257360IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019362.75E-27260355IPR015300DNA-binding pseudobarrel domain
CDDcd100173.60E-24261350No hitNo description
SMARTSM010192.2E-21262369IPR003340B3 DNA binding domain
PROSITE profilePS5086312.351262369IPR003340B3 DNA binding domain
PfamPF023621.8E-23262357IPR003340B3 DNA binding domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 436 aa     Download sequence    Send to blast
MIHPGWTVSF GCSELSFIGR LVKPMFQPFD CKLMLVNVEL KAAVLMVLGR DVTGQTFGVP  60
MEGYELQQQL YIVYFSTSVL AFVLSFNIDM FNFLETMDEE MLSVISSGEG NATSEVSDSI  120
STSHPARKRQ RSGSNGTSSR FKGVIRQKNG QWGAQLYANH TRIWLGTFKS ETDAAMAYDS  180
AAIKFRTGDT HRNFPLTDIT VEEPKFQSNY SAEAVLSMIR DGSYQYKFMD FLKNSFRNGK  240
VEIDLNSVRK YSGKGLSCKQ LFQKELTPSD VGKLNRLVIP KKYAVKFFPP IEGSGSKGSD  300
AELIFYDKFM RLWKFRYCYW NSSQSFVFTR GWNRFLKEKE LKANDVISFY VCESRKEQEV  360
QRFCMIDVNN YGNDDALAEA ANLQVEREVD LQLRLGHCYA FDGGKQVKQE QELMAVDATE  420
DVNTTGFKLF GMQIN*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
1wid_A5e-2825835810109DNA-binding protein RAV1
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtProbably acts as a transcriptional activator. Binds to the GCC-box pathogenesis-related promoter element. May be involved in the regulation of gene expression by stress factors and by components of stress signal transduction pathways (By similarity). {ECO:0000250}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_021283244.10.0LOW QUALITY PROTEIN: AP2/ERF and B3 domain-containing transcription factor At1g50680-like
SwissprotQ9C6P51e-95RAVL2_ARATH; AP2/ERF and B3 domain-containing transcription factor At1g50680
TrEMBLA0A061F3C80.0A0A061F3C8_THECC; AP2/B3 transcription factor family protein, putative isoform 1
STRINGEOY114180.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM15647617
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G50680.14e-98RAV family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]