PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG040397t1
Common NameTCM_040397
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family RAV
Protein Properties Length: 362aa    MW: 41556.9 Da    PI: 6.9784
Description RAV family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG040397t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1AP226.91.2e-084893354
               AP2  3 ykGVrwdkkrgrWvAeIrdpsengkr.krfslgkfgtaeeAakaaiaarkkle 54
                      +kGV    ++g W A+I+       + +r++lg+f +++eAa  ++ a+ kl+
  Thecc1EG040397t1 48 FKGVV-PQQNGHWGAQIYA------NhQRIWLGTFKSEKEAAMSYDSAAIKLR 93
                      78885.556899****999......44**********99***********998 PP

2B389.23.3e-28171265183
                       EEEE-..-HHHHTT-EE--HHH.HTT............---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEE CS
                B3   1 ffkvltpsdvlksgrlvlpkkfaeeh............ggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvv 81 
                       f+k+ltpsdv+k++rlv+pkk+a ++             g+  e+++l+++d+  r+W+++++y+++s+++v+t+GW++Fvk+++Lke+D+++
  Thecc1EG040397t1 171 FQKELTPSDVGKLNRLVIPKKYAVKYfpyicendeenvAGVGVEEMELVFYDRLMRTWKFRYCYWRSSQSFVFTRGWNRFVKEKKLKERDIIT 263
                       99*****************************877666644444999*********************************************** PP

                       EE CS
                B3  82 Fk 83 
                       F+
  Thecc1EG040397t1 264 FH 265
                       *9 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
CDDcd000183.11E-2246104No hitNo description
Gene3DG3DSA:3.30.730.103.2E-1547102IPR001471AP2/ERF domain
PROSITE profilePS5103216.89847102IPR001471AP2/ERF domain
SMARTSM003805.2E-1847108IPR001471AP2/ERF domain
SuperFamilySSF541717.19E-1447103IPR016177DNA-binding domain
Gene3DG3DSA:2.40.330.101.0E-30167269IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019364.19E-26169268IPR015300DNA-binding pseudobarrel domain
CDDcd100172.42E-24170265No hitNo description
PfamPF023622.0E-25171265IPR003340B3 DNA binding domain
PROSITE profilePS5086312.506171286IPR003340B3 DNA binding domain
SMARTSM010196.3E-18171276IPR003340B3 DNA binding domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 362 aa     Download sequence    Send to blast
MDEDESSMVS NAKWKVAAAE TSDCGNSNYP LRASKRSRHE TNASLARFKG VVPQQNGHWG  60
AQIYANHQRI WLGTFKSEKE AAMSYDSAAI KLRSVDSHRN FPWAEHNIQE PNFQSLYSTE  120
DVLNMIRAGS YQAKFAEFVN ILSERNGILG SKSVNKNLVH GDIHFSCVQL FQKELTPSDV  180
GKLNRLVIPK KYAVKYFPYI CENDEENVAG VGVEEMELVF YDRLMRTWKF RYCYWRSSQS  240
FVFTRGWNRF VKEKKLKERD IITFHTCECP ALVEKDALNF FLIDVNYNGE QRCINEDKVL  300
NGLESSPQDL QVELELNLGK SFYCRIDNCN SLFNEDKGLS GLKSSHDVEE KRVTLFGVQI  360
N*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
1wid_A3e-2717026413100DNA-binding protein RAV1
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtProbably acts as a transcriptional activator. Binds to the GCC-box pathogenesis-related promoter element. May be involved in the regulation of gene expression by stress factors and by components of stress signal transduction pathways (By similarity). {ECO:0000250}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007014835.20.0PREDICTED: AP2/ERF and B3 domain-containing transcription factor At1g51120
SwissprotQ9C6881e-110RAVL3_ARATH; AP2/ERF and B3 domain-containing transcription factor At1g51120
TrEMBLA0A061GZ080.0A0A061GZ08_THECC; AP2/B3 transcription factor family protein, putative
STRINGEOY324540.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM17052788
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G51120.11e-113RAV family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]