PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG037484t2
Common NameTCM_037484
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family ARF
Protein Properties Length: 1147aa    MW: 128139 Da    PI: 6.5465
Description ARF family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG037484t2genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B370.62e-22126227199
                       EEEE-..-HHHHTT-EE--HHH.HTT......---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SS CS
                B3   1 ffkvltpsdvlksgrlvlpkkfaeeh......ggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgr 87 
                       f+k+lt sd++++g +++p++ ae++      +++++  ++l+ +d++ ++W++++iyr++++r++lt+GW+ Fv+ ++L +gD+v+F +  +
  Thecc1EG037484t2 126 FCKTLTASDTSTHGGFSVPRRAAEKIfppldfSMQPPA-QELVARDLHDNTWTFRHIYRGQPKRHLLTTGWSVFVSTKRLFAGDSVLFIR--D 215
                       99*****************************9445444.38************************************************4..4 PP

                       SEE..EEEEE-S CS
                B3  88 sefelvvkvfrk 99 
                       ++ +l++++ r+
  Thecc1EG037484t2 216 EKSQLLLGIKRA 227
                       666678888765 PP

2Auxin_resp125.14.7e-41252334183
        Auxin_resp   1 aahaastksvFevvYnPrastseFvvkvekvekalkvkvsvGmRfkmafetedsserrlsGtvvgvsdldpvrWpnSkWrsLk 83 
                       aahaa+++s+F+++YnPras+seFv++++k++ka++++vs+GmRf+m+fete+s  rr++Gt++g+sdldpvrW+nS+Wr+L+
  Thecc1EG037484t2 252 AAHAAANNSPFTIFYNPRASPSEFVIPLAKYNKAMYTQVSLGMRFRMMFETEESGVRRYMGTITGISDLDPVRWKNSQWRNLQ 334
                       79*******************************************************************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF1019363.14E-44116255IPR015300DNA-binding pseudobarrel domain
Gene3DG3DSA:2.40.330.103.3E-41118241IPR015300DNA-binding pseudobarrel domain
CDDcd100171.08E-19125226No hitNo description
SMARTSM010192.8E-23126228IPR003340B3 DNA binding domain
PfamPF023621.3E-20126227IPR003340B3 DNA binding domain
PROSITE profilePS5086312.224126228IPR003340B3 DNA binding domain
PfamPF065075.0E-36252334IPR010525Auxin response factor
PROSITE profilePS5174525.68310191112IPR000270PB1 domain
PfamPF023095.6E-910291105IPR033389AUX/IAA domain
SuperFamilySSF542771.08E-810331097No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0009638Biological Processphototropism
GO:0009723Biological Processresponse to ethylene
GO:0009734Biological Processauxin-activated signaling pathway
GO:0009785Biological Processblue light signaling pathway
GO:0010311Biological Processlateral root formation
GO:0045893Biological Processpositive regulation of transcription, DNA-templated
GO:0048366Biological Processleaf development
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
GO:0005515Molecular Functionprotein binding
Sequence ? help Back to Top
Protein Sequence    Length: 1147 aa     Download sequence    Send to blast
MKAPPNGFLA NSAEGERKSI NSELWHACAG PLVSLPPVGS LVVYFPQGHS EQVAASMQKE  60
TDFIPSYPNL PSKLICMLHN VTLHADPETD EVYAQMTLQP VNKYDKEALL ASDMGLKQSR  120
QPAEFFCKTL TASDTSTHGG FSVPRRAAEK IFPPLDFSMQ PPAQELVARD LHDNTWTFRH  180
IYRGQPKRHL LTTGWSVFVS TKRLFAGDSV LFIRDEKSQL LLGIKRANRQ QPALSSSVIS  240
SDSMHIGILA AAAHAAANNS PFTIFYNPRA SPSEFVIPLA KYNKAMYTQV SLGMRFRMMF  300
ETEESGVRRY MGTITGISDL DPVRWKNSQW RNLQVGWDES TAGERPSRVS IWEIEPVVTP  360
FYICPPPFFR PRFPKQPGMP DDESDIENAF KRAMPWLGDD FGMKDAPSSI FPGLSLVQWM  420
SMQQNNQFPA AQSGCFPSMV SSNPLHNNLS TDDPSKLLNF QAPVSPASNM PFNKANANQV  480
NQLPQAPMTW PQQQQLQQLL QTPLSQHQQQ QQQTQQQLQR QQPQQPQQPQ QQPQQHLLHQ  540
QQPQSQPQQQ QQQQQQQQQQ QQRQQPQLQQ QLQQQAFLPA QVNNGIIAPT QISNQNLHQP  600
AVYSQLQQQQ LLTGNSQSTQ AILSANKTSY PLTSLPQDTQ IQQQMEQQTN LIQRQQQQTQ  660
LQQQQTQLQQ QQTQLQQSPL QLLQQSLSQR TQQQPQIQQL SPQGLSDQQL QLQLLQKLQQ  720
QQQQQQQQQS SQQLLSPAGS LLQPPMVQQQ QTHQQNQPLQ QLPLSQSQPQ PLGSNGFSTS  780
TLMQPQQLSM NQPQSQNKPL VAMRTHSGLT DGDAPSCSTS PSTNNCQVSP SNFLNRSQQV  840
PSILVTDPVV EPASTLVQEL QNKSDIRIKH ELPTSKGPDQ SKYKSTVTDQ LEASSSGTSY  900
CLDAGTIQHN FSLPPFLEGD VQSHPRNNLP FTANIDGLAP DTLLSRGYDS QKDLQNLLSN  960
YGGTPRDIDT ELSTAAISSQ SFGVPNIPFK PGCSNDVAIN DTGVLNGGLW ASQTQRMRTY  1020
TKKVQKRGSV GRSIDVTRYK GYDELRHDLA RMFGIEGQLE DPQSSDWKLV YVDHENDILL  1080
VGDDPWEEFV SCVQSIKILS SAEVQQMSLD GDLGNVAVPN QACSGTDSGN AWRGHYDDTS  1140
AASFNR*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4ldu_A1e-1741535545388Auxin response factor 5
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_012444798.10.0PREDICTED: auxin response factor 19-like isoform X1
TrEMBLA0A061GLE10.0A0A061GLE1_THECC; Auxin response factor
STRINGEOY301980.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM26292665
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G19220.10.0auxin response factor 19
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]