PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG029410t2
Common NameTCM_029410
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family ARF
Protein Properties Length: 903aa    MW: 99537 Da    PI: 6.3735
Description ARF family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG029410t2genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B375.27.6e-24130231199
                       EEEE-..-HHHHTT-EE--HHH.HTT.......---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-S CS
                B3   1 ffkvltpsdvlksgrlvlpkkfaeeh.......ggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldg 86 
                       f+k+lt sd++++g +++p++ ae+        +++++  ++l+ +d++ ++W++++i+r++++r++lt+GW+ Fv+a++L +gD+v+F   +
  Thecc1EG029410t2 130 FCKTLTASDTSTHGGFSVPRRAAEKVfppldfsQQPPA--QELIARDLHDNEWKFRHIFRGQPKRHLLTTGWSVFVSAKRLVAGDSVLFI--W 218
                       99*****************************8544444..48************************************************..8 PP

                       SSEE..EEEEE-S CS
                B3  87 rsefelvvkvfrk 99 
                       +++ +l+++++r+
  Thecc1EG029410t2 219 NEKNQLLLGIRRA 231
                       899999****997 PP

2Auxin_resp119.62.4e-39256339183
        Auxin_resp   1 aahaastksvFevvYnPrastseFvvkvekvekalk.vkvsvGmRfkmafetedsserrlsGtvvgvsdldpvrWpnSkWrsLk 83 
                       aahaa+t+s F+++YnPras+seFv++++k+ ka++ ++vsvGmRf+m fete+ss rr++Gt++g+sdldp rWpnS+Wrs+k
  Thecc1EG029410t2 256 AAHAAATNSRFTIFYNPRASPSEFVIPLAKYIKAVYhTRVSVGMRFRMLFETEESSVRRYMGTITGISDLDPARWPNSHWRSVK 339
                       79*********************************9789******************************************985 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF1019364.19E-49115259IPR015300DNA-binding pseudobarrel domain
Gene3DG3DSA:2.40.330.103.1E-42123245IPR015300DNA-binding pseudobarrel domain
CDDcd100171.08E-21129230No hitNo description
PfamPF023623.6E-22130231IPR003340B3 DNA binding domain
PROSITE profilePS5086312.548130232IPR003340B3 DNA binding domain
SMARTSM010192.0E-24130232IPR003340B3 DNA binding domain
PfamPF065072.1E-34256339IPR010525Auxin response factor
PfamPF023091.2E-7762858IPR033389AUX/IAA domain
SuperFamilySSF542777.19E-7765844No hitNo description
PROSITE profilePS5174524.392769853IPR000270PB1 domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0009734Biological Processauxin-activated signaling pathway
GO:0009908Biological Processflower development
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
GO:0005515Molecular Functionprotein binding
Sequence ? help Back to Top
Protein Sequence    Length: 903 aa     Download sequence    Send to blast
MRLASAGFNP QTQEDFAGEK RVLNSELWHA CAGPLVSLPP VGSRVVYFPQ GHSEQVAAST  60
NKEVDAHIPN YPSLPPQLIC QLHNVTMHAD VETDEVYAQM TLQPLSPQEQ KEAYLPAELG  120
TPSKQPTNYF CKTLTASDTS THGGFSVPRR AAEKVFPPLD FSQQPPAQEL IARDLHDNEW  180
KFRHIFRGQP KRHLLTTGWS VFVSAKRLVA GDSVLFIWNE KNQLLLGIRR ANRPQTVMPS  240
SVLSSDSMHL GLLAAAAHAA ATNSRFTIFY NPRASPSEFV IPLAKYIKAV YHTRVSVGMR  300
FRMLFETEES SVRRYMGTIT GISDLDPARW PNSHWRSVKV GWDESTAGER QPRVSLWEIE  360
PLTTFPMYPA PFPLRLKRPW PPGLPSFHGI KDDDLGMNSP LMWLRGDADR GMQSLNLQGI  420
GVTPWMQPRL DASMVGLPAD MYQAMAAAAL QDLRAVDPSK PATASLLQFQ QPQNLPCRPA  480
ALMQPQMLQQ SQPQAFLQGV EDNQHQSQSQ AQTPPHLLQQ QLQHQNSFNN QQHPQHPLSQ  540
QHQQLVDHQQ IHSAVSAMSQ YASASQSQSS SLQAMPSLCQ QQSFSDSNGN TVTSPIVSPL  600
HSLLGSFPQD ESSNLLNLPR SNPVITSAAW PSKRAAVEVL SSGSPQCVLP QVEQLGPTQT  660
NMSQNSISLP PFPGRECSID QEGGTDPQSH LLFGVNIEPS SLLMPNGMSS LRGVGSDSDS  720
TTIPFSSNYM STAGTDFSVN PAMTPSSCID ESGFLQSPEN VGQGNPQTRT FVKVYKSGSF  780
GRSLDISKFS SYNELRSELA RMFGLEGQLE DPLRSGWQLV FVDRENDVLL LGDDPWPEFV  840
NSVWCIKILS PQEVQQMGKR GLELLNSVPV QRLSNGSCDD YVSRQDSRNL SSGIASVGSL  900
DY*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4ldu_A1e-1621436041388Auxin response factor 5
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtAuxin response factors (ARFs) are transcriptional factors that bind specifically to the DNA sequence 5'-TGTCTC-3' found in the auxin-responsive promoter elements (AuxREs).
UniProtAuxin response factors (ARFs) are transcriptional factors that bind specifically to the DNA sequence 5'-TGTCTC-3' found in the auxin-responsive promoter elements (AuxREs).
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007024963.10.0PREDICTED: auxin response factor 6 isoform X1
SwissprotA2X1A10.0ARFF_ORYSI; Auxin response factor 6
SwissprotQ6H6V40.0ARFF_ORYSJ; Auxin response factor 6
TrEMBLA0A061GCH90.0A0A061GCH9_THECC; Auxin response factor
STRINGEOY275850.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM13442894
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G30330.20.0auxin response factor 6
Publications ? help Back to Top
  1. Kikuchi S, et al.
    Collection, mapping, and annotation of over 28,000 cDNA clones from japonica rice.
    Science, 2003. 301(5631): p. 376-9
    [PMID:12869764]
  2. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]