PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG029410t1
Common NameTCM_029410
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family ARF
Protein Properties Length: 900aa    MW: 99203.7 Da    PI: 6.4417
Description ARF family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG029410t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B375.27.6e-24127228199
                       EEEE-..-HHHHTT-EE--HHH.HTT.......---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-S CS
                B3   1 ffkvltpsdvlksgrlvlpkkfaeeh.......ggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldg 86 
                       f+k+lt sd++++g +++p++ ae+        +++++  ++l+ +d++ ++W++++i+r++++r++lt+GW+ Fv+a++L +gD+v+F   +
  Thecc1EG029410t1 127 FCKTLTASDTSTHGGFSVPRRAAEKVfppldfsQQPPA--QELIARDLHDNEWKFRHIFRGQPKRHLLTTGWSVFVSAKRLVAGDSVLFI--W 215
                       99*****************************8544444..48************************************************..8 PP

                       SSEE..EEEEE-S CS
                B3  87 rsefelvvkvfrk 99 
                       +++ +l+++++r+
  Thecc1EG029410t1 216 NEKNQLLLGIRRA 228
                       899999****997 PP

2Auxin_resp119.62.4e-39253336183
        Auxin_resp   1 aahaastksvFevvYnPrastseFvvkvekvekalk.vkvsvGmRfkmafetedsserrlsGtvvgvsdldpvrWpnSkWrsLk 83 
                       aahaa+t+s F+++YnPras+seFv++++k+ ka++ ++vsvGmRf+m fete+ss rr++Gt++g+sdldp rWpnS+Wrs+k
  Thecc1EG029410t1 253 AAHAAATNSRFTIFYNPRASPSEFVIPLAKYIKAVYhTRVSVGMRFRMLFETEESSVRRYMGTITGISDLDPARWPNSHWRSVK 336
                       79*********************************9789******************************************985 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF1019364.19E-49112256IPR015300DNA-binding pseudobarrel domain
Gene3DG3DSA:2.40.330.103.1E-42120242IPR015300DNA-binding pseudobarrel domain
CDDcd100171.08E-21126227No hitNo description
PROSITE profilePS5086312.548127229IPR003340B3 DNA binding domain
SMARTSM010192.0E-24127229IPR003340B3 DNA binding domain
PfamPF023623.5E-22127228IPR003340B3 DNA binding domain
PfamPF065072.1E-34253336IPR010525Auxin response factor
PfamPF023091.2E-7759855IPR033389AUX/IAA domain
SuperFamilySSF542777.19E-7762841No hitNo description
PROSITE profilePS5174524.392766850IPR000270PB1 domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0009734Biological Processauxin-activated signaling pathway
GO:0009908Biological Processflower development
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
GO:0005515Molecular Functionprotein binding
Sequence ? help Back to Top
Protein Sequence    Length: 900 aa     Download sequence    Send to blast
MRLASAGFNP QTQEGEKRVL NSELWHACAG PLVSLPPVGS RVVYFPQGHS EQVAASTNKE  60
VDAHIPNYPS LPPQLICQLH NVTMHADVET DEVYAQMTLQ PLSPQEQKEA YLPAELGTPS  120
KQPTNYFCKT LTASDTSTHG GFSVPRRAAE KVFPPLDFSQ QPPAQELIAR DLHDNEWKFR  180
HIFRGQPKRH LLTTGWSVFV SAKRLVAGDS VLFIWNEKNQ LLLGIRRANR PQTVMPSSVL  240
SSDSMHLGLL AAAAHAAATN SRFTIFYNPR ASPSEFVIPL AKYIKAVYHT RVSVGMRFRM  300
LFETEESSVR RYMGTITGIS DLDPARWPNS HWRSVKVGWD ESTAGERQPR VSLWEIEPLT  360
TFPMYPAPFP LRLKRPWPPG LPSFHGIKDD DLGMNSPLMW LRGDADRGMQ SLNLQGIGVT  420
PWMQPRLDAS MVGLPADMYQ AMAAAALQDL RAVDPSKPAT ASLLQFQQPQ NLPCRPAALM  480
QPQMLQQSQP QAFLQGVEDN QHQSQSQAQT PPHLLQQQLQ HQNSFNNQQH PQHPLSQQHQ  540
QLVDHQQIHS AVSAMSQYAS ASQSQSSSLQ AMPSLCQQQS FSDSNGNTVT SPIVSPLHSL  600
LGSFPQDESS NLLNLPRSNP VITSAAWPSK RAAVEVLSSG SPQCVLPQVE QLGPTQTNMS  660
QNSISLPPFP GRECSIDQEG GTDPQSHLLF GVNIEPSSLL MPNGMSSLRG VGSDSDSTTI  720
PFSSNYMSTA GTDFSVNPAM TPSSCIDESG FLQSPENVGQ GNPQTRTFVK VYKSGSFGRS  780
LDISKFSSYN ELRSELARMF GLEGQLEDPL RSGWQLVFVD RENDVLLLGD DPWPEFVNSV  840
WCIKILSPQE VQQMGKRGLE LLNSVPVQRL SNGSCDDYVS RQDSRNLSSG IASVGSLDY*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4ldu_A1e-1621135741388Auxin response factor 5
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtAuxin response factors (ARFs) are transcriptional factors that bind specifically to the DNA sequence 5'-TGTCTC-3' found in the auxin-responsive promoter elements (AuxREs).
UniProtAuxin response factors (ARFs) are transcriptional factors that bind specifically to the DNA sequence 5'-TGTCTC-3' found in the auxin-responsive promoter elements (AuxREs).
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007024962.10.0PREDICTED: auxin response factor 6 isoform X2
SwissprotA2X1A10.0ARFF_ORYSI; Auxin response factor 6
SwissprotQ6H6V40.0ARFF_ORYSJ; Auxin response factor 6
TrEMBLA0A061GKH90.0A0A061GKH9_THECC; Auxin response factor
STRINGEOY275850.0(Theobroma cacao)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G30330.10.0auxin response factor 6
Publications ? help Back to Top
  1. Kikuchi S, et al.
    Collection, mapping, and annotation of over 28,000 cDNA clones from japonica rice.
    Science, 2003. 301(5631): p. 376-9
    [PMID:12869764]
  2. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]