PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG000516t1
Common NameTCM_000516
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family B3
Protein Properties Length: 568aa    MW: 64464.3 Da    PI: 9.9003
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG000516t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B356.55.1e-18211131100
                       EEEE-..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..E CS
                B3   1 ffkvltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelv 93 
                       ffk+ + + +l+ g l +p kf++++g+   ++    l+ ++g++W+v+l  +k++g+++l++GW+eF +   L+ g f+vF+++g+ +f  +
  Thecc1EG000516t1  21 FFKIIL-PETLRDGKLGIPTKFVKKYGNGMSSP--ALLKVPNGEVWKVEL--TKSDGKVWLKNGWQEFLNHYSLEYGHFLVFRYEGNCNF--H 106
                       556554.45677788***********7665555..89999**********..**********************************9999..9 PP

                       EEEE-SS CS
                B3  94 vkvfrks 100
                       v +f++s
  Thecc1EG000516t1 107 VIIFDRS 113
                       9999985 PP

2B363.53.2e-20239334196
                       EEEE-..-HHHHT..T-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTE..EEE-TTHHHHHHHHT--TT-EEEEEE-SSSE CS
                B3   1 ffkvltpsdvlks..grlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgr..yvltkGWkeFvkangLkegDfvvFkldgrse 89 
                       f+ v+ ps+v+ +  +rl +p++f+++h +k+ +   + l +++g++W+v +  r k ++    l++GWk F+++n++++gD++vF+l++  +
  Thecc1EG000516t1 239 FLLVMQPSYVGFKstCRLAIPNNFVRKHLMKEDC--VVNLCNSNGKTWTVSFHCREKERKlnASLQSGWKTFANDNNIQVGDVCVFELTN--C 327
                       677889999987777*************877656..7***************77666666889************************854..4 PP

                       E..EEEE CS
                B3  90 felvvkv 96 
                        e  +kv
  Thecc1EG000516t1 328 IEISFKV 334
                       4455554 PP

3B361.31.6e-19386482199
                       EEEE-..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTE...EEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE CS
                B3   1 ffkvltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgr...yvltkGWkeFvkangLkegDfvvFkldgrsef 90 
                       f  vl ps+v+ ++ l++p kfa+++ +kk  + +++l+ ++g+sW vk+   ++s r     l +GW++Fv +n+L++g ++vF+l++  e+
  Thecc1EG000516t1 386 FAVVLQPSYVHLNK-LSVPEKFARKFFKKK--HNEVILRLSNGKSWPVKY--YQHSIRtpsAKLCNGWRKFVLDNKLEVGNVCVFELTEGIET 473
                       56677777777776.***********8674..458***************..33333334588999********************9987666 PP

                       ..EEEEE-S CS
                B3  91 elvvkvfrk 99 
                       +++v+++rk
  Thecc1EG000516t1 474 SFKVTIYRK 482
                       689999987 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:2.40.330.108.5E-2714113IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019363.92E-2615113IPR015300DNA-binding pseudobarrel domain
CDDcd100172.53E-2119111No hitNo description
PROSITE profilePS5086313.95820113IPR003340B3 DNA binding domain
SMARTSM010195.3E-2121113IPR003340B3 DNA binding domain
PfamPF023622.5E-1522112IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.101.4E-25231338IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019361.18E-24232338IPR015300DNA-binding pseudobarrel domain
CDDcd100179.36E-23237337No hitNo description
PfamPF023623.7E-18239333IPR003340B3 DNA binding domain
SMARTSM010193.3E-11239340IPR003340B3 DNA binding domain
PROSITE profilePS5086312.985241340IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.102.2E-28377482IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019361.43E-25379482IPR015300DNA-binding pseudobarrel domain
CDDcd100174.92E-17384481No hitNo description
PROSITE profilePS5086314.677385483IPR003340B3 DNA binding domain
SMARTSM010191.2E-15386483IPR003340B3 DNA binding domain
PfamPF023626.5E-18387482IPR003340B3 DNA binding domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 568 aa     Download sequence    Send to blast
MASSHRKGND HSTFTTESPH FFKIILPETL RDGKLGIPTK FVKKYGNGMS SPALLKVPNG  60
EVWKVELTKS DGKVWLKNGW QEFLNHYSLE YGHFLVFRYE GNCNFHVIIF DRSASEIEYP  120
YTSNNHGQHK ELPEEKIEES EGDNSIQILE DIAPSRKTRE KSHLSCLRPH KMMRSTNSAN  180
KTESNLKSES LFPQFRHDGS PARKGDKSTS RHRIQKLKAD NKAKALQRAR AFKSENPFFL  240
LVMQPSYVGF KSTCRLAIPN NFVRKHLMKE DCVVNLCNSN GKTWTVSFHC REKERKLNAS  300
LQSGWKTFAN DNNIQVGDVC VFELTNCIEI SFKVCIYQGK TDVFHPVKRN AGKSSTGQDY  360
QKPLNAFEKA KAIQIASAFR SENPSFAVVL QPSYVHLNKL SVPEKFARKF FKKKHNEVIL  420
RLSNGKSWPV KYYQHSIRTP SAKLCNGWRK FVLDNKLEVG NVCVFELTEG IETSFKVTIY  480
RKQAIEDANL GSSLADKSTE NQVESKVSLV INVDSDSVHD NGNMNQEAQN LNFSGLRPFI  540
VSQVLTGLKL TLNEVKNGSS SKFEEMV*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4i1k_A9e-1536148124141B3 domain-containing transcription factor VRN1
4i1k_B9e-1536148124141B3 domain-containing transcription factor VRN1
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017972198.10.0PREDICTED: B3 domain-containing transcription factor VRN1
TrEMBLA0A061DG540.0A0A061DG54_THECC; AP2/B3-like transcriptional factor family protein, putative
STRINGEOX912700.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM14219723
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G18990.11e-46B3 family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]