PlantRegMap/PlantTFDB v5.0
Plant Transcription
Factor Database
|
Home TFext BLAST Prediction Download Help About Links PlantRegMap |
Transcription Factor Information
Basic Information? help Back to Top | |||||||||
---|---|---|---|---|---|---|---|---|---|
TF ID | Thecc1EG021168t2 | ||||||||
Common Name | TCM_021168 | ||||||||
Organism | |||||||||
Taxonomic ID | |||||||||
Taxonomic Lineage |
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
|
||||||||
Family | MYB_related | ||||||||
Protein Properties | Length: 1187aa MW: 132274 Da PI: 9.1989 | ||||||||
Description | MYB_related family protein | ||||||||
Gene Model |
|
Signature Domain? help Back to Top | |||||||
---|---|---|---|---|---|---|---|
No. | Domain | Score | E-value | Start | End | HMM Start | HMM End |
1 | Myb_DNA-binding | 32.8 | 1.6e-10 | 47 | 82 | 3 | 40 |
SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHH CS Myb_DNA-binding 3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqc 40 WT+eE e++++a +++G++ Wk++a + +R+ +++ Thecc1EG021168t2 47 QWTKEELERFYEAYRKYGKD-WKKVATVVR-NRSVEMV 82 6*****************99.*********.**98876 PP |
Protein Features ? help Back to Top | ||||||
---|---|---|---|---|---|---|
Database | Entry ID | E-value | Start | End | InterPro ID | Description |
PROSITE profile | PS51293 | 10.496 | 43 | 96 | IPR017884 | SANT domain |
SuperFamily | SSF46689 | 5.91E-11 | 44 | 91 | IPR009057 | Homeodomain-like |
SMART | SM00717 | 5.9E-6 | 44 | 92 | IPR001005 | SANT/Myb domain |
Gene3D | G3DSA:1.10.10.60 | 2.3E-5 | 47 | 81 | IPR009057 | Homeodomain-like |
Pfam | PF00249 | 4.5E-9 | 47 | 82 | IPR001005 | SANT/Myb domain |
CDD | cd00167 | 1.84E-7 | 48 | 82 | No hit | No description |
Pfam | PF06584 | 6.1E-32 | 642 | 742 | IPR033471 | DIRP domain |
SMART | SM01135 | 2.3E-55 | 642 | 743 | IPR033471 | DIRP domain |
Gene Ontology ? help Back to Top | ||||||
---|---|---|---|---|---|---|
GO Term | GO Category | GO Description | ||||
GO:0005730 | Cellular Component | nucleolus | ||||
GO:0016592 | Cellular Component | mediator complex | ||||
GO:0003677 | Molecular Function | DNA binding |
Sequence ? help Back to Top |
---|
Protein Sequence Length: 1187 aa Download sequence Send to blast |
MAPSRKSKSV NKKFSYVNEV ASSKDGDSSA KRSGQRKRKL SDMLGPQWTK EELERFYEAY 60 RKYGKDWKKV ATVVRNRSVE MVEALYTMNR AYLSLPEGTA SVVGLIAMMT DHYCVMGGSD 120 SEQESNEGVG ASRKPQKRSR GKLRDQPSKS LDKSFPDLLQ FHSAASSYGC LSLLKRRRSE 180 SRPRAVGKRT PRVPISFSHD KNKGERYFSP IRQGMKLKVD TVDDDVAHEI ALVLTEASQR 240 GGSPQVSRTP NRKAEASSPI LNSERMNAES ETTSAKIHGS EMDEDACELS LGSTEADNAD 300 YARGKNYSMN IEGTGTIEVQ QKGKRYYRRK PGVEESVNNH LEDTKEACSG TEEDQKLCDF 360 KGKFEAEVAD TKPSRGSIKG LRKRSKKVLF GRVEDTSFDA LQTLADLSLM MPETAADTES 420 SVQFKEEKNE VVEKTKLKGN HPVSGAKGTA PKTCKQGKVF GHDVRAIPEA KEETHPGNVG 480 MRKRRQKSSP YKLQIPKDET DADSHLGESR NIEALDEVKN FPSKGKRSNN VAHSKQGKSV 540 RPPEHRSSST DHGRDLNNST PSTIQVSPVN QVNLPTKVRS KRKIDAQKQV IGKDIKSSDG 600 IVKGKFSVPV SLFHDRALNL KEKLCNFLCP YQARRWCTFE WFCSTIDYPW FAKREFVEYL 660 DHVGLGHVPR LTRVEWGVIR SSLGKPRRFS EQFLKEEREK LYQYRESVRT HYAELRAGIG 720 EGLPTDLARP LSVGQRVIAI HPKTREIHDG NVLIVDHSRY RIQFDSTELG VESVMDIDCM 780 ALNPLENLPA SLVRQNAAVR KFFENYNELK MNGQPKESKM EENIKFAPCE ENANSPSRTS 840 PSTFSVGNLS QPVKVDPSSP NLQLKVGPME TVYTQQAVNS QLSALALIQA READVEALSQ 900 LTRALDKKHL QEAVVSELRR MNDEVLENQK GGDNSIKDSD SFKKQYAAVL LQLNEVNEQV 960 SSALFSLRQR NTYQGTSSVR LLKPLAKIGE HGCQLSSFDH SMHHAQESVS HVAEIVESSR 1020 TKARSMVDAA MQAMSSLRKG GKSIERIEDA IDFVNNQLSV DDLSVPAPRS SIPIDSAHST 1080 VTFHDHLTAF VSNPLATGHA PDTKLQNSSD QDDLRIPSDL IVHCVATLLM IQKCTERQFP 1140 PGDVAQVLDS AVTSLKPCCS QNLSIYAEIQ KCMGIIRNQI LALVPT* |
Nucleic Localization Signal ? help Back to Top | |||
---|---|---|---|
No. | Start | End | Sequence |
1 | 480 | 485 | MRKRRQ |
Regulation -- PlantRegMap ? help Back to Top | ||||||
---|---|---|---|---|---|---|
Source | Upstream Regulator | Target Gene | ||||
PlantRegMap | Retrieve | - |
Annotation -- Protein ? help Back to Top | |||||||
---|---|---|---|---|---|---|---|
Source | Hit ID | E-value | Description | ||||
Refseq | XP_007035526.2 | 0.0 | PREDICTED: protein ALWAYS EARLY 3 isoform X1 | ||||
Swissprot | Q6A332 | 0.0 | ALY3_ARATH; Protein ALWAYS EARLY 3 | ||||
TrEMBL | A0A061EQ26 | 0.0 | A0A061EQ26_THECC; Always early, putative isoform 2 | ||||
STRING | EOY06452 | 0.0 | (Theobroma cacao) |
Orthologous Group ? help Back to Top | |||
---|---|---|---|
Lineage | Orthologous Group ID | Taxa Number | Gene Number |
Malvids | OGEM12714 | 24 | 27 |
Best hit in Arabidopsis thaliana ? help Back to Top | ||||||
---|---|---|---|---|---|---|
Hit ID | E-value | Description | ||||
AT3G21430.2 | 0.0 | DNA binding |
Link Out ? help Back to Top | |
---|---|
Phytozome | Thecc1EG021168t2 |
Entrez Gene | 18603464 |
Publications ? help Back to Top | |||
---|---|---|---|
|