PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gh_A13G0225
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family MYB
Protein Properties Length: 1587aa    MW: 173591 Da    PI: 6.3619
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gh_A13G0225genomeNAU-NBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding27.19.9e-09809850346
                      SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
  Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                      +WT eE e++ d  + +G++ +++Ia  +  ++t  +c+++++k
      Gh_A13G0225 809 PWTSEEKEIFMDKLAAFGKD-FRKIATFLD-HKTTADCVEFYYK 850
                      8*****************99.*********.***********98 PP

2Myb_DNA-binding30.39.5e-1010281067445
                       S-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHH CS
  Myb_DNA-binding    4 WTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwq 45  
                       WT eE   +++av  +G++ ++ I+r++  +R++ qck ++ 
      Gh_A13G0225 1028 WTDEEKSVFIQAVSSYGKD-FAMISRCVR-TRSRDQCKVFFS 1067
                       *****************99.*********.********8776 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466891.05E-13793854IPR009057Homeodomain-like
PROSITE profilePS5129314.53805856IPR017884SANT domain
SMARTSM007171.6E-7806854IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.603.1E-5808854IPR009057Homeodomain-like
PfamPF002498.2E-6808850IPR001005SANT/Myb domain
PROSITE profilePS5129311.17410231074IPR017884SANT domain
SMARTSM007171.0E-710241072IPR001005SANT/Myb domain
SuperFamilySSF466893.04E-1010261074IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.605.4E-610271068IPR009057Homeodomain-like
PfamPF002493.6E-710281067IPR001005SANT/Myb domain
CDDcd001677.59E-710281066No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1587 aa     Download sequence    Send to blast
MPQEPLPWDR KDIYKDRKHE RAELQPPPLS AARWREASSM SSYQHGSFRE FARWGSADFR  60
RPPGHGKQGN WHLFPEDIGG HGYVPWRSSD KILDGETYRQ SVSRGDGKYG RSYSRDNNRG  120
SYNQRDWRGH SLETSNGSPN TSVRPHDVNN EQRSVDDMFT YPSRTHSDFV NTWNQLQKDQ  180
HDNRTCGVNG LGTGQRCERE NSLGSVDWKP LKWSRSGSLS SWGSGFSHSS SSKSLGGVDS  240
GEAKLELHQK NLAPVQSPSG DAAACVTSAP PSDETTSRKK PRLGWGEGLA KFEKKKDEGP  300
DTSINSGGAA ISLCNTEPNT SLNSNLVDKS PRVLGFSDCS SPATPSSVAC SSSPGVEEKS  360
FGKAANIDND VNNLCGSPSF GSQNQLEGSS FSLEKLDINS IINMGSSLID LLQSDEPSTM  420
DSSFVQSTAI NKLLLWKGDI LKALEMTESE IDSLETELKS SKDDPGRRCQ CPATSSSFPV  480
QENGKSCEEQ EAASSMIPQP ASLKIDPSND VLEVLQEANA DIKDGVIDSP GTATSDFMVS  540
SSLEKAESLC DVVKVQDCSG NSSSAQLTTM EQVILATDSC NEEAAAVVSG EGSVLVKIDN  600
EAHVPESSNS DAAGENMTCD VILTTNKELA NRASLVFKKL LPKDQYSIEI SEISNAVWGQ  660
ISSLIREKIA MRKRHLRFKE RVLTLKFKAF QYAWKEDMLS PAMRKYWAKS QKKYELSLRS  720
TYGGYQKHRS SYRSRVTSSA GNLVLEPTAE MINFTSKLLL DSHVKLYRNA LKMPALILDE  780
QEQLSRFISS NGLVEDPCAI EKERALINPW TSEEKEIFMD KLAAFGKDFR KIATFLDHKT  840
TADCVEFYYK NHKSECFKKT KKKLDLTKQG KSSANTYLLT SGKKWSKEFN AASIDVLGAA  900
SVIATHAESG MQKHQTSSSR IFFGGHYSKI SRADDRIANR SSSFDIIGND RETTAADVLA  960
GICGSLSSEA MSSCITSSVD PGESFHRDWK CHKVDSLFKR RSTSNVAQNV DDGTCSDESC  1020
GEMDPADWTD EEKSVFIQAV SSYGKDFAMI SRCVRTRSRD QCKVFFSKAR KCLGLDLIDP  1080
RTRNLGTPMS DDANGGGSDT ENACVLECLV VSSDKLGSKP EDLPSNIVCT NMDERNPTSK  1140
IILPTDLNVP DENDRKLVDH RDSEAVQTVD SDAGRAELIT ECSVDMNIDS KAGSLQVQKS  1200
VVALGNLNAG RDVTEQGVSV AVSAFLGAAA YPCIPSLDSV AESKPATSLY EHDTKCSAET  1260
SSQSICRMDS NKATDESVGK NSCSSFSLSA KGLHQIPPDL DSAKKPSVSN NSSANGSALH  1320
DSDALRCEKI CNLDRLSSTL DYEENETKQA QKSVREDESG RLSGKTSVNI TEPRQILRGY  1380
PLQVSTLKEM NGDVKGLATS KTGSAGPCLA QECYLQKCNS SKSAAELPLL VENLEQAKDR  1440
LKSHSRISDT ENPGRNGNVK LFGQILNSSS RDDKVSHFSK QNTKPSNLKL IGNNVDGNSK  1500
FDANNHVAEN VPKRSYGFWD GKRIQTGLSS LPDSSILMAK YPSAFANYPP TSSSQMEQQA  1560
LQTVVHSTDR TLNGVSFPLK GNKQQQR
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4a69_C2e-17768858494NUCLEAR RECEPTOR COREPRESSOR 2
4a69_D2e-17768858494NUCLEAR RECEPTOR COREPRESSOR 2
Search in ModeBase
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankJX5788050.0JX578805.1 Gossypium hirsutum clone NBRI_GE10901 microsatellite sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_016697564.10.0PREDICTED: uncharacterized protein LOC107913477 isoform X1
RefseqXP_016697565.10.0PREDICTED: uncharacterized protein LOC107913477 isoform X1
TrEMBLA0A1U8K4X30.0A0A1U8K4X3_GOSHI; uncharacterized protein LOC107913477 isoform X1
STRINGGorai.013G026300.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM52602744
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G52250.10.0MYB family protein