PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Ahy022820
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fabales; Fabaceae; Papilionoideae; Dalbergieae; Arachis
Family MYB
Protein Properties Length: 1061aa    MW: 116788 Da    PI: 7.1
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
gnl|UG|Ahy#S59551041PU_refUnigeneView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding27.28.8e-09799840346
                      SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
  Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                      +WT eE e++++ ++ +G++ +++Ia+ +  ++t  +c+++++k
        Ahy022820 799 PWTSEEREIFLEKFAVFGKD-FRKIASFLH-HKTTADCVEFYYK 840
                      8*****************99.*********.***********98 PP

2Myb_DNA-binding292.5e-099941035346
                       SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
  Myb_DNA-binding    3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46  
                        WT +E   +++av  +G + + + ar++g +R+++qck ++ k
        Ahy022820  994 DWTDDEKAAFIQAVSSFGRD-FVKLARCIG-TRSPEQCKVFFSK 1035
                       5*****************87.*********.********88765 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466892.92E-14783843IPR009057Homeodomain-like
PROSITE profilePS5129315.926795846IPR017884SANT domain
SMARTSM007171.3E-8796844IPR001005SANT/Myb domain
PfamPF002492.2E-6798840IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.609.2E-6798843IPR009057Homeodomain-like
PROSITE profilePS5129311.3439901041IPR017884SANT domain
SMARTSM007178.3E-99911039IPR001005SANT/Myb domain
SuperFamilySSF466893.2E-99921041IPR009057Homeodomain-like
PfamPF002493.2E-79941034IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.602.1E-59941035IPR009057Homeodomain-like
CDDcd001671.17E-79951033No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1061 aa     Download sequence    Send to blast
MPPEPLPWDR KDFFKERKHE RSESLGSVAR WRDSSHHRDF HRWGSAEFRR PPGHGKQGGW  60
HVFSEDSGHG YGISRSSSEK MLDEDCRPSV SRGDGKYGRG SRENRGPFGQ RDWRGQSWET  120
TNGSMNLPRR PPDVNNDHRS VDDNLTYSAH PHSDFVNTWD PHHLKDQHDK IGGANGFGTG  180
ARSDRENFLA SIDWKPLKWT RSGSLSSRGS GFSHSSSSRS AGGADSHEAK AELHPKNATV  240
NESHSGEAAV CVTSSAPCED TTSRKKPRLN WGEGLAKYEK KKVEVPDGSA NKDGPVLSNG  300
SIEPCAFPGS SLVDKSPKVT GFSDCACASP ATPSSVACSS SPGVDDKLFG KPANVDNDVS  360
NLTCSPVPGS QDHFQRFSFN LEKLDIESLN SLNSSIIELI QSDDTSYVNS GPMRSTAMNK  420
LLIWKADISK VLETTESEID SLENELKSLR SASGDRGSYP AVLGSQMVGN NENPFEVPVG  480
VSDEVTRPEP LKILSSDDPD AEKLPLSTNL NSIHENGKEE DIDSPGSATS KLSEPPPLVK  540
AVSSSDTRRY DTFLEDANAG QSNGMKCLIP CTTRKYPSNS ACSDVNAASE VPDSIITASG  600
ASLRSSTEDS LYKKIISSNK ELAKSACGVF AKLLPQGYTK IDKVGASSDL CSQTSIMEKF  660
AEKKQFARFK ERVITLKFKA LHHLWKEDMR LLSIKKCRPK YHKKHELSVR STFNGNQKNR  720
FSIRSRFPLP AGNHLSLVPT AEVINFTRKL LSEPQVKIHR DALKMPALVL DEKIPKFISS  780
NGLVEDPLAI EKEKALINPW TSEEREIFLE KFAVFGKDFR KIASFLHHKT TADCVEFYYK  840
NHKSDCFEKL KKQQKLGKSF LAKTDLVASG KKWNHEANTA SLDILSAASV MADGFACNKK  900
MRPGNFLMGG YVNVKASRVD DSIRERSSSF DILGDEREAF ADVMASSEAM SFCGTSSVEP  960
VEGSRDSRLM PDTAENVDDE TCSDESCGEM DPTDWTDDEK AAFIQAVSSF GRDFVKLARC  1020
IGTRSPEQCK VFFSKARKCL GLDLMRPMPE NVGSPANDGA X
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4a69_C1e-157718481794NUCLEAR RECEPTOR COREPRESSOR 2
4a69_D1e-157718481794NUCLEAR RECEPTOR COREPRESSOR 2
Search in ModeBase
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_020966324.10.0uncharacterized protein LOC107617278 isoform X2
RefseqXP_025675909.10.0uncharacterized protein LOC112776118 isoform X2
TrEMBLA0A444XTP90.0A0A444XTP9_ARAHY; Uncharacterized protein
STRINGGLYMA20G31871.10.0(Glycine max)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G52250.11e-170MYB family protein