PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gh_A10G1853
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family MYB
Protein Properties Length: 1723aa    MW: 188761 Da    PI: 6.0062
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gh_A10G1853genomeNAU-NBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding23.61.2e-07816857346
                      SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
  Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                      +WT +E e++ d  + +G++ ++++a+ +  ++t  +c+++++k
      Gh_A10G1853 816 PWTSQEKEIFMDKLAAFGKD-FRKVASFLD-HKTTADCVEFYYK 857
                      8*****************99.*********.***********98 PP

2Myb_DNA-binding33.21.2e-1010341074345
                       SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHH CS
  Myb_DNA-binding    3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwq 45  
                        WT eE   +++av  +G++ ++ I+r++g +R++ qck ++ 
      Gh_A10G1853 1034 HWTDEEKSAFLQAVSSYGKD-FDMISRYVG-TRSRDQCKVFFS 1074
                       6*****************99.*********.********8776 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466896.13E-13800861IPR009057Homeodomain-like
PROSITE profilePS5129314.74812863IPR017884SANT domain
SMARTSM007179.9E-7813861IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.602.7E-4816857IPR009057Homeodomain-like
PROSITE profilePS5129312.74410301081IPR017884SANT domain
SMARTSM007174.2E-910311079IPR001005SANT/Myb domain
SuperFamilySSF466895.38E-1110321081IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.602.0E-610331075IPR009057Homeodomain-like
PfamPF002491.4E-810341074IPR001005SANT/Myb domain
CDDcd001677.92E-810351073No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1723 aa     Download sequence    Send to blast
MPPEPLPWDR KDFYKERKHE RTQSLPQQPL TARWRESSSM SPYQHASFRE FTRWGSADFR  60
RPPGHGRQGS WHLFAEENGG HGYVPSRSSN KILDDENFRQ LDSRVDGKYS RNSRENNRGS  120
YSQRDWRGHS WENCNGSPST PGRPHHVNNE RRSVDDMPTY LSHTHSDFVN TWDQLQKSQL  180
DNKTIAVNGL GTGQKCQSEN LVGSIDWKPL KWTRSGSLSS RGSGFSHSSS SKSLGGVDSG  240
EGKLESQQKN LTPVQSPSGD AAACVTSPAP SDETSSRKKP RLAWGEGLAK YEKKKVEGPD  300
TSIDRAGAKI SVRNTEFNNF LSSNLADKSP RVLGFSDCAS PATPSSVACS SSPGVEEKSF  360
GKAANVDNDT SNLCGSPTLG SQNHLEGPSF NLEKLDINSI INMGSSLTNL LQADDPCTVD  420
SSFVRSTAIS KLLLWKSDVL KALEMTESEI DSLENELKLL KGDSRSRCPC PATSSSFPVE  480
EHGKACGEQE AASSQIPRRA PLQIDACGDV LVEKQPLCNG VLEEVNDDVK DGDIDSPGTA  540
TSKFMEPLSL EKAVSPSDVV KFHECSGDFG TVQLMSMGKV ILATGSGNEG TATTISAEGS  600
VLKRIDNDAH VPESSNSDVG GENVMYEMIL ATNKELANVA SEVFNKLLPK DQYNAEISEI  660
GNVACTQSDS AIREKIAIRK QYIRFKERVL TIKFKAFQNA WKEDLRSPLM RKYRAKSQKK  720
YEFSLRSTHG GYQKHRSSIH SRFTFPAGNP ILEPSVEMNF TSKLLLGSHG RLYRNALKMP  780
ALILDEKEKK VSRFISSNGL VEDPCAIEKE RALINPWTSQ EKEIFMDKLA AFGKDFRKVA  840
SFLDHKTTAD CVEFYYKNHK SECFEKTKKN DLSKQQGKSA VNTYLLTSGK KRGRELNAAS  900
LDVLGAASVI AAHAESGMRN RHTSGRILLR GRFDSKRSQL DDSIAERSSN FDIVGSDQDT  960
VAADVLAGIC GSFSSEAMSS CITSSADPGE GYHHDWKFHK VDSVVKRPST SDVLQNVDGD  1020
TCSDESCGEM DSSHWTDEEK SAFLQAVSSY GKDFDMISRY VGTRSRDQCK VFFSKARKCL  1080
GLDLIHSRTR NMGTPMSDDA NGGETDTEDA CVQESSVVCS EKLGSKVEED LPSTIVSMNV  1140
DESDLTREAN LQSDHNISEG NIERLVDHKD SVAAEVNFSN VDQTEPMSEC GAGDMDVDSN  1200
QAESLHVLNN VALANLSALE NHVAEEGVSG AVSATHRGTG DCHPSLDASV EPKSGAAALS  1260
TEGFGNNLEA QETFSSKNVM DVRDTRCNAE IGSQVICRPD LDKSSGESID KNSCLDFSFS  1320
SEGLRQVPLD LGSAGKPSIL LFPNENFSAK NSASHSDASQ CEKICNQDRL SVTLAYQGNE  1380
DKQPNNAVSG HEPEHLSGKP SVDLAELQIS TLKEMDIDIG HCQLPEVKRL STSEKGVTGS  1440
YLVQDFLQKC NGPKSPSEFP QLVQNLEQAN SRPKFHSHSL SDTEKPCRNG NVKLFGQILN  1500
SSSQDDGKVR FPEQSMKSSN LNFRGYNNVD GNASFSKFDQ NIIFAPENVP RRSYGFWDGN  1560
RIQTGLSSLP DSEILVAKYP AAFVNYPASS SQMQLQASQS IVRNTDRNMN GVSVFTPREI  1620
SSNNGVMDYQ VYGGHDCTKV VPFAMDMKRR EMFSEMQRRN GFDAISNLQH QGRGMVGMNV  1680
VGTGVGGVVG GSCPNLSDPV AVLRMQYAKT EQYGGQSGSI MRE
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4a69_C1e-16774865494NUCLEAR RECEPTOR COREPRESSOR 2
4a69_D1e-16774865494NUCLEAR RECEPTOR COREPRESSOR 2
Search in ModeBase
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Ghi.204230.0boll
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_016722048.10.0PREDICTED: uncharacterized protein LOC107934192
RefseqXP_016722049.10.0PREDICTED: uncharacterized protein LOC107934192
TrEMBLA0A1U8M9260.0A0A1U8M926_GOSHI; uncharacterized protein LOC107934192
STRINGGorai.011G240800.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM52602744
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G52250.10.0MYB family protein