PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gh_D10G2114
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family MYB
Protein Properties Length: 1721aa    MW: 188016 Da    PI: 6.2099
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gh_D10G2114genomeNAU-NBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding23.11.7e-07814855346
                      SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
  Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                      +WT +E e++    + +G++ +++Ia+ +  ++t  +c+++++k
      Gh_D10G2114 814 PWTSQEKEIFMAKLAAFGKD-FRKIASFLD-HKTTADCVEFYYK 855
                      8*****************99.*********.***********98 PP

2Myb_DNA-binding31.73.6e-1010321072345
                       SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHH CS
  Myb_DNA-binding    3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwq 45  
                        WT eE   +++av  +G++ ++ I+r++g +R++ qck ++ 
      Gh_D10G2114 1032 HWTDEEKSAFLQAVLSYGKD-FDMISRYVG-TRSRDQCKVFFS 1072
                       6*****************99.*********.********8776 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466892.42E-13798859IPR009057Homeodomain-like
PROSITE profilePS5129314.387810861IPR017884SANT domain
SMARTSM007174.0E-7811859IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.606.9E-5814855IPR009057Homeodomain-like
PROSITE profilePS5129313.27210281079IPR017884SANT domain
SMARTSM007173.0E-810291077IPR001005SANT/Myb domain
SuperFamilySSF466894.79E-1110301079IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.605.0E-610301073IPR009057Homeodomain-like
PfamPF002495.5E-810321072IPR001005SANT/Myb domain
CDDcd001673.95E-710331071No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1721 aa     Download sequence    Send to blast
MPPEPLPWDR KDFYKERKHE RTQSLPQQPL TARWRESSSM SPYQHASFRE FTRWGSADFR  60
RPPGHGRQGS WHLFAEENGG HGYVPSRSSD KILDDENFRQ LDSRVDGKYS RNSRENNRGS  120
SSQRDWRGHS WENCNGSPSA PGRPHLVNNE RRSVDDMPTY LSHTHSDFVN TWDQLQKSQL  180
DNKTIAVNGL GTGQKCQSEN LVGSIDWKPL KWTRSGSLSS RGSGFSHSSS SKSLGGVDSG  240
EGKLELQQKN LTPVQSPSGD AAACVTSPAP SDETSSRKKP RLAWGEGLAK YEKKKVEGPD  300
TSIDRAGAKI SVRNTEFNNF LSSNLADKSP RVLGFSDCAS PATPSSVACS SSPGVEEKSF  360
GKAANVDNDT SNLCGSPTLG SQNHLEGPSF NLEKLDINSI INMGSSLTNL LQADDPCTVD  420
SSFVRSTAIS KLLLWKSDVL KALEMTESEI DSLESELKLL KGDSRSRCPC PATSSSFPVE  480
EHGKACGEQE AASSLIPRPA PLQIDACGDV LVGKQPLCNG VLEEVNDDVK DGDIDSPGTA  540
TSKFMEPLSL EKAVSPSDVV KFHECSGDFG TVQLMSMGKV ILATGSGNAG TATTISAEGS  600
VLKRIDNDAH VPESSNSDVG DENVMYEMIL ATNKELANVA SEVFNKLLPK DQYNSEIGNV  660
ACTQSDSAIR KKIAIRKQYL RFKERVLTIK FKAFQNAWKE DLRSPSMRKY RAKSQKKYEF  720
SLRSAHGGYQ KHRSSIHSRL TSPAGNPILE PRAEMINFTS KLLLGSHGRL YRNALKMPAL  780
ILDEKEKKVS RFISSNGLVE DPCAIEKERA LINPWTSQEK EIFMAKLAAF GKDFRKIASF  840
LDHKTTADCV EFYYKNHKSE CFEKTKKNDL SKQQGKSAVN TYLLTSGKKR GRELNAASLD  900
VLGAASVIAA HAESGMRNRH TSGRILLRGR FDSKRSQLDD SIAERSSNFD IVGSDQDTVA  960
ADVLAGICGS FSSEAMSSCI TSSADPGEGY HHDWKFHKVD SVVKRPSTSD VLQNVDGDTC  1020
SDESCGEMES SHWTDEEKSA FLQAVLSYGK DFDMISRYVG TRSRDQCKVF FSKARKCLGL  1080
DLIHSRTRNI GTPMSDDANG GETDTEDACV QESSVVCSEK LGSKVEEDLP STIVSMNVDE  1140
SDLTREASLQ SDHNISEGNI ERLADHKDSV AAEVNFSNVD HTEPISECGA GDMDVDSNQA  1200
ESLHVQNNVA LANLSALENH VAEEGVSVAV SASHGGTGDC HPSLDASVEP KSGAAVLSTE  1260
GFGNNLEAQE TLSSKNVMDA RDTRCNAEIG SQVICRPDLD KSSGESIDKN SCLDFSFNSE  1320
GLRQVPLDLG SAGKPSILLF PNENFSAKNS ASHSDASQCD KICNQDRLSA TLAYQGNEDK  1380
QPNNAVSGHE PEHLSGKPSV DLAELQISTL KEMDIDIGHS QLPEVKRLST SDKGVTGLYL  1440
VQDYLQKCNG PKSPSEFPQL VQNLEQANSR PKSHSRSLSD TEKPCRNGNV KLFGQILNSS  1500
SQDDGKIRFR EQSMKSSNLN FRGHNNVDGN ASFSKFDQNI IFAPENVPRR SYGFWDGNRI  1560
QTGLSSLPDS EILVAKYPAA FVNYPASSSQ MQLQASQSIV RNTDRNMNGV SVFTPREISS  1620
NNGVMDYQVY GGHDCTKVVP FAMDMKRREM FSEMQRRNGF DAISNLQHQG RGMVGMNVVG  1680
TGVGGVVGGS CPNLSDPVAV LRMQYAKTEQ YGGQSGSIMR E
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4a69_C2e-16772863494NUCLEAR RECEPTOR COREPRESSOR 2
4a69_D2e-16772863494NUCLEAR RECEPTOR COREPRESSOR 2
Search in ModeBase
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Ghi.204230.0boll
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_016700569.10.0PREDICTED: uncharacterized protein LOC107915891 isoform X1
RefseqXP_016700570.10.0PREDICTED: uncharacterized protein LOC107915891 isoform X1
TrEMBLA0A1U8KIP60.0A0A1U8KIP6_GOSHI; uncharacterized protein LOC107915891 isoform X1
STRINGGorai.011G240800.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM52602744
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G52250.10.0MYB family protein