PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID RcHm_v2.0_Chr4g0421591
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Rosoideae; Rosoideae incertae sedis; Rosa
Family MYB_related
Protein Properties Length: 894aa    MW: 104925 Da    PI: 7.8311
Description MYB_related family protein
Gene Model
Gene Model ID Type Source Coding Sequence
RcHm_v2.0_Chr4g0421591genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding23.51.3e-074681340
                            SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHH CS
         Myb_DNA-binding  3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqc 40
                             WTteE + ++++ +++G++ W ++a+ +  +R+l+++
  RcHm_v2.0_Chr4g0421591 46 QWTTEELKHFYEGYRKYGKD-WMKVASVVR-HRSLENV 81
                            6*****************99.*********.***9886 PP

Sequence ? help Back to Top
Protein Sequence    Length: 894 aa     Download sequence    
MAPVRKTKGV NKRFPYINEV ASNKYGDNAN KNKQKKRKMS DMLGPQWTTE ELKHFYEGYR  60
KYGKDWMKVA SVVRHRSLEN VEALYTMNRA YLSLPEGAAS VIGLTAMMCD YYSMLERSDS  120
EQNCIDNAGT PRKSQKLARG KLRNDASKGL EGHILDISQS RSIASDGCLS LLKNRRTDED  180
SSFDALLTLA DLSLRMPEAT AEMESSALVE EENFNIAKKS KLKGNHSVTM VEDTALKTSQ  240
LGKLKEGVQQ SDTGIQKRKQ KSPVKLQINE NEAQTEFPWS DNQMIEILKS SFDSRILRLW  300
IRRNKMARPK FKIISSSSEE FREDTQSQES LDEETPTSFQ KTLQHEMTKI KHRKKKQAED  360
TDDEETAEEE TEDEDEEETT SRYCVKKRSQ KRNTTDDEDE EYDQKEKKKR KVQLPKTKEQ  420
KAKKHGRKKK EEKEETEKKE ENIAKIDWRQ QKCTLSAFWR LIDAHKSRIP AATWEILRHT  480
VFWGMIEPFL ERKLTENQLH KHEADLEIIM RYFDKKKNKF IFGEKEMEIT VQDVKTLFDL  540
PTEGIYMELN KKLSKEERKE SAIFDTNVKK DSVLKTEVEK KLVKELTKEK KEKDLKKQQD  600
PKKIASLIIM YLFAAFFFSR TATNITWDLI SVCEDIDNIN KFNWSRMIID FLLDGIQKYQ  660
KDKPTTLSGC LLLIYYWFLE KTKIKNWIPG KETETPRFVR WSIKEIFNLQ EVYKNPGWPE  720
RLIKDGPWLE KEWLEELEFE EKIEEQETPS FNNWVPQASQ REEEEIIRLT GNMRVIIEEL  780
ETVLREEPTR ENMEEKLKRL AEGNEQLSKQ NKEMWTKLKI ADEMIESMEK KKHDLLLETR  840
ASKNQIAALY KLLAKAKHSG NAGASQQKES QLQIVQISTE EEQVRQAEFE DNRA
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1407412KKKRKV
2407430KKKRKVQLPKTKEQKAKKHGRKKK
3407433KKKRKVQLPKTKEQKAKKHGRKKKEEK
4408431KKKRKVQLPKTKEQKAKKHGRKKK
5408434KKKRKVQLPKTKEQKAKKHGRKKKEEK
6409432KKKRKVQLPKTKEQKAKKHGRKKK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G21430.21e-44MYB_related family protein