PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID SMil_00009366-RA_Salv
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Lamiales; Lamiaceae; Nepetoideae; Mentheae; Salvia
Family Trihelix
Protein Properties Length: 446aa    MW: 51320.9 Da    PI: 6.3876
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
SMil_00009366-RA_SalvgenomeNDCTCMView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix75.58.6e-24118217186
               trihelix   1 rWtkqevlaLiearremeerlrrgk.............lkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkr.tse 74 
                            +Wt+++v++Li+a++++ e+  ++              +kk++W++vsk+m+erg  +sp+qC++k+++lnkryk+++e+ +++ ++e
  SMil_00009366-RA_Salv 118 KWTDSMVRLLITAVSYISEEAAAEYgggggarrkyanlQKKGKWKSVSKVMAERGHFVSPQQCEDKFNDLNKRYKRLNEILGRGtSCE 205
                            7**************99999997534455667778888**********************************************6699 PP

               trihelix  75 ssstcpyfdqle 86 
                            ++++ +++d ++
  SMil_00009366-RA_Salv 206 VVEKPALLDVMD 217
                            999999999887 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF138371.8E-22116243No hitNo description
Sequence ? help Back to Top
Protein Sequence    Length: 446 aa     Download sequence    Send to blast
MEGNLSSRSG MQGNTSFGGF DLQGPLRVHH HQQQALAPHH QNLRQGSSMV HHTIHENFPL  60
TMGSNQEEPS FSLTDYNKLD SRGKSASDED DPSFTEDAAD NRSDPSKGKK ASPWQRVKWT  120
DSMVRLLITA VSYISEEAAA EYGGGGGARR KYANLQKKGK WKSVSKVMAE RGHFVSPQQC  180
EDKFNDLNKR YKRLNEILGR GTSCEVVEKP ALLDVMDHIP EKAKEEVRKI LSSKHLHYEE  240
MCSYHNGNRL HLPADPELQR SLRLALRSRD DHDESDAKRH LNEDNDEDDQ EGEYDDHGEY  300
EDNQVLAGDH RAYLLGNSAK RAKQCQTHDE FSFRNSLNSL DCNKTFSFQV ENTDSDANRA  360
PAEGSKTNAI VQKQSLSQWN LQIEEQRLNI EMQMLELEKE KFKWQRFCRK KDRELEIMRM  420
EIERMKLENE RMALDLRRKE LAIDSS
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1398410KEKFKWQRFCRKK
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_011074759.10.0uncharacterized protein LOC105159405
TrEMBLA0A4D8ZD590.0A0A4D8ZD59_SALSN; Uncharacterized protein
STRINGEOY167050.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA76001830
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G21200.11e-115sequence-specific DNA binding transcription factors