PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID 462914776
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; PACMAD clade; Chloridoideae; Eragrostideae; Eragrostidinae; Eragrostis
Family Trihelix
Protein Properties Length: 447aa    MW: 50881.1 Da    PI: 6.7962
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
462914776genomeTefView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix82.17.5e-26120216186
   trihelix   1 rWtkqevlaLiearremeerlrrgk..........lkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkr.tsessstcpyfdqle 86 
                +Wt+++v++Li+a+++++e+ +++           +kk++W+++sk+m erg+++sp+qC++k+++lnkryk+++++ +++ +++++ + +++++++
  462914776 120 KWTDSMVKLLITAVSYTGEDPGADLgggrrnftimQKKGKWKAISKVMGERGCHVSPQQCEDKFNDLNKRYKRLTDILGRGtACNVVANPALLESMN 216
                7*********************9744455677788**********************************************5599999999999997 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF138374.2E-20118242No hitNo description
Sequence ? help Back to Top
Protein Sequence    Length: 447 aa     Download sequence    Send to blast
GQLHSKNSTR VLGEGGASGK LILLRGAAIP VLAAPPTDGK QLFSNSHMPG NITMSMNRVT  60
EPDDFPGFQF KEHVHTDNNQ HHSHHSKNCM SDDEEHEMTE DANDTPSGKG KKGSAWHRMK  120
WTDSMVKLLI TAVSYTGEDP GADLGGGRRN FTIMQKKGKW KAISKVMGER GCHVSPQQCE  180
DKFNDLNKRY KRLTDILGRG TACNVVANPA LLESMNHLSD KMKDDAKKIM SSKHLFYEEM  240
CSYHNNNRAN LPEDPALQHS LQLALRCKED YDSRRDVSGD ADEDDQSADS DYYEDYEEHH  300
AVHTNMREPS MLKRMRHTDM AFVNSCSHEG SARSDPHGIT VDINKVLPDG TNLVLSQKDL  360
ASQSLEIQKH RLQIDAKELE LTQQCLKWER FKKKKDRELE RMTLENEHMR IENKRLELEL  420
RQKELELELK LKGQGNHFSE GQEQVKL
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1388394ERFKKKK
2389395RFKKKKD
Cis-element ? help Back to Top
SourceLink
PlantRegMap462914776
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_002449041.10.0uncharacterized protein LOC8070669
TrEMBLA0A0A9LZQ40.0A0A0A9LZQ4_ARUDO; Uncharacterized protein
STRINGSb05g003890.10.0(Sorghum bicolor)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP29203886
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G21200.13e-94sequence-specific DNA binding transcription factors