PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID KFK42101.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Arabideae; Arabis
Family Trihelix
Protein Properties Length: 591aa    MW: 66140.1 Da    PI: 5.5173
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
KFK42101.1genomeMPIPBRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix90.51.8e-2865148186
    trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86 
                 rW++qe+laL+++r++m+ ++r+++ k+plWeevs+km+e gf r++k+Ckek+en+ k++k++keg+ ++++++  t+++fdql+
  KFK42101.1  65 RWPRQETLALLKIRSDMGIAFRDASVKGPLWEEVSRKMAELGFIRNAKKCKEKFENVYKYHKRTKEGRTGKSEGK--TYRFFDQLQ 148
                 8********************************************************************975544..6******98 PP

2trihelix101.37.8e-32398483187
    trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                 rW+k e+ aLi++r++++ +++++  k+plWe++s  m++ gf+r++k+Ckekwen+nk++kk+ke++kkr +e+s+tcpyf+ql+a
  KFK42101.1 398 RWPKVEIEALIKLRTNLDAKYQENGPKGPLWEQISGGMKRLGFNRNSKRCKEKWENINKYFKKVKESNKKR-PEDSKTCPYFHQLDA 483
                 8*********************************************************************8.99***********85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM007170.1362124IPR001005SANT/Myb domain
PROSITE profilePS500906.91164122IPR017877Myb-like domain
PfamPF138371.4E-1764150No hitNo description
CDDcd122034.04E-2164129No hitNo description
SMARTSM007170.0071395457IPR001005SANT/Myb domain
PfamPF138371.8E-21397484No hitNo description
PROSITE profilePS500906.911397455IPR017877Myb-like domain
CDDcd122031.32E-24398462No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0010192Biological Processmucilage biosynthetic process
GO:0044212Molecular Functiontranscription regulatory region DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 591 aa     Download sequence    Send to blast
MMQLGGTPTA STAENVPPPQ PPPPXXSNDS AVAAVEATEA AAAAAAVGAF EVSEEMSDRG  60
FGGNRWPRQE TLALLKIRSD MGIAFRDASV KGPLWEEVSR KMAELGFIRN AKKCKEKFEN  120
VYKYHKRTKE GRTGKSEGKT YRFFDQLQAL ETHHQQQQQQ QPLQPQPLQT QTPLRPHNNN  180
NNSVFSTPPP VVPTVTPPIT NLTPVNLPPF PNISGDFLSD NSTSSSSSYS TSSDIEIGGN  240
TKKRKRKWKG FFERLMKQVV DKQEELQNKF LEAVEKREHE RLLREESWRV QEIARINREH  300
EILAQERSMS AAKDAAVMAF LQKLSEKPSQ TPQPQPQQQM QLNNLQQTPQ PPQPPPQTPQ  360
PPVITPSETT KTDNGDQYLP PMSAEASVAV AAAGSSSRWP KVEIEALIKL RTNLDAKYQE  420
NGPKGPLWEQ ISGGMKRLGF NRNSKRCKEK WENINKYFKK VKESNKKRPE DSKTCPYFHQ  480
LDALYKERSN NNNNKANGGN IGASSSSNSG LVKQDDSVPL MVQPEQQWPP ATATVVAQVS  540
LPVGQPLDQS FDDEEGTDEE YDDEEDENVE EEEEGEFEIV PSNNNNKTTT D
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1241246KKRKRK
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor that binds specific DNA sequence. {ECO:0000250}.
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00243DAPTransfer from AT1G76880Download
Motif logo
Cis-element ? help Back to Top
SourceLink
PlantRegMapKFK42101.1
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAC0792839e-88AC079283.4 Arabidopsis thaliana chromosome 1 BAC F7O12 genomic sequence, complete sequence.
GenBankAK2301529e-88AK230152.1 Arabidopsis thaliana mRNA for hypothetical protein, partial sequence., clone: RAFL22-85-G13.
GenBankCP0026849e-88CP002684.1 Arabidopsis thaliana chromosome 1 sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_013693759.10.0trihelix transcription factor GT-2-like
SwissprotQ391171e-143TGT2_ARATH; Trihelix transcription factor GT-2
TrEMBLA0A087HIZ90.0A0A087HIZ9_ARAAL; Uncharacterized protein
STRINGA0A087HIZ90.0(Arabis alpina)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM48492553
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76880.11e-108Trihelix family protein