PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID KFK42102.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Arabideae; Arabis
Family Trihelix
Protein Properties Length: 574aa    MW: 65783.3 Da    PI: 8.2796
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
KFK42102.1genomeMPIPBRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix93.71.8e-2937121187
    trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                 rW++ e+laL+++r+em++++r++ lk+plWee+s+km + g++rs+k+Ckek+en+ k++k++k+g+ ++++++  t+++f++lea
  KFK42102.1  37 RWPRPETLALLRIRSEMDKAFRDSTLKAPLWEEISRKMVDLGYKRSAKKCKEKFENVYKYHKRTKDGRTGKSEGK--TYRFFEELEA 121
                 8********************************************************************975544..6******985 PP

2trihelix107.68.1e-34396481187
    trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                 rW+k ev aLi++r+++e +++++  k+plWee+s+ mr+ g++rs+k+Ckekwen+nk++kk+ke++kkr + +s+tcpyf+qlea
  KFK42102.1 396 RWPKTEVEALIRIRKNLEANYQENGTKGPLWEEISAGMRRLGYNRSAKRCKEKWENINKYFKKVKESNKKR-PVDSKTCPYFHQLEA 481
                 8*********************************************************************8.9999*********85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM007170.00263496IPR001005SANT/Myb domain
PfamPF138372.2E-1936121No hitNo description
PROSITE profilePS500907.0043694IPR017877Myb-like domain
CDDcd122033.58E-2436101No hitNo description
PROSITE profilePS500907.526389453IPR017877Myb-like domain
SMARTSM007179.2E-4393455IPR001005SANT/Myb domain
CDDcd122032.86E-25395460No hitNo description
PfamPF138377.3E-24395482No hitNo description
Gene3DG3DSA:1.10.10.602.8E-4395452IPR009057Homeodomain-like
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 574 aa     Download sequence    Send to blast
MSLNSSGPLD SSGGVEEKDM KMEETGDGGG GGGGGNRWPR PETLALLRIR SEMDKAFRDS  60
TLKAPLWEEI SRKMVDLGYK RSAKKCKEKF ENVYKYHKRT KDGRTGKSEG KTYRFFEELE  120
AFETLNSYAP EPESQPTTTV IATATATSMI PWINSNNPSI EKSSLPLKNH HQMSVKPITT  180
NPTFLSKQPS LTTPFPFYSN NHTTTVDTTG CKTTSNDLMN NVSSLNLFSS STSSSTASDE  240
EEDHHQGKRS RKKRKYWKGL FTKLTKELME KQEKMQKRFL ETLENREKER ISREEAWRVQ  300
EIARINREHE TLVHERSNAA TKDAAIISFL HKISGGQQQQ PQQHAQQNHK VSQRKQYQSE  360
NSITFESKEP RTILLDTTMK MGNYDTNHSV SPSSSRWPKT EVEALIRIRK NLEANYQENG  420
TKGPLWEEIS AGMRRLGYNR SAKRCKEKWE NINKYFKKVK ESNKKRPVDS KTCPYFHQLE  480
ALYNERNKSG ALPMLPLMVT PKRQLLLSQE TQTEFETDQR DKVGKEGEED EGDSEEDDYE  540
EEGEGEGDNE TSEFEIVLNK TSSSPMDRNN NLFT
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
17987KRSAKKCKE
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor that binds specific DNA sequence. {ECO:0000250}.
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00244DAPTransfer from AT1G76890Download
Motif logo
Cis-element ? help Back to Top
SourceLink
PlantRegMapKFK42102.1
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqNP_177815.10.0Duplicated homeodomain-like superfamily protein
SwissprotQ391170.0TGT2_ARATH; Trihelix transcription factor GT-2
TrEMBLA0A087HJ000.0A0A087HJ00_ARAAL; Uncharacterized protein
STRINGA0A087HJ000.0(Arabis alpina)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM59952847
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76890.20.0Trihelix family protein