PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID CA12g14080
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Capsiceae; Capsicum
Family Trihelix
Protein Properties Length: 562aa    MW: 63442.3 Da    PI: 7.7312
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
CA12g14080genomePEPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix40.66.7e-131483786
    trihelix 37 kmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86
                km++ g++rs+k+Ckek+en+ k+++++keg+ ++   + +t+++fdql+
  CA12g14080  1 KMADLGYHRSAKKCKEKFENVYKYHRRTKEGRASK--ADGKTYRFFDQLQ 48
                6999*****************************96..66678******98 PP

2trihelix101.28.2e-32362447187
    trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                 rW+k+e+ aLi +r+ ++ +++++  k+plWee+s+ mr+ g++r++k+Ckekwen+nk++kk+ke++kkr +e+s+tcpyf+qlea
  CA12g14080 362 RWPKEEIDALISLRTCLDLKYQENGPKGPLWEEISAGMRKLGYNRNAKRCKEKWENINKYFKKVKESNKKR-PEDSKTCPYFNQLEA 447
                 8*********************************************************************8.99***********85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF138371.5E-7150No hitNo description
SMARTSM007170.0066359421IPR001005SANT/Myb domain
PfamPF138374.6E-22361448No hitNo description
CDDcd122037.77E-27361426No hitNo description
PROSITE profilePS500907.201361419IPR017877Myb-like domain
Sequence ? help Back to Top
Protein Sequence    Length: 562 aa     Download sequence    Send to blast
KMADLGYHRS AKKCKEKFEN VYKYHRRTKE GRASKADGKT YRFFDQLQAL ENNPASHSLP  60
PTPLAAPPIT MAMAMPMRSG NAYVNPPPMP MPQNNAASSP QSLPFVVSHN NVVTAAVPAV  120
NHPMMPALLL SQPQQQSQPA IQQPMANLNQ INQPQGNTTT SFLSNSTSSS STSSDEDIQR  180
RHMKKRKWKD FFERLMKDVI KKQEELQKKF METLEKRERD RMVREETWRV QEMARLNREH  240
DLLVQERSMA AAKDATIIAF LQKITEQQNT PVLNSNTINT SPTQMQSKLP KKPSVAPPHS  300
QSPQTPQPQP PPAIAVSLPM TIHTPVPTPT PETMSLPVAT KSFETPKTDN GGENFSPESS  360
SRWPKEEIDA LISLRTCLDL KYQENGPKGP LWEEISAGMR KLGYNRNAKR CKEKWENINK  420
YFKKVKESNK KRPEDSKTCP YFNQLEALYK EKSKNETVPN TVVFGLRPEN STPIAPIMAQ  480
PEQQWPLPQI QPQVQQQGST NYNNQHHHDN HESDSMDHNE DEDDIEEDED DEDEGDGYEI  540
VTNKQPSSIA AATVTTTATA V*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1178186QRRHMKKRK
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00244DAPTransfer from AT1G76890Download
Motif logo
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_016551158.10.0PREDICTED: trihelix transcription factor GT-2-like
TrEMBLA0A2G3B3E40.0A0A2G3B3E4_CAPCH; Trihelix transcription factor GTL1
STRINGSolyc12g056510.1.10.0(Solanum lycopersicum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA24022459
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76880.14e-92Trihelix family protein