PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID CCG015645.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Malpighiales; Salicaceae; Saliceae; Populus
Family TALE
Protein Properties Length: 1315aa    MW: 145050 Da    PI: 9.1096
Description TALE family protein
Gene Model
Gene Model ID Type Source Coding Sequence
CCG015645.1genomeLZUView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox25.32.6e-08123412662355
                   SS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHH CS
     Homeobox   23 rypsaeereeLAkklgLterqVkvWFqNrRake 55  
                   +yp++ee+++L++ +gL+++q+ +WF N+R ++
  CCG015645.1 1234 PYPTEEEKAKLSEITGLDQKQINNWFINQRKRH 1266
                   8*****************************885 PP

2ELK33.68.4e-1211871207222
          ELK    2 LKhqLlrKYsgyLgsLkqEFs 22  
                   LK +L+rKYsg+L++L++EF+
  CCG015645.1 1187 LKGMLMRKYSGHLSNLRKEFL 1207
                   9*******************8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM014068.4E-32441526IPR006880INO80 complex subunit B-like conserved region
PfamPF047951.8E-20441526IPR006880INO80 complex subunit B-like conserved region
PROSITE profilePS5029310.945631664IPR013026Tetratricopeptide repeat-containing domain
Gene3DG3DSA:1.25.40.102.3E-26631742IPR011990Tetratricopeptide-like helical domain
PROSITE profilePS5000511.476631664IPR019734Tetratricopeptide repeat
SMARTSM000281.9E-5631664IPR019734Tetratricopeptide repeat
SuperFamilySSF484523.8E-23632736IPR011990Tetratricopeptide-like helical domain
PfamPF005158.6E-8635663IPR001440Tetratricopeptide repeat 1
PROSITE profilePS502938.755692725IPR013026Tetratricopeptide repeat-containing domain
PROSITE profilePS500057.788692725IPR019734Tetratricopeptide repeat
SMARTSM000280.89692725IPR019734Tetratricopeptide repeat
PfamPF138771.7E-21886973IPR025986RNA-polymerase II-associated protein 3-like, C-terminal domain
SMARTSM012551.0E-2010441088IPR005540KNOX1
PfamPF037901.8E-1910451084IPR005540KNOX1
SMARTSM012566.1E-2610911142IPR005541KNOX2
PfamPF037915.1E-2410941141IPR005541KNOX2
PROSITE profilePS5121310.28411861206IPR005539ELK domain
SMARTSM011881.2E-611861207IPR005539ELK domain
PfamPF037899.5E-911871207IPR005539ELK domain
PROSITE profilePS5007112.16312061269IPR001356Homeobox domain
SuperFamilySSF466893.04E-1812081280IPR009057Homeodomain-like
SMARTSM003892.4E-1212081273IPR001356Homeobox domain
CDDcd000861.85E-1212101270No hitNo description
Gene3DG3DSA:1.10.10.601.9E-2612111271IPR009057Homeodomain-like
PfamPF059204.3E-1612261265IPR008422Homeobox KN domain
PROSITE patternPS00027012441267IPR017970Homeobox, conserved site
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0031011Cellular ComponentIno80 complex
GO:0005515Molecular Functionprotein binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1315 aa     Download sequence    Send to blast
MEGFGFSDAS SAVRKKRSNT SRRPRNESHA PSGYIDVTSL SSTPPSETVS KVSSDENNDH  60
GSILRKKKVS LILCNSRASS SNLADCESAQ NMMKNEDGGF GESDEASNNG SFRGSSERRH  120
SGVDSRRLNQ GVLTPANWKS TSSSLGRVGG FSDGVGNESK VKKVKLKVGG ITRTITAKSA  180
SDGASAIAVG SSSSKPSRFP DPRQKTVEEN LDNNHSFISG KGSGLQGVPW KEFSRSGRND  240
GKADGLRGEN LSSKQTGQSE PVRKSKRLPK KRLLDGVLDD GAEDYDEIQF LEKVKTSKIS  300
TNHGAGFEDE EGGSRKQRKI LRVLKRNVDG LNDVDSGVHG STRFGKEGKK SKSGRVSEDT  360
DYVEDEDLGS DGDPTSKRKK PRKELADLSA DSKKEMTVTT RQRALQTGKD VSSGFASLIE  420
FPNGLPPAPP KKQKEKLSEV EQQLKRAEAL QRRRMQVEKA NRESEAEAIR KILGQDSTRK  480
KREDKLKKRQ EEMAQEKATN AMVLASDHVR WVMGPAGTTV TFPTEMGLPS IFDSKPCRNF  540
TPSLTAMARA PGKHGRDQAL DWELLKDTDK KVKTKSQASD VKIGEDGRSK GKTSAVDSSR  600
SGSGQYDYSR NFGAINRLSS NFTTDEITVD ATTEKELGNE YFKQKKFNEA IECYSRSIAL  660
SPTAVAYANR AMAFREAEDD CTETLNLDDH YIKAYSRRAT ARKELGKLKE SIEDSEFALK  720
LEPINQEIKK QYAEVKSLYE KASDYLMLEI LQKASGALRS SLQGTQKGGR SEASVNGHAV  780
HPVSIPTQNT GVSASKKDNT KENDGNNLVK KSVRVKEVRN KGTGAGSKSD GHVGNDSPAN  840
ATLSSSVDSV QKNNRTQRQE LKTSVMELAS QAASRAMAEA AKNITPPNSA YQFEVSWRGF  900
SGDRALQAHL LKVTSPSALP QIFKNALSVP ILIDIIKCVA SFFIDDMDLA VKYLENLTKV  960
PRFGVLIMCL SSTDTSANNF MEEFYRFNPT FFSSPDDSVR LENLAVANFP DASTSTTTEF  1020
HSHASSFLQA GNGHREVTGS DMYDAIKTQI ANHPRYPDLV SAHLECQKVG APPEMVSLLE  1080
AIGRGNYKIN TCYEIGADPE LDEFMESYCE VLHRYKQELS KPFDEATTFL SSIESQLSSL  1140
CKGTLTKIFD YGSDEPAGTS EEELSCGEVE ASESQETTGV SSEEQNLKGM LMRKYSGHLS  1200
NLRKEFLKNR KKGKLPKDAR TTLLDWWNHH YRWPYPTEEE KAKLSEITGL DQKQINNWFI  1260
NQRKRHWKPS EDMRFPRMDG VSGDPGASPN MLWNSLGNVK GNLRSALCIN WSYIS
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
6fo1_G7e-28625974124635RNA polymerase II-associated protein 3
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1483488DKLKKR
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAC2090450.0AC209045.1 Populus trichocarpa clone POP061-B03, complete sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
TrEMBLV4T5T70.0V4T5T7_9ROSI; Uncharacterized protein
STRINGXP_006433540.10.0(Citrus clementina)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G23380.15e-78KNOTTED1-like homeobox gene 6