PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID evm.model.supercontig_27.230
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Caricaceae; Carica
Family Trihelix
Protein Properties Length: 412aa    MW: 46691.9 Da    PI: 7.8569
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
evm.model.supercontig_27.230genomeASGPBView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix88.76.3e-28298384186
                      trihelix   1 rWtkqevlaLiearremeerlrr.gklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcp 80 
                                   rW++ ev+aLi +r+ me+++r+ g +k   W+e+s+ m   g++rs+k+Ckekwen+nk+++k   + kk+++e+s+ cp
  evm.model.supercontig_27.230 298 RWPDAEVQALIMLRTAMEHKFRTtGVSKCSIWDEISAGMTNMGYTRSAKKCKEKWENINKYFRKSIGNGKKHHPEKSKACP 378
                                   8*********************7478999**************************************************** PP

                      trihelix  81 yfdqle 86 
                                   yf++l+
  evm.model.supercontig_27.230 379 YFQELD 384
                                   ****97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS500906.748291356IPR017877Myb-like domain
PfamPF138379.1E-20297384No hitNo description
CDDcd122035.55E-25297362No hitNo description
Sequence ? help Back to Top
Protein Sequence    Length: 412 aa     Download sequence    Send to blast
MELLTADRQL REIDGFPQHV DPFPQTNGCD LVYDQHQPTA VIHPLEDHIA SLVATNPSPP  60
QKLRPIRFSG RMEDPPPPFP DDSAALAGTL DRLPDLGFGG GNVVQCFHGQ VKPLVGDMAD  120
AGGGGSDAVD LERVGSETSE ENRGGSVNSL GNMSVSKFDN RGHLQKVIKK RHQKLKEPVN  180
RKRKRETRKK LEVFLEQLVE KVMERQEQMH KQLIETMERR ENERIMREEA WRQKEIERMR  240
RDEEARAQET ARSLALISFI QSTVGHEIEI PQQPLSTIFC TEDNGIQKEI KCDPSNRRWP  300
DAEVQALIML RTAMEHKFRT TGVSKCSIWD EISAGMTNMG YTRSAKKCKE KWENINKYFR  360
KSIGNGKKHH PEKSKACPYF QELDILYKNG LLSPGNSLVS GGNESETKRE N*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1168184KKRHQKLKEPVNRKRKR
2231240RQKEIERMRR
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00552DAPTransfer from AT5G47660Download
Motif logo
Cis-element ? help Back to Top
SourceLink
PlantRegMapevm.model.supercontig_27.230
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_021902301.10.0trihelix transcription factor GTL1
TrEMBLA0A067L8R11e-108A0A067L8R1_JATCU; Uncharacterized protein
STRINGevm.model.supercontig_27.2300.0(Carica papaya)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM94112736
Representative plantOGRP1130157
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G47660.14e-65Trihelix family protein