PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Bostr.7128s0454.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Boechereae; Boechera
Family bHLH
Protein Properties Length: 971aa    MW: 109048 Da    PI: 7.3143
Description bHLH family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Bostr.7128s0454.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1HLH13.30.000163744071652
                          HHHHHHHHHCTSCCC...TTS-STCHHHHHHHHHHHH CS
                  HLH  16 iNsafeeLrellPkaskapskKlsKaeiLekAveYIk 52 
                          i   f+ Lr++l ++    + Kl+K++iL  +  YI+
  Bostr.7128s0454.1.p 374 IHRTFDLLRQMLAES---EDVKLDKVTILHAVLVYIN 407
                          666799*********...9**************9997 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF484521.2E-53469IPR011990Tetratricopeptide-like helical domain
PROSITE profilePS513756.0294276IPR002885Pentatricopeptide repeat
PROSITE profilePS513757.103119149IPR002885Pentatricopeptide repeat
PfamPF015350.72122150IPR002885Pentatricopeptide repeat
PROSITE profilePS513758.923150185IPR002885Pentatricopeptide repeat
PfamPF015354.1E-5154180IPR002885Pentatricopeptide repeat
TIGRFAMsTIGR007568.4E-4154186IPR002885Pentatricopeptide repeat
PROSITE profilePS513755.568186220IPR002885Pentatricopeptide repeat
PROSITE profilePS513757.465221251IPR002885Pentatricopeptide repeat
PfamPF015350.054226252IPR002885Pentatricopeptide repeat
PROSITE profilePS513759.076252286IPR002885Pentatricopeptide repeat
PfamPF015353.6E-5254283IPR002885Pentatricopeptide repeat
TIGRFAMsTIGR007560.0018254283IPR002885Pentatricopeptide repeat
PROSITE profilePS513755.448326356IPR002885Pentatricopeptide repeat
PfamPF015350.87331351IPR002885Pentatricopeptide repeat
PROSITE profilePS513759.01357391IPR002885Pentatricopeptide repeat
PfamPF015350.0065359385IPR002885Pentatricopeptide repeat
TIGRFAMsTIGR007560.0012359387IPR002885Pentatricopeptide repeat
PROSITE profilePS513756.862429459IPR002885Pentatricopeptide repeat
PfamPF130413.8E-8460507IPR002885Pentatricopeptide repeat
PROSITE profilePS5137510.095460494IPR002885Pentatricopeptide repeat
TIGRFAMsTIGR007567.2E-4463495IPR002885Pentatricopeptide repeat
PROSITE profilePS513756.193495529IPR002885Pentatricopeptide repeat
PROSITE profilePS513756.204530560IPR002885Pentatricopeptide repeat
Gene3DG3DSA:1.25.40.103.0E-9558595IPR011990Tetratricopeptide-like helical domain
PROSITE profilePS5137511.093561595IPR002885Pentatricopeptide repeat
PfamPF015354.9E-6563593IPR002885Pentatricopeptide repeat
TIGRFAMsTIGR007562.1E-4563595IPR002885Pentatricopeptide repeat
PROSITE profilePS513756.971631661IPR002885Pentatricopeptide repeat
PfamPF130413.5E-11661708IPR002885Pentatricopeptide repeat
Gene3DG3DSA:1.25.40.103.0E-9661830IPR011990Tetratricopeptide-like helical domain
PROSITE profilePS5137512.353662696IPR002885Pentatricopeptide repeat
TIGRFAMsTIGR007563.2E-7665697IPR002885Pentatricopeptide repeat
SuperFamilySSF484521.2E-5675827IPR011990Tetratricopeptide-like helical domain
PROSITE profilePS513756.917697727IPR002885Pentatricopeptide repeat
PROSITE profilePS513758.265733764IPR002885Pentatricopeptide repeat
PROSITE profilePS513755.437766796IPR002885Pentatricopeptide repeat
PROSITE profilePS513757.487800834IPR002885Pentatricopeptide repeat
PfamPF144322.6E-37837960IPR032867DYW domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005515Molecular Functionprotein binding
GO:0008270Molecular Functionzinc ion binding
Sequence ? help Back to Top
Protein Sequence    Length: 971 aa     Download sequence    Send to blast
MASVLLPLPQ LSVLFDYRRS RKEPSFPRAV YNTNAVSSNS TNANHFLRRI SSFCETGNLD  60
ESFRVVQEFA GDDESSSDAF LLVREALGLL LQASGKRKDI EIGRKIHQLV SGSTRLRNDD  120
VLCTRMITMY AMCGSPDDSR SVFNALRSKN LFQWNAVISS YSRNELYHEV LEMFIKMISK  180
TDLLPDNFTF PCVIKACAGI SDVGIGLAVH GLVLKTGLLE DVFVGNALVS FYGTHGFVSH  240
ALKLFDIMPK RNLVSWNSMI RVFSDNGFSE ESFLFLGEMM EEGDDGAFMP DVATVVTVLP  300
VCAREREIGV GKGVHGWAVK LNLDKELVVN NALMDMYSKC GCITDAQMIF KLNNNKNVVS  360
WNTMVGGFSA EGDIHRTFDL LRQMLAESED VKLDKVTILH AVLVYINESV LPSLKELHCY  420
SLKQEFIHDE LVANAFVSTY AKCGSLSYAQ RVFHGIRSKT VNSWNALIGG YAQNRDPRSS  480
LDAYLQMKYS GLLPDNFTVC SLLSACSQLK SLRLGKEVHG FIIRNRLEKD LFVYMSVLSL  540
YIHCEELCKV QVLFDAMEDK SLVSWNTVIT GYLQNGFPER ALGHFRQMVL YGIQPCEISM  600
MNVFGACSLL PSLRLGREAH AYALKRLLEE NVFIACSIID MYAKNGSITQ SFKVFNGLKE  660
KNTTSWNAMI MGYGIHGLAK EAIKLFEEMQ RTGHNPDDLT FLGVLTACNH SGLIHEGLRY  720
LDQMKSSFGL KPNLKHYACI IDMLGRAGQL DKALRVAAEE MSEEPDVGIL NSLLSSCRIH  780
GKLEMGEKIA AKLFELEPQK PENYVLLSNL YAGLGKWDDV RKVRQRMKEM SLRKDAGCSW  840
IELNGKVLSF VAGESSSGGF EEIKSLWSIL EMKIWKMGYR PDTSSVQHDL SEEEKIEQLR  900
GHSEKLAITY GLIKTSEGTT LRVYKNLRIC VDCHNAAKLI SKVMEREIVV RDNKRFHHFK  960
NGICSCGDYW *
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5iww_D1e-58507973339PLS9-PPR
Search in ModeBase
Cis-element ? help Back to Top
SourceLink
PlantRegMapBostr.7128s0454.1.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAC0133540.0AC013354.6 Genomic sequence for Arabidopsis thaliana BAC F15H18 from chromosome I, complete sequence.
GenBankCP0026840.0CP002684.1 Arabidopsis thaliana chromosome 1 sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010498238.10.0PREDICTED: pentatricopeptide repeat-containing protein At1g18485
SwissprotQ0WN600.0PPR48_ARATH; Pentatricopeptide repeat-containing protein At1g18485
TrEMBLD7KGE60.0D7KGE6_ARALL; Pentatricopeptide repeat-containing protein
STRINGBostr.7128s0454.1.p0.0(Boechera stricta)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G64100.25e-18pentatricopeptide (PPR) repeat-containing protein
Publications ? help Back to Top
  1. Aubourg S,Boudet N,Kreis M,Lecharny A
    In Arabidopsis thaliana, 1% of the genome codes for a novel protein family unique to plants.
    Plant Mol. Biol., 2000. 42(4): p. 603-13
    [PMID:10809006]
  2. Lurin C, et al.
    Genome-wide analysis of Arabidopsis pentatricopeptide repeat proteins reveals their essential role in organelle biogenesis.
    Plant Cell, 2004. 16(8): p. 2089-103
    [PMID:15269332]