PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gh_A13G0768
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family bHLH
Protein Properties Length: 1302aa    MW: 143238 Da    PI: 7.385
Description bHLH family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gh_A13G0768genomeNAU-NBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1HLH25.72.1e-08372420355
                  HHHHHHHHHHHHHHHHHHHHHHCTSCCC...TTS-STCHHHHHHHHHHHHHHH CS
          HLH   3 rahnerErrRRdriNsafeeLrellPkaskapskKlsKaeiLekAveYIksLq 55 
                   +h+ +Er RR++i +++  L++l+P +    +k   Ka +L + ++Y++sLq
  Gh_A13G0768 372 NSHSLAERVRREKISERMKFLQDLVPGC----NKVTGKAVMLDEIINYVQSLQ 420
                  58*************************9....677*****************9 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
CDDcd000831.58E-10367424No hitNo description
SuperFamilySSF474591.83E-16367437IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
PROSITE profilePS5088815.478369419IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
Gene3DG3DSA:4.10.280.102.0E-16370437IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
PfamPF000101.7E-5372420IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
SMARTSM003535.5E-10375425IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
PROSITE profilePS513755.536563597IPR002885Pentatricopeptide repeat
Gene3DG3DSA:1.25.40.102.3E-9566629IPR011990Tetratricopeptide-like helical domain
PROSITE profilePS513756.829598628IPR002885Pentatricopeptide repeat
PfamPF015350.4603625IPR002885Pentatricopeptide repeat
PROSITE profilePS513759.208629663IPR002885Pentatricopeptide repeat
PfamPF015350.0069631659IPR002885Pentatricopeptide repeat
TIGRFAMsTIGR007560.0012631661IPR002885Pentatricopeptide repeat
PfamPF015350.019662689IPR002885Pentatricopeptide repeat
PROSITE profilePS513756.555696730IPR002885Pentatricopeptide repeat
PROSITE profilePS513755.371731761IPR002885Pentatricopeptide repeat
PROSITE profilePS5137510.928762792IPR002885Pentatricopeptide repeat
PfamPF130412.2E-7762792IPR002885Pentatricopeptide repeat
TIGRFAMsTIGR007564.3E-7764792IPR002885Pentatricopeptide repeat
Gene3DG3DSA:1.25.40.102.3E-97911039IPR011990Tetratricopeptide-like helical domain
PROSITE profilePS5137512.156793827IPR002885Pentatricopeptide repeat
PfamPF130419.2E-11793839IPR002885Pentatricopeptide repeat
TIGRFAMsTIGR007563.3E-8795828IPR002885Pentatricopeptide repeat
PROSITE profilePS513758.188863893IPR002885Pentatricopeptide repeat
PfamPF130412.1E-11893941IPR002885Pentatricopeptide repeat
PROSITE profilePS5137511.805894928IPR002885Pentatricopeptide repeat
TIGRFAMsTIGR007564.1E-7896929IPR002885Pentatricopeptide repeat
PROSITE profilePS513756.555929963IPR002885Pentatricopeptide repeat
PROSITE profilePS513757.366964994IPR002885Pentatricopeptide repeat
PfamPF015350.0023969991IPR002885Pentatricopeptide repeat
PROSITE profilePS5137512.7159951029IPR002885Pentatricopeptide repeat
PfamPF130413.2E-99951042IPR002885Pentatricopeptide repeat
TIGRFAMsTIGR007561.7E-69971031IPR002885Pentatricopeptide repeat
PROSITE profilePS513759.64610301065IPR002885Pentatricopeptide repeat
TIGRFAMsTIGR007563.9E-410321063IPR002885Pentatricopeptide repeat
PROSITE profilePS513756.9510661096IPR002885Pentatricopeptide repeat
PfamPF015350.001910691093IPR002885Pentatricopeptide repeat
TIGRFAMsTIGR007569.8E-410701093IPR002885Pentatricopeptide repeat
Gene3DG3DSA:1.25.40.102.3E-911301149IPR011990Tetratricopeptide-like helical domain
PROSITE profilePS513757.59611321166IPR002885Pentatricopeptide repeat
PfamPF144323.9E-4011681292IPR032867DYW domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0008270Molecular Functionzinc ion binding
GO:0046983Molecular Functionprotein dimerization activity
Sequence ? help Back to Top
Protein Sequence    Length: 1302 aa     Download sequence    Send to blast
MDVGEKDKYE LEKRNDNSLN YQTSGMSSAW QFGGSNLTST PMSLVSSDNP LVIGSSSASA  60
SMGDSFCSNL WEHPSNSQNL GFCEINVQNG ASSSNAMGIG RGGPASVRSS IDRPFEMGWN  120
AASSMLKGGI FLPNASGVLP SSLSQLQTDS AFIERAAKFS SFNGGNFSDI VNPFGIPEAM  180
GVYARGVGLM QGPQDVYTIS GVKSISGVES QRSKLMTTEA SRDLQAENRA TQESPLKNER  240
NSGSLVRSNE EAKQGNGGSG NESNEAESSG GAGGHDEPSA LDGTAGESSA KVLSSKKRKR  300
IVQEAEVDQA KGSQSPVEAA KDGAENQQKG DQNQTTMVNK TTAKHGKQGS QASDPPKEEY  360
IHVRARRGQA TNSHSLAERV RREKISERMK FLQDLVPGCN KVTGKAVMLD EIINYVQSLQ  420
RQVEFLSMKL ATVNPRVDCN IEGLLAKDII QSRAGPSTLG FSPDLSVGYP PLHPSQPGLC  480
PGGFPVMGNN ADIIRRTLSS HFTPMTGGFK EPNQLSNAWE DELHNVVQMN YGTSTASDSQ  540
EVNGRKGPTP IPVVFNRQKM SSNSNYYCSL LKICTETRNR SQAKKIHCHI LRTIKDPETF  600
LLNNLVNAYS KLGDLTYARN VFDKIPQPNL FSWNTILFTY SKSGNLSDMN DIFNRMPKRD  660
GVSWNSLISG YASRALVTDA VKGYNSMLGD EAANLNRITF STMLILSSSQ GCIDLGRQIH  720
GQIVKFGFGS YLFVGCPLMD MYSKAGFVYD AKQVFDETPE RNVVMYNTMI TGFLRCGMVE  780
DSWSLFHSMR EKDPISWTTM ITGLTQNGLY KEAIDLFREM RTEGLVMDQF TFGSMLTACG  840
GLMALKEGKQ AHAFVIRTNH MDNVFVGSAL VDMYCKCKRI ASAEAVFKRM THKNVVSWTA  900
LLVGYGQNGY SEEAIRVFGD MQRNDINPDY YTLGSVISSC ANLASLEEGS QFHGQAIVSG  960
LISFTTVSNA LVTLYSKCGS IEEANRLFNE MNFRDEVSWT ALVSGYAQFG KADETIDLFQ  1020
KMLAHGLKPD EVTFVGVLSA CSRAGLVEKG YQYFESMVKE HGIMPVVDHY TCMIDLLSRA  1080
GRLEEARCFI NKMPMPPDAI GWSTLLSSCR LHGNLEVGKW AAASLQELEP NNPAGYILLS  1140
SIYAAKGKWD YVSELRRGMR NKGVRKEPGC SWIKYKGKVH IFSADDQTSP FSDRIYAELD  1200
KLNLKMIEEG YVPNLSTVLH DVEESEKKKM LNYHSERLAI AFGLIFIPPG LPIRIVKNLR  1260
VCGDCHNATK YISKITQREI LVRDAVRFHL FKDGTCSCGD FW
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5iww_D2e-74567111617337PLS9-PPR
Search in ModeBase
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Ghi.13161e-116boll
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_016675647.10.0PREDICTED: putative pentatricopeptide repeat-containing protein At1g68930
SwissprotQ9CAA80.0PP108_ARATH; Putative pentatricopeptide repeat-containing protein At1g68930
TrEMBLA0A1Q3ATJ30.0A0A1Q3ATJ3_CEPFO; HLH domain-containing protein/PPR domain-containing protein/PPR_2 domain-containing protein/DYW_deaminase domain-containing protein (Fragment)
STRINGLus100305810.0(Linum usitatissimum)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G68920.31e-116bHLH family protein
Publications ? help Back to Top
  1. Aubourg S,Boudet N,Kreis M,Lecharny A
    In Arabidopsis thaliana, 1% of the genome codes for a novel protein family unique to plants.
    Plant Mol. Biol., 2000. 42(4): p. 603-13
    [PMID:10809006]
  2. Lurin C, et al.
    Genome-wide analysis of Arabidopsis pentatricopeptide repeat proteins reveals their essential role in organelle biogenesis.
    Plant Cell, 2004. 16(8): p. 2089-103
    [PMID:15269332]