PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Csa07g038230.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Camelina
Family Whirly
Protein Properties Length: 244aa    MW: 26699.3 Da    PI: 10.2114
Description Whirly family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Csa07g038230.1genomeCSGPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Whirly213.12.7e-66642021139
          Whirly   1 svyktkaalkvkavrptfealdsgnlklkraGglllelanataerkydWekkqsfalsatevaelvdlaskesceffhdpaakgsneGkvrkalk 95 
                     +++k+kaal+v++v+ptf+++dsgn++++r+G+l+++++++++erkyd  +kq f+ls+tev++l++++sk+s+effhdp++k+sn+G+vrk+l+
  Csa07g038230.1  64 AIFKGKAALSVEPVLPTFTEIDSGNRRIERRGSLMMTFMPSIGERKYDGAQKQLFSLSPTEVGSLISMGSKDSSEFFHDPSMKSSNAGQVRKTLS 158
                     59********************************************************************************************* PP

          Whirly  96 vePlpdGsGlfvnlsvtnslvkgnesfsvPvskaefavlrsllv 139
                     ++P++dGsG+f++lsv n+++++n++f+vPv+ aefav++++++
  Csa07g038230.1 159 IKPHADGSGYFFSLSVVNNILNTNDRFVVPVTTAEFAVMKTAFS 202
                     *****************************************995 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:2.30.31.101.4E-7252219IPR009044ssDNA-binding transcriptional regulator
SuperFamilySSF544473.92E-6656241IPR009044ssDNA-binding transcriptional regulator
PfamPF085361.6E-5865199IPR013742Plant transcription factor
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006281Biological ProcessDNA repair
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005739Cellular Componentmitochondrion
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 244 aa     Download sequence    Send to blast
MMKQARTLLS RSLCDQSKSL LEEKVRATTL RGFASWSDSS ASRGGFSGQK DAAKPSGRVF  60
APYAIFKGKA ALSVEPVLPT FTEIDSGNRR IERRGSLMMT FMPSIGERKY DGAQKQLFSL  120
SPTEVGSLIS MGSKDSSEFF HDPSMKSSNA GQVRKTLSIK PHADGSGYFF SLSVVNNILN  180
TNDRFVVPVT TAEFAVMKTA FSFALPHIMG WDRVIDQVGT GTQKTTSQHL TTGPQHIEQE  240
WDK*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4kop_A1e-104502142166Single-stranded DNA-binding protein WHY2, mitochondrial
4kop_B1e-104502142166Single-stranded DNA-binding protein WHY2, mitochondrial
4kop_C1e-104502142166Single-stranded DNA-binding protein WHY2, mitochondrial
4kop_D1e-104502142166Single-stranded DNA-binding protein WHY2, mitochondrial
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtSingle-stranded DNA-binding protein that associates with mitochondrial DNA and may play a role in the regulation of the gene expression machinery. Seems also to be required to prevent break-induced DNA rearrangements in the mitochondrial genome. Can bind to melt double-stranded DNA in vivo. {ECO:0000269|PubMed:18423020, ECO:0000269|PubMed:20551348, ECO:0000269|PubMed:22762281}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapCsa07g038230.1
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAB4935320.0AB493532.1 Arabidopsis thaliana At1g71260 mRNA for hypothetical protein, partial cds, clone: RAAt1g71260.
GenBankAY0721100.0AY072110.1 Arabidopsis thaliana unknown protein (At1g71260) mRNA, complete cds.
GenBankAY1229610.0AY122961.1 Arabidopsis thaliana unknown protein (At1g71260) mRNA, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010415836.10.0PREDICTED: single-stranded DNA-binding protein WHY2, mitochondrial
SwissprotQ8VYF71e-132WHY2_ARATH; Single-stranded DNA-binding protein WHY2, mitochondrial
TrEMBLR0GIE31e-138R0GIE3_9BRAS; Uncharacterized protein
STRINGXP_010415836.10.0(Camelina sativa)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM123582731
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G71260.11e-134WHIRLY 2