PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cagra.6197s0012.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Capsella
Family Trihelix
Protein Properties Length: 374aa    MW: 42733.8 Da    PI: 5.3677
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cagra.6197s0012.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix92.44.4e-2950134286
             trihelix   2 WtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86 
                          W ++e+++Li +r em++ ++++k++k+lWee+s+kmre+gf rsp +C++kw+n+ k++kk k+++ k ts+ s +++y+++++
  Cagra.6197s0012.1.p  50 WVQDETRTLISLRGEMDNLFNTSKSNKHLWEEISNKMREKGFDRSPAMCTDKWRNILKEFKKAKHQDDKATSGGSAKMSYYKEID 134
                          ***********************************************************************************98 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS500908.43242106IPR017877Myb-like domain
Gene3DG3DSA:1.10.10.602.8E-443108IPR009057Homeodomain-like
SMARTSM007172.1E-446108IPR001005SANT/Myb domain
CDDcd122031.18E-2948113No hitNo description
PfamPF138376.8E-2149136No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 374 aa     Download sequence    Send to blast
MFVSDNNPSQ DINMMIDLPT QQQNHQIILG DSSGGEDHEI KAPKKRAETW VQDETRTLIS  60
LRGEMDNLFN TSKSNKHLWE EISNKMREKG FDRSPAMCTD KWRNILKEFK KAKHQDDKAT  120
SGGSAKMSYY KEIDDIFRER NKKVALYKSP ATPPSSAKVD SFMQFTDKGF EDTGISFTSV  180
EANGRPSLNL ETQLDHDGLP LAMASADPVT ANGVPPWNWR DASGNGGDVQ PFVGRIITVK  240
FGEYTRRFGI DGTAEAIKEA IRSAFRLRTR RAFWLEDEEQ VVRSLDRDMP LGNYTLHVDE  300
GIAVRVCHYD ESDPLPVHQE EKIFYTEEDY KDFLARRGWT CLREFDAFRN IDVVRNIDNM  360
DDLQPGVLYR GMR*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
2jmw_A5e-4344132186DNA binding protein GT-1
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor that binds specific DNA sequence. {ECO:0000250}.
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00003PBMTransfer from 484361Download
Motif logo
Cis-element ? help Back to Top
SourceLink
PlantRegMapCagra.6197s0012.1.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAB0723700.0AB072370.1 Arabidopsis thaliana mRNA for transcription factor GT-4, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006291280.20.0trihelix transcription factor GT-4 isoform X2
SwissprotQ9LU920.0TGT4_ARATH; Trihelix transcription factor GT-4
TrEMBLR0HJX10.0R0HJX1_9BRAS; Uncharacterized protein (Fragment)
STRINGCagra.6197s0012.1.p0.0(Capsella grandiflora)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM28912768
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G25990.10.0Trihelix family protein