PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cagra.20992s0001.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Capsella
Family Trihelix
Protein Properties Length: 580aa    MW: 66387.8 Da    PI: 7.1193
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cagra.20992s0001.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix95.74.4e-3045129187
              trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                           rW++ e+laL+++r+em++++r++ lk+plWee+s+km+e g++rs+k+Ckek+en+ k++k++keg+ ++++++  t+++f++lea
  Cagra.20992s0001.1.p  45 RWPRPETLALLRIRSEMGKAFRDSTLKAPLWEEISRKMMELGYKRSAKKCKEKFENVYKYHKRTKEGRTGKSEGK--TYRFFEELEA 129
                           8********************************************************************975544..6******985 PP

2trihelix1071.3e-33398483187
              trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                           rW+k ev aLi++r+++e +++++  k+plWee+s+ mr+ g++r++k+Ckekwen+nk++kk+ke++kkr + +s+tcpyf+qlea
  Cagra.20992s0001.1.p 398 RWPKTEVEALIRIRKNLEANYQENGTKGPLWEEISAGMRRLGYNRNAKRCKEKWENINKYFKKVKESNKKR-PLDSKTCPYFHQLEA 483
                           8*********************************************************************8.9************85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM007170.004642104IPR001005SANT/Myb domain
PfamPF138378.2E-2044129No hitNo description
CDDcd122033.01E-2344109No hitNo description
PROSITE profilePS500907.51544102IPR017877Myb-like domain
PROSITE profilePS500907.352391455IPR017877Myb-like domain
SMARTSM007170.0056395457IPR001005SANT/Myb domain
PfamPF138372.3E-23397484No hitNo description
Gene3DG3DSA:1.10.10.604.2E-4397454IPR009057Homeodomain-like
CDDcd122037.49E-25398462No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 580 aa     Download sequence    Send to blast
MSGNSSGPLE SSGGGAGGSG EEEKDMKMMM EETGEVAGGG GGGNRWPRPE TLALLRIRSE  60
MGKAFRDSTL KAPLWEEISR KMMELGYKRS AKKCKEKFEN VYKYHKRTKE GRTGKSEGKT  120
YRFFEELEAF ETLNSYHHPE SQPAKSSATL TTASLIPWIS SNNNPSTEKI SLPLKHHHQV  180
SVQPITTNPT FLTKQPSSTT PFPFYSNNNT TTLSQPPISN DLMNNVSSLH LFSSSTSSST  240
ASDEEEDHHD QGKRSRKRRR YWKGLFTKLT KELMDKQEKM QRRFLETLEN REKERISREE  300
AWRVQEIARI NREHETFLHE RSNAAAKDAA IISFLHKISG GQQQQPQQQN HKPSQRKQYQ  360
SDHSITFESK EPKTILLETT TKIGNYDTSH SISPSSSRWP KTEVEALIRI RKNLEANYQE  420
NGTKGPLWEE ISAGMRRLGY NRNAKRCKEK WENINKYFKK VKESNKKRPL DSKTCPYFHQ  480
LEALYNERNK NGAMPLPLPL PLMVTPERQL LVSQETQTEL ETDQRDKVGD KEDEEEGESE  540
EDEYDEEEEG EGDNETSEFE IVLNKTSSSP MDINNNLFT*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
18795KRSAKKCKE
2255259RKRRR
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor that binds specific DNA sequence. {ECO:0000250}.
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00244DAPTransfer from AT1G76890Download
Motif logo
Cis-element ? help Back to Top
SourceLink
PlantRegMapCagra.20992s0001.1.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAY0871170.0AY087117.1 Arabidopsis thaliana clone 3190 mRNA, complete sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006300884.20.0trihelix transcription factor GT-2
SwissprotQ391170.0TGT2_ARATH; Trihelix transcription factor GT-2
TrEMBLR0GDA80.0R0GDA8_9BRAS; Uncharacterized protein (Fragment)
STRINGCagra.20992s0001.1.p0.0(Capsella grandiflora)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM59952847
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76890.20.0Trihelix family protein