PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Carubv10019974m
Common NameCARUB_v10019974mg
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Capsella
Family Trihelix
Protein Properties Length: 624aa    MW: 71359.4 Da    PI: 9.0023
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Carubv10019974mgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix95.54.9e-3088172187
         trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                      rW++ e+laL+++r+em++++r++ lk+plWee+s+km+e g++rs+k+Ckek+en+ k++k++keg+ ++++++  t+++f++lea
  Carubv10019974m  88 RWPRPETLALLRIRSEMGKAFRDSTLKAPLWEEISRKMMELGYKRSAKKCKEKFENVYKYHKRTKEGRTGKSEGK--TYRFFEELEA 172
                      8********************************************************************975544..6******985 PP

2trihelix106.91.4e-33442527187
         trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                      rW+k ev aLi++r+++e +++++  k+plWee+s+ mr+ g++r++k+Ckekwen+nk++kk+ke++kkr + +s+tcpyf+qlea
  Carubv10019974m 442 RWPKTEVEALIRIRKNLEANYQENGTKGPLWEEISAGMRRLGYNRNAKRCKEKWENINKYFKKVKESNKKR-PLDSKTCPYFHQLEA 527
                      8*********************************************************************8.9************85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM007170.004685147IPR001005SANT/Myb domain
CDDcd122038.17E-2487152No hitNo description
PROSITE profilePS500907.51587145IPR017877Myb-like domain
PfamPF138379.1E-2087172No hitNo description
PROSITE profilePS500907.352435499IPR017877Myb-like domain
SMARTSM007170.0056439501IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.604.6E-4441498IPR009057Homeodomain-like
PfamPF138372.6E-23441528No hitNo description
CDDcd122031.96E-25442506No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 624 aa     Download sequence    Send to blast
QNKKKKRKEK GKSTTGFVFS SNSPPHHISL SLFKKDPNSS FQVSMSGNSS GPLESSGGGA  60
GGSGEEEKDM KMMMEETGEV AGGGGGNRWP RPETLALLRI RSEMGKAFRD STLKAPLWEE  120
ISRKMMELGY KRSAKKCKEK FENVYKYHKR TKEGRTGKSE GKTYRFFEEL EAFETLNSYH  180
HPESQPAKSS ATLTTASLIP WISSNNNPST EKSSLPLKHH HHQVSVQPIT TNPTFLTKQP  240
SSTTPFPFYS NNNTTTLSQP PLSSDLMNNV SSLHLFSSST SSSTASDEEE DHHDQGKRSR  300
KRRRYWKGLF TKLTKELMDK QEKMQRRFLE TLENREKERI SREEAWRVQE IARINREHET  360
FLHERSNAAA KDAAIISFLH KISGGQQQQP QQQNHKPSQR KQYQSDHSIT FESKEPKTIL  420
LETTTKIGNY DTSHSISPSS SRWPKTEVEA LIRIRKNLEA NYQENGTKGP LWEEISAGMR  480
RLGYNRNAKR CKEKWENINK YFKKVKESNK KRPLDSKTCP YFHQLEALYN ERNKNGAMPL  540
PLPLPLMVTP ERQLLVSQET QTELETDQRD KVGDKEDEEE GESEEDEYDE EEEGEGDNET  600
SEFEIVLNKT SSSPMDINNN LFT*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
127KKKKRK
2130138KRSAKKCKE
3299303RKRRR
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor that binds specific DNA sequence. {ECO:0000250}.
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00244DAPTransfer from AT1G76890Download
Motif logo
Cis-element ? help Back to Top
SourceLink
PlantRegMapCarubv10019974m
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAY0871170.0AY087117.1 Arabidopsis thaliana clone 3190 mRNA, complete sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006300884.20.0trihelix transcription factor GT-2
SwissprotQ391170.0TGT2_ARATH; Trihelix transcription factor GT-2
TrEMBLR0GDA80.0R0GDA8_9BRAS; Uncharacterized protein (Fragment)
STRINGXP_006300884.10.0(Capsella rubella)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM59952847
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76890.20.0Trihelix family protein