PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Carubv10003996m
Common NameCARUB_v10003996mg
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Capsella
Family bHLH
Protein Properties Length: 1333aa    MW: 153382 Da    PI: 5.0903
Description bHLH family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Carubv10003996mgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1HLH36.68.2e-1211791225354
                       HHHHHHHHHHHHHHHHHHHHHHCTSCCC...TTS-STCHHHHHHHHHHHHHH CS
              HLH    3 rahnerErrRRdriNsafeeLrellPkaskapskKlsKaeiLekAveYIksL 54  
                       ++h+ +ErrRR riN  f++Lr++lP++      K +Ka++L ++v+Y ++L
  Carubv10003996m 1179 KKHSDAERRRRLRINCQFATLRTILPNL-----VKQDKASVLGETVRYFNEL 1225
                       68*************************9.....7**************9988 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF474597.85E-1211681231IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
PROSITE profilePS5088815.00811761225IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
CDDcd000834.41E-711791230No hitNo description
Gene3DG3DSA:4.10.280.102.4E-1211791230IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
PfamPF000101.4E-911791225IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
SMARTSM003534.6E-1211821231IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005739Cellular Componentmitochondrion
GO:0046983Molecular Functionprotein dimerization activity
Sequence ? help Back to Top
Protein Sequence    Length: 1333 aa     Download sequence    Send to blast
MEKVYEELDE VKAANEKLRM DYRNKTEILD NLKKVQKEQL VDIREARLVN EKQCFEIEEK  60
TRDVSELKRA NEDLQRCLRE KDSVLKRVND ANDKLRANGE DKYRELEEEK RSMMSALDEA  120
TEKNIDLEQE NNVYRAEIEG LKGLLGAAEK KRIQVEKTVE AMKEMRGRDD MVIKMEEEKA  180
QVEEKLKWKK EQFKHLEEAY EKLQNLFKAS KKEWEEEKST LLDEIYSLQA KLDSLTRISE  240
DLQKKLQMSN SALTQEETRR KRLEVQVSEF KTRYEDAFAE YKDARTQLDD LAGKRDEEVA  300
ELRQSLSMKD TYLKEMKYEN GKLEQENQEL LGSLKELQEA TIQGSGNSAL SKLKNKFRNL  360
ENIHKTCSAN LRSKEAEWSS RLDKMAEEIN DYQLRLQSKE EALKDVELEL ENCHSSAAKV  420
RLQYEEISVM FLVLSRTVSE AQSRLANVKD EQIKDEKREE KSYSLLMEQL DQKNAALAKA  480
HLEIEEERER VACLLKRIDM LDLIEDQKIQ MEKEVERYKE MVEESSRFQT QMKEKLEEAE  540
NDYEEKLLQV CDALDNTNSD LVSEREKVVA LTRQIESFGF VKEKNLVMEK EIEKYKEMLE  600
ESQKSRVLLE EQISQLESDS KENIRELCSK VDIAYAKLAE EVEKTTSLVR KSEAIDLNEE  660
NREQELDNYK GRLEESTKSQ LRLQEKVVEV ENDSKRKLAD VSEALETANS ELSDKTSEVY  720
QIEFQLWIWK SIAKRLKAEL EQNQNLRKRV EASLIEQVGV GEAIMQERNE LMHKLKSINS  780
TRSSVSEIET LMRDKDDILE NLQRELELLE QESLSRELED VFIAHTIAET ELQKEREIFA  840
GALQQKEQDL REVKHKWEGS FKSVSLLLAE EQNKVNMLHK AWEKLSATQI LTAVESESKK  900
MMIIELEGEI FSLSKKLKAS GENASCFRQE ATKLGAELET KQRELKEVTT QMQVKLKTSE  960
AEKTELVKEV ASLSSEKGNL LSFISAMEDG MLKLCDGDTK LMKTLERVTQ CCDGFGKENN  1020
NGETTGSPRL AMKHEDAVIE DSYYLLVSCP YITSYYMFSS FSLYVHTAKS AFPLLFLPSL  1080
CYFFSHHQSN MFSSYSVAEP IICCIACVVC LMQLEQGVRP ISRCYNPNPT TYPTTIGRNI  1140
FFPGAGAGAG AATSSKLFSR GFSVTKPKPK TESKEAAAKK HSDAERRRRL RINCQFATLR  1200
TILPNLVKQD KASVLGETVR YFNELKKMVQ DIPTTPSLED SLRLGHCNNN RDLARVVFSC  1260
SDREGLMSEV AESLKAAKTK AVRAEIMTVG GRTKCALFVQ GVNGNEGLVK LKKSLKPVVN  1320
GKISSEAKEQ HQ*
Cis-element ? help Back to Top
SourceLink
PlantRegMapCarubv10003996m
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAL0355380.0AL035538.1 Arabidopsis thaliana DNA chromosome 4, BAC clone F20D10 (ESSA project).
GenBankAL1615920.0AL161592.2 Arabidopsis thaliana DNA chromosome 4, contig fragment No. 88.
GenBankCP0026870.0CP002687.1 Arabidopsis thaliana chromosome 4 sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_023634603.10.0uncharacterized protein At4g38062
SwissprotP0CB230.0Y4862_ARATH; Uncharacterized protein At4g38062
TrEMBLR0GUB70.0R0GUB7_9BRAS; Uncharacterized protein
STRINGXP_006283007.10.0(Capsella rubella)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM959266
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G38070.10.0bHLH family protein