PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Neem_6198_f_1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Sapindales; Meliaceae; Azadirachta
Family B3
Protein Properties Length: 1678aa    MW: 188770 Da    PI: 7.5924
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Neem_6198_f_1genomeNGDView Nucleic Acid
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B346.85.3e-15114312191799
                     E--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEEEE-S CS
             B3   17 vlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvvkvfrk 99  
                      lp kf+++++ +k+++ +++l  +sg++W v l  +++ +  +++kGW  Fv+++ ++ gD +vF++dg+  f  +v+vf++
  Neem_6198_f_1 1143 KLPLKFVRHME-SKTCG-QVSLIGPSGNVWHVDL--TQGNDDLFFAKGWPAFVRDHFIECGDLLVFRYDGELHF--TVQVFDQ 1219
                     69*****6555.55676.9***************..999999**************************996666..9999987 PP

2B338.22.5e-12134414261398
                     TT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE...EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEEEE- CS
             B3   13 sgrlvlpkkfaeehggkkeesktltledesgrsWevkliy.rkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvvkvfr 98  
                     s  l +p kf  +h   +++++ ++l++++g sW v+    +k   ++++ +GW  Fv++n++k gD+++F+l+++ e  l+v+++r
  Neem_6198_f_1 1344 SYTLNIPYKFSMAHL--PKCKTVVILRNLKGASWIVNSVPtTKVHTSHTFCGGWLAFVRSNEIKLGDICIFELVRKCE--LRVHILR 1426
                     45689*****98885..348889**************95557777779999*********************998444..5888887 PP

3B344.62.6e-14157116511396
                     TT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTE.EEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEEE CS
             B3   13 sgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgr.yvltkGWkeFvkangLkegDfvvFkldgrsefelvvkv 96  
                     s  l +p kf  +h   + ++++++l++++g+ W+v+   + k+++ +++ +GW  Fv+ n++k gD+++F+l+++ e   +v++
  Neem_6198_f_1 1571 SYTLKIPYKFSMAHL--PDCKTEIVLRNLKGECWTVNSLPDSKGRTvHTFCGGWMAFVRGNDIKIGDICIFELISKCEM--RVHI 1651
                     45699*****99995..348899***************6655544469999**********************986555..4444 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF533832.89E-12154398IPR015424Pyridoxal phosphate-dependent transferase
Gene3DG3DSA:3.40.640.104.5E-30178389IPR015421Pyridoxal phosphate-dependent transferase, major region, subdomain 1
SMARTSM010195.9E-511331220IPR003340B3 DNA binding domain
SuperFamilySSF1019368.63E-1911431225IPR015300DNA-binding pseudobarrel domain
Gene3DG3DSA:2.40.330.104.6E-1711431222IPR015300DNA-binding pseudobarrel domain
PROSITE profilePS5086313.26711441220IPR003340B3 DNA binding domain
CDDcd100179.86E-1611441218No hitNo description
PfamPF023622.1E-1111441219IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.101.4E-1913241426IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019362.16E-2113271427IPR015300DNA-binding pseudobarrel domain
CDDcd100173.84E-1813301426No hitNo description
PROSITE profilePS5086312.84413321428IPR003340B3 DNA binding domain
SMARTSM010199.6E-1013321428IPR003340B3 DNA binding domain
PfamPF023627.2E-1013421426IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.104.9E-2115521652IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019361.51E-2215521653IPR015300DNA-binding pseudobarrel domain
CDDcd100171.23E-1915571651No hitNo description
PROSITE profilePS5086312.44915591655IPR003340B3 DNA binding domain
SMARTSM010194.9E-1215591655IPR003340B3 DNA binding domain
PfamPF023622.9E-1115691651IPR003340B3 DNA binding domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
GO:0003824Molecular Functioncatalytic activity
Sequence ? help Back to Top
Protein Sequence    Length: 1678 aa     Download sequence    Send to blast
MHLSLWKPIT HCAALIMDKK TKRRNRSGLT VDVKRKSSVL RQLQENKLKE ALEEASEEGS  60
LAKSQDIDSD SLNHETSFGR SRSLARLHAQ KEFLRATALA ADRIFSSEDS ILQLHDAFSK  120
FLTMYPKYQN TEKIDQLRSD EYGHLSESFA KVCLDYCGFG LFSYIQTQQY WESSAFTLSE  180
ITANLSNHAL YGGAERGSTE HDIKTRIMDY LNIPENEYGL VFTVSRGSAF KLLAESYPFD  240
RNRRLLTMFD HESQSVNWMA QSAKEKGAKV YSAWFKYPSL KLCSRELRKQ ISNKKKKKKG  300
CANGLFVFPV QSRVTGAKYS YQWMALAQQN NWHVLLDAGS LGPKDMDSLG LSLFRPDFII  360
TSFYRVFGSD PTGFGCLLIK KSVMGTLQNQ SGRTGSGMVK ILPVYLQYLS DSMDGLDALA  420
RIDNDAINGN EESMPETEGG SQMPAFSGVF TSNQVRDVFE IEMDQDNSSD RDGASTIFEE  480
VESISVGEVM KSPIFSEDES SDNSYWIDLG QSPFGSENSG QLTKQKTGSP LPPSWFSGRR  540
NNKQFSPKAT SKLSKSPMYE DRRVNLGLHD EPALSFDAAV LSMSQDLDDV KENPEEEQFA  600
ETEPAFGNGE KHTHLEHVGE IQEETDIRDG SLLHDSTLSA VANGFRNKNQ TSGLGHVNFG  660
DASASEIRQD KDSAIRRETE GEFRLLGRRE RDRFAVGRFF VVEENDRVPS MGRRVSFSME  720
EYRKENLSHL EQSDISLTAL PDDESVSDAE YDDEQDWERQ EPEIICRHLD HINMLGLNKT  780
TLRLRYLINW LVTSLLRLRI PSSGEDVGVP LVQIYGPKIK YERGASVAFN VRESSEGRLV  840
HPEIVQKLAE KNGISVGIGI LSHIRIVDSP KQHCGAFELE DRDLCKPMAN GRGDGKNVFY  900
RVEVVTASLG FLTNFEDVYK MWAFVANVFT LTAQVEISYL DSWEMLKQTG KDAKGTLLCS  960
LSLLLFLGFE GDSLFSTVNA TANKLKLSFI VYVVLNNVHI YLSQIFSYPE CDMRKHQEED  1020
YGFERVYKHD SPQWTGPSRA KYNSYLLKAI DQFYLQTGKK LIDLILHLNK SKKIQMQTNN  1080
PGYDAQEKNE VAETAIYINT RYRKKLHRTG TPPASYAASS NCWQAISARA SNLLGIALAR  1140
LPKLPLKFVR HMESKTCGQV SLIGPSGNVW HVDLTQGNDD LFFAKGWPAF VRDHFIECGD  1200
LLVFRYDGEL HFTVQVFDQS ACEKEASFNS QSSQNSRKFD DSRGQKRERE EMAAPSDKVF  1260
QGVLKKLREV SSEFQSECID KNQEAGSCEE KKWCVSLSNS FALPSQSKVC NVKPEDEEKN  1320
VAQSFMSSFP YFVRLMKRFN ISGSYTLNIP YKFSMAHLPK CKTVVILRNL KGASWIVNSV  1380
PTTKVHTSHT FCGGWLAFVR SNEIKLGDIC IFELVRKCEL RVHILRVGKE DQHSQSGKIA  1440
FGLNVGSAGI SCKMFDGVPK KVKNSLKIHS KCTKKVKLCD MEGSNMCDIK KHVGTTKNSA  1500
SGAPCCQSKI GNEKSEVAIQ IGNSIGAEIG SEARSKLRMM VALDEEKAAR SFGSCVPHFV  1560
RIMRKFNISG SYTLKIPYKF SMAHLPDCKT EIVLRNLKGE CWTVNSLPDS KGRTVHTFCG  1620
GWMAFVRGND IKIGDICIFE LISKCEMRVH ISGIGRTELY HQSGKSTSNE SSICDPPL
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_015884772.10.0uncharacterized protein LOC107420352
TrEMBLA0A2P5FMX80.0A0A2P5FMX8_TREOI; Glycine dehydrogenase (Decarboxylating)
STRINGEMJ265380.0(Prunus persica)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G33280.15e-17B3 family protein