PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Pav_sc0001673.1_g130.1.mk
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Amygdaloideae; Amygdaleae; Prunus
Family ARR-B
Protein Properties Length: 1160aa    MW: 131053 Da    PI: 6.3439
Description ARR-B family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Pav_sc0001673.1_g130.1.mkgenomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1G2-like74.51.5e-23788841155
                    G2-like   1 kprlrWtpeLHerFveaveqLGGsekAtPktilelmkvkgLtlehvkSHLQkYRl 55 
                                kp+l+Wt+eLH++F++av+ L G + A Pk+il++m+v+gL  e+v SHLQkYRl
  Pav_sc0001673.1_g130.1.mk 788 KPKLVWTNELHKSFLQAVRLL-GVDSAHPKNILQHMNVPGLRKENVSSHLQKYRL 841
                                79*******************.********************************8 PP

2Myb_DNA-binding34.54.7e-11790840148
                                TSSS-HHHHHHHHHHHHHTTTT..-HHHHHHHHT.TTS-HHHHHHHHHHHT CS
            Myb_DNA-binding   1 rgrWTteEdellvdavkqlGgg..tWktIartmg.kgRtlkqcksrwqkyl 48 
                                ++ WT e+++ +++av++lG +  ++k I ++m+  g+  +++ s++qky+
  Pav_sc0001673.1_g130.1.mk 790 KLVWTNELHKSFLQAVRLLGVDsaHPKNILQHMNvPGLRKENVSSHLQKYR 840
                                679***********************************************7 PP

3Response_reg49.91.7e-176067131108
                                EEEESSSHHHHHHHHHHHHHTTCEEEEEESSHHHHHHHHHHHH..ESEEEEESSCTTSEHHHHHHHHHHHTTTSEEEEEESTTT CS
               Response_reg   1 vlivdDeplvrellrqalekegyeevaeaddgeealellkekd..pDlillDiempgmdGlellkeireeepklpiivvtahge 82 
                                vl+vd + + + +l ++l + gy +v +a+ + +al ++++k+   +l+l+   +p+md +ell+++++ + k+p++++++ ++
  Pav_sc0001673.1_g130.1.mk 606 VLVVDGDSACLAILSRMLCNLGY-KVMTAKRAYDALSIAQKKEdeLHLVLTESHLPDMDKYELLEKMKAVS-KVPVVIMSDDDD 687
                                89*********************.***************888889***********************999.************ PP

                                HHHHHHHHHTTESEEEESS--HHHHH CS
               Response_reg  83 eedalealkaGakdflsKpfdpeelv 108
                                e+ +l  l  Ga  +++Kp  ++ l 
  Pav_sc0001673.1_g130.1.mk 688 ENAMLGGLFKGAVFYFVKPLTINSLK 713
                                ********************999886 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1160 aa     Download sequence    
MGVGGKFWDL LKPYARHEGF DFLRNKRVAV DLSFWIVQHE TAIKDRARSP HLRVTFFRTI  60
NLFSKFGAFP VFVVDGSPSP LKSEARIARF FRSSGIDSSS LPVAGDVKAK GEAEALCAQL  120
DAEGHVDACI TSDSDVFLFG AKCVIKTFQS NSREPFECYY MSDIEAGLGL KRKHLIAISL  180
LVGNDYDLNG VQGVGLDTAL RFAQTFSEDV ILNRLREIGN GDASLLQGEI RSVDDSVPSP  240
DESSLKRKFS HCSFCGHPGS KRTHFKSSCE YCSSTMGEGC MKKSEGFKCS CSSCDMDRKE  300
KEQKKQDNWR LKVLSKIALE PNFPNDAIIE MYLCNSHGYF TENDGLHISW GSPKTEMVVD  360
FLAYHQLWEP SYIRQRMLPM LSTIFLREMA KDPVKSLLYG QYEFDSIDRL KIRYGHQFYV  420
VKWKKSAPSL GCVSCTVPPE ESDMQQDDVM EVDESINPFD ESDVPTIDIN NGCCLLLTDE  480
NMDLVHAAFP EEVDRFLQEK ELKELKRRKT GKPETAGSRG VQLNITEFYR SAKVYETEPG  540
EILTKKTEPG EILSSQRAET SKEKRKPSSS NLPKSKLFEK LWMLKAIANL SDLLIFSSPR  600
LEPVSVLVVD GDSACLAILS RMLCNLGYKV MTAKRAYDAL SIAQKKEDEL HLVLTESHLP  660
DMDKYELLEK MKAVSKVPVV IMSDDDDENA MLGGLFKGAV FYFVKPLTIN SLKNLWQFAI  720
IKNRNHVVID LTEEESSVYG ESQQENTSNE GLESESFMTR DRWLRRKNPE GTYKDEETEN  780
SDSTSQKKPK LVWTNELHKS FLQAVRLLGV DSAHPKNILQ HMNVPGLRKE NVSSHLQKYR  840
LSLKREQEAI MKARARGPKK SNFPSHQSSS TLNFRGGCSQ TPNRYSTSTA YQPKRRSHVQ  900
DLSSHMSRPS PAGSVCFPSQ VYSSGHPTLS NESSFPSMPY HSNYKNGQPT LSNQLYYTNY  960
HNPNHIGNRI TSNKELAGFR QVGSFVRYSN FDGNKNFSSF GDHGHSNLVD FPILLHNSTQ  1020
QEVQQQQQLQ PQVLFSPPLQ MPSPPPQPQE QQENDILGQE RQLQPQFLPS SPLQMPSPSP  1080
LPAAAEQQGE QDDIFGLEKQ LMQSPQWHMP SMPSMPAAPT AHEQEKDDMF GIESEETDDL  1140
FDIAKGTTQH FNDVDFDDLW
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1502508LKELKRR
2503509KELKRRK
3894898KRRSH
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G16110.19e-53ARR-B family protein