PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_022737338.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Helicteroideae; Durio
Family ARR-B
Protein Properties Length: 1339aa    MW: 151164 Da    PI: 6.5625
Description ARR-B family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_022737338.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1DUF70246.31.5e-1473131271
          DUF702   2 rsgtasCqdCGnqakkdCaheRCRtCCksrgfdCathvkstWvpaakrrerqqqlaaasskaaasaaeaa 71 
                     r+g+++CqdCGn+ kk+     C      + f+C+t vk tWvpaakrr rqq +aa+++k++  + + +
  XP_022737338.1  73 RQGSMNCQDCGNKPKKEY----C------QVFQCQTDVKGTWVPAAKRRRRQQLTAAQQEKKKIFK-NTK 131
                     7899************84....2......47********************998888766554322.222 PP

2G2-like88.37.1e-2810991152155
         G2-like    1 kprlrWtpeLHerFveaveqLGGsekAtPktilelmkvkgLtlehvkSHLQkYRl 55  
                      kpr++W+ eLH++F+ av++L G +kA+Pk+il+lm++kgLt+ehv+SHLQkYRl
  XP_022737338.1 1099 KPRIVWSVELHREFIAAVNKL-GLDKAVPKKILDLMNIKGLTREHVASHLQKYRL 1152
                      79*******************.********************************8 PP

3Myb_DNA-binding409.3e-1311031151348
                       SS-HHHHHHHHHHHHHTTTT..-HHHHHHHHT.TTS-HHHHHHHHHHHT CS
  Myb_DNA-binding    3 rWTteEdellvdavkqlGgg..tWktIartmg.kgRtlkqcksrwqkyl 48  
                        W+ e++  ++ av++lG +   +k+I   m+ kg+t++++ s++qky+
   XP_022737338.1 1103 VWSVELHREFIAAVNKLGLDkaVPKKILDLMNiKGLTREHVASHLQKYR 1151
                       6***********************************************7 PP

4Response_reg64.93.8e-2291910251109
                      EEEESSSHHHHHHHHHHHHHTTCEEEEEESSHHHHHHHHHHHH..ESEEEEESSCTTSEHHHHHHHHHHHTTTSEEEEEESTTTHHHHHHHHH CS
    Response_reg    1 vlivdDeplvrellrqalekegyeevaeaddgeealellkekd..pDlillDiempgmdGlellkeireeepklpiivvtahgeeedalealk 91  
                      vl vdD+p ++++l+++l+k +y +v+++ ++  al++l+e++  +Dl++  +++p+mdG++ll+    e ++lp+i+++ah +  + ++a+ 
  XP_022737338.1  919 VLAVDDDPCFLKVLENMLRKCQY-HVTTTNQAITALKMLRENRnkYDLVISEVNLPDMDGFKLLELVVLE-MDLPVIMLSAHLD--TPIKAIT 1007
                      799********************.***************999889*******************988664.59*********98..999**** PP

                      TTESEEEESS--HHHHHH CS
    Response_reg   92 aGakdflsKpfdpeelvk 109 
                       Ga d+l Kp+ + el +
  XP_022737338.1 1008 HGACDCLLKPVRMTELKN 1025
                      *************99986 PP

Sequence ? help Back to Top
Protein Sequence    Length: 1339 aa     Download sequence    
MTVMKLMYRR GRKKFGPNEG IYNKRFEIWS QYFHKQQQDV NNYSLEAGPS RRASELNLSD  60
ESSSRSAGFM IMRQGSMNCQ DCGNKPKKEY CQVFQCQTDV KGTWVPAAKR RRRQQLTAAQ  120
QEKKKIFKNT KTKLQLSVPK ASGSRLTELR KEEEWAAREI HLTGDKHNVS ELLSVPKASG  180
SGLTEPQKEE EWTAKEIHFT GNKHNVSELP KSPNCSSLIA VYLQGNYELT AIPPLFFQHM  240
ALLQVLDLSR TSIKSLPESL PKLVALKRLL LRGCELFMEL SPQVGKLKNL EELDLDETQI  300
MDLPREIGEL LKLRHFRVSF YHICGKKKGK SDIVIHPETI SSLSQLAQLS IDVNPEDKRW  360
EDSVEVVVKE VCNSKALRTL SLYLPRFQLL ENISFLYPSL SHFRFTVGHH KRRVISRVPA  420
EVEAKFRNGD KCLKFVNGEN IPIEIKGVLY YATSFFLDHH KTATNLSEFG IENMKWLKFC  480
LLAECNKMET LIDGEMHYER NEDDRSESDL GSVEHVLESL EYLSIYYMDN LGSIWRGPNR  540
YGCMSKLKFL ALHTCPQLIN IFSHSLLENF VNLEEIILED CPQVTSLLSH ASVKPKISDK  600
IFLPRLKRLL LLYLPELVSI SNGLLIAPKL ESIGCYNCPK LKSILKMELS SKTLKIIKGE  660
CQWWEDLNWN ETEWGNRPAY LMRIFSPINN EKDVMTQLAE DRDLLEATIE NEGQQPDDEK  720
LLDVSARDHK GQCSGYTEEI KTETDMITMS TPSVSPYLYE MQHGSIPMPL KMEEGPMYED  780
WNELAESQIR EMAYLEKKII EAEAELMPKV DVTSPTSEKG MNSDENVSIE SAHLSGPDCV  840
FANEYAGIAK VSIQDESQES EDENLFEVSK QADDQQTDGK AERERPILER NKKRSCLEKM  900
GGSTSEDGHM DRFPVGMRVL AVDDDPCFLK VLENMLRKCQ YHVTTTNQAI TALKMLRENR  960
NKYDLVISEV NLPDMDGFKL LELVVLEMDL PVIMLSAHLD TPIKAITHGA CDCLLKPVRM  1020
TELKNIWQHV VRRKKPDSED QINDPNQDKA RRGTGEAGQT STSSSDQKVK KKRKDQGEDE  1080
EAEGVDNGHE NEDPSTQKKP RIVWSVELHR EFIAAVNKLG LDKAVPKKIL DLMNIKGLTR  1140
EHVASHLQKY RLYLRRLACV ATQQANMSAA LGSKDPSYLG MGSLDGFGDL RTLTGPGMLS  1200
NTSLSSCQPG VMISGLNSSA ALSLRGISPG VIQPGHSQTL NNSIDGLGKI EPAVLPAKQN  1260
QNGTLFRGIP TSVELNQPSQ SKSTNHFGEF NSVNDPNVFG VATYFQDARV TVGSSSNSLF  1320
TASGNPLLLQ ANTQHTEGI
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1109128KRRRRQQLTAAQQEKKKIFK
210321074RRKKPDSEDQINDPNQDKARRGTGEAGQTSTSSSDQKVKKKRK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G25180.11e-101ARR-B family protein