PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID AUR62023866-RA
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; Caryophyllales; Chenopodiaceae; Chenopodioideae; Atripliceae; Chenopodium
Family B3
Protein Properties Length: 2041aa    MW: 231973 Da    PI: 6.0892
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
AUR62023866-RAgenomePhytozomeView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B368.21.2e-21262360598
                     -..-HHHHTT-EE--HHH.HTT.....---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-.SS.SEE.. CS
              B3   5 ltpsdvlksgrlvlpkkfaeeh.....ggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkld.gr.sefel 92 
                     ltps v+k+++l +pk fae++     +  + +s+ l +ed++g+ W+++++y+++ + y+l++GW++Fv++++L +gD+v F++  g  ++ ++
  AUR62023866-RA 262 LTPSGVSKLKCLPIPKVFAEKYfpkllS--SIKSILLNFEDLTGKLWTFQYSYWNTYQCYILAEGWSRFVEEKQLSVGDTVFFYRGvGGeDRYKF 354
                     799*********************9554..45899*************************************************54444466777 PP

                     EEEEE- CS
              B3  93 vvkvfr 98 
                     ++++ r
  AUR62023866-RA 355 FIHWSR 360
                     777766 PP

Sequence ? help Back to Top
Protein Sequence    Length: 2041 aa     Download sequence    
MLTKSLINPV SASASSSREY YCESGTRFLR RDSLISHQLF SDAVSIKEDL TAIAMLSPVS  60
KQPKLQLAEQ DAKTSVSNNP QELAVADPFE QSVITSVSNK PRELNVADPF EQSSMTSVSN  120
KPRELNVADP FEHSAMTFVS NKPRELNVAD PFEQSSMTSV SSKPQELNVA DPFAHTTMPS  180
VSTRQPQECN VADSVAQIDM HLREALTAGT ELMENLKLSG SENDLAASEL AEQVILSAKR  240
VQKLLPVKKE MMFRKELLPM DLTPSGVSKL KCLPIPKVFA EKYFPKLLSS IKSILLNFED  300
LTGKLWTFQY SYWNTYQCYI LAEGWSRFVE EKQLSVGDTV FFYRGVGGED RYKFFIHWSR  360
RGALAKQFLG NALLFRMLNS PPNASYDEMD ADDVHAPFTP FPFTPSPLTP YPFTPSPLTP  420
YPFTPSPTLS DGLIVDDALA PYFHSLSPNH CSPCEADFDG VIADKALRYC ALFPHNYEFE  480
KETVIQLWMA HGVFVREPME GNADFFFENW LTRGYFLFCQ TNLVTGKAMY KFKRDCCLFP  540
LSILSGNHVV LIEGNSNLND VSVEVSHVSI RCYTQKNSTV FQELKRFKHL RTLMFLRGHD  600
LSFKQVPRDF FLSLKLLQVL DLSQTCLLEL PSSIGNLKCL RYLDLSETLI KRLPAAIDCL  660
ENLLTLKLRG CLNLTSLPKG MNKLVNLRHL ELDVRRQLRS MPPGLGTLTN IQTLSAFLVD  720
LEDECSIRQL QNMNNLSGNF CICRLENVLT KDEAIEACLC YKSKLTKLEL QWSDSQNEDV  780
SGSGEVLSSL VPNTSLEELH ISCFDGFDLP GWICDPAFSK LVRITLFKCE NCLVLPSLGQ  840
LPSLKFLNIL DFHKLKVIDH NFYGWLDFQG VIAFPKLEKL AIENMLSLEE WSGMEKSVFP  900
CLLKLTIKYC PKLCSLNTLS NLSFLKYLEI SHCDMLESWT DGRLPPSVET LIIEDCPMIS  960
VESLKNGGQD WHKVAHIQSL WVDNQEIPSG KSFHATFLYL SSRRTCIVFF FFFKGQQSSF  1020
PSIVILPLLS QLILQWLNER RFTEVSDTED FHIALLVTLF RLYSKWTGPM VMNFAGDDMA  1080
ASVISTLKLI ILRKKGCIVY DVVTGELSSL NKNTKIDFAL ALLQDLLERF KFLVFEEAGQ  1140
VVQFDAEDEL KKLKRKLLKA ETLFDSFQLT PNNTWQHWVG EVTGVCYDAE DLVDDIVLGA  1200
SKTSVIEKMF SFFERRKMAQ QIQELQGRLE DIISGLDMVN RTNQQALQCS LGCYKEIAHT  1260
REQIRVPVRL FGRESDKEKI VSMLLEETLI TVSIVGMGGL GKTTLAQSIQ DDSRIQEKFH  1320
RIVWISVSAE FDMTKITDFI LNRRQEGEYS FHPERIQSSF GDLYLGRSIL FVLDDLREVK  1380
GNDWCSFCHY FLCSSGSKAL LTTSNPNVTS ITNATPYHLQ MMKDEDCQVL IMDRALSFNN  1440
ISERKLVILE DIAGAMAQKC KGLPLAANIL GLLLSSKHDD DDWVTLSEKD ICELSIFKEE  1500
IFPAFRLSNP NLASHLKKCL AYCSLFPRDY DFEKDNLVQL WMAEGFLLPQ GMTNLEQIGC  1560
EYFNELLWRS VFQLSHLGDQ EMPSYKIHEF IHRFAEFVAS DTCFRLAKGE WSCSAPLYKK  1620
VRHLSLLCDC IKLPFLKEIE KCDGLRTFLL LSEHGTQIGQ VPYSFFQKLV RLRVLDLSHT  1680
NIDELPESLG RLKHLRYLDA SQTHILRLPN SASDLHGLQV LKLSGCSELR ELPKQIKNLT  1740
NLIHLDVDMR KLRCMPASIG SLSCLKTLPA FMVGKKEGYR ITELKNLKHL RGTIYLGKLE  1800
YVRDGAEARE AMICDKPFIK RLELEWSRCS RDGSVQMDIL SGLQPHKNLK ELLLINYGGS  1860
RFPVWLTSPS CLLVSIHVQF CKQDDVLPSL GQLLFLKTLN IEGMDRVKCV DNHFCGEGRN  1920
EAFPSLESLK IQDMMCLARW FKLPDNSMPQ LRELTIEDCP NLLSMQSLKH MNSLQTFELN  1980
RCMELMSLPE LPVSIQSLII TDCDMVKQRC QPEEGPDWSS IRAIPYVEID YETVVPQASS  2040
*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
111501157LKKLKRKL
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G46870.11e-27B3 family protein