PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PNH10332.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; Chlamydomonadales; Tetrabaenaceae; Tetrabaena
Family YABBY
Protein Properties Length: 2524aa    MW: 255399 Da    PI: 7.4374
Description YABBY family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PNH10332.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1YABBY42.72e-13737789110162
       YABBY 110 prvppvirPPekrqrvPsaynrfikeeiqrikasnPdishreafsaaaknWah 162
                 p++ +  +PP kr r Ps yn f+++e++r++a+ P +  r+af  aa  W+ 
  PNH10332.1 737 PQQLQLQQPPAKRSREPSQYNLFVRDEVKRLRAETPGLDNRDAFRQAASKWSV 789
                 6677899********************************************75 PP

Sequence ? help Back to Top
Protein Sequence    Length: 2524 aa     Download sequence    
MDSNLCGTEQ QQQQQHHQQQ QRLTEPPTSH NPEHDQVWGE RRLENQQQQQ QCGPQEQAAG  60
AAAAADWGRG ASSSSTGPLA EPAGGLAVAA PQETQAASAV PPPEGEAMPG GPLHSVPLPP  120
LPPHLAQPTP GAVPHPCHHG YHHQPQLQRV QPSLAQGFPA PSPVPYHHTP PPYHQQQQQQ  180
QYHQQPYPPQ PYPPHQTHPP HPCQQQPYPQ QQQQQQQQQQ QQQQQQQAQQ QVQQHQQHQQ  240
QVQQHQQQVP QQQLQQQERG IAATVQVTCE AMDCGVLLEA NIPVEHPPEE PFLVRCGNCR  300
RLLEVRLTRV AVDPAPQPQP GGVGRERGPG GASRPAEGGG GEQQHQQQQQ QQKPQQHHHL  360
HPGPAHAPAA LAAWHAAPAA NPAAANPVLP PGYYPPHSTQ PPPALSPQRQ PGPHPWPAEP  420
QHAPPPWGQA PPVSGPRGGP QPQPPQQTPQ PQQPPPQPRQ QPPRPHGGSL GASGISGASS  480
ATGPPGSLPL PSAAGAAGTA QGEGQQRPLL SPVASWPQPQ ARGAGGGGGG EAGGGGGAGG  540
AAGMMQESES VPCGGQGGDV TAGGMGGGGG QYALPQTAVQ YAAQPGGSDL GGGYGTPYGA  600
PGPATGHYWQ QLPAPSNVQQ QQQQQQQPGP QSLLQSLSGP YGGLGAGVLP AHSGGGAGVP  660
YGLASDLPYG YGQGPGGSSA WPAAEPPLYG SAAGALLQQE ALLTGYPGGG VAGFTACAVI  720
PGAGTGRPPL QPEGPAPQQL QLQQPPAKRS REPSQYNLFV RDEVKRLRAE TPGLDNRDAF  780
RQAASKWSVL QSSRRDRSSG GGSALSGPRG ASTVSATTSL GGHAGAQQQV SSGARAASGA  840
ADALGGGGGH GGGGGGGNGE GEGAPGGPLA SGLLPLPPLP PLPVQEAGGG KAGTQGGEKE  900
AEEAHSMDLS SSATSHLGLA PRGPRQRQHS TAAAPAAASG AATGGGNGDG GGGHAGPPAS  960
PAGAAAAAAA GVTALPAVSS FSSLLADACG AQPLAAAEAV DAEPAAAPYT AAVRPAAGEQ  1020
QQHLNQQRQQ QQRGVTEGMG SAAFRNDDML MQLSGGGNGP GTNAGGAALA PTAAAGGASA  1080
DSIMPASGSR SRLDAASMHA PPPTSAWGSG HEQQHHHHQQ QQQQQQQQQQ QQQQQHPNHP  1140
QQQQQHLQHQ QHLQLQHPQP QQQQHPQQQH QHLQLQHQQQ QQHQQQPHPG APWAAAASGA  1200
ATAAVTAAAA ALGRAAGFSG ARTTWGPQAP ETGAAAGRPP AGVAFPGLTH PAGAGAGPQG  1260
PWGPGLPAPA ANVGSFHEQQ RLAPPPPPQQ QQQQQQQLHE QRYYQHQQDQ RQHQQQQHHQ  1320
QEQEQVARVA GSGQMLASQR YGSGGSGGGA GGGGGRASSG GGQLGSGGGG SGRRGSAAAA  1380
GSVGSAGGGA RAQGGPAPAA RRGLLAGVLA ELQQHQQHQQ HQQHQQHQQH QQHQQHQQHQ  1440
QHQQHQHLQQ NQQQHHHQQD HHQQLQQLQQ HQQHNVRQVQ SGGGSEVGGA SGSRRSSSLA  1500
GMLHGGLGSG GGSGAGGAPP HRPWRPPHEG AAPSGPPPAG SQQRRGGASA QRPAAAEWSV  1560
LQSSRRDRSS GGGSALSGPR GASTVSATTS LGGHAGAQQQ VSSGARDASG AADALGGGGG  1620
HGGGGGGGNG EGEGAPGGPL ASGLLPLLPL PPLPVQEAGG GKAGTQGGEK EAEEAHSMDL  1680
SSSATSHLGL APRGPRQRQH STAAAPAAAS GAATGGGNGD GGGGHAGPPA SPAGAAAAAA  1740
AGVTALPAVS SFSSLLADAC GAQPLAAAEA VDAEPAAAPY TAAVRPAAGE QQQHLNQQRQ  1800
QQQRGVTEGM GSAAFRNDDM LMQLSGGGNG PGTNAGGAAL APTAAAGGAS ADSIMPASGS  1860
RSRLDAASMH APPPTSAWGS GHEQQHHHHQ QQQQQQQQQQ QQQHPNHPQQ QQQHLQHQQH  1920
LQLQHPQPQQ QQHPQQQHQH LQLQHQQQQQ HQQQPHPGAP WAAAASGAAT AAVTAAAAAL  1980
GRAAGFSGAR TTWGPQAPET GAAAGRPPAG VAFPGLTHPA GAGAGPQGPW GPGLPAPAAN  2040
VGSFHEQQRL APPPPPQQQQ QQQQQLHEQR YYQHQQDQRQ HQQQQHHQQE QEQVARVAGS  2100
GQMLASQRYG SGGSGGGAGG GGGRASSGGG QLGSGGGGSG RRGSAAAAGS VGSAGGGARA  2160
QGGPAPAARR GLLAGVLAEL QQHQQHQQHQ QHQQHQQHQQ HQQHQQHQQH QQHQHLQQNQ  2220
QQHHHQQDHH QQLQQLQQHQ QHNVRQVQSG GGSEVGGASG SRRSSSLAGM LHGGLGSGGG  2280
SGAGGAPPHR PWRPPHEGAA PSGPPPAGSQ QRRGGASAQR PAAAEAGGGV ASAAAAAHSP  2340
LVIADLSQLL SIVSNLPMMA AAPLLPHPPP ALRRAAPAAA AAAAWLPPPP GGLQGLDSGC  2400
MVDVAGGGAE RQESLGWQWG DWTWGSGSAG GSQGGSEAAV QDAMEAAAGD QAPQAGAGAS  2460
QRLADGAAAP PPEAAAAAVT AAGRGGDRGG GASAPGGALA SELSFSPFSA LEAIQELHPD  2520
ERED
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
113521362GGGGRASSGGG
213531363GGGGRASSGGG
313541364GGGGRASSGGG
421202130GGGGRASSGGG
521212131GGGGRASSGGG
621222132GGGGRASSGGG
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G23420.16e-07YABBY family protein