PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Vocar.0029s0161.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; Chlamydomonadales; Volvocaceae; Volvox
Family Nin-like
Protein Properties Length: 2367aa    MW: 249278 Da    PI: 8.9081
Description Nin-like family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Vocar.0029s0161.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1RWP-RK63.73e-2022412289452
               RWP-RK    4 eisledlskyFslpikdAAkeLgvclTvLKriCRqyGIkRWPhRkiksl 52  
                           +i+ledl+k F++p  +AA  Lg ++T LKr+CRq+GI RWPhRk+ sl
  Vocar.0029s0161.1.p 2241 DIRLEDLRKTFNMPAPQAAMALGTSTTNLKRKCRQLGILRWPHRKLDSL 2289
                           799*******************************************997 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5151917.49622252310IPR003035RWP-RK domain
PfamPF020422.6E-1822422289IPR003035RWP-RK domain
Sequence ? help Back to Top
Protein Sequence    Length: 2367 aa     Download sequence    Send to blast
MQLPYELFPQ HAEAKQAGFS PRTVEAVHVV QPSEDQSFPA AAAANQAPGG GWVPPYIAGA  60
REASGASSHA SRRGPQRPCE PKAQKLHLGG GGGSVEGSWE HVDAQRYDER DVVKVEESLP  120
YKDEVDWTEQ GALHRLGLAA QVAVPWRQRR RLAVWREQES LMQQLLLHAR PVAERGRASG  180
SGGPPATSAA IPSSIPPQNQ SACDTTLRQS YRCRKPLVEP EVASQTDAAG SFEGALSLSP  240
TSPRGPANSD LPWGSVMAPK ANLGDGTGTS PATVRPRLPV QRKPLQPPQR LQPPPQTWRP  300
PGLSELPQPH PHYQQAQCGL QKRQHQQPHQ GNGFVLSRSV LSSLLLAGAA AARRQRGNAA  360
LVYGSGSYPP AHKDFREEVV PTGLSAVRQL TASYGDGVDD RESEEVAAIR KAEAAGSISA  420
AVSARGEYDS VKYDSAAAAA PSSGAAAGVA PAASRQPTDV GHRGATLAAA AAAAGGCGGG  480
DGGGVYINTS GDCHGRTGTT TVGQSVAPPT EGHGRMAVQE KYGYGSDDVQ CVITYEKPDK  540
VPGHARMLKL LQQQQQQQQQ HDLMVTRREM QKNRHYELAS YITDLGCDDG TGLPGGAARE  600
AQRQSLAGGS LIAVSRQHGA TATTTVRATA IAHIAIGQQT QTEAVQKTSA ESGGKPLLPP  660
PAMVAAAAKV AAASSLRQPY IHLKATTPAT PHHANAAAIR GTSGSITTGA RRSESVGASP  720
APTIVRETGT GGLLSGPGAA ATVAEGSPRA AQASPEASFR ICPAENDSPF AAREAAVIAT  780
AAVAPATAVR NGPRRGHLER SEDTSSPDDA AAAGVNIPAA ATEADLFAAA EALLLFKDPL  840
CRGGRDLPGG GGAAATTVAA AGVGVEERPE GVACDSKPVR VRKRRRMSGT PAAEAAQGLP  900
ADAATGVRKP EDMHTHLAPR SRQPTKEQQV TDGCGASELG SGQRPGSGSG QRPGSGSGQR  960
PGSGSGQRPG SGSGQRPGSG SGQRPGSESG QRPGSESGQR PGSESGQRPG SGSGRAKRNK  1020
GAVAGADAGE PPLRPRRSPG GGGGGGSSIQ IIDRSSNRAA AATATATTIT AAATAAPAAV  1080
AAVAPRCVDA ATAVASFDHG DPPMTKDGQW AGASSLPPPQ QRHLLQHHAV LTIPKSPTGN  1140
LNEEALVNDY RRGEMWRRYP SDQRAALVGP GRAPPVNVAT RLLHQNPQHS EPPVRTLIPS  1200
IWFAEQAGSQ ATDQQKVLIN KSLSHQLLLQ QQQQQQQLQP DRQEGEWPQQ QQQQQQPHHH  1260
QQQQQQPRRQ HRQLLQAAQA QNQMRDQHGP PLVPPVSPES NYTQQLRQMD QQQQQQQQAE  1320
WRSEKCGPHA HMPQQHAHQP KQPQQMASQQ EPGSLRADQQ QFQIQIPKQR HGQPQREQRQ  1380
LRHADHQQQQ QVMQQQEQEQ DYGEHVERER QQEQVQPSRI LPSPRSSTLP HAKMRHLAGA  1440
HTDSIFRMPE EQQGLPRNSS GGVSRGQLEG AAALPRRGAP GSAGPSPRPH PHSAPEPHTM  1500
QSHANMSLNI NPASMMDCPG MRAPLDRTQL SAQAVDSRAA LRADRRGEEW EWTSGTATAG  1560
GKGAGAGAGL PAVDLVQWWR ATDDQTETAA AAATAATAAP QSDADAEVRE LEQQLLRMLQ  1620
QQQQQQQPHR QLPSAGLLRE LIDRDAVGFD RMDAIMGPGQ PEPPATGSRL HSNVTGQIDI  1680
PGQQQQQEDH HYHHYHHQQQ HQQQPKQQQE KQRQGEVQEF ARQRLRAALR GTPPSSQPVL  1740
SGAYPDCSGI GPDAGVAGAG VGTEPPFGRS ADGEQQLGTH VRWGASSGGA AAGPWDAAEP  1800
RLERSSRKQR RQEDTEPRVE EEEEEQRRRR KPWQHQHQPQ EQHRGLLQRG QPQLQRALGD  1860
PEEQPLQQQQ PAHHHRNRYH HNHHRHHLPL GFKDDVRSEA MAAPTWSAAR PVNGGPNGSG  1920
GCANSGSGGS AGFVIRSGSH SAVDVEPGQP QQQQQQPATA HRTSGGAVAG RDGDASAASK  1980
LLRDEHSQQQ QQQQQQQPQP QPQQQQALSE MCGAPTHAAA ARQAMQGGLG GDTLRPDSGS  2040
TQGVGPSFAA GGVDATAGRG SGVGGDAGNS TAAAAGKGSA GAAAAVEVGA GSGGSGSAAG  2100
ESTDTHITSG PPPTTAPSPL PGGALAATAA AAAAAAVGAV GECPVPPQPR QRGGTAGTVT  2160
TQQQQQRRRR QRGTRSVGVL RSEHAAIKDG DGDCEGDDDA GAVRDEEEEG GCEAPREGCS  2220
GGGGAAAAAN APAVQRGAGG DIRLEDLRKT FNMPAPQAAM ALGTSTTNLK RKCRQLGILR  2280
WPHRKLDSLN RLRSLVLSDV VEKSQRQALL DSIARNIQQI HQDPNSGLVP SLKNLRQAYY  2340
KRGFDVRARR GGSLVAGTTD SDNCSA*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1842850GGRDLPGGG
2881885RKRRR
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_002949843.10.0RWP-RK domain transcription factor
TrEMBLD8TTY40.0D8TTY4_VOLCA; RWP-RK domain transcription factor
STRINGXP_002949843.10.0(Volvox carteri)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
ChlorophytaeOGCP841650
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G53040.15e-14RWP-RK domain-containing protein