Summer of Code - Getting Started

From Octave
Revision as of 12:41, 3 March 2023 by Pr0m1th3as (talk | contribs) (โ†’โ€ŽSuggested projects: added suggested project)
Jump to navigation Jump to search
Info icon.svg

Since 2011 the GNU Octave project has successfully mentored:

in Summer of Code (SoC) programs by Google and ESA.

Those SoC programs aim to advertise open-source software development and to attract potential new Octave developers.

Steps toward a successful application

  1. ๐Ÿ˜‰๐Ÿ’ฌ We want to get to know you (before the deadline). Communicate with us.
    • Join Octave Discourse or IRC. Using a nickname is fine.
    • Show us that you're motivated to work on Octave ๐Ÿ’ป. There is no need to present an overwhelming CV ๐Ÿ†; evidence of involvement with Octave is more important.
    • If you never talked to us, we will likely reject your proposal, even it looks good ๐Ÿšฎ
  2. ๐Ÿ‘ฉโ€๐Ÿ”ฌ Get your hands dirty.
    • We are curious about your programming skills ๐Ÿš€
    • Use Octave!
      • If you come across something that does not work the way you like โžก๏ธ try to fix that ๐Ÿ”ง
      • Or if you find a missing function โžก๏ธ try to implement it.
  3. ๐Ÿ“๐Ÿ’ก Tell us what you are going to do.
    • Do not write just to say what project you're interested in. Be specific about what you are going to do, include links ๐Ÿ”—, show us you know what you are talking about ๐Ÿ’ก, and ask many smart questions ๐Ÿค“
    • Remember, we are volunteer developers and not your boss ๐Ÿ™‚
  4. ๐Ÿ“” Prepare your proposal with us.
    • Try to show us as early as possible a draft of your proposal ๐Ÿ“‘
    • If we see your proposal for the first time after the application deadline, it might easily contain some paragraphs not fully clear to us. Ongoing interaction will give us more confidence that you are capable of working on your project ๐Ÿ™‚๐Ÿ‘
    • Then submit the proposal following the applicable rules, e.g. for GSoC. ๐Ÿ“จ

How do we judge your application?

Depending on the mentors and SoC program there are varieties, but typically the main factors considered would be:

  • You have demonstrated interest in Octave and an ability to make substantial modifications to Octave
    The most important thing is that you've contributed some interesting code samples to judge your skills. It's OK during the application period to ask for help on how to format these code samples, which normally are Mercurial patches.
  • You showed understanding of your topic
    Your proposal should make it clear that you're reasonably well versed in the subject area and won't need all summer just to read up on it.
  • Well thought out, adequately detailed, realistic project plan
    "I'm good at this, so trust me" isn't enough. In your proposal, you should describe which algorithms you'll use and how you'll integrate with existing Octave code. You should also prepare a project timeline and goals for the midterm and final evaluations.

What you should know about Octave

GNU Octave is mostly written in C++ and its own scripting language that is mostly compatible with Matlab. There are bits and pieces of Fortran, Perl, C, awk, and Unix shell scripts here and there. In addition to being familiar with C++ and Octave's scripting language, you as successful applicant will be familiar with or able to quickly learn about Octave's infrastructure. You can't spend the whole summer learning how to build Octave or prepare a changeset and still successfully complete your project ๐Ÿ˜‡

You should know:

  1. How to build Octave from its source code using the GNU build system.
  2. How to submit patches (changesets).

Suggested projects

The following suggested projects are distilled from the Projects page for the benefit of potential SoC participants. You can also look at our completed past projects for more inspiration.

Info icon.svg
Do you use Octave at your working place or university? Do you have some numerical project in mind? You are always welcome to propose your own projects. If you are passionate about your project, it will be easy to find an Octave developer to mentor and guide you.

Adding regression GAM and kNN classification functionality in statistics package

Generalized Additive Models and k-Nearest Neighbor algorithms are two important tools in advanced statistics used for regression and classification problems, respectively. The statistics package, although heavily developed during the past year, still lacks any functionality regarding these two algorithms. The scope is to implement both the respective class def objects as well as the relevant functions in a MATLAB compatible way.

  • Project size [?] and Difficulty
~350 hours (medium)
  • Required skills
Octave, familiarity with statistical methods
  • Potential mentors
Andreas Bertsatos

ode15{i,s} : Matlab Compatible DAE solvers

An initial implementation of Matlab compatible Differential Algebraic Equations (DAE) solvers, ode15i and ode15s, based on SUNDIALS, was done by Francesco Faccio during GSoC 2016. The code is maintained in the main Octave repository and consists mainly of the following three files: libinterp/dldfcn/__ode15__.cc, scripts/ode/ode15i.m and scripts/ode/ode15s.m.

The decic function for selecting consistent initial conditions for ode15i can be made more Matlab compatible by using another algorithm. Another useful extension is to make ode15{i,s} work with datatypes other than double and to improve interpolation at intermediate time steps.

  • Project size [?] and Difficulty
~350 hours (medium)
  • Required skills
Octave, C/C++; familiarity with numerical methods for DAEs
  • Potential mentors
Francesco Faccio, Carlo de Falco, Marco Caliari, Jacopo Corno, Sebastian Schรถps

PolarAxes and Plotting Improvements

Octave currently provides supports for polar axes by using a Cartesian 2-D axes and adding a significant number of properties and callback listeners to get things to work. What is needed is the implementation of a dedicated "polaraxes" object in C++. This will require creating a new fundamental graphics object type, and programming in C++/OpenGL to render the object. When "polaraxes" exists as an object type, then m-files will be written to access them, including polaraxes.m, polarplot.m, rticks.m, rticklabels.m, thetaticks, thetaticklabels.m, rlim.m, thetalim.m. This relates to bug #49804.

  • Project size [?] and Difficulty
~350 hours (medium)
  • Required skills
Octave, C/C++; optional experience with OpenGL programming
  • Potential mentors
Rik

Table datatype

In 2013, Matlab introduced a new table datatype to conveniently organize and access data in tabular form. This datatype has not been introduced to Octave yet (see bug #44571). However, there are two initial implementation approaches https://github.com/apjanke/octave-tablicious and https://github.com/gnu-octave/table.

Based upon the existing approaches, the goal of this project is to define an initial subset of table functions, which involve sorting, splitting, merging, and file I/O and implement it within the given time frame.

  • Project size [?] and Difficulty
~350 hours (hard)
  • Required skills
Octave, C/C++

TISEAN package

The TISEAN package provides an Octave interface to TISEAN is a suite of code for nonlinear time series analysis. In 2015, another GSoC project started with the work to create interfaces to many TISEAN functions, but there is still work left to do. There are missing functions to do computations on spike trains, to simulate autoregresive models, to create specialized plots, etc. These are of importance for many scientific disciplines involving statistical computations and signal processing.

  • Project size [?] and Difficulty
~350 hours (medium)
  • Required skills
Octave, C/C++; FORTRAN API knowledge
  • Potential mentors
KaKiLa

Better tab completion

Links: https://savannah.gnu.org/bugs/index.php?62492 and https://savannah.gnu.org/bugs/?53384

Description: currently pressing Tab at the Octave command prompt attempts autocompletion of all identifiers in scope (variables, functions, classdefs etc) as well as files and directories in the path. It is not context sensitive.

Project: Improve tab completion. For example,

  • Typing
load x

and then pressing tab should ideally give all loadable files and directories starting with x, not unrelated results like variables or functions.

  • Typing
cd

and tab should only give directories.

  • Any file and directory names that are included in the results should include those with spaces and special characters including those that would be interpreted as operators by Octave.
  • Typing commands like
axis

or

format

and pressing tab should give only those options relevant to that command. E.g. format can be followed by short / long / loose / compact etc but not by a file or variable. Similarly axis can be followed by ij / xy / tight / equal / actual limits etc but not by files or directories. And so on for other commands. This should be made possible for both preexisting commands and for yet-to-be-written commands without any rewriting of existing function code or documentation.

To get more examples, see how bash completion works. You can type git or hg and then tab and it will give the list of available commands. If you type "sort --r" and then tab, it gives the list of options to sort starting with "--r", etc.

Graphics rendering back to front sorting

Several incompatibilities have been identified in how Octave plots transparent objects in 3D, causing certain transparent objects to hide opaque objects behind them even though they're not supposed to. The vast majority of them were isolated to one problem: if the objects to be drawn are rendered such that the one farthest away from the viewer is rendered first and nearer objects are rendered on top of that, then transparency would be automatically achieved, but this needs very careful coding to stay performant and to avoid rendering objects that will be overwritten fully by others. See [1] for a summary.

This project can be a GSoC for a student who understands 3D graphics rendering.

Symbolic package

The Symbolic package provides symbolic computing and other computer algebra system tools via the SymPy Python library. GSoC projects in 2016 and 2022 improved the package.

There are no specific plans for Symbolic in GSoC 2023, but improvements elsewhere that would help Symbolic include:

  • Developing the Octave-Pythonic package.
  • Fix the storage of non-expressions by working with upstream SymPy: currently we rely on deprecated functionality in SymPy.
  • Improvements and fixes to classdef-related issues in Octave itself.
  • Developing the Octave Jupyter kernel.

Project sizes

Since GSoC 2022 there exist two project sizes[1][2]:

  • ~175 hours (~12 weeks, Jun 13 - Sept 12)
  • ~350 hours (~22 weeks, Jun 13 - Nov 21)

Footnotes

See also