News – Tagged "neuraxle"

Structuring Machine Learning Code: Design Patterns & Clean Code

by Guillaume Chevalier

Several design patterns are discussed with practical examples and their implications. So not only you want to build neural networks and other machine learning algorithms, but also you want to find the best hyperparameters for them automatically. We’ll here demonstrate how it’s possible in a clean code way.

Read more

How to unit test machine learning code?

by Guillaume Chevalier

Why are unit tests important? Why is testing important? How to do it for machine learning code? Those are questions I will answer. I suggest that ...

Read more

Our top learning resources for AI programmers

by Guillaume Chevalier

You are an Artificial Intelligence (AI) programmer and you'd like to learn how to program well as we do at Neuraxio? Lucky you, we've launched a s...

Read more

AI technologies for eCommerce - The Commerce Show

by Guillaume Chevalier

This is a podcast episode from The Commerce Show. Original description from The Commerce Show: In this episode, we are talking about AI technolog...

Read more

What is Automated Machine Learning (AutoML)? - A Metaphor

by Guillaume Chevalier

Daily, what does a data scientist do? And how can Automated Machine Learning avoid you to babysit your AI, practically?

Here is a metaphor: your data scientist is a mom. A babysitter.

The data scientist creates a nice artificial neural network and trains it on data. Then he’s going to supervise the learning. The data scientist will make sure that the learning converges in the right way so that the artificial neural network can give good predictions and then flourish.

Seriously, that’s all well and good, but it costs time, and it costs money.

Is there anything we can do to automate the process of being a mom - actually being a data scientist? Actually, we can use Automated Machine Learning.

Read more

What's Wrong with Scikit-Learn Pipelines?

by Guillaume Chevalier

Scikit-Learn’s “pipe and filter” design pattern is simply beautiful. But how to use it for Deep Learning, AutoML, and complex production-level pipelines?

Scikit-Learn had its first release in 2007, which was a pre deep learning era. It’s one of the most known and adopted machine learning library, and is still growing. On top of all, it uses the Pipe and Filter design pattern as a software architectural style - it’s what makes Scikit-Learn so fabulous, added to the fact it provides algorithms ready for use. However, it has massive issues when it comes to do the following, which we should be able to do in 2020 already:

Automatic Machine Learning (AutoML),
Deep Learning Pipelines,
More complex Machine Learning pipelines.

Let’s first clarify what’s missing exactly, and then let’s see how we solved each of those problems with building new design patterns based on the ones Scikit-Learn already uses.

TL;DR: How could things work to allow us to do what’s in the above list with the Pipe and Filter design pattern / architectural style that is particular of Scikit-Learn? The API must be redesigned to include broader functionalities, such as allowing the definition of hyperparameter spaces, and allowing a more comprehensive object lifecycle & data flow functionalities in the steps of a pipeline. We coded a solution: that is Neuraxle.

Don’t get me wrong, I used to love Scikit-Learn, and I still love to use it. It is a nice status quo: it offers useful features such as the ability to define pipelines with a panoply of premade machine learning algorithms. However, there are serious problems that they just couldn’t see in 2007, when deep learning wasn’t a thing.

Read more

How to Code Neat Machine Learning Pipelines

by Guillaume Chevalier

Coding Machine Learning Pipelines - the right way.

Have you ever coded an ML pipeline which was taking a lot of time to run? Or worse: have you ever got to the point where you needed to save on disk intermediate parts of the pipeline to be able to focus on one step at a time by using checkpoints? Or even worse: have you ever tried to refactor such poorly-written machine learning code to put it to production, and it took you months? Well, we’ve all been there if working on machine learning pipelines for long enough. So how should we build a good pipeline that will give us flexibility and the ability to easily refactor the code to put it in production later?

First, we’ll define machine learning pipelines and explore the idea of using checkpoints between the pipeline’s steps. Then, we’ll see how we can implement such checkpoints in a way that you won’t shoot yourself in the foot when it comes to put your pipeline to production. We’ll also discuss of data streaming, and then of Oriented Object Programming (OOP) encapsulation tradeoffs that can happen in pipelines when specifying hyperparameters.

Read more

Hello World!

by Guillaume Chevalier

Greetings!

def print_hello_world():
  print("Hello World!")

Hello World!

We’ll be releasing Neuraxle 0.2.0 very soon on PyPI (so you’ll can pip install neuraxle). We’ll also post here tutorials, articles and updates. Stay tuned, register below!

Read more