Bea Stollnitz

Hi, I’m Bea Stollnitz, and I’m a principal developer advocate at Microsoft, focusing on Azure ML.

All code in this blog is made available under the MIT license.

Large Language Models
- How GPT models work: accessible to everyone
  April 24, 2023
  This post explains the basics of how GPT models work. My goal is for it to be accessible to everyone, even for those of you without a programming background.
- How GPT models work: for data scientists and ML engineers
  May 19, 2023
  This post explains the basics of how GPT models work. The target audience is data scientists, ML engineers, and anyone with a machine learning background.
- The Transformer architecture of GPT models
  July 21, 2023
  This post explains in detail the Transformer architecture used by GPT models. The target audience includes data scientists and ML engineers with a strong technical background.
- Retrieval-Augmented Generation (RAG)
  September 25, 2023
  This post explains how you can add your own data to a pre-trained LLM, such as a GPT model, using Retrieval-Augmented Generation (RAG). The target audience is data scientists and developers working with AI.
Azure ML: from beginner to pro
- Introduction to Azure ML
  July 11, 2022
  This post is a great starting point if you’re new to Azure ML. It will give you a succinct high-level overview of the concepts that make up Azure ML, and provide structure for the technical articles that will follow in this series.
- How to train and deploy in Azure ML
  July 18, 2022
  In this post, you’ll learn how to train a simple machine learning model in the cloud, and how to deploy it using a managed endpoint. I assume familiarity with machine learning, but no knowledge of Azure or Azure ML.
- How to train using pipelines and components in Azure ML
  July 25, 2022
  In this post, I’ll discuss how to break up your training code into Azure ML components, and how to connect those components into an Azure ML pipeline. I assume that you read my blog post on how to do basic training in Azure ML, or that you have equivalent experience.
- Mixing methods for creating Azure ML resources
  August 1, 2022
  This post demonstrates how to mix the three main methods for creating Azure ML resources (CLI, SDK and Studio) in a single project. I recommend that you already have some familiarity with these different methods, either from your own experience or by reading my introductory and basic training posts.
- How to do hyperparameter tuning using Azure ML
  August 8, 2022
  This post shows how to use an Azure ML Sweep Job to do hyperparameter tuning. I assume understanding of hyperparameter tuning, and familiarity with training models in the cloud using Azure ML.
- Creating managed online endpoints in Azure ML
  August 15, 2022
  In this post, you’ll learn how to create endpoints on Azure ML that enable real-time predictions using your MLflow model. I assume familiarity with machine learning concepts, such as training and prediction, but no knowledge of Azure.
- Creating managed online endpoints in Azure ML without using MLflow
  August 22, 2022
  In this post, you’ll learn how to create endpoints on Azure ML that enable real-time predictions for a non-MLflow model. Before you read this blog post, I suggest that you read my post on “Creating managed online endpoints in Azure ML,” which explains how to deploy an MLflow model using the same type of endpoint.
- Creating batch endpoints in Azure ML
  August 29, 2022
  In this post, you’ll learn how to create endpoints on Azure ML that enable asynchronous batch predictions using your MLflow model. I assume familiarity with machine learning concepts, such as training and prediction, but no knowledge of Azure.
- Creating batch endpoints in Azure ML without using MLflow
  September 6, 2022
  In this post, you’ll learn how to create endpoints on Azure ML that enable asynchronous batch predictions using a model that has not been saved using MLflow. Before you read this post, I suggest that you read my blog post on “Creating batch endpoints in Azure ML,” which explains how to create similar endpoints for MLflow models.
- Choosing the compute for Azure ML resources
  May 20, 2022
  When creating Azure ML resources you often have to decide which compute to use to run your code. This post will help you understand how to make the right choice for your scenario. I assume some familiarity with Azure ML in order to follow along.
- Choosing the environment for Azure ML resources
  May 20, 2022
  In this post, you will learn about the different types of environments you can choose from when creating a new Azure ML resource. I assume some familiarity with Azure ML in order to follow along.
- How to use Azure ML registries to share models, components, and environments
  November 14, 2022
  In this post, you’ll learn how you can easily share your Azure ML assets (models, components, environments) with other users in your organization, using Azure ML registries. I assume that you have basic familiarity on how to train and deploy models in Azure ML.
Azure ML + PyTorch: better together
- Training and deploying your PyTorch model in the cloud with Azure ML
  March 13, 2023
  The purpose of this post is to get PyTorch developers quickly up to speed with Azure ML. You’ll learn how to train and deploy a simple PyTorch model in the cloud, using the Azure ML Python SDK. It assumes familiarity with PyTorch, but no knowledge of Azure ML.
- Training your PyTorch model using components and pipelines in Azure ML
  March 20, 2023
  This article helps PyTorch developers become more efficient with using Azure ML. It builds on the knowledge from my previous post on ‘Training and deploying your PyTorch model in the cloud using Azure ML.’
- Faster training and inference using the Azure Container for PyTorch in Azure ML
  March 27, 2023
  This article talks about the Azure Container for PyTorch, a new curated environment released by Microsoft that greatly optimizes training and inference for PyTorch models.
- Distributed training with PyTorch and Azure ML
  April 3, 2023
  This article discusses distributed training on Azure ML, when using PyTorch models. It assumes that you’re familiar with PyTorch and that you know the basics of training and deploying a model on Azure ML.
Development tools
- Configuring your Linux terminal on GitHub Codespaces
  February 9, 2022
  In this post, you’ll learn how to configure your Linux terminal using dotfiles, which will enable you to quickly set up local and remote environments with all the settings you’re familiar with. This is especially useful if you’re using Codespaces, so this post focuses on configuring your Linux terminal for Codespaces.
- Configuring your Windows development machine for data science
  November 29, 2022
  In this post, you’ll learn how to configure your Windows development machine to work on data science projects. I’ll cover how to install and configure VS Code, WSL, Ubuntu, Zsh, Miniconda, Git, and the terminal.
- How to structure your machine learning projects using GitHub and VS Code
  October 14, 2022
  In this post, I’ll explain how I organize the files in my machine learning projects, and I’ll share with you the GitHub template I use. Most aspects of the guidance in this article are relevant regardless of the editor you use, but some are specific to VS Code.
Scientific ML techniques
- Creating spectrograms and scaleograms for signal classification
  January 21, 2022
  In this post, I’ll explain how to convert time-series signals into spectrograms and scaleograms. In a future post, we’ll use the images created here to classify the signals. I assume that you have basic math skills and are familiar with basic machine learning concepts.
- Discovering equations from data using SINDy
  December 29, 2021
  In this post, I’ll discuss the 2016 paper “Discovering Governing Equations from Data by Sparse Identification of Nonlinear Dynamical Systems” by Brunton et al. I’ll explain the main concepts of the paper in an accessible way, and I’ll show how we can use its novel approach to discover the Lorenz system of equations from data. I assume basic familiarity with ordinary differential equations and dynamical systems.
- Using PySINDy to discover equations from experimental data
  December 30, 2021
  In this post, I will use the PySINDy Python package to discover a system of ordinary differential equations that best represents my experimental data. I assume that you read my post “Discovering equations from data using SINDy,” and that you have basic familiarity with ordinary differential equations and dynamical systems.
Deep learning frameworks
- Introduction to PyTorch
  June 21, 2021
  This post introduces PyTorch concepts through the creation of a basic neural network using the Fashion MNIST dataset as a data source. I assume that you have a basic conceptual understanding of neural networks, and that you’re comfortable with Python, but I assume no knowledge of PyTorch.
- Introduction to TensorFlow using Keras
  August 27, 2021
  This post provides all the concepts and practical knowledge you need to get started with TensorFlow. We’ll explore Keras, a high-level API released as part of TensorFlow, and we’ll use it to build a basic neural network using the Fashion MNIST dataset as a data source. I assume that you have a basic conceptual understanding of neural networks, and that you’re comfortable with Python, but I assume no knowledge of TensorFlow or Keras.
- Going beyond Keras - customizing with TensorFlow
  September 3, 2021
  In this blog post, we’ll re-implement parts of the code from my earlier Keras post, but this time we’ll use lower-level TensorFlow concepts. I assume that you completed my tutorial on Keras or that you have a solid knowledge of Keras, but I assume no knowledge of TensorFlow.
- Comparing PyTorch and TensorFlow implementations
  September 8, 2021
  How do PyTorch code and TensorFlow code compare? Maybe you’re in the beginning phases of your machine learning journey and deciding which framework to embrace, or maybe you’re an experienced ML practicioner considering a change of framework. Either way, you’re in the right place. Drawing from my previous posts, I’ll compare the PyTorch and TensorFlow versions of the code used to classify images in the Fashion MNIST dataset.

Large Language Models

How GPT models work: accessible to everyone

How GPT models work: for data scientists and ML engineers

The Transformer architecture of GPT models

Retrieval-Augmented Generation (RAG)

Azure ML: from beginner to pro

Introduction to Azure ML

How to train and deploy in Azure ML

How to train using pipelines and components in Azure ML

Mixing methods for creating Azure ML resources

How to do hyperparameter tuning using Azure ML

Creating managed online endpoints in Azure ML

Creating managed online endpoints in Azure ML without using MLflow

Creating batch endpoints in Azure ML

Creating batch endpoints in Azure ML without using MLflow

Choosing the compute for Azure ML resources

Choosing the environment for Azure ML resources

How to use Azure ML registries to share models, components, and environments

Azure ML + PyTorch: better together

Training and deploying your PyTorch model in the cloud with Azure ML

Training your PyTorch model using components and pipelines in Azure ML

Faster training and inference using the Azure Container for PyTorch in Azure ML

Distributed training with PyTorch and Azure ML

Development tools

Configuring your Linux terminal on GitHub Codespaces

Configuring your Windows development machine for data science

How to structure your machine learning projects using GitHub and VS Code

Scientific ML techniques

Creating spectrograms and scaleograms for signal classification

Discovering equations from data using SINDy

Using PySINDy to discover equations from experimental data

Deep learning frameworks

Introduction to PyTorch

Introduction to TensorFlow using Keras

Going beyond Keras - customizing with TensorFlow

Comparing PyTorch and TensorFlow implementations