Events

Events

Past Events

Wednesday, January 27, 1:00 PM EST

Workshop: Scaling LightGBM training with Dask on Saturn Cloud

In this workshop, attendees will get an introduction to LightGBM, a popular lightweight gradient-boosted decision tree (GBDT) library. This introduction will cover GBDTs generally and LightGBM, specifically. It will also describe which parts of a GBDT can be parallelized and how GBDT training works with multiple machines.

Online Livestream Replay available after live event Free
Watch
Tuesday, January 19, 3:00 PM EST

Workshop: Introduction to PyTorch with Dask: Batch Inference

In this hands-on workshop, attendees will have the opportunity to see how image classification tasks in PyTorch can be easily parallelized using Dask clusters on Saturn Cloud.

Online Livestream Replay available after live event Free
Watch the recording
Wednesday, January 13, 2:00 PM EST

Machine Learning Without Limits: Snowflake and Python Together In The Cloud

In this webinar, we will introduce how data scientists can utilize Snowflake and Saturn Cloud together for their machine learning workloads and more.

Online Livestream Replay available after live event Free
Watch the recording
Thursday, December 18, 2:00 PM EST

Workshop: Escaping MemoryError- Machine Learning on Big Data with Dask

In this hands-on workshop, attendees will be introduced to Dask, a Python-native parallel computing framework. Dask extends traditional Python tools to operate at scale across a cluster of machines, removing memory and compute limitations. Instructors will walk step-by-step through setting up a Dask cluster, processing large datasets efficiently, and performing machine learning model training across the cluster.

Online Livestream Replay available after live event Free
Watch the recording
Thursday, December 10, 2:00 PM EST

Workshop: Introduction to PyTorch with Dask

In this hands-on workshop, attendees will learn how Dask and parallelization can be incorporated into standard PyTorch workflows to create faster inference and training, as well as higher quality training results. Instructors will walk step-by-step through how to run two types of computer vision jobs on GPU Dask clusters: large batch inference and transfer learning.

Online Livestream Replay available after live event Free
Watch the recording
Wednesday, December 9, 1:30 PM EST

Accelerating XGBoost With Python

Join us for an interactive discussion with Aaron Richter, Senior Data Scientist at Saturn Cloud, and Mike McCarty, Director of Software Engineering at Capital One. We will be discussing all things XGBoost along with packages, methods, tips for accelerating XGBoost performance in Python, and more.

Online Livestream Replay available after live event Free
Watch the recording
November 11-15, 2020

Parallel Processing in Python

This talk covers the current landscape of parallel processing tools in Python, with a focus on which tools are best suited for various workloads such as arrays, dataframes, machine learning, and deep learning.

Online Livestream Replay available after live event Free
Watch the recording
Tuesday, November 10, 1:00 PM EST

Workshop: Scaling Machine Learning in Python

In this hands-on workshop, you'll have the opportunity to see how a standard data science and machine learning workflow, using pandas and scikit-learn, can easily be parallelized using Dask clusters. Instructors will walk step-by-step through how to migrate existing Python code to Dask, an open-source framework enabling parallelization of Python.

Online Livestream Replay available after live event Free
Watch the recording
Friday, October 30th, 1:30 PM EST

Data & AI Accessibility: The Democratization of Data Science

Join Saturn Cloud's Senior Data Scientist, Aaron Richter, and Travis Oliphant, CEO of OpenTeams and Quansight for an interactive discussion covering: The creation of NumPy and the start of the OSS PyData community and projects, how the data/AI ecosystem has changed over the last 10-20 years, Dask and Numba, how OSS tools will continue to be well-maintained moving forward, and more.

Online Livestream Replay available after live event Free
Watch the recording
Thursday, October 29th, 10:30 AM PDT

Next-Generation Big Data Pipelines with Prefect and Dask

Data pipelines are crucial to an organization’s data science efforts. They ensure data is collected and organized in a timely and accurate manner, and is made available for analysis and modeling. In this talk, we’ll introduce the next-generation stack for big data pipelines built upon Prefect and Dask, and compare it to popular tools like Spark, Airflow, and the Hadoop ecosystem.

Online Livestream Replay available after live event Free
Watch the recording
Wednesday, August 26, 2:00 PM EST

The Future Of High Performance Computing

Meet the engineering leads behind RAPIDS through an interactive discussion and an opportunity to ask questions during the event

Online Livestream Replay available after live event Free
Watch the recording
TUESDAY, AUGUST 4, 2:00-3:00 PM EST

100x Faster Compute: Scaling Python For Data Science on AWS

Learn how to run up to 100x faster data science workloads in Python with Dask and RAPIDS and understand the infrastructure that’s necessary to launch high performance clusters and GPU machines in AWS.

Online Livestream Replay available after live event Free
Watch the recording