Blog

Articles

Sep 9, 2022

10X Faster Hugging Face with Modin

In this post, we show that Modin can speed up machine learning workflows by 10X or more, on tasks such as performing sentiment analysis with the Hugging Face Transformers library....

Articles

Aug 30, 2022

The Intel Distribution of Modin

In a recently released article, Intel describes the benefits of Modin generally, and the Intel Distribution of Modin specifically. We share some excerpts....

Articles

Aug 24, 2022

Professional Pandas: The Pandas Assign Method and Chaining

This is the first in a series of blog posts that teach how to write professional-quality pandas code. In this post, we discuss the pandas assign function, and how it can be used together with chaining to create clean, readable code....

Articles

Aug 18, 2022

Pandas vs. SQL – Part 4: Pandas Is More Convenient

This is the fourth in a series of blog posts that contrast Pandas dataframes with databases (or equivalently, SQL). In this post, we demonstrate how the Pandas dataframe data model provides additional convience that makes it a great fit for data science and machine learning....

Articles

Aug 12, 2022

Why I Joined Ponder: Hazem Elmeleegy

Hazem Elmeleegy describes his background in computer science, and why he joined Ponder....

Articles

Jul 29, 2022

Pandas Is Now As Popular As Python Was in 2016

In this post, we dig into a decade of Stack Overflow Developer Survey results, and learn that: Python’s popularity more than doubled between 2013 and 2022; people love Python now, but no more than before; pandas is now as popular as Python was in 2016; pandas has high adoption in the Python commun...

Articles

Jul 26, 2022

Pandas vs. SQL – Part 3: Pandas Is More Flexible

This is the third in a series of blog posts that contrast Pandas dataframes with databases (or equivalently, SQL). In this post, we demonstrate how the Pandas dataframe data model provides additional flexibility that makes it a great fit for data science and machine learning, over the relational mod...

Articles

Jul 14, 2022

Pandas vs. SQL – Part 2: Pandas Is More Concise

This is the second in a series of blog posts that contrast Pandas dataframes with databases (or equivalently, SQL). We compare Pandas vs. SQL on the first of three axes: conciseness. We discuss 10 operations that are much more straightforward in Pandas than in SQL, spanning data cleaning, machine le...

Articles

Jun 28, 2022

Pandas vs. SQL – Part 1: The Food Court and the Michelin-Style Restaurant

This is the first in a series of blog posts that contrast Pandas dataframes with databases (or equivalently, SQL). Both dataframes and databases are, in fact, old ideas, dating back multiple decades. Databases opt for scalability, robustness, and efficiency, while Pandas opts for versatility, flexib...

Ready to level up your Pandas game?

Sign up for a free health check for your data workflows to identify opportunities to scale and accelerate your data team.

Book a session