Data Loading

Learn to ingest data from CSV files, databases, APIs, streaming sources, and more.

6 Topics

API Data Ingestion: REST APIs and JSON Parsing

Intermediate

API (Application Programming Interface) data ingestion is the process of programmatically extracting data from external services and systems through HTTP…

2 prereqs 4 related ~7 min read

Database Connections: SQL with Pandas and SQLAlchemy

Intermediate

Database connectivity in Python enables programmatic access to relational databases for data storage, retrieval, and manipulation. The primary approach…

2 prereqs 3 related ~6 min read

File Formats for Data: CSV, Parquet, JSON, Excel, and HDF5

Intermediate

File formats define how data is structured, stored, and serialized on disk, each optimized for different use cases across the data lifecycle. CSV…

2 prereqs 3 related ~5 min read

NumPy Fundamentals: Arrays, Data Types, and Vectorization

Beginner

NumPy (Numerical Python) is the fundamental package for scientific computing in Python. It provides support for large, multi-dimensional arrays and matrices,…

1 prereq 3 related ~6 min read

Pandas Basics: DataFrames and Data Loading

Beginner

Pandas is an open-source Python library that provides high-performance, easy-to-use data structures and tools for data analysis. At its core, pandas introduces…

1 prereq 3 related ~5 min read

Streaming Data: Chunking, Iterators, and Large Dataset Handling

Advanced

Streaming data processing is an approach to handling datasets that are too large to fit into memory or data that arrives continuously over time. Unlike batch…

3 prereqs 3 related ~6 min read