2025

May 6, 2025

MLOps in the Age of Foundation Models. Evolving Infrastructure for LLMs and Beyond

Large-scale foundation models have changed how we build and operate ML systems. This post covers how ML infrastructure and MLOps practices evolved to support them—contrasting the classic era with modern paradigms, and examining the new patterns and workflows that emerged.

May 4, 2025

Scaling Large Language Models - Practical Multi-GPU and Multi-Node Strategies for 2025

Today's LLMs don't fit on a single GPU. A 70B-parameter model needs ~140GB just for weights in FP16 -- nearly 2x what an A100 can hold. Training or serving these models means distributing work across multiple GPUs, and doing it wrong wastes most of your compute budget.

This guide covers practical strategies for scaling LLMs across multiple GPUs and nodes, drawing from Hugging Face's Ultra-Scale Playbook.

April 19, 2025

Quick-Guide on setting up a MacBook for AI Engineering

My 10-step process for turning a fresh macOS install into an AI engineering workstation.

April 17, 2025

Quick Guide: Managing Python on macOS with uv

Quick Start

# Install uv
brew install uv

# For new projects (modern workflow)
uv init                # create project structure
uv add pandas numpy    # add dependencies
uv run train.py        # run your script

# For existing projects (legacy workflow)
uv venv                             # create virtual environment
uv pip install -r requirements.txt  # install dependencies
uv run train.py                     # run your script

# Run tools without installing them
uvx ruff check .       # run linter
uvx black .            # run formatter